Troubleshooting Ubuntu 20 and a fresh install of Amd drivers

Keith Myers
Keith Myers
Joined: 11 Feb 11
Posts: 2,430
Credit: 6,881,450,831
RAC: 24,109,665

I've never had any issues

I've never had any issues with a Linux host suspending on me, unasked unless provoked by power outages.

Run 24/7 with Performance governor.

Have the Suspend & Power Button in the Power Settings option set for suspend when on battery (UPS) to  keep from discharging the battery too far which will hurt it after 20 minutes.

Have not enabled Suspend when plugged in.  Simple setup.

 

cecht
cecht
Joined: 7 Mar 18
Posts: 1,091
Credit: 1,622,271,706
RAC: 2,451,395

I just came out of a rabbit

I just came out of a rabbit hole that I dropped into after a botched upgrade of Ubuntu and AMDGPU drivers. (Basically, I failed to follow the good advice in this thread.) In all the discussions and fixes described here, in "Linux kernel 5.10 + AMDGPU + Radeon 20.45 = Frequent Gnome Crashes", and in "A quick guide: How to install OpenCL for AMD GPUs on Linux Kubuntu 18.04 (and similar distro)" I had a problem that hadn't been covered. Regardless of the combination of kernels, AMDGPU versions, and successful AMDGPU removal and installation options, the drivers were not recognizing my GPUs.  I finally found a solution on a cryptominer's forum. It turns out that somewhere in my fumbling the upgrade and recovery attempts, amdgpu package was blacklisted by the system which prevented amdgpu from loading.

The Fix: check to see if amdgpu is blacklisted, if so, delete the file and reboot.

$ ls /etc/modprobe.d/ alsa-base.conf blacklist-ath_pci.conf blacklist-framebuffer.conf blacklist-rare-network.conf iwlwifi.conf amd64-microcode-blacklist.conf blacklist.conf blacklist-modem.conf blacklist-amdgpu.conf dkms.conf blacklist-firewire.conf blacklist-oss.conf intel-microcode-blacklist.conf 
$ cat /etc/modprobe.d/blacklist-amdgpu.conf blacklist amdgpu 
$ sudo rm /etc/modprobe.d/blacklist-amdgpu.conf
$ reboot

I'm now running Ubuntu 20.04.3 with a 5.4 kernel and just the opencl=legacy component of AMDGPU 20.10, which is where I had (desperately) wound things back to before discovering the blacklist fix. Now to decide whether to upgrade things once again or let the sleeping dog lie. My other host is running fine with Ubuntu 20.04.2, a 5.11 kernel, and the opencl=rocr component of AMDGPU 20.10.

Ideas are not fixed, nor should they be; we live in model-dependent reality.

Comment viewing options

Select your preferred way to display the comments and click "Save settings" to activate your changes.