Regarding sdma ring hangs: if you still have access to the affected machine using ssh, it would be helpful to add a comment with the following information: - the last dmesg lines (at least the "[drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring sdma1 timeout, signaled seq=9871, emitted seq=9873" one) - the output of : umr -R sdma0 (or sdma1 depending on which one failed) Thanks!