Re: GPU hanging after upgrade to 14.1

From: Andrea Venturoli <ml_at_netfence.it>
Date: Mon, 07 Oct 2024 08:53:55 UTC
On 10/3/24 15:27, Andrea Venturoli wrote:

>  > drmn0: [drm] GPU HANG: ecode 9:1:85dffdfb, in MainThread [100791]
>  > drmn0: [drm] Resetting rcs0 for preemption time out
>  > drmn0: [drm] Xorg[100791] context reset due to GPU hang
> 
> or:
> 
>  > drmn0: [drm] Resetting rcs0 for CS error
>  > drmn0: [drm] MainThread[100791] context reset due to GPU hang
>  > drmn0: [drm] GPU HANG: ecode 9:1:00280001, in MainThread [100791]
>  > drmn0: [drm] GPU HANG: ecode 9:1:85dfffff, in MainThread [100791]
>  > drmn0: [drm] Resetting rcs0 for preemption time out
>  > drmn0: [drm] Xorg[100791] context reset due to GPU hang
> 
> I've seen a couple of bug reports, but those does not look the same to me.

New messages today: ThunderBird started displaying strange, I moved to 
first console and saw:

> Oct  7 09:15:14 hector kernel: drmn0: [drm] *ERROR* Atomic update failure on pipe A (start=8394 end=8395) time 1675 us, min 1073, max 1079, scanline start 1001, end 1109
> Oct  7 09:17:14 hector kernel: drmn0: [drm] *ERROR* Atomic update failure on pipe A (start=14986 end=14987) time 2280 us, min 1073, max 1079, scanline start 980, end 1090
> Oct  7 09:18:14 hector kernel: drmn0: [drm] *ERROR* Atomic update failure on pipe A (start=18846 end=18847) time 1864 us, min 1073, max 1079, scanline start 962, end 1089
> Oct  7 09:19:05 hector kernel: drmn0: [drm] *ERROR* Atomic update failure on pipe A (start=22251 end=22252) time 1625 us, min 1073, max 1079, scanline start 999, end 1108

After a while, however, it started working again without the need to 
kill Xserver.


  bye
	av.