[Bug 259670] graphics/drm-kmod: sporadic GPU hang with i915kms

From: <bugzilla-noreply_at_freebsd.org>
Date: Thu, 03 Feb 2022 09:56:52 UTC
https://bugs.freebsd.org/bugzilla/show_bug.cgi?id=259670

--- Comment #10 from Oleh Vinichenko <moonlapse81@gmail.com> ---
i pulled very latest changes in drm-kmod, compiled and run for awhile until
system rebooted ( no crash core saved ), but it is recorded in
/var/log/messages:

Feb  3 11:44:25 chaklunchik kernel: drmn0: GPU HANG: ecode 6:1:bb13ffff, in
Renderer [101226]
Feb  3 11:44:25 chaklunchik kernel: drmn0: Resetting chip for stopped heartbeat
on rcs0
Feb  3 11:44:25 chaklunchik kernel: drmn0: Renderer[101226] context reset due
to GPU hang
Feb  3 11:44:25 chaklunchik kernel: drmn0: GPU HANG: ecode 6:1:00000000, in
Renderer [101226]
Feb  3 11:44:25 chaklunchik kernel:
Feb  3 11:44:25 chaklunchik syslogd: last message repeated 1 times
Feb  3 11:44:25 chaklunchik kernel: Fatal trap 12: page fault while in kernel
mode
Feb  3 11:44:25 chaklunchik kernel: cpuid = 5; apic id = 05
Feb  3 11:44:25 chaklunchik kernel: fault virtual address       = 0x61
Feb  3 11:44:25 chaklunchik kernel: fault code          = supervisor read data,
page not present
Feb  3 11:44:25 chaklunchik kernel: instruction pointer =
0x20:0xffffffff80986ac7
Feb  3 11:44:25 chaklunchik kernel: stack pointer               =
0x28:0xfffffe001b190b70
Feb  3 11:44:25 chaklunchik kernel: frame pointer               =
0x28:0xfffffe001b190bb0
Feb  3 11:44:25 chaklunchik kernel: code segment                = base 0x0,
limit 0xfffff, type 0x1b
Feb  3 11:44:25 chaklunchik kernel:                     = DPL 0, pres 1, long
1, def32 0, gran 1
Feb  3 11:44:25 chaklunchik kernel: processor eflags    = interrupt enabled,
resume, IOPL = 0
Feb  3 11:44:25 chaklunchik kernel: current process             = 0
(linuxkpi_short_wq_8)
Feb  3 11:44:25 chaklunchik kernel: trap number         = 12
qFeb  3 11:44:25 chaklunchik kernel: panic: page fault
Feb  3 11:44:25 chaklunchik kernel: cpuid = 5
Feb  3 11:44:25 chaklunchik kernel: time = 1643881438
Feb  3 11:44:25 chaklunchik kernel: KDB: stack backtrace:
Feb  3 11:44:25 chaklunchik kernel: #0 0xffffffff8076258b at kdb_backtrace+0x6b
Feb  3 11:44:25 chaklunchik kernel: #1 0xffffffff8071646f at vpanic+0x17f
Feb  3 11:44:25 chaklunchik kernel: #2 0xffffffff807162e3 at panic+0x43
Feb  3 11:44:25 chaklunchik kernel: #3 0xffffffff80a2e455 at trap_fatal+0x385
Feb  3 11:44:25 chaklunchik kernel: #4 0xffffffff80a2e4af at trap_pfault+0x4f
Feb  3 11:44:25 chaklunchik kernel: #5 0xffffffff80a07528 at calltrap+0x8
Feb  3 11:44:25 chaklunchik kernel: #6 0xffffffff80986bbd at kmem_free+0x2d
Feb  3 11:44:25 chaklunchik kernel: #7 0xffffffff81d731ac at
__i915_gpu_coredump_free+0x12c
Feb  3 11:44:25 chaklunchik kernel: #8 0xffffffff81d473f9 at
intel_gt_handle_error+0xa9
Feb  3 11:44:25 chaklunchik kernel: #9 0xffffffff81d34e50 at heartbeat+0x110
Feb  3 11:44:25 chaklunchik kernel: #10 0xffffffff808df36d at
linux_work_fn+0xed
Feb  3 11:44:25 chaklunchik kernel: #11 0xffffffff807770e7 at
taskqueue_run_locked+0x187
Feb  3 11:44:25 chaklunchik kernel: #12 0xffffffff80778402 at
taskqueue_thread_loop+0xc2
Feb  3 11:44:25 chaklunchik kernel: #13 0xffffffff806d4082 at fork_exit+0x82
Feb  3 11:44:25 chaklunchik kernel: #14 0xffffffff80a0859e at
fork_trampoline+0xe
i am not sure if any of these crashes helpful. When i try versions from
supported drm-kmod ( aka from ports ) i having same issues. Maybe, i could try
OpenBSD, which has drm synced with linux-kernel-5.15.x to look for any
improvements.

-- 
You are receiving this mail because:
You are the assignee for the bug.
You are on the CC list for the bug.