[Bug 278958] zfs panic: page fault in sync_dnodes_task

From: <bugzilla-noreply_at_freebsd.org>
Date: Mon, 13 May 2024 19:17:26 UTC
https://bugs.freebsd.org/bugzilla/show_bug.cgi?id=278958

            Bug ID: 278958
           Summary: zfs panic: page fault in sync_dnodes_task
           Product: Base System
           Version: 14.0-RELEASE
          Hardware: amd64
                OS: Any
            Status: New
          Severity: Affects Only Me
          Priority: ---
         Component: kern
          Assignee: bugs@FreeBSD.org
          Reporter: nunziotocci2000@gmail.com

Created attachment 250626
  --> https://bugs.freebsd.org/bugzilla/attachment.cgi?id=250626&action=edit
core.txt

Fatal trap 12: page fault while in kernel mode
cpuid = 29; apic id = 1d
fault virtual address   = 0x0
fault code              = supervisor write data, page not present
instruction pointer     = 0x20:0xffffffff820975a1
stack pointer           = 0x28:0xfffffe022b901de0
frame pointer           = 0x28:0xfffffe022b901de0
code segment            = base 0x0, limit 0xfffff, type 0x1b
                        = DPL 0, pres 1, long 1, def32 0, gran 1
processor eflags        = interrupt enabled, resume, IOPL = 0
current process         = 6 (dp_sync_taskq_17)
rdi: fffff8000234cf60 rsi: fffff800022f8328 rdx: 0000000000000000
rcx: 0000000000000000  r8: 0000000000000000  r9: fffffe027bbf1a00
rax: 00000000000000e8 rbx: 0000000000000270 rbp: fffffe022b901de0
r10: 0000000000000000 r11: 0000000098ff24fe r12: fffff800022f8328
r13: 0000000000000000 r14: fffff8000234cf40 r15: fffff80aa5726c00
trap number             = 12
panic: page fault
cpuid = 29
time = 1715524597
KDB: stack backtrace:
#0 0xffffffff80b9009d at kdb_backtrace+0x5d
#1 0xffffffff80b431a2 at vpanic+0x132
#2 0xffffffff80b43063 at panic+0x43
#3 0xffffffff8100c85c at trap_fatal+0x40c
#4 0xffffffff8100c8af at trap_pfault+0x4f
#5 0xffffffff80fe3ac8 at calltrap+0x8
#6 0xffffffff82105083 at sync_dnodes_task+0x63
#7 0xffffffff8209addf at taskq_run+0x1f
#8 0xffffffff80ba5992 at taskqueue_run_locked+0x182
#9 0xffffffff80ba6c22 at taskqueue_thread_loop+0xc2
#10 0xffffffff80afdb7f at fork_exit+0x7f
#11 0xffffffff80fe4b2e at fork_trampoline+0xe
Uptime: 4d20h39m45s



kgdb backtrace:

#0  __curthread () at /usr/src/sys/amd64/include/pcpu_aux.h:57
#1  doadump (textdump=<optimized out>) at /usr/src/sys/kern/kern_shutdown.c:405
#2  0xffffffff80b42d37 in kern_reboot (howto=260)
    at /usr/src/sys/kern/kern_shutdown.c:526
#3  0xffffffff80b4320f in vpanic (fmt=0xffffffff81136b3b "%s",
    ap=ap@entry=0xfffffe022b901c30) at /usr/src/sys/kern/kern_shutdown.c:970
#4  0xffffffff80b43063 in panic (fmt=<unavailable>)
    at /usr/src/sys/kern/kern_shutdown.c:894
#5  0xffffffff8100c85c in trap_fatal (frame=0xfffffe022b901d20, eva=0)
    at /usr/src/sys/amd64/amd64/trap.c:952
#6  0xffffffff8100c8af in trap_pfault (frame=0xfffffe022b901d20,
    usermode=false, signo=<optimized out>, ucode=<optimized out>)
    at /usr/src/sys/amd64/amd64/trap.c:760
#7  <signal handler called>
#8  0xffffffff820975a1 in list_remove (list=0xfffff8000234cf60,
    object=object@entry=0xfffff800022f8328)
    at /usr/src/sys/contrib/openzfs/module/os/freebsd/spl/list.c:127
#9  0xffffffff8216158e in multilist_sublist_remove (
    mls=mls@entry=0xfffff8000234cf40, obj=obj@entry=0xfffff800022f8328)
    at /usr/src/sys/contrib/openzfs/module/zfs/multilist.c:363
#10 0xffffffff82105083 in dmu_objset_sync_dnodes (list=0xfffff8000234cf40,
    tx=0xfffff80aa5726c00)
    at /usr/src/sys/contrib/openzfs/module/zfs/dmu_objset.c:1557
#11 sync_dnodes_task (arg=0xfffff8083ac22e60)
    at /usr/src/sys/contrib/openzfs/module/zfs/dmu_objset.c:1638
#12 0xffffffff8209addf in taskq_run (arg=0xfffff8005728f900,
    pending=<optimized out>)
    at /usr/src/sys/contrib/openzfs/module/os/freebsd/spl/spl_taskq.c:320
#13 0xffffffff80ba5992 in taskqueue_run_locked (
    queue=queue@entry=0xfffff80002356300)
    at /usr/src/sys/kern/subr_taskqueue.c:512
#14 0xffffffff80ba6c22 in taskqueue_thread_loop (
    arg=arg@entry=0xfffff80003a91620) at /usr/src/sys/kern/subr_taskqueue.c:824
#15 0xffffffff80afdb7f in fork_exit (
    callout=0xffffffff80ba6b60 <taskqueue_thread_loop>,
    arg=0xfffff80003a91620, frame=0xfffffe022b901f40)
    at /usr/src/sys/kern/kern_fork.c:1160
#16 <signal handler called>

See attached for core.txt

This seems to happen intermittently while running a backup, which is performed
by a remote computer running `zfs send` through SSH.

If there's anything else you'd like to see please let me know. I have a full
vmcore as well if needed (11GB). I am also able to run kgdb to inspect said
vmcore.

-- 
You are receiving this mail because:
You are the assignee for the bug.