[Bug 263607] [panic] [arm64] [13.1-RC4] very early panic after Release APs

From: <bugzilla-noreply_at_freebsd.org>
Date: Wed, 27 Apr 2022 12:50:04 UTC
https://bugs.freebsd.org/bugzilla/show_bug.cgi?id=263607

            Bug ID: 263607
           Summary: [panic] [arm64] [13.1-RC4] very early panic after
                    Release APs
           Product: Base System
           Version: 13.1-RELEASE
          Hardware: Any
                OS: Any
            Status: New
          Severity: Affects Only Me
          Priority: ---
         Component: arm
          Assignee: freebsd-arm@FreeBSD.org
          Reporter: dch@freebsd.org

...
CPU 14: ARM Neoverse-N1 r3p1 affinity: 14
CPU 15: ARM Neoverse-N1 r3p1 affinity: 15
Release APs...Trying to mount root from
zfs:zroot/ROOT/14.0-CURRENT-20220329.190055 []...
done
timeout stopping cpus
panic: Assertion v != tid failed at /usr/src/sys/kern/kern_mutex.c:920
cpuid = 13
time = 1
KDB: stack backtrace:
db_trace_self() at db_trace_self
KDB: enter: panic
[ thread pid 0 tid 100035 ]
Stopped at      kdb_enter+0x44: undefined       f902011f
db>





CPU  1: ARM Neoverse-N1 r3p1 affinity:  1
CPU  2: ARM Neoverse-N1 r3p1 affinity:  2
CPU  3: ARM Neoverse-N1 r3p1 affinity:  3
------ sometimes we hang here for several seconds before panic --------
Release APs...Trying to mount root from
zfs:zroot/ROOT/14.0-CURRENT-20220421.064527 []...
done
panic: vm_fault_lookup: fault on nofault entry, addr: 0xffff00004038d000
cpuid = 2
time = 1
KDB: stack backtrace:
db_trace_self() at db_trace_self
KDB: enter: panic
[ thread pid 0 tid 100014 ]
Stopped at      kdb_enter+0x40: undefined       f902027f
db> bt
Tracing pid 0 tid 100014 td 0xffffa0000297f000
db_trace_self() at db_trace_self


Sometimes I get more, though:

CPU  3: ARM Neoverse-N1 r3p1 affinity:  3
Release APs...done
  x0:                0
  x1:               68
  x2:              101
  x3:                0
  x4:              201
  x5: ffffa005caec6e98
  x6: 1de7ec7edbadc0de
  x7:         768a5bc7
  x8:         deadc0de
  x9: ffffa0002195f568
 x10:                0
 x11:               1a
 x12:               1a
 x13: ffffa0002195f56c
 x14:                0
 x15:                1
 x16:                8
 x17:                0
 x18: ffff0000403af630 ($d.1 + 3ec80328)
 x19: ffff000040498200 ($d.1 + 3ed68ef8)
 x20: ffffa0002195ffd8
 x21:              101
 x22: ffffa0002195f000
 x23:                a
 x24:                0
 x25:                0
 x26:              101
 x27:               7c
 x28: ffffa00021960fd8
 x29:               fe
  sp: ffff0000403af630
  lr:               b9
 elr:               b9
spsr:         60400045
 far: ffff000043048000 ($d.1 + 41918cf8)


e123e2294cb50deb288916b79a8c05a006f8bca3 occasionally but same fails


CPU 13: ARM Neoverse-N1 r3p1 affinity: 13
CPU 14: ARM Neoverse-N1 r3p1 affinity: 14
CPU 15: ARM Neoverse-N1 r3p1 affinity: 15
Release APs...done
timeout stopping cpus
panic: data abort with spinlock held
cpuid = 14
time = 1
KDB: stack backtrace:
db_trace_self() at db_trace_self
KDB: enter: panic
[ thread pid 0 tid 100031 ]
Stopped at      kdb_enter+0x44: undefined       f902011f
db>



CPU 15: ARM Neoverse-N1 r3p1 affinity: 15
Release APs...done
panic: Assertion v != tid failed at /usr/src/sys/kern/kern_mutex.c:920
cpuid = 10
time = 1
KDB: stack backtrace:
db_trace_self() at db_trace_self
db_trace_self_wrapper() at db_trace_self_wrapper+0x30
vpanic() at vpanic+0x174
panic() at panic+0x44
thread_lock_flags_() at thread_lock_flags_+0x1dc
sched_preempt() at sched_preempt+0x38
arm_gic_v3_intr() at arm_gic_v3_intr+0xe8
intr_irq_handler() at intr_irq_handler+0x80
handle_el1h_irq() at handle_el1h_irq+0xc
--- interrupt
data_abort() at data_abort+0x148
handle_el1h_sync() at handle_el1h_sync+0x10
--- exception, esr 0x96000006
handle_el1h_sync() at handle_el1h_sync+0x8
--- exception, esr 0x96000006
handle_el1h_sync() at handle_el1h_sync+0x8
--- exception, esr 0x96000006
handle_el1h_sync() at handle_el1h_sync+0x8
--- exception, esr 0x96000006
handle_el1h_sync() at handle_el1h_sync+0x8
--- exception, esr 0x96000006
handle_el1h_sync() at handle_el1h_sync+0x8
--- exception, esr 0x96000006
handle_el1h_sync() at handle_el1h_sync+0x8
--- exception, esr 0x96000006
handle_el1h_sync() at handle_el1h_sync+0x8
--- exception, esr 0x96000006
handle_el1h_sync() at handle_el1h_sync+0x8
--- exception, esr 0x96000006
handle_el1h_sync() at handle_el1h_sync+0x8
--- exception, esr 0x96000006
handle_el1h_sync() at handle_el1h_sync+0x8
--- exception, esr 0x96000006
handle_el1h_sync() at handle_el1h_sync+0x8
--- exception, esr 0x96000006
handle_el1h_sync() at handle_el1h_sync+0x8
--- exception, esr 0x96000006
handle_el1h_sync() at handle_el1h_sync+0x8
--- exception, esr 0x96000006
handle_el1h_sync() at handle_el1h_sync+0x8
--- exception, esr 0x96000006
handle_el1h_sync() at handle_el1h_sync+0x8
--- exception, esr 0x96000006
handle_el1h_sync() at handle_el1h_sync+0x8
--- exception, esr 0x96000006
handle_el1h_sync() at handle_el1h_sync+0x8
--- exception, esr 0x96000006
handle_el1h_sync() at handle_el1h_sync+0x8
--- exception, esr 0x96000006
handle_el1h_sync() at handle_el1h_sync+0x8
--- exception, esr 0x96000006
thread_lock_block() at thread_lock_block+0x38
sched_switch() at sched_switch+0x12c
mi_switch() at mi_switch+0x184
spinlock_exit() at spinlock_exit+0x60
__mtx_unlock_flags() at __mtx_unlock_flags+0x154
vm_page_zone_import() at vm_page_zone_import+0xe4
zone_alloc_item() at zone_alloc_item+0xb4
vm_page_alloc_noobj_domain() at vm_page_alloc_noobj_domain+0xd4
uma_small_alloc() at uma_small_alloc+0x64
keg_alloc_slab() at keg_alloc_slab+0xbc
zone_import() at zone_import+0x10c
cache_alloc() at cache_alloc+0x3ac
cache_alloc_retry() at cache_alloc_retry+0x2c
malloc() at malloc+0x94
sbuf_new() at sbuf_new+0x6c
vfs_mountroot() at vfs_mountroot+0x60
start_init() at start_init+0x28
fork_exit() at fork_exit+0x74
fork_trampoline() at fork_trampoline+0x14
KDB: enter: panic
[ thread pid 1 tid 100002 ]
Stopped at      kdb_enter+0x44: undefined       f902011f
db>

-- 
You are receiving this mail because:
You are the assignee for the bug.