[Bug 282713] Process enters in STOP state and doesn't respond to any signal.

From: <bugzilla-noreply_at_freebsd.org>
Date: Fri, 22 Nov 2024 10:29:09 UTC
https://bugs.freebsd.org/bugzilla/show_bug.cgi?id=282713

--- Comment #11 from Rupesh Pilania <rupeshpilania@gmail.com> ---

  PID USERNAME    THR PRI NICE   SIZE    RES STATE    C   TIME    WCPU COMMAND
 1214 root          1 102    0   146M   109M CPU6     6 936:26  99.78%
parent-process

./procstat -t 1214
  PID    TID COMM                TDNAME              CPU  PRI STATE   WCHAN
 1214 100136 heimdall            -                     6  202 run     -
 ./procstat -k 1214
  PID    TID COMM                TDNAME              KSTACK
 1214 100136 heimdall            -                   <running>
 ./procstat -kk 1214
  PID    TID COMM                TDNAME              KSTACK
 1214 100136 heimdall            -                   <running>

All child processes stucks at umtxqb if we are trying to kill.


  PID USERNAME    THR PRI NICE   SIZE    RES STATE    C   TIME    WCPU COMMAND
 1214 root          1 102    0   146M   109M CPU6     6 936:26  99.78%
parent-process

./procstat -t 1214
  PID    TID COMM                TDNAME              CPU  PRI STATE   WCHAN
 1214 100136 heimdall            -                     6  202 run     -
 ./procstat -k 1214
  PID    TID COMM                TDNAME              KSTACK
 1214 100136 heimdall            -                   <running>
 ./procstat -kk 1214
  PID    TID COMM                TDNAME              KSTACK
 1214 100136 heimdall            -                   <running>

 dut-c600v-02:rtestuser 20] ./procstat -t 2803
  PID    TID COMM                TDNAME              CPU  PRI STATE   WCHAN
 2803 100353 child-1      -                    -1  120 stop    -
 2803 100515 child-1      -                    -1  120 stop    -
 2803 100516 child-1      -                    -1  152 stop    sbwait
 2803 100519 child-1      -                    -1  120 stop    usem
 2803 100520 child-1      -                    -1  120 stop    usem
 2803 100524 child-1      -                    -1  120 stop    usem
 2803 100526 child-1      -                    -1  120 stop    usem
 2803 100532 child-1      -                    -1  120 stop    usem
 2803 100533 child-1      -                    -1  120 stop    usem
 2803 100534 child-1      -                    -1  120 stop    usem
 2803 100535 child-1      -                    -1  120 stop    usem
 2803 100536 child-1      -                    -1  120 stop    umtxqb
 2803 100537 child-1      -                    -1  120 stop    usem
 2803 100540 child-1      -                    -1  120 stop    usem
 2803 100541 child-1      -                    -1  120 stop    usem
 2803 100543 child-1      -                    -1  120 stop    usem
 2803 100544 child-1      -                    -1  120 stop    usem
 2803 100552 child-1      -                    -1  120 stop    usem
 2803 100553 child-1      -                    -1  120 stop    usem
 2803 100555 child-1      -                    -1  120 stop    usem
 2803 100561 child-1      -                    -1  120 stop    usem
 2803 100562 child-1      -                    -1  120 stop    usem
 2803 100563 child-1      -                    -1  120 stop    usem
 2803 100564 child-1      -                    -1  120 stop    usem
 2803 100565 child-1      -                    -1  120 stop    usem
 2803 100568 child-1      -                    -1  120 stop    usem
 2803 100570 child-1      -                    -1  120 stop    usem
 2803 100572 child-1      -                    -1  120 stop    usem
 2803 100581 child-1      -                    -1  120 stop    usem
 2803 100582 child-1      -                    -1  120 stop    usem
 2803 100583 child-1      -                    -1  120 stop    usem
 2803 100610 child-1      -                    -1  120 stop    usem
 2803 100615 child-1      -                    -1  120 stop    usem
 2803 100616 child-1      -                    -1  120 stop    -
 2803 100756 child-1      -                    -1  120 stop    -
 2803 100921 child-1      -                    -1  120 stop    sbwait
 2803 100922 child-1      -                    -1  120 stop    uwait
 2803 100923 child-1      -                    -1  120 stop    uwait
 2803 100924 child-1      -                    -1  120 stop    -
 2803 100925 child-1      -                    -1  120 stop    -
 2803 100930 child-1      -                    -1  120 stop    uwait
 2803 101143 child-1      -                    -1  120 stop    sbwait
 2803 102579 child-1      -                    -1  120 stop    uwait
 2803 394294 child-1      -                    -1  120 stop    uwait
 2803 577659 child-1      -                    -1  120 stop    usem
dut-c600v-02:rtestuser 21]
dut-c600v-02:rtestuser 21] ./procstat -kk 2803
  PID    TID COMM                TDNAME              KSTACK
 2803 100353 child-1      -                   mi_switch+0xc1
thread_suspend_check+0x260 exit1+0xaa sigexit+0x133 postsig+0x251
kern_sigsuspend+0x2b7 sys_sigsuspend+0x31 amd64_syscall+0x741
fast_syscall_common+0xf8
 2803 100515 child-1      -                   mi_switch+0xc1
thread_suspend_switch+0xc0 cursig+0x651 ast+0x2df doreti_ast+0x1f
 2803 100516 child-1      -                   mi_switch+0xc1
sleepq_catch_signals+0x2e6 sleepq_wait_sig+0x9 _sleep+0x1be sbwait+0x4c
soreceive_generic+0x307 soreceive+0x44 kern_recvit+0x1ba sys_recvfrom+0x86
amd64_syscall+0x741 fast_syscall_common+0xf8
 2803 100519 child-1      -                   mi_switch+0xc1
sleepq_catch_signals+0x2e6 sleepq_wait_sig+0x9 _sleep+0x1be umtxq_sleep+0x132
__umtx_op_sem2_wait+0x6f8 sys__umtx_op+0x7a amd64_syscall+0x741
fast_syscall_common+0xf8
 2803 100520 child-1      -                   mi_switch+0xc1
sleepq_catch_signals+0x2e6 sleepq_wait_sig+0x9 _sleep+0x1be umtxq_sleep+0x132
__umtx_op_sem2_wait+0x6f8 sys__umtx_op+0x7a amd64_syscall+0x741
fast_syscall_common+0xf8
 2803 100524 child-1      -                   mi_switch+0xc1
sleepq_catch_signals+0x2e6 sleepq_wait_sig+0x9 _sleep+0x1be umtxq_sleep+0x132
__umtx_op_sem2_wait+0x6f8 sys__umtx_op+0x7a amd64_syscall+0x741
fast_syscall_common+0xf8
 2803 100526 child-1      -                   mi_switch+0xc1
sleepq_catch_signals+0x2e6 sleepq_wait_sig+0x9 _sleep+0x1be umtxq_sleep+0x132
__umtx_op_sem2_wait+0x6f8 sys__umtx_op+0x7a amd64_syscall+0x741
fast_syscall_common+0xf8
 2803 100532 child-1      -                   mi_switch+0xc1
sleepq_catch_signals+0x2e6 sleepq_wait_sig+0x9 _sleep+0x1be umtxq_sleep+0x132
__umtx_op_sem2_wait+0x6f8 sys__umtx_op+0x7a amd64_syscall+0x741
fast_syscall_common+0xf8
 2803 100533 child-1      -                   mi_switch+0xc1
sleepq_catch_signals+0x2e6 sleepq_wait_sig+0x9 _sleep+0x1be umtxq_sleep+0x132
__umtx_op_sem2_wait+0x6f8 sys__umtx_op+0x7a amd64_syscall+0x741
fast_syscall_common+0xf8
 2803 100534 child-1      -                   mi_switch+0xc1
sleepq_catch_signals+0x2e6 sleepq_wait_sig+0x9 _sleep+0x1be umtxq_sleep+0x132
__umtx_op_sem2_wait+0x6f8 sys__umtx_op+0x7a amd64_syscall+0x741
fast_syscall_common+0xf8
 2803 100535 child-1      -                   mi_switch+0xc1
sleepq_catch_signals+0x2e6 sleepq_wait_sig+0x9 _sleep+0x1be umtxq_sleep+0x132
__umtx_op_sem2_wait+0x6f8 sys__umtx_op+0x7a amd64_syscall+0x741
fast_syscall_common+0xf8
 2803 100536 child-1      -                   mi_switch+0xc1 _sleep+0x1cb
__umtx_op_sem2_wait+0x287 sys__umtx_op+0x7a amd64_syscall+0x741
fast_syscall_common+0xf8
 2803 100537 child-1      -                   mi_switch+0xc1
sleepq_catch_signals+0x2e6 sleepq_wait_sig+0x9 _sleep+0x1be umtxq_sleep+0x132
__umtx_op_sem2_wait+0x6f8 sys__umtx_op+0x7a amd64_syscall+0x741
fast_syscall_common+0xf8
 2803 100540 child-1      -                   mi_switch+0xc1
sleepq_catch_signals+0x2e6 sleepq_wait_sig+0x9 _sleep+0x1be umtxq_sleep+0x132
__umtx_op_sem2_wait+0x6f8 sys__umtx_op+0x7a amd64_syscall+0x741
fast_syscall_common+0xf8
 2803 100541 child-1      -                   mi_switch+0xc1
sleepq_catch_signals+0x2e6 sleepq_wait_sig+0x9 _sleep+0x1be umtxq_sleep+0x132
__umtx_op_sem2_wait+0x6f8 sys__umtx_op+0x7a amd64_syscall+0x741
fast_syscall_common+0xf8
 2803 100543 child-1      -                   mi_switch+0xc1
sleepq_catch_signals+0x2e6 sleepq_wait_sig+0x9 _sleep+0x1be umtxq_sleep+0x132
__umtx_op_sem2_wait+0x6f8 sys__umtx_op+0x7a amd64_syscall+0x741
fast_syscall_common+0xf8
 2803 100544 child-1      -                   mi_switch+0xc1
sleepq_catch_signals+0x2e6 sleepq_wait_sig+0x9 _sleep+0x1be umtxq_sleep+0x132
__umtx_op_sem2_wait+0x6f8 sys__umtx_op+0x7a amd64_syscall+0x741
fast_syscall_common+0xf8
 2803 100552 child-1      -                   mi_switch+0xc1
sleepq_catch_signals+0x2e6 sleepq_wait_sig+0x9 _sleep+0x1be umtxq_sleep+0x132
__umtx_op_sem2_wait+0x6f8 sys__umtx_op+0x7a amd64_syscall+0x741
fast_syscall_common+0xf8
 2803 100553 child-1      -                   mi_switch+0xc1
sleepq_catch_signals+0x2e6 sleepq_wait_sig+0x9 _sleep+0x1be umtxq_sleep+0x132
__umtx_op_sem2_wait+0x6f8 sys__umtx_op+0x7a amd64_syscall+0x741
fast_syscall_common+0xf8
 2803 100555 child-1      -                   mi_switch+0xc1
sleepq_catch_signals+0x2e6 sleepq_wait_sig+0x9 _sleep+0x1be umtxq_sleep+0x132
__umtx_op_sem2_wait+0x6f8 sys__umtx_op+0x7a amd64_syscall+0x741
fast_syscall_common+0xf8
 2803 100561 child-1      -                   mi_switch+0xc1
sleepq_catch_signals+0x2e6 sleepq_wait_sig+0x9 _sleep+0x1be umtxq_sleep+0x132
__umtx_op_sem2_wait+0x6f8 sys__umtx_op+0x7a amd64_syscall+0x741
fast_syscall_common+0xf8
 2803 100562 child-1      -                   mi_switch+0xc1
sleepq_catch_signals+0x2e6 sleepq_wait_sig+0x9 _sleep+0x1be umtxq_sleep+0x132
__umtx_op_sem2_wait+0x6f8 sys__umtx_op+0x7a amd64_syscall+0x741
fast_syscall_common+0xf8
 2803 100563 child-1      -                   mi_switch+0xc1
sleepq_catch_signals+0x2e6 sleepq_wait_sig+0x9 _sleep+0x1be umtxq_sleep+0x132
__umtx_op_sem2_wait+0x6f8 sys__umtx_op+0x7a amd64_syscall+0x741
fast_syscall_common+0xf8
 2803 100564 child-1      -                   mi_switch+0xc1
sleepq_catch_signals+0x2e6 sleepq_wait_sig+0x9 _sleep+0x1be umtxq_sleep+0x132
__umtx_op_sem2_wait+0x6f8 sys__umtx_op+0x7a amd64_syscall+0x741
fast_syscall_common+0xf8
 2803 100565 child-1      -                   mi_switch+0xc1
sleepq_catch_signals+0x2e6 sleepq_wait_sig+0x9 _sleep+0x1be umtxq_sleep+0x132
__umtx_op_sem2_wait+0x6f8 sys__umtx_op+0x7a amd64_syscall+0x741
fast_syscall_common+0xf8
 2803 100568 child-1      -                   mi_switch+0xc1
sleepq_catch_signals+0x2e6 sleepq_wait_sig+0x9 _sleep+0x1be umtxq_sleep+0x132
__umtx_op_sem2_wait+0x6f8 sys__umtx_op+0x7a amd64_syscall+0x741
fast_syscall_common+0xf8
 2803 100570 child-1      -                   mi_switch+0xc1
sleepq_catch_signals+0x2e6 sleepq_wait_sig+0x9 _sleep+0x1be umtxq_sleep+0x132
__umtx_op_sem2_wait+0x6f8 sys__umtx_op+0x7a amd64_syscall+0x741
fast_syscall_common+0xf8
 2803 100572 child-1      -                   mi_switch+0xc1
sleepq_catch_signals+0x2e6 sleepq_wait_sig+0x9 _sleep+0x1be umtxq_sleep+0x132
__umtx_op_sem2_wait+0x6f8 sys__umtx_op+0x7a amd64_syscall+0x741
fast_syscall_common+0xf8
 2803 100581 child-1      -                   mi_switch+0xc1
sleepq_catch_signals+0x2e6 sleepq_wait_sig+0x9 _sleep+0x1be umtxq_sleep+0x132
__umtx_op_sem2_wait+0x6f8 sys__umtx_op+0x7a amd64_syscall+0x741
fast_syscall_common+0xf8
 2803 100582 child-1      -                   mi_switch+0xc1
sleepq_catch_signals+0x2e6 sleepq_wait_sig+0x9 _sleep+0x1be umtxq_sleep+0x132
__umtx_op_sem2_wait+0x6f8 sys__umtx_op+0x7a amd64_syscall+0x741
fast_syscall_common+0xf8
 2803 100583 child-1      -                   mi_switch+0xc1
sleepq_catch_signals+0x2e6 sleepq_wait_sig+0x9 _sleep+0x1be umtxq_sleep+0x132
__umtx_op_sem2_wait+0x6f8 sys__umtx_op+0x7a amd64_syscall+0x741
fast_syscall_common+0xf8
 2803 100610 child-1      -                   mi_switch+0xc1
sleepq_catch_signals+0x2e6 sleepq_wait_sig+0x9 _sleep+0x1be umtxq_sleep+0x132
__umtx_op_sem2_wait+0x6f8 sys__umtx_op+0x7a amd64_syscall+0x741
fast_syscall_common+0xf8
 2803 100615 child-1      -                   mi_switch+0xc1
sleepq_catch_signals+0x2e6 sleepq_wait_sig+0x9 _sleep+0x1be umtxq_sleep+0x132
__umtx_op_sem2_wait+0x6f8 sys__umtx_op+0x7a amd64_syscall+0x741
fast_syscall_common+0xf8
 2803 100616 child-1      -                   mi_switch+0xc1
thread_suspend_check+0x260 ast+0x376 doreti_ast+0x1f
 2803 100756 child-1      -                   mi_switch+0xc1
sleepq_catch_signals+0x2e6 sleepq_wait_sig+0x9 _sleep+0x1be sbwait+0x4c
soreceive_generic+0x307 soreceive+0x44 kern_recvit+0x1ba sys_recvfrom+0x86
amd64_syscall+0x741 fast_syscall_common+0xf8
 2803 100921 child-1      -                   mi_switch+0xc1
sleepq_catch_signals+0x2e6 sleepq_wait_sig+0x9 _sleep+0x1be sbwait+0x4c
soreceive_generic+0x307 soreceive+0x44 kern_recvit+0x1ba sys_recvfrom+0x86
amd64_syscall+0x741 fast_syscall_common+0xf8
 2803 100922 child-1      -                   mi_switch+0xc1
sleepq_catch_signals+0x2e6 sleepq_wait_sig+0x9 _sleep+0x1be umtxq_sleep+0x132
do_wait+0x41c __umtx_op_wait_uint_private+0x54 sys__umtx_op+0x7a
amd64_syscall+0x741 fast_syscall_common+0xf8
 2803 100923 child-1      -                   mi_switch+0xc1
sleepq_catch_signals+0x2e6 sleepq_wait_sig+0x9 _sleep+0x1be umtxq_sleep+0x132
do_wait+0x41c __umtx_op_wait_uint_private+0x54 sys__umtx_op+0x7a
amd64_syscall+0x741 fast_syscall_common+0xf8
 2803 100924 child-1      -                   mi_switch+0xc1
thread_suspend_check+0x260 ast+0x376 doreti_ast+0x1f
 2803 100925 child-1      -                   mi_switch+0xc1
thread_suspend_check+0x260 ast+0x376 doreti_ast+0x1f
 2803 100930 child-1      -                   mi_switch+0xc1
sleepq_catch_signals+0x2e6 sleepq_wait_sig+0x9 _sleep+0x1be umtxq_sleep+0x132
do_wait+0x41c __umtx_op_wait_uint_private+0x54 sys__umtx_op+0x7a
amd64_syscall+0x741 fast_syscall_common+0xf8
 2803 101143 child-1      -                   mi_switch+0xc1
sleepq_catch_signals+0x2e6 sleepq_wait_sig+0x9 _sleep+0x1be sbwait+0x4c
soreceive_generic+0x307 soreceive+0x44 kern_recvit+0x1ba sys_recvfrom+0x86
amd64_syscall+0x741 fast_syscall_common+0xf8
 2803 102579 child-1      -                   mi_switch+0xc1
sleepq_catch_signals+0x2e6 sleepq_wait_sig+0x9 _sleep+0x1be umtxq_sleep+0x132
do_wait+0x41c __umtx_op_wait_uint_private+0x54 sys__umtx_op+0x7a
amd64_syscall+0x741 fast_syscall_common+0xf8
 2803 394294 child-1      -                   mi_switch+0xc1
sleepq_catch_signals+0x2e6 sleepq_wait_sig+0x9 _sleep+0x1be umtxq_sleep+0x132
do_wait+0x41c __umtx_op_wait_uint_private+0x54 sys__umtx_op+0x7a
amd64_syscall+0x741 fast_syscall_common+0xf8
 2803 577659 child-1      -                   mi_switch+0xc1
sleepq_catch_signals+0x2e6 sleepq_wait_sig+0x9 _sleep+0x1be umtxq_sleep+0x132
__umtx_op_sem2_wait+0x6f8 sys__umtx_op+0x7a amd64_syscall+0x741
fast_syscall_common+0xf8

We didn't tried killing the parent.
https://bugs.freebsd.org/bugzilla/show_bug.cgi?id=200992 Comment10. Lokks like
hitting hypervisor issue.

-- 
You are receiving this mail because:
You are the assignee for the bug.