stop_cpus_hard when multiple CPUs are panicking from an NMI

Neel Natu neelnatu at gmail.com
Fri Nov 16 02:49:13 UTC 2012


Hi,

On Thu, Nov 15, 2012 at 3:41 PM, Attilio Rao <attilio at freebsd.org> wrote:
> On Thu, Nov 15, 2012 at 10:58 PM, Ryan Stone <rysto32 at gmail.com> wrote:
>> At work we have some custom watchdog hardware that sends an NMI upon
>> expiry.  We've modified the kernel to panic when it receives the watchdog
>> NMI.  I've been trying the "stop_scheduler_on_panic" mode, and I've
>> discovered that when my watchdog expires, the system gets completely
>> wedged.  After some digging, I've discovered is that I have multiple CPUs
>> getting the watchdog NMI and trying to panic concurrently.  One of the CPUs
>> wins, and the rest spin forever in this code:
>
> Quick question: can you control the way your watchdog sends the NMI?
> Like only to BSP rather than broadcast, etc.

It can be done but you really want the NMI to be broadcast since the
watchdog is firing because some cpu is "hung" and you don't know which
one it is.

best
Neel

> This is tied to the very unique situation that you cannot really
> deliver the (second) NMI.
>
> Attilio
>
>
> --
> Peace can only be achieved by understanding - A. Einstein
> _______________________________________________
> freebsd-hackers at freebsd.org mailing list
> http://lists.freebsd.org/mailman/listinfo/freebsd-hackers
> To unsubscribe, send any mail to "freebsd-hackers-unsubscribe at freebsd.org"


More information about the freebsd-hackers mailing list