deadlock or bad disk ? RELENG_8

Mike Tancsa mike at sentex.net
Mon Jul 19 12:41:39 UTC 2010


At 11:58 PM 7/18/2010, Jeremy Chadwick wrote:

>So I believe this indicates the message only gets printed during swapin,
>not swapout.  Meaning it's happening during an I/O read from da0.

Yes, and from my existing ssh sessions, it would _seem_ no disk IO 
was completing.  ie I tried a killall -9 watchdogd which would need 
to load killall from the disk, read whatever its linked against. 
However, after hitting enter it was just blocking on trying to read. 
So I would describe it as if the entire system was waiting from that 
"swapper Indefinite wait" to finish, or I could not read anything 
from drives associated with that controller.



>So what's hz?  Well, I want to assume it's kern.hz, which defaults to
>1000.  1000*20 = 20000, so the timeout would be 20000/1000 = 20 seconds.
>That's a pretty long time to be waiting for an I/O read to return.

I think the messages were printing to the serial console faster than 
that, but I could be wrong. If it happens again, I will time it


         ---Mike





--------------------------------------------------------------------
Mike Tancsa,                                      tel +1 519 651 3400
Sentex Communications,                            mike at sentex.net
Providing Internet since 1994                    www.sentex.net
Cambridge, Ontario Canada                         www.sentex.net/mike



More information about the freebsd-stable mailing list