FreeBSD 11 i386 disk deadlock (I think)

David Cross dcrosstech at gmail.com
Tue Nov 22 02:33:05 UTC 2016


I noticed this situation on a Soekris machine I have (geode processor),  It
has happened twice, once in installworld ,once in buildworld.  I am not
sure how to trigger it more easily, but the symptoms are quite
straightforward.

Both times this happened in objcopy.  And objcopy was stuck in the
"vm_wait" wait-channel.  The first time I was able to control-C it (after a
few Control-Ts that showed it was not progressing)  and everything came
back, I was then able to re-run the installworld (that had been the source
of the objcopy) without incident.  That I was able to Control-C it, and
re-run it without a disk-wait process, or blocking in the same spot is
telling to me.

The second time I had wandered away and by the time I got back I was no
longer able to interact with the machine.  It still passed traffic, but any
attempt to open a new ssh connection was greeted with an opened TCP socket
and IMMEDIATE reset/termination.  I was forced to reset it.

While this was happening, it was successfully routing 6 megabytes/second
through its interfaces.

I am going to try to run this again to see if I can get any more
information, it is apparently fairly easy for me to trip.


More information about the freebsd-hackers mailing list