drive failure during rebuild causes page fault
Joe Rhett
jrhett at meer.net
Sun Dec 12 22:03:51 PST 2004
And here's where I found even more interesting stuff. (again with the
sil3114 controller)
If you detach a channel and then attach the channel, a new raid device gets
created. And the removed drive shows up in the new array...
# atacontrol create RAID0 ad6 ad8
# atacontrol detach 4
Dec 12 21:55:18 sandbox kernel: ad8: deleted from ar0 disk1
Dec 12 21:55:18 sandbox kernel: ar0: WARNING - mirror lost
Dec 12 21:55:18 sandbox kernel: ad8: WARNING - removed from configuration
sandbox# atacontrol status 1
atacontrol: ioctl(ATARAIDSTATUS): Device not configured
Okay, ar0 is broken, and raid array 1 doesn't exist.
# atacontrol attach 4
Dec 12 21:55:57 sandbox kernel: ad8: 76319MB <ST380013AS/3.18> [155061/16/63] at ata4-master SATA150
sandbox# atacontrol status 1
ar1: ATA RAID1 subdisks: DOWN ad8 status: BROKEN
Hm? Where did this array come from?
Okay, so now someone will tell me that I'm doing things all out of order,
which I suspect. But that leaves the obvious that "Others will do this"
and there is no documentation to suggest otherwise.
What about a command to show the current list of raid arrays? either make
'atacontrol status' return the status of all arrays in the system, or
make a new command that will list out which arrays are available. I only
stumbled on this because I mistyped a number and then realized that I was
looking at the wrong thing (and the wrong thing should not exist!)
On Sun, Dec 12, 2004 at 09:42:00PM -0800, Joe Rhett wrote:
> And another, I can now confirm that it is fairly easy to kill 5.3-release
> during the rebuilding process. The following steps will cause a kernel
> page fault consistently:
>
> atacontrol create RAID0 ad6 ad10
> atacontrol detach 5
> log: ad10 deleted from ar0 disk1
> log: ad10 WARNING - removed from configuration
> atacontrol addspare 0 ad8
> log: ad8 inserted into ar0 disk1 as spare
> atacontrol rebuild 0
> atacontrol detach 4
> log: ad8 deleted from ar0 disk1
> log: ad8 WARNING - removed from configuration
>
> Fatal trap 12: page fault while in kernel mode
> fault virtual address = 0x10
> ....
> current process = 1063 (rebuilding ar0 1%)
> trap number = 12
> panic: page fault
>
> (tell me if you want or need anything I skipped above. Got lazy cause I
> had to type it in by hand...)
>
> --
> Joe Rhett
> Senior Geek
> Meer.net
> _______________________________________________
> freebsd-stable at freebsd.org mailing list
> http://lists.freebsd.org/mailman/listinfo/freebsd-stable
> To unsubscribe, send any mail to "freebsd-stable-unsubscribe at freebsd.org"
--
Joe Rhett
Senior Geek
Meer.net
More information about the freebsd-stable
mailing list