Problems with UP2000+
Douglas K. Rand
rand at meridian-enviro.com
Tue Aug 15 21:56:42 UTC 2006
We've got a Microway UP2000+ system that's been working just fine for
the last year. That is, until it seems to have developed some hardware
related problems. It started with:
dc0: watchdog timeout
dc0: watchdog timeout
dc0: watchdog timeout
dc0: watchdog timeout
dc0: watchdog timeout
dc0: watchdog timeout
dc0: watchdog timeout
ahc0: Timedout SCBs already complete. Interrupts may not be functioning.
ahc0: Timedout SCBs already complete. Interrupts may not be functioning.
dc0: watchdog timeout
dc0: watchdog timeout
Interestingly the system doesn't crash or completely hang. It stops
for a bit, considers the answer to the ultimate question (it isn't
fast enough to think about the actual question) and then works for a
few minutes. Rinse and repeat.
And then a few hours later it started having SCSI problems:
ahc0: Recovery Initiated
>>>>>>>>>>>>>>>>>> Dump Card State Begins <<<<<<<<<<<<<<<<<
ahc0: Dumping Card State while idle, at SEQADDR 0x18
Card was paused
ACCUM = 0x68, SINDEX = 0x48, DINDEX = 0xe4, ARG_2 = 0x1a
HCNT = 0x0 SCBPTR = 0x68
SCSISIGI[0xa6]:(REQI|BSYI|MSGI|CDI) ERROR[0x0] SCSIBUSL[0x0]
LASTPHASE[0x1]:(P_BUSFREE) SCSISEQ[0x1a]:(ENAUTOATNP|ENAUTOATNO|ENRSELI)
SBLKCTL[0xa]:(SELWIDE|SELBUSB) SCSIRATE[0x0] SEQCTL[0x10]:(FASTMODE)
SEQ_FLAGS[0xc0]:(NO_CDB_SENT|NOT_IDENTIFIED) SSTAT0[0x0]
SSTAT1[0x13]:(REQINIT|PHASECHG|PHASEMIS) SSTAT2[0x0]
SSTAT3[0x0] SIMODE0[0x8]:(ENSWRAP) SIMODE1[0xa4]:(ENSCSIPERR|ENSCSIRST|ENSELTIMO)
SXFRCTL0[0x80]:(DFON) DFCNTRL[0x0] DFSTATUS[0x89]:(FIFOEMP|HDONE|PRELOAD_AVAIL)
STACK: 0x0 0x154 0x16a 0x17
SCB count = 192
Kernel NEXTQSCB = 107
Card NEXTQSCB = 107
QINFIFO entries:
Waiting Queue entries: 104:104
Disconnected Queue entries:
QOUTFIFO entries:
Sequencer Free SCB List:
Sequencer SCB Info:
Well, first thing we tried was to replace the NIC. Got a fxp from the
shelf and tried that. It took 5 hours for it to have problems:
ahc0: Timedout SCBs already complete. Interrupts may not be functioning.
ahc0: Timedout SCBs already complete. Interrupts may not be functioning.
fxp0: device timeout
fxp0: device timeout
I had heard that the onboard SCSI sometimes go bad on these
motherboards, so I grabbed an Adaptec 2940UW from the shelf and tried
that. (Lucky for me the BIOS was "new" enough to be able to boot from
the 2940UW.) That lasted about 57 hours, but still ended up with the
same problem:
fxp0: device timeout
ahc1: Timedout SCBs already complete. Interrupts may not be functioning.
ahc1: Timedout SCBs already complete. Interrupts may not be functioning.
fxp0: device timeout
ahc1: Timedout SCBs already complete. Interrupts may not be functioning.
ahc1: Timedout SCBs already complete. Interrupts may not be functioning.
ahc1:A:1: no active SCB for reconnecting target - issuing BUS DEVICE RESET
SAVED_SCSIID == 0x17, SAVED_LUN == 0x0, ARG_1 == 0x17 ACCUM = 0x0
SEQ_FLAGS == 0xc0, SCBPTR == 0x6, BTT == 0xff, SINDEX == 0x31
SCSIID == 0x17, SCB_SCSIID == 0x17, SCB_LUN == 0x0, SCB_TAG == 0xff, SCB_CONTROL == 0x0
SCSIBUSL == 0x17, SCSISIGI == 0xe6
SXFRCTL0 == 0x88
SEQCTL == 0x10
We are now in the process of trying different PCI slots for things, so
far with out any luck. And trying the system with one of the three
power supplies turned off.
I was wondering if anybody might have any suggestions. This is a
fairly nice system with a pair of 833 MHz 21264D CPUs and 2 GB of
RAM. And while I'll admit it might not be worth the power it
consumes, I still like it.
Here's the boot log:
*** keyboard not plugged in...
2048 Meg of system memory
Simple COM1 Debug Disabled
Onboard Adaptec Enabled
initializing GCT/FRU at 1da000
Testing the System
Testing the Memory
memory_test none
Memory test skipped...
Testing the Disks (read only)
[-- rand at localhost bumped alcor at alcor.meridian-enviro.com -- Tue Aug 15 15:44:48 2006]
[-- MARK -- Tue Aug 15 15:45:00 2006]
Testing ew* devices.
UP2000+ 833 MHz Console A5.8-65, 31-JAN-2001 02:21:31
CPU 0 booting
(boot dkb0.0.0.9.0)
block 0 of dkb0.0.0.9.0 is a valid boot block
reading 15 blocks from dkb0.0.0.9.0
bootstrap code read in
base = 200000, image_start = 0, image_bytes = 1e00
initializing HWRPB at 2000
initializing page table at 3ff4a000
initializing machine state
setting affinity to the primary CPU
jumping to bootstrap code
Loading /boot/loader
Consoles: SRM firmware console
VMS PAL rev: 0x2004200010154
OSF PAL rev: 0x200400002014f
Switch to OSF PAL code succeeded.
FreeBSD/alpha SRM disk boot, Revision 1.2
(rand at kuo.meridian-enviro.com, Wed May 10 14:49:39 CDT 2006)
Memory: 2097152 k
Loading /boot/defaults/loader.conf
/boot/kernel/kernel data=0x3f4270+0x32320 syms=[0x8+0x4fa40+0x8+0x43026]
Hit [Enter] to boot immediately, or any other key for command prompt.
^MBooting [/boot/kernel/kernel] in 9 seconds...
Type '?' for a list of commands, 'help' for more detailed help.
OK boot -s
Entering /boot/kernel/kernel at 0xfffffc0000341110...
KDB: debugger backends: ddb
KDB: current backend: ddb
Copyright (c) 1992-2006 The FreeBSD Project.
Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994
The Regents of the University of California. All rights reserved.
FreeBSD 6.1-RELEASE #13: Thu Aug 10 17:09:40 CDT 2006
root@:/usr/obj/usr/src/sys/KUO
ST6600
UP2000+ 833 MHz, 833MHz
8192 byte page size, 2 processors.
CPU: EV68CX (21264D) major=13 minor=3 extensions=0x1307<BWX,FIX,CIX,MVI,PRECISE>
OSF PAL rev: 0x200400002014f
real memory = 2144616448 (2045 MB)
avail memory = 2097733632 (2000 MB)
tsunami0: <21271 Core Logic chipset>
pcib0: <21271 PCI host bus adapter> on tsunami0
pci0: <PCI bus> on pcib0
isab0: <PCI-ISA bridge> at device 5.0 on pci0
isa0: <ISA bus> on isab0
atapci0: <Cypress 82C693 ATA controller> port 0x1f0-0x1f7,0x3f6,0x170-0x177,0x376,0x10200-0x1020f irq 238 at dev
ice 5.1 on pci0
ata0: <ATA channel 0> on atapci0
ata0: interrupting at ISA irq 14
ata1: <ATA channel 1> on atapci0
ata1: interrupting at ISA irq 15
atapci1: <GENERIC ATA controller> port 0x170-0x177,0x374-0x377 mem 0x1020000-0x102ffff irq 239 at device 5.2 on
pci0
atapci1: unable to map interrupt
device_attach: atapci1 attach returned 6
pci0: <serial bus, USB> at device 5.3 (no driver attached)
ahc0: <Adaptec aic7890/91 Ultra2 SCSI adapter> port 0x10000-0x100ff mem 0x1041000-0x1041fff irq 19 at device 6.0
on pci0
ahc0: interrupting at TSUNAMI irq 19
ahc0: [GIANT-LOCKED]
aic7890/91: Ultra2 Wide Channel A, SCSI Id=7, 255 SCBs
ahc1: <Adaptec 2940 Ultra SCSI adapter> port 0x10100-0x101ff mem 0x1042000-0x1042fff irq 23 at device 9.0 on pci
0
ahc1: interrupting at TSUNAMI irq 23
ahc1: [GIANT-LOCKED]
aic7880: Ultra Wide Channel A, SCSI Id=7, 16/253 SCBs
pcib1: <21271 PCI host bus adapter> on tsunami0
pci1: <PCI bus> on pcib1
dc0: <Intel 21143 10/100BaseTX> port 0x10000-0x1007f mem 0x1040000-0x10403ff irq 39 at device 9.0 on pci1
miibus0: <MII bus> on dc0
ukphy0: <Generic IEEE 802.3u media interface> on miibus0
ukphy0: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, auto
dc0: Ethernet address: 00:c0:f0:6a:bb:4c
dc0: interrupting at TSUNAMI irq 39
atkbdc0: <Keyboard controller (i8042)> at port 0x60,0x64 on isa0
atkbd0: <AT Keyboard> irq 1 on atkbdc0
atkbd0: interrupting at ISA irq 1
atkbd0: [GIANT-LOCKED]
mcclock0: <MC146818A real time clock> at port 0x70-0x71 on isa0
sio0 at port 0x3f8-0x3ff irq 4 flags 0x10 on isa0
sio0: type 16550A, console
sio0: interrupting at ISA irq 4
sio1 at port 0x2f8-0x2ff irq 3 flags 0x80 on isa0
sio1: type 16550A
sio1: interrupting at ISA irq 3
Timecounter "i8254" frequency 1193182 Hz quality 0
Timecounter "alpha" frequency 833333333 Hz quality 800
Timecounters tick every 0.976 msec
da0 at ahc1 bus 0 target 0 lun 0
da0: <SEAGATE ST173404LW 0004> Fixed Direct Access SCSI-3 device
da0: 20.000MB/s transfers (10.000MHz, offset 8, 16bit), Tagged Queueing Enabled
da0: 70007MB (143374738 512 byte sectors: 255H 63S/T 8924C)
da1 at ahc1 bus 0 target 1 lun 0
da1: <SEAGATE ST173404LW 0004> Fixed Direct Access SCSI-3 device
da1: 20.000MB/s transfers (10.000MHz, offset 8, 16bit), Tagged Queueing Enabled
da1: 70007MB (143374738 512 byte sectors: 255H 63S/T 8924C)
GEOM_MIRROR: Device gm0 created (id=653487903).
GEOM_MIRROR: Device gm0: provider da0 detected.
GEOM_MIRROR: Device gm0: provider da1 detected.
GEOM_MIRROR: Device gm0: provider da1 activated.
GEOM_MIRROR: Device gm0: provider da0 activated.
GEOM_MIRROR: Device gm0: provider mirror/gm0 launched.
Trying to mount root from ufs:/dev/mirror/gm0a
Enter full pathname of shell or RETURN for /bin/sh:
More information about the freebsd-alpha
mailing list