Bug 113234 - SCSI driver atp870u: fatal I/O ERRORS
Summary: SCSI driver atp870u: fatal I/O ERRORS
Status: RESOLVED FIXED
Alias: None
Product: SUSE LINUX 10.0
Classification: openSUSE
Component: Kernel (show other bugs)
Version: Beta 3
Hardware: i586 SUSE Other
: P5 - None : Critical
Target Milestone: ---
Assignee: Jens Axboe
QA Contact: E-mail List
URL:
Whiteboard:
Keywords:
Depends on:
Blocks:
 
Reported: 2005-08-26 08:42 UTC by Klaus Wagner
Modified: 2005-09-01 09:34 UTC (History)
1 user (show)

See Also:
Found By: Other
Services Priority:
Business Priority:
Blocker: ---
Marketing QA Status: ---
IT Deployment: ---


Attachments
/var/log/boot.msg of SUSE 10.0b3 installation (17.91 KB, text/plain)
2005-08-26 08:45 UTC, Klaus Wagner
Details
dmesg output upon the SCSI I/O ERRORS (1.94 KB, application/octet-stream)
2005-08-26 08:48 UTC, Klaus Wagner
Details
Remove the huge diff from 2.6.12 (99.34 KB, patch)
2005-08-31 12:41 UTC, Jens Axboe
Details | Diff
Tarball: lspci, hwinfo about AEC67160 on ProLiant ML310G2 (3.91 KB, application/x-gzip)
2005-08-31 14:27 UTC, Klaus Wagner
Details

Note You need to log in before you can comment on or make changes to this bug.
Description Klaus Wagner 2005-08-26 08:42:52 UTC
The combination of ACARD AEC-67160 SCSI host and hard disk: QUANTUM
ATLAS10K2-TY184L (driver: atp870u V2.6+ac) does not work anymore with
SUSE 10.0 beta3 (error msgs attached). Verified both on my workstation
(Dell Precision WS340) and a new HP ProLiant ML310 G2.

Immediately afterwards, the SCSI host and disk did a good bonnie
test under the SuSE 9.0 update kernel 2.4.21-266, so it's not
the fault of the hardware.

Going to attach the error messages I get upon the first attempt to access
the disk during SUSE 10.0b3 installation...
Comment 1 Klaus Wagner 2005-08-26 08:45:49 UTC
Created attachment 47722 [details]
/var/log/boot.msg of SUSE 10.0b3 installation

boot.msg is intended to document the normal boot
process. No error messages yet in here.
Comment 2 Klaus Wagner 2005-08-26 08:48:04 UTC
Created attachment 47723 [details]
dmesg output upon the SCSI I/O ERRORS

This is the collection of error msgs issued by
the atp870u driver. It coincided with the first
mkfs attempt on a /dev/sda partition. Needless to say
the installation got stuck here.
Comment 3 Klaus Wagner 2005-08-26 08:50:48 UTC
I could attempt to start a SUSE 9.1 or 9.3 installation on this 
hardware in order to narrow down the moment it wouldn't work anymore.
If you're interested please say so. Ditto for more logs (hwinfo etc.).
Comment 4 Olaf Kirch 2005-08-26 09:04:52 UTC
Jens, any idea? 
Comment 5 Jens Axboe 2005-08-26 09:08:26 UTC
Klaus, if you could try 9.1 through 9.3 it would be very helpful.
Comment 6 Klaus Wagner 2005-08-26 10:26:28 UTC
Tested on the HP ProLiant ML310 G2:  all is well with

9.1: installation kernel 2.6.4-52
9.3: installation kernel 2.6.11.4-20a

In each case I copied a few hundred MB from the installation medium to the
hard disk in question without encountering any problems or error msgs.

On the other hand, with 10.0b3, not even the "Installation -- safe settings"
kernel parameters (acpi=off barrier=off ... ) don't prevent messages as
attached in comment #2 and the subsequent hang.
Comment 7 Klaus Wagner 2005-08-26 10:27:10 UTC
(sorry, forgot to tag previous comment as "provides needed info")
Comment 8 Klaus Wagner 2005-08-31 10:43:59 UTC
Raising severity to "critical" after consulting QA (taking also into
account the up-coming SLES SP3).
Comment 9 Jens Axboe 2005-08-31 12:40:07 UTC
SP3 is not affected, this looks like a bug introduced in 2.6.12 or 2.6.13. I'm
attaching a patch for testing,
Comment 10 Jens Axboe 2005-08-31 12:41:05 UTC
Created attachment 48308 [details]
Remove the huge diff from 2.6.12

2.6.12 introduced support for a new card, lets test if it broke the older card
in the process. Please test a kernel with this applied!
Comment 11 Klaus Wagner 2005-08-31 14:24:59 UTC
OK, tested in the following way:

   - used kernel-source-2.6.13_rc6_git13-4 from SL 10.0b3
   - patched drivers/scsi/atp870u.[ch] using patch from comment #10
   - retrieved /proc/config.gz from running kernel 2.6.13-rc6-git13-4-default
     (from SL 10.0b3)
   - built the modules from drivers/scsi according to instructions in 
     www.suse.de/~agruen/kernel-doc/; retrieved drivers/scsi/atp870u.ko

   - Started a new "manual" SL 10.0b3 installation on HP ProLiant ML310 G2
     using mini-ISO CD ("manual" in order to avoid premature loading of new,
     bad atp870u driver which would mess things up immediately)
   - Loaded modules for SCSI (mptscsih) and Ethernet (tg3).
   - Started installation via ftp from dist.suse.de
   - As soon as the "Software Agreement Request" popped up (prior to
     HW detection and driver loading!): loaded patched atp870u.ko
     module manually.
   - Then "yes" to "Software Agreement Request".
   - Now YaST2 started, did HW detection, found all disks.
   - Selected again the disk connected to the AEC67160 for installation.

_Result_: this time disk access works with no errors, installation is
          going smoothly (currently still in progress). 

I'll add more info about the AEC67160 SCSI controller in a moment...
Comment 12 Klaus Wagner 2005-08-31 14:27:09 UTC
Created attachment 48328 [details]
Tarball: lspci, hwinfo about AEC67160 on ProLiant ML310G2
Comment 13 Jens Axboe 2005-09-01 08:24:09 UTC
Ok, so that is promising, The question is how to proceed with this. We don't
even know if support for the new card works, so I'd be inclined to back out the
patch so we at least don't have regressions in this area and report the issue to
the acard maintainer.

Andreas, what do you think?
Comment 14 Klaus Wagner 2005-09-01 08:47:16 UTC
Agreed (unsurprisingly, since I'd profit from your proposal ;-).
Of course, I don't have the authority to decide anything, though.

Add-on note: Here, at least, no more recent atp870u hardware seems available
while, on the other hand, the AEC67160 even got a SuSE certification once.


Comment 15 Andreas Jaeger 2005-09-01 08:48:12 UTC
Jens, go ahead.
Comment 16 Jens Axboe 2005-09-01 09:34:26 UTC
Done, committed.