* [BUG] 2.6.23-rc3 can't see sd partitions on Alpha @ 2007-11-18 5:20 Bob Tracy 2007-11-25 6:38 ` Andrew Morton 0 siblings, 1 reply; 40+ messages in thread From: Bob Tracy @ 2007-11-18 5:20 UTC (permalink / raw) To: linux-kernel Completely reproducible... 2.6.23-rc3 kernel boots, and normal messages are seen on console as far as disks found and partitions on each. However, once /dev is populated and the boottime scripts attempt to check filesystem status, no partitions on either of the two disks attached to the SCSI controller are seen. Dropping into a single-user root shell confirms the sudden "blindness": fdisk can't open /dev/sda. When I reboot on 2.6.24-rc2, everything works normally. System environment is Debian Etch. Both 2.6.24-rc2 and -rc3 were built from the respective unaltered kernel.org source trees, using the same kernel configuration modulo saying "no" to CONFIG_SENSORS_I5K_AMB and CONFIG_PID_NS in -rc3. No problems with -rc3 on a x86 box. -- ------------------------------------------------------------------------ Bob Tracy | "They couldn't hit an elephant at this dist- " rct@frus.com | - Last words of Union General John Sedgwick, | Battle of Spotsylvania Court House, U.S. Civil War ------------------------------------------------------------------------ ^ permalink raw reply [flat|nested] 40+ messages in thread
* Re: [BUG] 2.6.23-rc3 can't see sd partitions on Alpha 2007-11-18 5:20 [BUG] 2.6.23-rc3 can't see sd partitions on Alpha Bob Tracy @ 2007-11-25 6:38 ` Andrew Morton 2007-11-25 12:15 ` Rafael J. Wysocki 2007-11-26 13:48 ` Bob Tracy 0 siblings, 2 replies; 40+ messages in thread From: Andrew Morton @ 2007-11-25 6:38 UTC (permalink / raw) To: Bob Tracy Cc: linux-kernel, Rafael J. Wysocki, Richard Henderson, Ivan Kokshaysky On Sat, 17 Nov 2007 23:20:36 -0600 (CST) rct@gherkin.frus.com (Bob Tracy) wrote: > Completely reproducible... 2.6.23-rc3 kernel boots, and normal messages > are seen on console as far as disks found and partitions on each. However, > once /dev is populated and the boottime scripts attempt to check filesystem > status, no partitions on either of the two disks attached to the SCSI > controller are seen. Dropping into a single-user root shell confirms > the sudden "blindness": fdisk can't open /dev/sda. > > When I reboot on 2.6.24-rc2, everything works normally. > > System environment is Debian Etch. Both 2.6.24-rc2 and -rc3 were built > from the respective unaltered kernel.org source trees, using the same > kernel configuration modulo saying "no" to CONFIG_SENSORS_I5K_AMB and > CONFIG_PID_NS in -rc3. No problems with -rc3 on a x86 box. Could be something change in sysfs. Please double-check the config options, make sure that something important didn't get disabled. Failing that, it would be great if you could bisect this down to the offending commit. http://www.kernel.org/doc/local/git-quick.html has help. Richard, Ivan: have you seen anything like this? Meanwhile, I guess we should track this as another post-2.6.23 regression please. Thanks. ^ permalink raw reply [flat|nested] 40+ messages in thread
* Re: [BUG] 2.6.23-rc3 can't see sd partitions on Alpha 2007-11-25 6:38 ` Andrew Morton @ 2007-11-25 12:15 ` Rafael J. Wysocki 2007-11-26 13:48 ` Bob Tracy 1 sibling, 0 replies; 40+ messages in thread From: Rafael J. Wysocki @ 2007-11-25 12:15 UTC (permalink / raw) To: Andrew Morton; +Cc: Bob Tracy, linux-kernel, Richard Henderson, Ivan Kokshaysky On Sunday, 25 of November 2007, Andrew Morton wrote: > On Sat, 17 Nov 2007 23:20:36 -0600 (CST) rct@gherkin.frus.com (Bob Tracy) wrote: > > > Completely reproducible... 2.6.23-rc3 kernel boots, and normal messages > > are seen on console as far as disks found and partitions on each. However, > > once /dev is populated and the boottime scripts attempt to check filesystem > > status, no partitions on either of the two disks attached to the SCSI > > controller are seen. Dropping into a single-user root shell confirms > > the sudden "blindness": fdisk can't open /dev/sda. > > > > When I reboot on 2.6.24-rc2, everything works normally. > > > > System environment is Debian Etch. Both 2.6.24-rc2 and -rc3 were built > > from the respective unaltered kernel.org source trees, using the same > > kernel configuration modulo saying "no" to CONFIG_SENSORS_I5K_AMB and > > CONFIG_PID_NS in -rc3. No problems with -rc3 on a x86 box. > > Could be something change in sysfs. Please double-check the config > options, make sure that something important didn't get disabled. > > Failing that, it would be great if you could bisect this down to the > offending commit. http://www.kernel.org/doc/local/git-quick.html has help. > > Richard, Ivan: have you seen anything like this? > > Meanwhile, I guess we should track this as another post-2.6.23 regression > please. Added to the list, http://bugzilla.kernel.org/show_bug.cgi?id=9457 . Thanks, Rafael ^ permalink raw reply [flat|nested] 40+ messages in thread
* Re: [BUG] 2.6.23-rc3 can't see sd partitions on Alpha 2007-11-25 6:38 ` Andrew Morton 2007-11-25 12:15 ` Rafael J. Wysocki @ 2007-11-26 13:48 ` Bob Tracy 2007-11-30 22:30 ` Michael Cree 1 sibling, 1 reply; 40+ messages in thread From: Bob Tracy @ 2007-11-26 13:48 UTC (permalink / raw) To: Andrew Morton Cc: linux-kernel, Rafael J. Wysocki, Richard Henderson, Ivan Kokshaysky Andrew Morton wrote: > Could be something change in sysfs. Please double-check the config > options, make sure that something important didn't get disabled. > > Failing that, it would be great if you could bisect this down to the > offending commit. http://www.kernel.org/doc/local/git-quick.html has help. I was out of town last week, and will be out this week as well. Won't be able to do the bisection until next week at the earliest, but I have remote access to the box if there's anything useful to be done that doesn't require a reboot. No logs available for the "no sd access" case: I'd have to rig up something to record the console output during boot if that's needed. Here's hoping someone else is seeing this or can replicate it in the meantime. -- ------------------------------------------------------------------------ Bob Tracy | "They couldn't hit an elephant at this dist- " rct@frus.com | - Last words of Union General John Sedgwick, | Battle of Spotsylvania Court House, U.S. Civil War ------------------------------------------------------------------------ ^ permalink raw reply [flat|nested] 40+ messages in thread
* Re: [BUG] 2.6.23-rc3 can't see sd partitions on Alpha 2007-11-26 13:48 ` Bob Tracy @ 2007-11-30 22:30 ` Michael Cree 2007-11-30 22:42 ` Andrew Morton 0 siblings, 1 reply; 40+ messages in thread From: Michael Cree @ 2007-11-30 22:30 UTC (permalink / raw) To: Bob Tracy Cc: Andrew Morton, linux-kernel, Rafael J. Wysocki, Richard Henderson, Ivan Kokshaysky Bob Tracy wrote: > Andrew Morton wrote: >> Could be something change in sysfs. Please double-check the config >> options, make sure that something important didn't get disabled. >> > Here's > hoping someone else is seeing this or can replicate it in the meantime. Snap. 2.6.24-rc2 works fine. 2.6.24-rc3 boots on Alpha but once /dev is populated no partitions of the scsi sub-system are seen. Looks like ide sub-system similarly affected. Managed to get boot log. Follows below (with output of various /proc info). Cheerz Michael. Linux version 2.6.24-rc3 (mjc@alpha) (gcc version 4.1.3 20071019 (prerelease) (Debian 4.1.2-17)) #1 Mon Nov 26 19:28:58 NZDT 2007 Booting on Tsunami variation Monet using machine vector Monet from SRM Major Options: EV67 LEGACY_START VERBOSE_MCHECK Command line: ro root=/dev/sda3 console=ttyS0 memcluster 0, usage 1, start 0, end 215 memcluster 1, usage 0, start 215, end 131062 memcluster 2, usage 1, start 131062, end 131072 freeing pages 215:384 freeing pages 930:131062 reserving pages 930:932 4096K Bcache detected; load hit latency 21 cycles, load miss latency 127 cycles Console graphics on hose 0 Built 1 zonelists in Zone order, mobility grouping on. Total pages: 130167 Kernel command line: ro root=/dev/sda3 console=ttyS0 PID hash table entries: 4096 (order: 12, 32768 bytes) Using epoch = 2000 Turning on RTC interrupts. Console: colour VGA+ 80x25 console [ttyS0] enabled Dentry cache hash table entries: 131072 (order: 7, 1048576 bytes) Inode-cache hash table entries: 65536 (order: 6, 524288 bytes) Memory: 1030896k/1048496k available (2786k kernel code, 15216k reserved, 370k data, 168k init) Mount-cache hash table entries: 512 net_namespace: 120 bytes NET: Registered protocol family 16 PCI: Bridge: 0001:01:08.0 IO window: 8000-8fff MEM window: 09000000-090fffff PREFETCH window: disabled. SMC37c669 Super I/O Controller found @ 0x3f0 Linux Plug and Play Support v0.97 (c) Adam Belay SCSI subsystem initialized NET: Registered protocol family 2 IP route cache hash table entries: 8192 (order: 3, 65536 bytes) TCP established hash table entries: 32768 (order: 6, 524288 bytes) TCP bind hash table entries: 32768 (order: 5, 262144 bytes) TCP: Hash tables configured (established 32768 bind 32768) TCP reno registered srm_env: version 0.0.6 loaded successfully io scheduler noop registered io scheduler cfq registered (default) tridentfb: Trident framebuffer 0.7.8-NEWAPI initializing isapnp: Scanning for PnP cards... isapnp: No Plug & Play device found rtc: SRM (post-2000) epoch (2000) detected Real Time Clock Driver v1.12ac Serial: 8250/16550 driver $Revision: 1.90 $ 4 ports, IRQ sharing enabled serial8250: ttyS0 at I/O 0x3f8 (irq = 4) is a 16550A serial8250: ttyS1 at I/O 0x2f8 (irq = 3) is a 16550A Floppy drive(s): fd0 is 2.88M FDC 0 is a post-1991 82077 Uniform Multi-Platform E-IDE driver Revision: 7.00alpha2 ide: Assuming 33MHz system bus speed for PIO modes; override with idebus=xx CY82C693: IDE controller (0x1080:0xc693 rev 0x00) at PCI slot 0000:00:07.1 CY82C693: not 100% native mode: will probe irqs later CY82C693U driver v0.34 99-13-12 Andreas S. Krebs (akrebs@altavista.net) ide0: BM-DMA at 0x8400-0x8407, BIOS settings: hda:pio, hdb:pio CY82C693: port 0x01f0 already claimed by ide0 ALI15X3: IDE controller (0x10b9:0x5228 rev 0xc6) at PCI slot 0001:02:09.1 ALI15X3: 100% native mode on irq 28 ide1: BM-DMA at 0x200008410-0x200008417, BIOS settings: hdc:DMA, hdd:DMA ide2: BM-DMA at 0x200008418-0x20000841f, BIOS settings: hde:pio, hdf:pio hdf: LITE-ON DVDRW SOHW-1653S, ATAPI CD/DVD-ROM drive hde: ST3200822A, ATA DISK drive ide2 at 0x200008438-0x20000843f,0x20000844e on irq 28 hde: max request size: 512KiB hde: 390721968 sectors (200049 MB) w/8192KiB Cache, CHS=24321/255/63, UDMA(100) hde: cache flushes supported hde: hde1 qla1280: QLA1040 found on PCI bus 1, dev 6 scsi(0:0): Resetting SCSI BUS scsi0 : QLogic QLA1040 PCI to SCSI Host Adapter Firmware version: 7.65.06, Driver version 3.26 serio: i8042 KBD port at 0x60,0x64 irq 1 serio: i8042 AUX port at 0x60,0x64 irq 12 mice: PS/2 mouse device common for all mice scsi 0:0:1:0: Direct-Access SEAGATE ST336706LW 0109 PQ: 0 ANSI: 3 scsi(0:0:1:0): Sync: period 10, offset 12, Wide input: AT Raw Set 2 keyboard as /devices/platform/i8042/serio0/input/input0 atkbd.c: keyboard reset failed on isa0060/serio1 TCP cubic registered Initializing XFRM netlink socket NET: Registered protocol family 1 NET: Registered protocol family 17 NET: Registered protocol family 15 scsi: waiting for bus probes to complete ... sd 0:0:1:0: [sda] 71687370 512-byte hardware sectors (36704 MB) sd 0:0:1:0: [sda] Write Protect is off sd 0:0:1:0: [sda] Write cache: enabled, read cache: enabled, supports DPO and FUA sd 0:0:1:0: [sda] 71687370 512-byte hardware sectors (36704 MB) sd 0:0:1:0: [sda] Write Protect is off sd 0:0:1:0: [sda] Write cache: enabled, read cache: enabled, supports DPO and FUA sda: sda1 sda2 sda3 sda4 sda5 sd 0:0:1:0: [sda] Attached SCSI disk kjournald starting. Commit interval 5 seconds EXT3-fs: mounted filesystem with ordered data mode. VFS: Mounted root (ext3 filesystem) readonly. Freeing unused kernel memory: 168k freed Mount failed for selinuxfs on /selinux: No such file or directory INIT: version 2.86 booting Starting the hotplug events dispatcher: udevd. Synthesizing the initial hotplug events...done. Waiting for /dev to be fully populated...scsi_id[321]: scsi_id: unable to access '/block' end_request: I/O error, dev fd0, sector 0 scsi_id[371]: scsi_id: unable to access '/block' usbcore: registered new interface driver usbfs usbcore: registered new interface driver hub Linux Tulip driver version 1.1.15 (Feb 27, 2007) tulip0: EEPROM default media type Autosense. tulip0: Index #0 - Media 10baseT (#0) described by a 21142 Serial PHY (2) block. tulip0: Index #1 - Media 10base2 (#1) described by a 21142 Serial PHY (2) block. tulip0: Index #2 - Media AUI (#2) described by a 21142 Serial PHY (2) block. tulip0: Index #3 - Media 10baseT-FDX (#4) described by a 21142 Serial PHY (2) block. tulip0: Index #4 - Media 100baseTx (#3) described by a 21143 SYM PHY (4) block. tulip0: Index #5 - Media 100baseTx-FDX (#5) described by a 21143 SYM PHY (4) block. eth0: Digital DS21142/43 Tulip rev 65 at Port 0x200009400, 08:00:2b:87:4c:b0, IRQ 45. Linux video capture interface: v2.00 scsi_id[402]: scsi_id: unable to access '/block' input: PC Speaker as /devices/platform/pcspkr/input/input1 scsi_id[422]: scsi_id: unable to access '/block' sata_uli 0001:02:09.0: version 1.3 scsi1 : sata_uli scsi2 : sata_uli ata1: SATA max UDMA/133 cmd 0x200008420 ctl 0x200008440 bmdma 0x200008400 irq 28 ata2: SATA max UDMA/133 cmd 0x200008428 ctl 0x200008444 bmdma 0x200008408 irq 28 usbcore: registered new device driver usb hdf: ATAPI 48X DVD-ROM DVD-R CD-R/RW drive, 2048kB Cache, UDMA(66) Uniform CD-ROM driver Revision: 3.20 scsi_id[469]: scsi_id: unable to access '/block' ata1: SATA link down (SStatus 0 SControl 300) ohci_hcd 0000:00:07.3: OHCI Host Controller ohci_hcd 0000:00:07.3: new USB bus registered, assigned bus number 1 ohci_hcd 0000:00:07.3: irq 10, io mem 0x09050000 scsi_id[490]: scsi_id: unable toivtv: Start initialization, version 1.1.0 access '/block' scsi_id[510]: scsi_id: unable to access '/block' scsi_id[532]: scsi_id: unable to access '/block' scsi_id[552]: scsi_id: unable to access '/block' usb usb1: configuration #1 chosen from 1 choice hub 1-0:1.0: USB hub found hub 1-0:1.0: 2 ports detected scsi_id[600]: scsi_id: unable to access '/block' ivtv0: Initializing card #0 ivtv0: Autodetected Hauppauge card (cx23415 based) ata2: SATA link up 1.5 Gbps (SStatus 113 SControl 300) ata2.00: ATA-7: ST3320620AS, 3.AAK, max UDMA/133 ata2.00: 625142448 sectors, multi 0: LBA48 NCQ (depth 0/32) tveeprom 0-0050: Hauppauge model 48139, rev K257, serial# 2962352 tveeprom 0-0050: tuner model is Philips FM1216 ME MK3 (idx 57, type 38) tveeprom 0-0050: TV standards PAL(B/G) PAL(I) SECAM(L/L') PAL(D/D1/K) (eeprom 0x74) tveeprom 0-0050: audio processor is MSP4418 (idx 25) tveeprom 0-0050: decoder processor is SAA7115 (idx 19) tveeprom 0-0050: has radio, has IR receiver, has no IR transmitter ivtv0: Autodetected Hauppauge WinTV PVR-350 ata2.00: configured for UDMA/133 usb 1-1: new full speed USB device using ohci_hcd and address 2 scsi 2:0:0:0: Direct-Access ATA ST3320620AS 3.AA PQ: 0 ANSI: 5 sd 2:0:0:0: [sdb] 625142448 512-byte hardware sectors (320073 MB) sd 2:0:0:0: [sdb] Write Protect is off sd 2:0:0:0: [sdb] Write cache: enabled, read cache: enabled, doesn't support DPO or FUA sd 2:0:0:0: [sdb] 625142448 512-byte hardware sectors (320073 MB) sd 2:0:0:0: [sdb] Write Protect is off sd 2:0:0:0: [sdb] Write cache: enabled, read cache: enabled, doesn't support DPO or FUA sdb:<6>usb 1-1: configuration #1 chosen from 1 choice hub 1-1:1.0: USB hub found sdb1 sdb2 sdb3 sdb4 tuner 0-0043: chip found @ 0x86 (ivtv i2c driver #0) tda9887 0-0043: tda988[5/6/7] found @ 0x43 (tuner) tuner 0-0043: type set to tda9887 hub 1-1:1.0: 4 ports detected sd 2:0:0:0: [sdb] Attached SCSI disk scsi_id[686]: scusb 1-1.3: new low speed USB device using ohci_hcd and address 3 si_id: unable to access '/block' scsi_id[707]: scusb 1-1.3: configuration #1 chosen from 1 choice si_id: unable to access '/block' tuner 0-0061: chip found @ 0xc2 (ivtv i2c driver #0) scsi_id[727]: scsi_id: unable to access '/block' scsi_id[739]: scsi_id: unable tousb 1-1.4: new full speed USB device using ohci_hcd and address 4 access '/block' scsi_id[752]: scsi_id: unable tosaa7115 0-0021: saa7115 found (1f7115d0e100000) @ 0x42 (ivtv i2c driver #0) access '/block' saa7127 0-0044: saa7129 found @ 0x88 (ivtv i2c driver #0) msp3400 0-0040: MSP4418G-B3 found @ 0x80 (ivtv i2c driver #0) msp3400 0-0040: MSP4418G-B3 supports nicam and radio, mode is autodetect and autoselect usb 1-1.4: configuration #1 chosen from 1 choice input: HID 04d9:0499 as /devices/pci0000:00/0000:00:07.3/usb1/1-1/1-1.3/1-1.3:1.0/input/input2 input: USB HID v1.10 Mouse [HID 04d9:0499] on usb-0000:00:07.3-1.3 usbcore: registered new interface driver usbhid drivers/hid/usbhid/hid-core.c: v2.6:USB HID core driver usblp0: USB Bidirectional printer dev 4 if 1 alt 0 proto 2 vid 0x03F0 pid 0x3B11 usbcore: registered new interface driver usblp Initializing USB Mass Storage driver... tuner-simple 0-0061: type set to 38 (Philips PAL/SECAM multi (FM1216ME MK3)) tuner 0-0061: type set to Philips PAL/SECAM m ivtv0: Registered device video0 for encoder MPG (4096 kB) ivtv0: Registered device video32 for encoder YUV (2048 kB) ivtv0: Registered device vbi0 for encoder VBI (1024 kB) ivtv0: Registered device video24 for encoder PCM (320 kB) ivtv0: Registered device radio0 for encoder radio ivtv0: Registered device video16 for decoder MPG (1024 kB) ivtv0: Registered device vbi8 for decoder VBI (64 kB) ivtv0: Registered device vbi16 for decoder VOUT ivtv0: Registered device video48 for decoder YUV (1024 kB) ivtv0: Initialized card #0: Hauppauge WinTV PVR-350 ivtv: End initialization scsi3 : SCSI emulation for USB Mass Storage devices usbcore: registered new interface driver usb-storage USB Mass Storage support registered. done. Setting parameters of disc: (none). Setting the system clock. System Clock set to: Fri Nov 30 22:01:02 UTC 2007. Activating swap:swapon: cannot canonicalize /dev/sdb2: No such file or directory swapon on /dev/sdb2 swapon: cannot stat /dev/sdb2: No such file or directory ^[[31mfailed!^[[39;49m Will now check root file system:fsck 1.40.2 (12-Jul-2007) [/sbin/fsck.ext3 (1) -- /lib/init/rw/rootdev] fsck.ext3 -a -C0 /lib/init/rw/rootdev /lib/init/rw/rootdev: clean, 192EXT3 FS on sda3, 086/1251712 fileinternal journal s, 1377761/2500000 blocks .. Setting the system clock. System Clock set to: Fri Nov 30 22:01:03 UTC 2007. Cleaning up ifupdown.... Loading device-mapper support. Loading device-mapper support. Will now check all file systems. fsck 1.40.2 (12-Jul-2007) Checking all file systems. [/sbin/fsck.ext2 (1) -- /boot] fsck.ext2 -a -C0 /dev/sda2 [/sbin/fsck.ext3 (2) -- /video] fsck.ext3 -a /dev/sdb4 fsck.ext3: No such file or directory while trying to open /dev/sdb4 /dev/sdb4: The superblock could not be read or does not describe a correct ext2 filesystem. If the device is valid and it really contains an ext2 filesystem (and not swap or ufs or something else), then the superblock is corrupt, and you might try running e2fsck with an alternate superblock: e2fsck -b 8193 <device> fsck.ext2: No such file or directory while trying to open /dev/sda2 /dev/sda2: The superblock could not be read or does not describe a correct ext2 filesystem. If the device is valid and it really contains an ext2 filesystem (and not swap or ufs or something else), then the superblock is corrupt, and you might try running e2fsck with an alternate superblock: e2fsck -b 8193 <device> [/sbin/fsck.ext3 (1) -- /home] fsck.ext3 -a -C0 /dev/sda5 fsck.ext3: No such file or directory while trying to open /dev/sda5 /dev/sda5: The superblock could not be read or does not describe a correct ext2 filesystem. If the device is valid and it really contains an ext2 filesystem (and not swap or ufs or something else), then the superblock is corrupt, and you might try running e2fsck with an alternate superblock: e2fsck -b 8193 <device> [/sbin/fsck.ext3 (1) -- /data2] fsck.ext3 -a -C0 /dev/hde1 fsck.ext3: No such file or directory while trying to open /dev/hde1 /dev/hde1: The superblock could not be read or does not describe a correct ext2 filesystem. If the device is valid and it really contains an ext2 filesystem (and not swap or ufs or something else), then the superblock is corrupt, and you might try running e2fsck with an alternate superblock: e2fsck -b 8193 <device> fsck died with exit status 8 ^[[31m*^[[39;49m File system check failed. A log is being saved in /var/log/fsck/checkfs if that location is writable. Please repair the file system manually. ^[[33mscsi 3:0:0:0: Direct-Access HP 1.00 PQ: 0 ANSI: 2 *^[[39;49m A maintenance shell will now be startesd 3:0:0:0: [sdc] Attached SCSI removable disk d. CONTROL-D will terminate this shell and resume system boot. Give root password for maintenance (or type Control-D to continue): scsi_id[1121]: scsi_id: unable to access '/block' alpha:~# df -k Filesystem 1K-blocks Used Available Use% Mounted on /dev/sda3 9842848 5353892 3988956 58% / tmpfs 515864 0 515864 0% /lib/init/rw udev 10240 48 10192 1% /dev tmpfs 515864 0 515864 0% /dev/shm alpha:~# cat /proc/mounts rootfs / rootfs rw 0 0 /dev/root / ext3 rw,errors=remount-ro,data=ordered 0 0 tmpfs /lib/init/rw tmpfs rw,nosuid 0 0 proc /proc proc rw,nosuid,nodev,noexec 0 0 sysfs /sys sysfs rw,nosuid,nodev,noexec 0 0 /dev/root /dev/.static/dev ext3 rw,errors=remount-ro,data=ordered 0 0 tmpfs /dev tmpfs rw 0 0 tmpfs /dev/shm tmpfs rw,nosuid,nodev 0 0 devpts /dev/pts devpts rw,nosuid,noexec 0 0 alpha:~# ls -l /dev/s* crw-rw---- 1 root audio 14, 1 Dec 1 11:00 /dev/sequencer crw-rw---- 1 root audio 14, 8 Dec 1 11:00 /dev/sequencer2 lrwxrwxrwx 1 root root 24 Dec 1 11:00 /dev/sndstat -> /proc/asound/oss/sndstat lrwxrwxrwx 1 root root 15 Dec 1 11:00 /dev/stderr -> /proc/self/fd/2 lrwxrwxrwx 1 root root 15 Dec 1 11:00 /dev/stdin -> /proc/self/fd/0 lrwxrwxrwx 1 root root 15 Dec 1 11:00 /dev/stdout -> /proc/self/fd/1 /dev/shm: total 0 /dev/snd: total 0 crw-rw---- 1 root audio 116, 0 Dec 1 11:00 controlC0 crw-rw---- 1 root audio 116, 32 Dec 1 11:01 controlC1 crw-rw---- 1 root audio 116, 36 Dec 1 11:01 hwC1D0 crw-rw---- 1 root audio 116, 8 Dec 1 11:00 midiC0D0 crw-rw---- 1 root audio 116, 40 Dec 1 11:01 midiC1D0 crw-rw---- 1 root audio 116, 24 Dec 1 11:00 pcmC0D0c crw-rw---- 1 root audio 116, 16 Dec 1 11:00 pcmC0D0p crw-rw---- 1 root audio 116, 56 Dec 1 11:01 pcmC1D0c crw-rw---- 1 root audio 116, 48 Dec 1 11:01 pcmC1D0p crw-rw---- 1 root audio 116, 49 Dec 1 11:01 pcmC1D1p crw-rw---- 1 root audio 116, 58 Dec 1 11:01 pcmC1D2c crw-rw---- 1 root audio 116, 50 Dec 1 11:01 pcmC1D2p crw-rw---- 1 root audio 116, 1 Dec 1 11:00 seq crw-rw---- 1 root audio 116, 33 Dec 1 11:00 timer alpha:~# cat /proc/cpuinfo cpu : Alpha cpu model : EV67 cpu variation : 7 cpu revision : 0 cpu serial number : system type : Tsunami system variation : Monet system revision : 0 system serial number : AY0 cycle frequency [Hz] : 666666666 timer frequency [Hz] : 1024.00 page size [bytes] : 8192 phys. address bits : 44 max. addr. space # : 255 BogoMIPS : 1326.92 kernel unaligned acc : 0 (pc=0,va=0) user unaligned acc : 0 (pc=0,va=0) platform string : COMPAQ Professional Workstation XP1000 cpus detected : 1 L1 Icache : 64K, 2-way, 64b line L1 Dcache : 64K, 2-way, 64b line L2 cache : 4096K, 1-way, 64b line L3 cache : n/a alpha:~# cat /proc/modules usb_storage 48849 0 - Live 0xfffffffc003f2000 usblp 15598 0 - Live 0xfffffffc003d2000 msp3400 35516 0 - Live 0xfffffffc00372000 saa7127 13680 0 - Live 0xfffffffc003c2000 saa7115 18260 0 - Live 0xfffffffc003ba000 usbhid 24300 0 - Live 0xfffffffc00380000 snd_cmipci 42696 0 - Live 0xfffffffc003ac000 snd_es18xx 43100 0 - Live 0xfffffffc0039e000 tuner 46548 0 - Live 0xfffffffc0038c000 tea5767 7044 1 tuner, Live 0xfffffffc00358000 snd_pcm_oss 31181 0 - Live 0xfffffffc0034e000 tda8290 13588 1 tuner, Live 0xfffffffc00338000 tuner_simple 9592 1 tuner, Live 0xfffffffc00340000 snd_mixer_oss 20609 1 snd_pcm_oss, Live 0xfffffffc00346000 mt20xx 14104 1 tuner, Live 0xfffffffc00290000 tea5761 5892 1 tuner, Live 0xfffffffc002f4000 snd_pcm 78993 3 snd_cmipci,snd_es18xx,snd_pcm_oss, Live 0xfffffffc0035c000 snd_page_alloc 10984 1 snd_pcm, Live 0xfffffffc002de000 snd_opl3_lib 12041 2 snd_cmipci,snd_es18xx, Live 0xfffffffc002d8000 snd_hwdep 9904 1 snd_opl3_lib, Live 0xfffffffc002d2000 snd_mpu401_uart 9784 2 snd_cmipci,snd_es18xx, Live 0xfffffffc002cc000 snd_seq_dummy 3852 0 - Live 0xfffffffc002b4000 snd_seq_oss 42876 0 - Live 0xfffffffc002e6000 snd_seq_midi 9192 0 - Live 0xfffffffc002ae000 snd_seq_midi_event 8256 2 snd_seq_oss,snd_seq_midi, Live 0xfffffffc002a8000 snd_seq 68264 6 snd_seq_dummy,snd_seq_oss,snd_seq_midi,snd_seq_midi_event, Live 0xfffffffc00324000 snd_timer 24984 3 snd_pcm,snd_opl3_lib,snd_seq, Live 0xfffffffc002c2000 snd_rawmidi 26433 2 snd_mpu401_uart,snd_seq_midi, Live 0xfffffffc002b8000 snd_seq_device 9080 6 snd_opl3_lib,snd_seq_dummy,snd_seq_oss,snd_seq_midi,snd_seq,snd_rawmidi, Live 0xfffffffc00298000 ivtv 166124 0 - Live 0xfffffffc002f8000 ohci_hcd 25196 0 - Live 0xfffffffc0029e000 firmware_class 10420 1 ivtv, Live 0xfffffffc0026e000 i2c_algo_bit 8196 1 ivtv, Live 0xfffffffc00248000 snd 64992 13 snd_cmipci,snd_es18xx,snd_pcm_oss,snd_mixer_oss,snd_pcm,snd_opl3_lib,snd_hwdep,snd_mpu401_uart,snd_seq_oss,snd_seq,snd_timer,snd_rawmidi,snd_seq_device, Live 0xfffffffc0025c000 ide_cd 46456 0 - Live 0xfffffffc00282000 soundcore 7744 1 snd, Live 0xfffffffc0024e000 cdrom 42408 1 ide_cd, Live 0xfffffffc00274000 cx2341x 14156 1 ivtv, Live 0xfffffffc001f6000 tveeprom 18336 1 ivtv, Live 0xfffffffc00254000 sata_uli 6740 0 - Live 0xfffffffc00240000 i2c_core 25850 12 msp3400,saa7127,saa7115,tuner,tea5767,tda8290,tuner_simple,mt20xx,tea5761,ivtv,i2c_algo_bit,tveeprom, Live 0xfffffffc00208000 pcspkr 3584 0 - Live 0xfffffffc00200000 serio_raw 7556 0 - Live 0xfffffffc001fc000 evdev 12617 0 - Live 0xfffffffc001de000 videodev 31449 1 ivtv, Live 0xfffffffc001ca000 v4l2_common 19560 7 msp3400,saa7127,saa7115,tuner,ivtv,cx2341x,videodev, Live 0xfffffffc001d6000 v4l1_compat 14948 2 ivtv,videodev, Live 0xfffffffc001c0000 tulip 57160 0 - Live 0xfffffffc001e6000 pata_cypress 6040 0 - Live 0xfffffffc001c6000 usbcore 158580 5 usb_storage,usblp,usbhid,ohci_hcd, Live 0xfffffffc00212000 alpha:~# cat /proc/ioports 00000000-01ffffff : PCI IO bus 0 00000000-0000001f : dma1 00000020-0000003f : pic1 00000040-0000005f : timer 00000060-0000006f : keyboard 00000070-00000080 : rtc 00000070-00000077 : rtc 000000a0-000000bf : pic2 000000c0-000000df : dma2 00000220-0000022f : ES18xx 000002f8-000002ff : serial 00000330-00000331 : MPU401 UART 000003f2-000003f5 : floppy 000003f7-000003f7 : floppy DIR 000003f8-000003ff : serial 00008000-000080ff : 0000:00:0b.0 00008400-0000840f : 0000:00:07.1 00008400-00008407 : ide0 200000000-201ffffff : PCI IO bus 1 200008000-200008fff : PCI Bus #02 200008000-2000080ff : 0001:02:0a.0 200008000-2000080ff : CMI8738-MC6 200008400-20000840f : 0001:02:09.0 200008400-20000840f : libata 200008410-20000841f : 0001:02:09.1 200008410-200008417 : ide1 200008418-20000841f : ide2 200008420-200008427 : 0001:02:09.0 200008420-200008427 : libata 200008428-20000842f : 0001:02:09.0 200008428-20000842f : libata 200008430-200008437 : 0001:02:09.1 200008438-20000843f : 0001:02:09.1 200008438-20000843f : ide2 200008440-200008443 : 0001:02:09.0 200008440-200008443 : libata 200008444-200008447 : 0001:02:09.0 200008444-200008447 : libata 200008448-20000844b : 0001:02:09.1 20000844c-20000844f : 0001:02:09.1 20000844e-20000844e : ide2 200009000-2000090ff : 0001:01:06.0 200009400-20000947f : 0001:01:03.0 200009400-20000947f : tulip alpha:~# cat /proc/iomem 00000000-3fffffff : PCI mem bus 0 09000000-0901ffff : 0000:00:0b.0 09020000-0902ffff : 0000:00:07.2 09030000-0903ffff : 0000:00:0b.0 09040000-0904ffff : 0000:00:0b.1 09050000-09050fff : 0000:00:07.3 09050000-09050fff : ohci_hcd 0c000000-0fffffff : 0000:00:0c.0 0c000000-0c7fffff : ivtv encoder 0d000000-0d7fffff : ivtv decoder 0e000000-0e00ffff : ivtv registers 10000000-17ffffff : 0000:00:0b.0 18000000-1fffffff : 0000:00:0b.1 200000000-23fffffff : PCI mem bus 1 209000000-2090fffff : PCI Bus #02 209000000-20900ffff : 0001:02:09.0 209100000-20913ffff : 0001:01:03.0 209140000-20914ffff : 0001:01:06.0 209150000-209150fff : 0001:01:06.0 209151000-2091513ff : 0001:01:03.0 209151000-2091513ff : tulip alpha:~# lspci -vvv 0000:00:07.0 ISA bridge: Contaq Microsystems 82c693 Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- Status: Cap- 66MHz- UDF- FastB2B+ ParErr- DEVSEL=medium >TAbort- <TAbort- <MAbort- >SERR- <PERR- Latency: 0 0000:00:07.1 IDE interface: Contaq Microsystems 82c693 (prog-if 80 [Master]) Control: I/O+ Mem- BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- Status: Cap- 66MHz- UDF- FastB2B+ ParErr- DEVSEL=medium >TAbort- <TAbort- <MAbort- >SERR- <PERR- Latency: 0 Interrupt: pin A routed to IRQ 14 Region 0: I/O ports at 01f0 [size=8] Region 1: I/O ports at 03f4 [size=1] Region 2: [virtual] Memory at 00000170 (32-bit, non-prefetchable) [disabled] [size=8] Region 3: [virtual] Memory at 00000370 (type 3, non-prefetchable) [disabled] [size=1] Region 4: I/O ports at 8400 [size=16] 0000:00:07.2 IDE interface: Contaq Microsystems 82c693 (prog-if 00 []) Control: I/O+ Mem- BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- Status: Cap- 66MHz- UDF- FastB2B+ ParErr- DEVSEL=medium >TAbort- <TAbort- <MAbort- >SERR- <PERR- Latency: 0 Interrupt: pin B routed to IRQ 15 Region 0: I/O ports at 01f0 [size=8] Region 1: I/O ports at 03f4 [size=1] Region 2: [virtual] Memory at 00000170 (32-bit, non-prefetchable) [disabled] [size=8] Region 3: [virtual] Memory at 00000370 (type 3, non-prefetchable) [disabled] [size=1] Region 4: Memory at 09020000 (32-bit, non-prefetchable) [disabled] [size=64K] 0000:00:07.3 USB Controller: Contaq Microsystems 82c693 (prog-if 10 [OHCI]) Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- Status: Cap- 66MHz- UDF- FastB2B+ ParErr- DEVSEL=medium >TAbort- <TAbort- <MAbort- >SERR- <PERR- Latency: 248, Cache Line Size: 32 bytes Interrupt: pin A routed to IRQ 10 Region 0: Memory at 09050000 (32-bit, non-prefetchable) [size=4K] 0000:00:0b.0 VGA compatible controller: ATI Technologies Inc RV280 [Radeon 9200 PRO] (rev 01) (prog-if 00 [VGA]) Subsystem: ATI Technologies Inc RV280 [Radeon 9200 PRO] Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- Status: Cap+ 66MHz- UDF- FastB2B+ ParErr- DEVSEL=medium >TAbort- <TAbort- <MAbort- >SERR- <PERR- Latency: 255 (2000ns min), Cache Line Size: 64 bytes Interrupt: pin A routed to IRQ 40 Region 0: Memory at 10000000 (32-bit, prefetchable) [size=128M] Region 1: I/O ports at 8000 [size=256] Region 2: Memory at 09030000 (32-bit, non-prefetchable) [size=64K] Expansion ROM at 09000000 [disabled] [size=128K] Capabilities: [50] Power Management version 2 Flags: PMEClk- DSI- D1+ D2+ AuxCurrent=0mA PME(D0-,D1-,D2-,D3hot-,D3cold-) Status: D0 PME-Enable- DSel=0 DScale=0 PME- 0000:00:0b.1 Display controller: ATI Technologies Inc RV280 [Radeon 9200 PRO] (Secondary) (rev 01) Subsystem: ATI Technologies Inc Unknown device 5961 Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- Status: Cap+ 66MHz- UDF- FastB2B+ ParErr- DEVSEL=medium >TAbort- <TAbort- <MAbort- >SERR- <PERR- Latency: 255 (2000ns min), Cache Line Size: 64 bytes Region 0: Memory at 18000000 (32-bit, prefetchable) [size=128M] Region 1: Memory at 09040000 (32-bit, non-prefetchable) [size=64K] Capabilities: [50] Power Management version 2 Flags: PMEClk- DSI- D1+ D2+ AuxCurrent=0mA PME(D0-,D1-,D2-,D3hot-,D3cold-) Status: D0 PME-Enable- DSel=0 DScale=0 PME- 0000:00:0c.0 Multimedia video controller: Internext Compression Inc iTVC15 MPEG-2 Encoder (rev 01) Subsystem: Hauppauge computer works Inc. WinTV PVR-350 Control: I/O- Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=medium >TAbort- <TAbort- <MAbort- >SERR- <PERR- Latency: 252 (32000ns min, 2000ns max), Cache Line Size: 64 bytes Interrupt: pin A routed to IRQ 36 Region 0: Memory at 0c000000 (32-bit, prefetchable) [size=64M] Capabilities: [44] Power Management version 2 Flags: PMEClk- DSI- D1- D2- AuxCurrent=0mA PME(D0-,D1-,D2-,D3hot-,D3cold-) Status: D0 PME-Enable- DSel=0 DScale=0 PME- 0001:01:03.0 Ethernet controller: Digital Equipment Corporation DECchip 21142/43 (rev 41) Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- Status: Cap- 66MHz- UDF- FastB2B+ ParErr- DEVSEL=medium >TAbort- <TAbort- <MAbort- >SERR- <PERR- Latency: 255 (5000ns min, 10000ns max), Cache Line Size: 64 bytes Interrupt: pin A routed to IRQ 45 Region 0: I/O ports at 200009400 [size=128] Region 1: Memory at 209151000 (32-bit, non-prefetchable) [size=1K] Expansion ROM at 209100000 [disabled] [size=256K] 0001:01:06.0 SCSI storage controller: QLogic Corp. ISP1020 Fast-wide SCSI (rev 06) Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=medium >TAbort- <TAbort- <MAbort- >SERR- <PERR- Latency: 248, Cache Line Size: 64 bytes Interrupt: pin A routed to IRQ 47 Region 0: I/O ports at 200009000 [size=256] Region 1: Memory at 209150000 (32-bit, non-prefetchable) [size=4K] Expansion ROM at 209140000 [disabled] [size=64K] Capabilities: [44] Power Management version 1 Flags: PMEClk- DSI- D1- D2- AuxCurrent=0mA PME(D0-,D1-,D2-,D3hot-,D3cold-) Status: D0 PME-Enable- DSel=0 DScale=0 PME- 0001:01:08.0 PCI bridge: Digital Equipment Corporation DECchip 21152 (rev 03) (prog-if 00 [Normal decode]) Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- Status: Cap+ 66MHz- UDF- FastB2B+ ParErr- DEVSEL=medium >TAbort- <TAbort- <MAbort- >SERR- <PERR- Latency: 255, Cache Line Size: 64 bytes Bus: primary=01, secondary=02, subordinate=02, sec-latency=255 I/O behind bridge: 00008000-00008fff Memory behind bridge: 09000000-090fffff Prefetchable memory behind bridge: 00000000fff00000-00000000000fffff Secondary status: 66MHz- FastB2B+ ParErr- DEVSEL=medium >TAbort- <TAbort- <MAbort+ <SERR- <PERR- BridgeCtl: Parity+ SERR+ NoISA- VGA- MAbort- >Reset- FastB2B- Capabilities: [dc] Power Management version 1 Flags: PMEClk- DSI- D1- D2- AuxCurrent=220mA PME(D0-,D1-,D2-,D3hot-,D3cold-) Status: D0 PME-Enable- DSel=0 DScale=0 PME- Bridge: PM- B3+ 0001:02:09.0 Mass storage controller: ALi Corporation ALi M5281 Serial ATA / RAID Host Controller (rev a1) (prog-if 85) Subsystem: ALi Corporation Unknown device 5283 Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- Status: Cap- 66MHz+ UDF- FastB2B+ ParErr- DEVSEL=medium >TAbort- <TAbort- <MAbort- >SERR- <PERR- Latency: 255, Cache Line Size: 64 bytes Interrupt: pin A routed to IRQ 28 Region 0: I/O ports at 200008420 [size=8] Region 1: I/O ports at 200008440 [size=4] Region 2: I/O ports at 200008428 [size=8] Region 3: I/O ports at 200008444 [size=4] Region 4: I/O ports at 200008400 [size=16] Expansion ROM at 209000000 [disabled] [size=64K] 0001:02:09.1 Mass storage controller: ALi Corporation M5228 ALi ATA/RAID Controller (rev c6) (prog-if 85) Control: I/O+ Mem- BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- Status: Cap- 66MHz+ UDF- FastB2B+ ParErr- DEVSEL=medium >TAbort- <TAbort- <MAbort- >SERR- <PERR- Latency: 255 Interrupt: pin A routed to IRQ 28 Region 0: I/O ports at 200008430 [size=8] Region 1: I/O ports at 200008448 [size=4] Region 2: I/O ports at 200008438 [size=8] Region 3: I/O ports at 20000844c [size=4] Region 4: I/O ports at 200008410 [size=16] 0001:02:0a.0 Multimedia audio controller: C-Media Electronics Inc CM8738 (rev 10) Subsystem: C-Media Electronics Inc CMI8738/C3DX PCI Audio Device Control: I/O+ Mem- BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=medium >TAbort- <TAbort- <MAbort- >SERR- <PERR- Latency: 255 (500ns min, 6000ns max) Interrupt: pin A routed to IRQ 24 Region 0: I/O ports at 200008000 [size=256] Capabilities: [c0] Power Management version 2 Flags: PMEClk- DSI- D1+ D2+ AuxCurrent=0mA PME(D0-,D1-,D2-,D3hot-,D3cold-) Status: D0 PME-Enable- DSel=0 DScale=0 PME- alpha:~# cat /proc/scsi/scsi Attached devices: Host: scsi0 Channel: 00 Id: 01 Lun: 00 Vendor: SEAGATE Model: ST336706LW Rev: 0109 Type: Direct-Access ANSI SCSI revision: 03 Host: scsi2 Channel: 00 Id: 00 Lun: 00 Vendor: ATA Model: ST3320620AS Rev: 3.AA Type: Direct-Access ANSI SCSI revision: 05 Host: scsi3 Channel: 00 Id: 00 Lun: 00 Vendor: HP Model: Rev: 1.00 Type: Direct-Access ANSI SCSI revision: 02 ^ permalink raw reply [flat|nested] 40+ messages in thread
* Re: [BUG] 2.6.23-rc3 can't see sd partitions on Alpha 2007-11-30 22:30 ` Michael Cree @ 2007-11-30 22:42 ` Andrew Morton 2007-11-30 23:26 ` Rafael J. Wysocki 2007-12-02 20:53 ` Michael Cree 0 siblings, 2 replies; 40+ messages in thread From: Andrew Morton @ 2007-11-30 22:42 UTC (permalink / raw) To: Michael Cree Cc: rct, linux-kernel, rjw, rth, ink, linux-scsi, Kay Sievers, Greg KH On Sat, 01 Dec 2007 11:30:01 +1300 Michael Cree <mcree@orcon.net.nz> wrote: > Bob Tracy wrote: > > Andrew Morton wrote: > >> Could be something change in sysfs. Please double-check the config > >> options, make sure that something important didn't get disabled. > >> > > Here's > > hoping someone else is seeing this or can replicate it in the meantime. > > Snap. > > 2.6.24-rc2 works fine. 2.6.24-rc3 boots on Alpha but once /dev is > populated no partitions of the scsi sub-system are seen. Looks like ide > sub-system similarly affected. Rafael, I assume you have this regression in the list? > Managed to get boot log. Follows below (with output of various /proc info). > > Cheerz > Michael. > > > Linux version 2.6.24-rc3 (mjc@alpha) (gcc version 4.1.3 20071019 > (prerelease) (Debian 4.1.2-17)) #1 Mon Nov 26 19:28:58 NZDT 2007 > Booting on Tsunami variation Monet using machine vector Monet from SRM > Major Options: EV67 LEGACY_START VERBOSE_MCHECK > Command line: ro root=/dev/sda3 console=ttyS0 > memcluster 0, usage 1, start 0, end 215 > memcluster 1, usage 0, start 215, end 131062 > memcluster 2, usage 1, start 131062, end 131072 > freeing pages 215:384 > freeing pages 930:131062 > reserving pages 930:932 > 4096K Bcache detected; load hit latency 21 cycles, load miss latency 127 > cycles > Console graphics on hose 0 > Built 1 zonelists in Zone order, mobility grouping on. Total pages: 130167 > Kernel command line: ro root=/dev/sda3 console=ttyS0 > PID hash table entries: 4096 (order: 12, 32768 bytes) > Using epoch = 2000 > Turning on RTC interrupts. > Console: colour VGA+ 80x25 > console [ttyS0] enabled > Dentry cache hash table entries: 131072 (order: 7, 1048576 bytes) > Inode-cache hash table entries: 65536 (order: 6, 524288 bytes) > Memory: 1030896k/1048496k available (2786k kernel code, 15216k reserved, > 370k data, 168k init) > Mount-cache hash table entries: 512 > net_namespace: 120 bytes > NET: Registered protocol family 16 > PCI: Bridge: 0001:01:08.0 > IO window: 8000-8fff > MEM window: 09000000-090fffff > PREFETCH window: disabled. > SMC37c669 Super I/O Controller found @ 0x3f0 > Linux Plug and Play Support v0.97 (c) Adam Belay > SCSI subsystem initialized > NET: Registered protocol family 2 > IP route cache hash table entries: 8192 (order: 3, 65536 bytes) > TCP established hash table entries: 32768 (order: 6, 524288 bytes) > TCP bind hash table entries: 32768 (order: 5, 262144 bytes) > TCP: Hash tables configured (established 32768 bind 32768) > TCP reno registered > srm_env: version 0.0.6 loaded successfully > io scheduler noop registered > io scheduler cfq registered (default) > tridentfb: Trident framebuffer 0.7.8-NEWAPI initializing > isapnp: Scanning for PnP cards... > isapnp: No Plug & Play device found > rtc: SRM (post-2000) epoch (2000) detected > Real Time Clock Driver v1.12ac > Serial: 8250/16550 driver $Revision: 1.90 $ 4 ports, IRQ sharing enabled > serial8250: ttyS0 at I/O 0x3f8 (irq = 4) is a 16550A > serial8250: ttyS1 at I/O 0x2f8 (irq = 3) is a 16550A > Floppy drive(s): fd0 is 2.88M > FDC 0 is a post-1991 82077 > Uniform Multi-Platform E-IDE driver Revision: 7.00alpha2 > ide: Assuming 33MHz system bus speed for PIO modes; override with idebus=xx > CY82C693: IDE controller (0x1080:0xc693 rev 0x00) at PCI slot 0000:00:07.1 > CY82C693: not 100% native mode: will probe irqs later > CY82C693U driver v0.34 99-13-12 Andreas S. Krebs (akrebs@altavista.net) > ide0: BM-DMA at 0x8400-0x8407, BIOS settings: hda:pio, hdb:pio > CY82C693: port 0x01f0 already claimed by ide0 > ALI15X3: IDE controller (0x10b9:0x5228 rev 0xc6) at PCI slot 0001:02:09.1 > ALI15X3: 100% native mode on irq 28 > ide1: BM-DMA at 0x200008410-0x200008417, BIOS settings: hdc:DMA, > hdd:DMA > ide2: BM-DMA at 0x200008418-0x20000841f, BIOS settings: hde:pio, > hdf:pio > hdf: LITE-ON DVDRW SOHW-1653S, ATAPI CD/DVD-ROM drive > hde: ST3200822A, ATA DISK drive > ide2 at 0x200008438-0x20000843f,0x20000844e on irq 28 > hde: max request size: 512KiB > hde: 390721968 sectors (200049 MB) w/8192KiB Cache, CHS=24321/255/63, > UDMA(100) > hde: cache flushes supported > hde: hde1 > qla1280: QLA1040 found on PCI bus 1, dev 6 > scsi(0:0): Resetting SCSI BUS > scsi0 : QLogic QLA1040 PCI to SCSI Host Adapter > Firmware version: 7.65.06, Driver version 3.26 > serio: i8042 KBD port at 0x60,0x64 irq 1 > serio: i8042 AUX port at 0x60,0x64 irq 12 > mice: PS/2 mouse device common for all mice > scsi 0:0:1:0: Direct-Access SEAGATE ST336706LW 0109 PQ: 0 ANSI: 3 > scsi(0:0:1:0): Sync: period 10, offset 12, Wide > input: AT Raw Set 2 keyboard as /devices/platform/i8042/serio0/input/input0 > atkbd.c: keyboard reset failed on isa0060/serio1 > TCP cubic registered > Initializing XFRM netlink socket > NET: Registered protocol family 1 > NET: Registered protocol family 17 > NET: Registered protocol family 15 > scsi: waiting for bus probes to complete ... > sd 0:0:1:0: [sda] 71687370 512-byte hardware sectors (36704 MB) > sd 0:0:1:0: [sda] Write Protect is off > sd 0:0:1:0: [sda] Write cache: enabled, read cache: enabled, supports > DPO and FUA > sd 0:0:1:0: [sda] 71687370 512-byte hardware sectors (36704 MB) > sd 0:0:1:0: [sda] Write Protect is off > sd 0:0:1:0: [sda] Write cache: enabled, read cache: enabled, supports > DPO and FUA > sda: sda1 sda2 sda3 sda4 sda5 > sd 0:0:1:0: [sda] Attached SCSI disk > kjournald starting. Commit interval 5 seconds > EXT3-fs: mounted filesystem with ordered data mode. > VFS: Mounted root (ext3 filesystem) readonly. > Freeing unused kernel memory: 168k freed > Mount failed for selinuxfs on /selinux: No such file or directory > INIT: version 2.86 booting > Starting the hotplug events dispatcher: udevd. > Synthesizing the initial hotplug events...done. > Waiting for /dev to be fully populated...scsi_id[321]: scsi_id: unable > to access '/block' > end_request: I/O error, dev fd0, sector 0 > scsi_id[371]: scsi_id: unable to access '/block' > usbcore: registered new interface driver usbfs > usbcore: registered new interface driver hub > Linux Tulip driver version 1.1.15 (Feb 27, 2007) > tulip0: EEPROM default media type Autosense. > tulip0: Index #0 - Media 10baseT (#0) described by a 21142 Serial PHY > (2) block. > tulip0: Index #1 - Media 10base2 (#1) described by a 21142 Serial PHY > (2) block. > tulip0: Index #2 - Media AUI (#2) described by a 21142 Serial PHY (2) > block. > tulip0: Index #3 - Media 10baseT-FDX (#4) described by a 21142 Serial > PHY (2) block. > tulip0: Index #4 - Media 100baseTx (#3) described by a 21143 SYM PHY > (4) block. > tulip0: Index #5 - Media 100baseTx-FDX (#5) described by a 21143 SYM > PHY (4) block. > eth0: Digital DS21142/43 Tulip rev 65 at Port 0x200009400, > 08:00:2b:87:4c:b0, IRQ 45. > Linux video capture interface: v2.00 > scsi_id[402]: scsi_id: unable to access '/block' I guess this is where things go bad. scsi_id is part of udev. Perhaps some sysfs nodes aren't being created correctly. Random guess: what is your setting of CONFIG_SCSI_SCAN_ASYNC and what happens if you invert it? > input: PC Speaker as /devices/platform/pcspkr/input/input1 > scsi_id[422]: scsi_id: unable to access '/block' > sata_uli 0001:02:09.0: version 1.3 > scsi1 : sata_uli > scsi2 : sata_uli > ata1: SATA max UDMA/133 cmd 0x200008420 ctl 0x200008440 bmdma > 0x200008400 irq 28 > ata2: SATA max UDMA/133 cmd 0x200008428 ctl 0x200008444 bmdma > 0x200008408 irq 28 > usbcore: registered new device driver usb > hdf: ATAPI 48X DVD-ROM DVD-R CD-R/RW drive, 2048kB Cache, UDMA(66) > Uniform CD-ROM driver Revision: 3.20 > scsi_id[469]: scsi_id: unable to access '/block' > ata1: SATA link down (SStatus 0 SControl 300) > ohci_hcd 0000:00:07.3: OHCI Host Controller > ohci_hcd 0000:00:07.3: new USB bus registered, assigned bus number 1 > ohci_hcd 0000:00:07.3: irq 10, io mem 0x09050000 > scsi_id[490]: scsi_id: unable toivtv: Start initialization, version 1.1.0 > access '/block' > scsi_id[510]: scsi_id: unable to access '/block' > scsi_id[532]: scsi_id: unable to access '/block' > scsi_id[552]: scsi_id: unable to access '/block' > usb usb1: configuration #1 chosen from 1 choice > hub 1-0:1.0: USB hub found > hub 1-0:1.0: 2 ports detected > scsi_id[600]: scsi_id: unable to access '/block' > ivtv0: Initializing card #0 > ivtv0: Autodetected Hauppauge card (cx23415 based) > ata2: SATA link up 1.5 Gbps (SStatus 113 SControl 300) > ata2.00: ATA-7: ST3320620AS, 3.AAK, max UDMA/133 > ... ^ permalink raw reply [flat|nested] 40+ messages in thread
* Re: [BUG] 2.6.23-rc3 can't see sd partitions on Alpha 2007-11-30 22:42 ` Andrew Morton @ 2007-11-30 23:26 ` Rafael J. Wysocki 2007-12-02 20:53 ` Michael Cree 1 sibling, 0 replies; 40+ messages in thread From: Rafael J. Wysocki @ 2007-11-30 23:26 UTC (permalink / raw) To: Andrew Morton Cc: Michael Cree, rct, linux-kernel, rth, ink, linux-scsi, Kay Sievers, Greg KH On Friday, 30 of November 2007, Andrew Morton wrote: > On Sat, 01 Dec 2007 11:30:01 +1300 > Michael Cree <mcree@orcon.net.nz> wrote: > > > Bob Tracy wrote: > > > Andrew Morton wrote: > > >> Could be something change in sysfs. Please double-check the config > > >> options, make sure that something important didn't get disabled. > > >> > > > Here's > > > hoping someone else is seeing this or can replicate it in the meantime. > > > > Snap. > > > > 2.6.24-rc2 works fine. 2.6.24-rc3 boots on Alpha but once /dev is > > populated no partitions of the scsi sub-system are seen. Looks like ide > > sub-system similarly affected. > > Rafael, I assume you have this regression in the list? Yes, http://bugzilla.kernel.org/show_bug.cgi?id=9457 ^ permalink raw reply [flat|nested] 40+ messages in thread
* Re: [BUG] 2.6.23-rc3 can't see sd partitions on Alpha 2007-11-30 22:42 ` Andrew Morton 2007-11-30 23:26 ` Rafael J. Wysocki @ 2007-12-02 20:53 ` Michael Cree 2007-12-03 1:17 ` Bob Tracy 1 sibling, 1 reply; 40+ messages in thread From: Michael Cree @ 2007-12-02 20:53 UTC (permalink / raw) To: Andrew Morton Cc: rct, linux-kernel, rjw, rth, ink, linux-scsi, Kay Sievers, Greg KH On 1/12/2007, at 11:42 AM, Andrew Morton wrote: > On Sat, 01 Dec 2007 11:30:01 +1300 > Michael Cree <mcree@orcon.net.nz> wrote: > >> Bob Tracy wrote: >>> Andrew Morton wrote: >>>> Could be something change in sysfs. Please double-check the config >>>> options, make sure that something important didn't get disabled. >>>> >>> Here's >>> hoping someone else is seeing this or can replicate it in the >>> meantime. >> >> Snap. >> >> 2.6.24-rc2 works fine. 2.6.24-rc3 boots on Alpha but once /dev is >> populated no partitions of the scsi sub-system are seen. Looks >> like ide >> sub-system similarly affected. [snip] >> eth0: Digital DS21142/43 Tulip rev 65 at Port 0x200009400, >> 08:00:2b:87:4c:b0, IRQ 45. >> Linux video capture interface: v2.00 >> scsi_id[402]: scsi_id: unable to access '/block' > > I guess this is where things go bad. Yes, that is what I thought too. > scsi_id is part of udev. Perhaps some sysfs nodes aren't being > created > correctly. > > Random guess: what is your setting of CONFIG_SCSI_SCAN_ASYNC and what > happens if you invert it? Is set to Y. Changed it to N and recompiled kernel and restarted. No change. Same problems remain. I now realise that not only SCSI drive device nodes are not appearing in /dev, but all disc nodes are not appearing. In my case all of fd0, hde (IDE disc), hdf (CD/DVD), sda (SCSI disc), sdb (SATA disc), sdc (memory card reader), and their accompanying partition nodes, do not get made in /dev. I'm not familiar with sysfs so don't know what what I should be looking for in particular, but I did have a quick noisy around /sys and noted that the above mentioned devices are all appearing at /sys/ block with what appeared to be sensible information in the subdirectories thereof. Sorry, but it is unlikely that I'll be able to look further into this problem at the moment as I am running a conference this week and the storm is about to hit... Cheers Michael. ^ permalink raw reply [flat|nested] 40+ messages in thread
* Re: [BUG] 2.6.23-rc3 can't see sd partitions on Alpha 2007-12-02 20:53 ` Michael Cree @ 2007-12-03 1:17 ` Bob Tracy 2007-12-04 12:16 ` Ingo Molnar 0 siblings, 1 reply; 40+ messages in thread From: Bob Tracy @ 2007-12-03 1:17 UTC (permalink / raw) To: Michael Cree Cc: Andrew Morton, linux-kernel, rjw, rth, ink, linux-scsi, Kay Sievers, Greg KH Michael Cree wrote: > On 1/12/2007, at 11:42 AM, Andrew Morton wrote: > > On Sat, 01 Dec 2007 11:30:01 +1300 > > Michael Cree <mcree@orcon.net.nz> wrote: > > > >> Bob Tracy wrote: > >>> Here's > >>> hoping someone else is seeing this or can replicate it in the > >>> meantime. > >> > >> Snap. > >> > >> 2.6.24-rc2 works fine. 2.6.24-rc3 boots on Alpha but once /dev is > >> populated no partitions of the scsi sub-system are seen. Looks > >> like ide sub-system similarly affected. > > [snip] > > >> eth0: Digital DS21142/43 Tulip rev 65 at Port 0x200009400, > >> 08:00:2b:87:4c:b0, IRQ 45. > >> Linux video capture interface: v2.00 > >> scsi_id[402]: scsi_id: unable to access '/block' > > > > I guess this is where things go bad. > > Yes, that is what I thought too. Thanks for the confirmation of the error condition. As best I can recall, your boot log is substantially the same as what I saw. Finally got back in town. Starting the git-bisect process. I've got a relatively slow network connection, and the PWS 433au isn't exactly what I would call "fast" by modern standards, so bear with me while I get things set up and crank through this. The clone of the 2.6 tree will take several more hours to finish downloading. I anticipate the best pace I'll be able to manage after that is two iterations in a 24- hour period. -- ------------------------------------------------------------------------ Bob Tracy | "They couldn't hit an elephant at this dist- " rct@frus.com | - Last words of Union General John Sedgwick, | Battle of Spotsylvania Court House, U.S. Civil War ------------------------------------------------------------------------ ^ permalink raw reply [flat|nested] 40+ messages in thread
* Re: [BUG] 2.6.23-rc3 can't see sd partitions on Alpha 2007-12-03 1:17 ` Bob Tracy @ 2007-12-04 12:16 ` Ingo Molnar 2007-12-04 15:36 ` Bob Tracy ` (2 more replies) 0 siblings, 3 replies; 40+ messages in thread From: Ingo Molnar @ 2007-12-04 12:16 UTC (permalink / raw) To: Bob Tracy Cc: Michael Cree, Andrew Morton, linux-kernel, rjw, rth, ink, linux-scsi, Kay Sievers, Greg KH * Bob Tracy <rct@gherkin.frus.com> wrote: > Finally got back in town. Starting the git-bisect process. I've got > a relatively slow network connection, and the PWS 433au isn't exactly > what I would call "fast" by modern standards, so bear with me while I > get things set up and crank through this. The clone of the 2.6 tree > will take several more hours to finish downloading. I anticipate the > best pace I'll be able to manage after that is two iterations in a 24- > hour period. once you are done with the download of the initial cloned git repository (which is 200MB+), all the bisection steps will be local and you'll be only limited by kernel rebuild speed and by bootup and testing speed, not by network bandwidth. ( once you have the cloned repository i'd suggest for you to keep it - that way you can track susequent kernels via "git-pull" and it uses a very network-efficient delta protocol. ) Ingo ^ permalink raw reply [flat|nested] 40+ messages in thread
* Re: [BUG] 2.6.23-rc3 can't see sd partitions on Alpha 2007-12-04 12:16 ` Ingo Molnar @ 2007-12-04 15:36 ` Bob Tracy 2007-12-05 17:30 ` Bob Tracy 2007-12-07 0:16 ` Bob Tracy 2 siblings, 0 replies; 40+ messages in thread From: Bob Tracy @ 2007-12-04 15:36 UTC (permalink / raw) To: Ingo Molnar Cc: Michael Cree, Andrew Morton, linux-kernel, rjw, rth, ink, linux-scsi, Kay Sievers, Greg KH Ingo Molnar wrote: > once you are done with the download of the initial cloned git repository > (which is 200MB+), all the bisection steps will be local and you'll be > only limited by kernel rebuild speed and by bootup and testing speed, > not by network bandwidth. ACK. Have tested two kernels in the past 24 hours, and the third is building as I type this. The builds seem to be taking about 3 hours each. First two tests good, so the offending commit is somewhere in the last 25% (roughly) of the changes between -rc2 and -rc3: git says 82 revisions left to test. Might have this painted into a corner in the next day or so. I'll try to be quick about it, since -rc4 is out. > ( once you have the cloned repository i'd suggest for you to keep it - > that way you can track susequent kernels via "git-pull" and it uses a > very network-efficient delta protocol. ) Will do... I'm in the fortunate position of having enough disk space on my Alpha that I can maintain multiple trees for this kind of effort. -- ------------------------------------------------------------------------ Bob Tracy | "They couldn't hit an elephant at this dist- " rct@frus.com | - Last words of Union General John Sedgwick, | Battle of Spotsylvania Court House, U.S. Civil War ------------------------------------------------------------------------ ^ permalink raw reply [flat|nested] 40+ messages in thread
* Re: [BUG] 2.6.23-rc3 can't see sd partitions on Alpha 2007-12-04 12:16 ` Ingo Molnar 2007-12-04 15:36 ` Bob Tracy @ 2007-12-05 17:30 ` Bob Tracy 2007-12-07 0:16 ` Bob Tracy 2 siblings, 0 replies; 40+ messages in thread From: Bob Tracy @ 2007-12-05 17:30 UTC (permalink / raw) To: Ingo Molnar Cc: Michael Cree, Andrew Morton, linux-kernel, rjw, rth, ink, linux-scsi, Kay Sievers, Greg KH Current progress: 11 revisions left to test. The current partial "git bisect log" is available per Ingo's suggestion on bugzilla. http://bugzilla.kernel.org/show_bug.cgi?id=9457 -- ------------------------------------------------------------------------ Bob Tracy | "They couldn't hit an elephant at this dist- " rct@frus.com | - Last words of Union General John Sedgwick, | Battle of Spotsylvania Court House, U.S. Civil War ------------------------------------------------------------------------ ^ permalink raw reply [flat|nested] 40+ messages in thread
* Re: [BUG] 2.6.23-rc3 can't see sd partitions on Alpha 2007-12-04 12:16 ` Ingo Molnar 2007-12-04 15:36 ` Bob Tracy 2007-12-05 17:30 ` Bob Tracy @ 2007-12-07 0:16 ` Bob Tracy 2007-12-07 0:33 ` Andrew Morton 2007-12-07 0:44 ` Rafael J. Wysocki 2 siblings, 2 replies; 40+ messages in thread From: Bob Tracy @ 2007-12-07 0:16 UTC (permalink / raw) To: Ingo Molnar Cc: Michael Cree, Andrew Morton, linux-kernel, rjw, rth, ink, linux-scsi, Kay Sievers, Greg KH OK. Finally have this thing painted into a corner: git has identified 6f37ac793d6ba7b35d338f791974166f67fdd9ba as the first bad commit. >From "git bisect log", this corresponds to # bad: [6f37ac793d6ba7b35d338f791974166f67fdd9ba] Merge branch 'master' of master.kernel.org:/pub/scm/linux/kernel/git/davem/net-2.6 Here's the full log: git-bisect start # good: [9aae299f7fd1888ea3a195cfe0edef17bb647415] Linux 2.6.24-rc2 git-bisect good 9aae299f7fd1888ea3a195cfe0edef17bb647415 # bad: [f05092637dc0d9a3f2249c9b283b973e6e96b7d2] Linux 2.6.24-rc3 git-bisect bad f05092637dc0d9a3f2249c9b283b973e6e96b7d2 # good: [e6a5c27f3b0fef72e528fc35e343af4b2db790ff] Merge branch 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/avi/kvm git-bisect good e6a5c27f3b0fef72e528fc35e343af4b2db790ff # good: [42614fcde7bfdcbe43a7b17035c167dfebc354dd] vmstat: fix section mismatch warning git-bisect good 42614fcde7bfdcbe43a7b17035c167dfebc354dd # bad: [a052f4473603765eb6b4c19754689977601dc1d1] Merge git://git.kernel.org/pub/scm/linux/kernel/git/sam/x86 git-bisect bad a052f4473603765eb6b4c19754689977601dc1d1 # good: [d8e5219f9f5ca7518eb820db9f3d287a1d46fcf5] CRISv10 improve and bugfix fasttimer git-bisect good d8e5219f9f5ca7518eb820db9f3d287a1d46fcf5 # good: [d90bf5a976793edfa88d3bb2393f0231eb8ce1e5] [NET]: rt_check_expire() can take a long time, add a cond_resched() git-bisect good d90bf5a976793edfa88d3bb2393f0231eb8ce1e5 # good: [2a113281f5cd2febbab21a93c8943f8d3eece4d3] kconfig: use $K64BIT to set 64BIT with all*config targets git-bisect good 2a113281f5cd2febbab21a93c8943f8d3eece4d3 # good: [2e2cd8bad6e03ceea73495ee6d557044213d95de] CRISv10 memset library add lineendings to asm git-bisect good 2e2cd8bad6e03ceea73495ee6d557044213d95de # bad: [6f37ac793d6ba7b35d338f791974166f67fdd9ba] Merge branch 'master' of master.kernel.org:/pub/scm/linux/kernel/git/davem/net-2.6 git-bisect bad 6f37ac793d6ba7b35d338f791974166f67fdd9ba # good: [2f1f53bdc6531696934f6ee7bbdfa2ab4f4f62a3] CRISv10 fasttimer: Scrap INLINE and name timeval_cmp better git-bisect good 2f1f53bdc6531696934f6ee7bbdfa2ab4f4f62a3 -- ------------------------------------------------------------------------ Bob Tracy | "They couldn't hit an elephant at this dist- " rct@frus.com | - Last words of Union General John Sedgwick, | Battle of Spotsylvania Court House, U.S. Civil War ------------------------------------------------------------------------ ^ permalink raw reply [flat|nested] 40+ messages in thread
* Re: [BUG] 2.6.23-rc3 can't see sd partitions on Alpha 2007-12-07 0:16 ` Bob Tracy @ 2007-12-07 0:33 ` Andrew Morton 2007-12-07 5:07 ` Bob Tracy ` (2 more replies) 2007-12-07 0:44 ` Rafael J. Wysocki 1 sibling, 3 replies; 40+ messages in thread From: Andrew Morton @ 2007-12-07 0:33 UTC (permalink / raw) To: Bob Tracy Cc: mingo, mcree, linux-kernel, rjw, rth, ink, linux-scsi, kay.sievers, greg On Thu, 6 Dec 2007 18:16:12 -0600 (CST) rct@frus.com (Bob Tracy) wrote: > OK. Finally have this thing painted into a corner: git has identified > 6f37ac793d6ba7b35d338f791974166f67fdd9ba as the first bad commit. > > >From "git bisect log", this corresponds to > > # bad: [6f37ac793d6ba7b35d338f791974166f67fdd9ba] Merge branch 'master' of master.kernel.org:/pub/scm/linux/kernel/git/davem/net-2.6 > > Here's the full log: > > git-bisect start > # good: [9aae299f7fd1888ea3a195cfe0edef17bb647415] Linux 2.6.24-rc2 > git-bisect good 9aae299f7fd1888ea3a195cfe0edef17bb647415 > # bad: [f05092637dc0d9a3f2249c9b283b973e6e96b7d2] Linux 2.6.24-rc3 > git-bisect bad f05092637dc0d9a3f2249c9b283b973e6e96b7d2 > # good: [e6a5c27f3b0fef72e528fc35e343af4b2db790ff] Merge branch 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/avi/kvm > git-bisect good e6a5c27f3b0fef72e528fc35e343af4b2db790ff > # good: [42614fcde7bfdcbe43a7b17035c167dfebc354dd] vmstat: fix section mismatch warning > git-bisect good 42614fcde7bfdcbe43a7b17035c167dfebc354dd > # bad: [a052f4473603765eb6b4c19754689977601dc1d1] Merge git://git.kernel.org/pub/scm/linux/kernel/git/sam/x86 > git-bisect bad a052f4473603765eb6b4c19754689977601dc1d1 > # good: [d8e5219f9f5ca7518eb820db9f3d287a1d46fcf5] CRISv10 improve and bugfix fasttimer > git-bisect good d8e5219f9f5ca7518eb820db9f3d287a1d46fcf5 > # good: [d90bf5a976793edfa88d3bb2393f0231eb8ce1e5] [NET]: rt_check_expire() can take a long time, add a cond_resched() > git-bisect good d90bf5a976793edfa88d3bb2393f0231eb8ce1e5 > # good: [2a113281f5cd2febbab21a93c8943f8d3eece4d3] kconfig: use $K64BIT to set 64BIT with all*config targets > git-bisect good 2a113281f5cd2febbab21a93c8943f8d3eece4d3 > # good: [2e2cd8bad6e03ceea73495ee6d557044213d95de] CRISv10 memset library add lineendings to asm > git-bisect good 2e2cd8bad6e03ceea73495ee6d557044213d95de > # bad: [6f37ac793d6ba7b35d338f791974166f67fdd9ba] Merge branch 'master' of master.kernel.org:/pub/scm/linux/kernel/git/davem/net-2.6 > git-bisect bad 6f37ac793d6ba7b35d338f791974166f67fdd9ba > # good: [2f1f53bdc6531696934f6ee7bbdfa2ab4f4f62a3] CRISv10 fasttimer: Scrap INLINE and name timeval_cmp better > git-bisect good 2f1f53bdc6531696934f6ee7bbdfa2ab4f4f62a3 commit 6f37ac793d6ba7b35d338f791974166f67fdd9ba Merge: 2f1f53b... d90bf5a... Author: Linus Torvalds <torvalds@woody.linux-foundation.org> Date: Wed Nov 14 18:51:48 2007 -0800 Merge branch 'master' of master.kernel.org:/pub/scm/linux/kernel/git/davem/n * 'master' of master.kernel.org:/pub/scm/linux/kernel/git/davem/net-2.6: [NET]: rt_check_expire() can take a long time, add a cond_resched() [ISDN] sc: Really, really fix warning [ISDN] sc: Fix sndpkt to have the correct number of arguments [TCP] FRTO: Clear frto_highmark only after process_frto that uses it [NET]: Remove notifier block from chain when register_netdevice_notifier f [FS_ENET]: Fix module build. [TCP]: Make sure write_queue_from does not begin with NULL ptr [TCP]: Fix size calculation in sk_stream_alloc_pskb [S2IO]: Fixed memory leak when MSI-X vector allocation fails [BONDING]: Fix resource use after free [SYSCTL]: Fix warning for token-ring from sysctl checker [NET] random : secure_tcp_sequence_number should not assume CONFIG_KTIME_S [IWLWIFI]: Not correctly dealing with hotunplug. [TCP] FRTO: Plug potential LOST-bit leak [TCP] FRTO: Limit snd_cwnd if TCP was application limited [E1000]: Fix schedule while atomic when called from mii-tool. [NETX]: Fix build failure added by 2.6.24 statistics cleanup. [EP93xx_ETH]: Build fix after 2.6.24 NAPI changes. [PKT_SCHED]: Check subqueue status before calling hard_start_xmit I'm struggling to see how any of those could have broken block device mounting on alpha. Are you sure you bisected right? ^ permalink raw reply [flat|nested] 40+ messages in thread
* Re: [BUG] 2.6.23-rc3 can't see sd partitions on Alpha 2007-12-07 0:33 ` Andrew Morton @ 2007-12-07 5:07 ` Bob Tracy 2007-12-07 10:26 ` Andrew Morton 2007-12-07 11:40 ` Ingo Molnar 2007-12-07 5:42 ` Bob Tracy 2007-12-07 9:33 ` Ingo Molnar 2 siblings, 2 replies; 40+ messages in thread From: Bob Tracy @ 2007-12-07 5:07 UTC (permalink / raw) To: Andrew Morton Cc: Bob Tracy, mingo, mcree, linux-kernel, rjw, rth, ink, linux-scsi, kay.sievers, greg Andrew Morton wrote: > commit 6f37ac793d6ba7b35d338f791974166f67fdd9ba > Merge: 2f1f53b... d90bf5a... > Author: Linus Torvalds <torvalds@woody.linux-foundation.org> > Date: Wed Nov 14 18:51:48 2007 -0800 > > Merge branch 'master' of master.kernel.org:/pub/scm/linux/kernel/git/davem/n > > * 'master' of master.kernel.org:/pub/scm/linux/kernel/git/davem/net-2.6: > [NET]: rt_check_expire() can take a long time, add a cond_resched() > [ISDN] sc: Really, really fix warning > [ISDN] sc: Fix sndpkt to have the correct number of arguments > [TCP] FRTO: Clear frto_highmark only after process_frto that uses it > [NET]: Remove notifier block from chain when register_netdevice_notifier f > [FS_ENET]: Fix module build. > [TCP]: Make sure write_queue_from does not begin with NULL ptr > [TCP]: Fix size calculation in sk_stream_alloc_pskb > [S2IO]: Fixed memory leak when MSI-X vector allocation fails > [BONDING]: Fix resource use after free > [SYSCTL]: Fix warning for token-ring from sysctl checker > [NET] random : secure_tcp_sequence_number should not assume CONFIG_KTIME_S > [IWLWIFI]: Not correctly dealing with hotunplug. > [TCP] FRTO: Plug potential LOST-bit leak > [TCP] FRTO: Limit snd_cwnd if TCP was application limited > [E1000]: Fix schedule while atomic when called from mii-tool. > [NETX]: Fix build failure added by 2.6.24 statistics cleanup. > [EP93xx_ETH]: Build fix after 2.6.24 NAPI changes. > [PKT_SCHED]: Check subqueue status before calling hard_start_xmit > > I'm struggling to see how any of those could have broken block device > mounting on alpha. Are you sure you bisected right? Based on what's in that commit, it *does* appear something went wrong with bisection. If the implicated commit is the next one in time sequence relative to # good: [2f1f53bdc6531696934f6ee7bbdfa2ab4f4f62a3] CRISv10 fasttimer: Scrap INLINE and name timeval_cmp better then the test of whether I bisected correctly is as simple as applying the commit and seeing if things break, because I'm running on the kernel corresponding to 2f1f53bdc6531696934f6ee7bbdfa2ab4f4f62a3 right now. Let me give that a try and I'll report back. Worst case, I'll have to start over and write off the past four days... Sorry about this... -- ------------------------------------------------------------------------ Bob Tracy | "They couldn't hit an elephant at this dist- " rct@frus.com | - Last words of Union General John Sedgwick, | Battle of Spotsylvania Court House, U.S. Civil War ------------------------------------------------------------------------ ^ permalink raw reply [flat|nested] 40+ messages in thread
* Re: [BUG] 2.6.23-rc3 can't see sd partitions on Alpha 2007-12-07 5:07 ` Bob Tracy @ 2007-12-07 10:26 ` Andrew Morton 2007-12-07 11:37 ` Ingo Molnar ` (2 more replies) 2007-12-07 11:40 ` Ingo Molnar 1 sibling, 3 replies; 40+ messages in thread From: Andrew Morton @ 2007-12-07 10:26 UTC (permalink / raw) To: Bob Tracy Cc: mingo, mcree, linux-kernel, rjw, rth, ink, linux-scsi, kay.sievers, greg On Thu, 6 Dec 2007 23:07:08 -0600 (CST) rct@frus.com (Bob Tracy) wrote: > Andrew Morton wrote: > > commit 6f37ac793d6ba7b35d338f791974166f67fdd9ba > > Merge: 2f1f53b... d90bf5a... > > Author: Linus Torvalds <torvalds@woody.linux-foundation.org> > > Date: Wed Nov 14 18:51:48 2007 -0800 > > > > Merge branch 'master' of master.kernel.org:/pub/scm/linux/kernel/git/davem/n > > > > * 'master' of master.kernel.org:/pub/scm/linux/kernel/git/davem/net-2.6: > > [NET]: rt_check_expire() can take a long time, add a cond_resched() > > [ISDN] sc: Really, really fix warning > > [ISDN] sc: Fix sndpkt to have the correct number of arguments > > [TCP] FRTO: Clear frto_highmark only after process_frto that uses it > > [NET]: Remove notifier block from chain when register_netdevice_notifier f > > [FS_ENET]: Fix module build. > > [TCP]: Make sure write_queue_from does not begin with NULL ptr > > [TCP]: Fix size calculation in sk_stream_alloc_pskb > > [S2IO]: Fixed memory leak when MSI-X vector allocation fails > > [BONDING]: Fix resource use after free > > [SYSCTL]: Fix warning for token-ring from sysctl checker > > [NET] random : secure_tcp_sequence_number should not assume CONFIG_KTIME_S > > [IWLWIFI]: Not correctly dealing with hotunplug. > > [TCP] FRTO: Plug potential LOST-bit leak > > [TCP] FRTO: Limit snd_cwnd if TCP was application limited > > [E1000]: Fix schedule while atomic when called from mii-tool. > > [NETX]: Fix build failure added by 2.6.24 statistics cleanup. > > [EP93xx_ETH]: Build fix after 2.6.24 NAPI changes. > > [PKT_SCHED]: Check subqueue status before calling hard_start_xmit > > > > I'm struggling to see how any of those could have broken block device > > mounting on alpha. Are you sure you bisected right? > > Based on what's in that commit, it *does* appear something went wrong > with bisection. If the implicated commit is the next one in time > sequence relative to > > # good: [2f1f53bdc6531696934f6ee7bbdfa2ab4f4f62a3] CRISv10 fasttimer: Scrap INLINE and name timeval_cmp better > > then the test of whether I bisected correctly is as simple as applying > the commit and seeing if things break, because I'm running on the > kernel corresponding to 2f1f53bdc6531696934f6ee7bbdfa2ab4f4f62a3 right > now. Let me give that a try and I'll report back. Worst case, I'll > have to start over and write off the past four days... Gad. I trust the second time will be faster. git-bisect _is_ very error prone. I find one of the problems is that each step is so far apart in time that you forget what you were doing. Did I remember to test that iteration? Did I install the right kernel? etc. > Sorry about this... Not appropriate ;) Thanks for helping out. ^ permalink raw reply [flat|nested] 40+ messages in thread
* Re: [BUG] 2.6.23-rc3 can't see sd partitions on Alpha 2007-12-07 10:26 ` Andrew Morton @ 2007-12-07 11:37 ` Ingo Molnar 2007-12-07 13:39 ` Bob Tracy 2007-12-07 14:55 ` Bob Tracy 2 siblings, 0 replies; 40+ messages in thread From: Ingo Molnar @ 2007-12-07 11:37 UTC (permalink / raw) To: Andrew Morton Cc: Bob Tracy, mcree, linux-kernel, rjw, rth, ink, linux-scsi, kay.sievers, greg * Andrew Morton <akpm@linux-foundation.org> wrote: > > then the test of whether I bisected correctly is as simple as > > applying the commit and seeing if things break, because I'm running > > on the kernel corresponding to > > 2f1f53bdc6531696934f6ee7bbdfa2ab4f4f62a3 right now. Let me give > > that a try and I'll report back. Worst case, I'll have to start > > over and write off the past four days... > > Gad. I trust the second time will be faster. > > git-bisect _is_ very error prone. I find one of the problems is that > each step is so far apart in time that you forget what you were doing. > Did I remember to test that iteration? Did I install the right > kernel? etc. i have a fully automated bootup-hang bisection script. It is based on "git-bisect run". I run the script, it builds and boots kernels fully automatically, and when the bootup fails (the script notices that via the serial log, which it continuously watches - or via a timeout, if the system does not come up within 10 minutes it's a "bad" kernel), the script raises my attention via a beep and i power cycle the test box. (yeah, i should make use of a managed power outlet to 100% automate it) So i dont have to a single manual decision anytime during the bisection. But the scripts are very much tied to my ad-hoc test environment so it would not be of much general use. Ingo ^ permalink raw reply [flat|nested] 40+ messages in thread
* Re: [BUG] 2.6.23-rc3 can't see sd partitions on Alpha 2007-12-07 10:26 ` Andrew Morton 2007-12-07 11:37 ` Ingo Molnar @ 2007-12-07 13:39 ` Bob Tracy 2007-12-07 14:55 ` Bob Tracy 2 siblings, 0 replies; 40+ messages in thread From: Bob Tracy @ 2007-12-07 13:39 UTC (permalink / raw) To: Andrew Morton Cc: Bob Tracy, mingo, mcree, linux-kernel, rjw, rth, ink, linux-scsi, kay.sievers, greg Andrew Morton wrote: > On Thu, 6 Dec 2007 23:07:08 -0600 (CST) rct@frus.com (Bob Tracy) wrote: > > Andrew Morton wrote: > > > commit 6f37ac793d6ba7b35d338f791974166f67fdd9ba > > > Merge: 2f1f53b... d90bf5a... > > > Author: Linus Torvalds <torvalds@woody.linux-foundation.org> > > > Date: Wed Nov 14 18:51:48 2007 -0800 > > > > > > Merge branch 'master' of master.kernel.org:/pub/scm/linux/kernel/git/davem/n > > > > > > * 'master' of master.kernel.org:/pub/scm/linux/kernel/git/davem/net-2.6: > > > (omitted for brevity) > > > > > > I'm struggling to see how any of those could have broken block device > > > mounting on alpha. Are you sure you bisected right? > > > > Based on what's in that commit, it *does* appear something went wrong > > with bisection. If the implicated commit is the next one in time > > sequence relative to > > > > # good: [2f1f53bdc6531696934f6ee7bbdfa2ab4f4f62a3] CRISv10 fasttimer: Scrap INLINE and name timeval_cmp better > > > > then the test of whether I bisected correctly is as simple as applying > > the commit and seeing if things break, because I'm running on the > > kernel corresponding to 2f1f53bdc6531696934f6ee7bbdfa2ab4f4f62a3 right > > now. Let me give that a try and I'll report back. Worst case, I'll > > have to start over and write off the past four days... > > Gad. I trust the second time will be faster. > > git-bisect _is_ very error prone. I find one of the problems is that each > step is so far apart in time that you forget what you were doing. Did I > remember to test that iteration? Did I install the right kernel? etc. > > > Sorry about this... > > Not appropriate ;) Thanks for helping out. Thanks for the kind words... The above-mentioned test verified that the bisection was/is correct: 2f1f53bdc6531696934f6ee7bbdfa2ab4f4f62a3 works, and 6f37ac793d6ba7b35d338f791974166f67fdd9ba doesn't. Now I've got to figure out why. "git diff 2f1f53bdc6531696934f6ee7bbdfa2ab4f4f62a3 6f37ac793d6ba7b35d338f791974166f67fdd9ba" produced a relatively short patch (18,437 bytes). The list of involved files: diff --git a/drivers/char/random.c b/drivers/char/random.c diff --git a/drivers/isdn/sc/card.h b/drivers/isdn/sc/card.h diff --git a/drivers/isdn/sc/packet.c b/drivers/isdn/sc/packet.c diff --git a/drivers/isdn/sc/shmem.c b/drivers/isdn/sc/shmem.c diff --git a/drivers/net/arm/ep93xx_eth.c b/drivers/net/arm/ep93xx_eth.c diff --git a/drivers/net/bonding/bond_main.c b/drivers/net/bonding/bond_main.c diff --git a/drivers/net/e1000/e1000_main.c b/drivers/net/e1000/e1000_main.c diff --git a/drivers/net/fs_enet/Kconfig b/drivers/net/fs_enet/Kconfig diff --git a/drivers/net/fs_enet/Makefile b/drivers/net/fs_enet/Makefile diff --git a/drivers/net/netx-eth.c b/drivers/net/netx-eth.c diff --git a/drivers/net/s2io.c b/drivers/net/s2io.c diff --git a/drivers/net/wireless/iwlwifi/iwl3945-base.c b/drivers/net/wireless/iwlwifi/iwl3945-base.c diff --git a/include/net/sock.h b/include/net/sock.h diff --git a/kernel/sysctl_check.c b/kernel/sysctl_check.c diff --git a/net/core/dev.c b/net/core/dev.c diff --git a/net/ipv4/route.c b/net/ipv4/route.c diff --git a/net/ipv4/tcp_input.c b/net/ipv4/tcp_input.c diff --git a/net/sched/sch_generic.c b/net/sched/sch_generic.c Current state of the source tree is the 6f37ac... version, so I'll start backing out the above diffs in related groups and continue until I've got a working kernel. For lack of an obvious target, I'll start with the seemingly innocuous change to sysctl_check.c. I'll report back when I've got something. -- ------------------------------------------------------------------------ Bob Tracy | "They couldn't hit an elephant at this dist- " rct@frus.com | - Last words of Union General John Sedgwick, | Battle of Spotsylvania Court House, U.S. Civil War ------------------------------------------------------------------------ ^ permalink raw reply [flat|nested] 40+ messages in thread
* Re: [BUG] 2.6.23-rc3 can't see sd partitions on Alpha 2007-12-07 10:26 ` Andrew Morton 2007-12-07 11:37 ` Ingo Molnar 2007-12-07 13:39 ` Bob Tracy @ 2007-12-07 14:55 ` Bob Tracy 2007-12-07 15:05 ` Ingo Molnar 2007-12-07 20:43 ` Michael Cree 2 siblings, 2 replies; 40+ messages in thread From: Bob Tracy @ 2007-12-07 14:55 UTC (permalink / raw) To: Andrew Morton Cc: Bob Tracy, mingo, mcree, linux-kernel, rjw, rth, ink, linux-scsi, kay.sievers, greg I wrote: > "git diff 2f1f53bdc6531696934f6ee7bbdfa2ab4f4f62a3 6f37ac793d6ba7b35d338f791974166f67fdd9ba" > produced a relatively short patch (18,437 bytes). The list of involved > files: > > (omitted) > > Current state of the source tree is the 6f37ac... version, so I'll start > backing out the above diffs in related groups and continue until I've got > a working kernel. For lack of an obvious target, I'll start with the > seemingly innocuous change to sysctl_check.c. I'll report back when I've > got something. That was quick :-). Backing out the sysctl_check.c diff gives me a working kernel. Beats the #$%@! out of me how/why, though. Michael Cree: could you try backing out the diff below from your 2.6.24-rc3 tree and see if things are now working for you? Here's "uname -a", just to confirm (maybe) I'm running on what I say works: Linux smirkin 2.6.24-rc2-g6f37ac79-dirty #2 Fri Dec 7 08:03:12 CST 2007 alpha Here's the diff I backed out (patch -R). It's short... diff --git a/kernel/sysctl_check.c b/kernel/sysctl_check.c index 5a2f2b2..4abc6d2 100644 --- a/kernel/sysctl_check.c +++ b/kernel/sysctl_check.c @@ -738,7 +738,7 @@ static struct trans_ctl_table trans_net_table[] = { { NET_ROSE, "rose", trans_net_rose_table }, { NET_IPV6, "ipv6", trans_net_ipv6_table }, { NET_X25, "x25", trans_net_x25_table }, - { NET_TR, "tr", trans_net_tr_table }, + { NET_TR, "token-ring", trans_net_tr_table }, { NET_DECNET, "decnet", trans_net_decnet_table }, /* NET_ECONET not used */ { NET_SCTP, "sctp", trans_net_sctp_table }, -- ------------------------------------------------------------------------ Bob Tracy | "They couldn't hit an elephant at this dist- " rct@frus.com | - Last words of Union General John Sedgwick, | Battle of Spotsylvania Court House, U.S. Civil War ------------------------------------------------------------------------ ^ permalink raw reply related [flat|nested] 40+ messages in thread
* Re: [BUG] 2.6.23-rc3 can't see sd partitions on Alpha 2007-12-07 14:55 ` Bob Tracy @ 2007-12-07 15:05 ` Ingo Molnar 2007-12-07 16:59 ` Bob Tracy 2007-12-07 20:43 ` Michael Cree 1 sibling, 1 reply; 40+ messages in thread From: Ingo Molnar @ 2007-12-07 15:05 UTC (permalink / raw) To: Bob Tracy Cc: Andrew Morton, mcree, linux-kernel, rjw, rth, ink, linux-scsi, kay.sievers, greg * Bob Tracy <rct@frus.com> wrote: > > Current state of the source tree is the 6f37ac... version, so I'll > > start backing out the above diffs in related groups and continue > > until I've got a working kernel. For lack of an obvious target, > > I'll start with the seemingly innocuous change to sysctl_check.c. > > I'll report back when I've got something. > > That was quick :-). Backing out the sysctl_check.c diff gives me a > working kernel. Beats the #$%@! out of me how/why, though. > > Michael Cree: could you try backing out the diff below from your > 2.6.24-rc3 tree and see if things are now working for you? > > Here's "uname -a", just to confirm (maybe) I'm running on what I say > works: > > Linux smirkin 2.6.24-rc2-g6f37ac79-dirty #2 Fri Dec 7 08:03:12 CST 2007 alpha > > Here's the diff I backed out (patch -R). It's short... > > diff --git a/kernel/sysctl_check.c b/kernel/sysctl_check.c > index 5a2f2b2..4abc6d2 100644 > --- a/kernel/sysctl_check.c > +++ b/kernel/sysctl_check.c > @@ -738,7 +738,7 @@ static struct trans_ctl_table trans_net_table[] = { > { NET_ROSE, "rose", trans_net_rose_table }, > { NET_IPV6, "ipv6", trans_net_ipv6_table }, > { NET_X25, "x25", trans_net_x25_table }, > - { NET_TR, "tr", trans_net_tr_table }, > + { NET_TR, "token-ring", trans_net_tr_table }, > { NET_DECNET, "decnet", trans_net_decnet_table }, > /* NET_ECONET not used */ > { NET_SCTP, "sctp", trans_net_sctp_table }, reverting this makes the kernel image shorter by 8 bytes - so perhaps some alignment issue somewhere? Or something gets overflown? Does any of this get actually used by your bootup? Ingo ^ permalink raw reply [flat|nested] 40+ messages in thread
* Re: [BUG] 2.6.23-rc3 can't see sd partitions on Alpha 2007-12-07 15:05 ` Ingo Molnar @ 2007-12-07 16:59 ` Bob Tracy 2007-12-07 18:06 ` Ingo Molnar 0 siblings, 1 reply; 40+ messages in thread From: Bob Tracy @ 2007-12-07 16:59 UTC (permalink / raw) To: Ingo Molnar Cc: Bob Tracy, Andrew Morton, mcree, linux-kernel, rjw, rth, ink, linux-scsi, kay.sievers, greg Ingo Molnar wrote: > > * Bob Tracy <rct@frus.com> wrote: > > > > Current state of the source tree is the 6f37ac... version, so I'll > > > start backing out the above diffs in related groups and continue > > > until I've got a working kernel. For lack of an obvious target, > > > I'll start with the seemingly innocuous change to sysctl_check.c. > > > I'll report back when I've got something. > > > > That was quick :-). Backing out the sysctl_check.c diff gives me a > > working kernel. Beats the #$%@! out of me how/why, though. > > > > Michael Cree: could you try backing out the diff below from your > > 2.6.24-rc3 tree and see if things are now working for you? > > > > Here's "uname -a", just to confirm (maybe) I'm running on what I say > > works: > > > > Linux smirkin 2.6.24-rc2-g6f37ac79-dirty #2 Fri Dec 7 08:03:12 CST 2007 alpha > > > > Here's the diff I backed out (patch -R). It's short... > > > > diff --git a/kernel/sysctl_check.c b/kernel/sysctl_check.c > > index 5a2f2b2..4abc6d2 100644 > > --- a/kernel/sysctl_check.c > > +++ b/kernel/sysctl_check.c > > @@ -738,7 +738,7 @@ static struct trans_ctl_table trans_net_table[] = { > > { NET_ROSE, "rose", trans_net_rose_table }, > > { NET_IPV6, "ipv6", trans_net_ipv6_table }, > > { NET_X25, "x25", trans_net_x25_table }, > > - { NET_TR, "tr", trans_net_tr_table }, > > + { NET_TR, "token-ring", trans_net_tr_table }, > > { NET_DECNET, "decnet", trans_net_decnet_table }, > > /* NET_ECONET not used */ > > { NET_SCTP, "sctp", trans_net_sctp_table }, > > reverting this makes the kernel image shorter by 8 bytes - so perhaps > some alignment issue somewhere? Or something gets overflown? Does any of > this get actually used by your bootup? Dunno... The dmesg output is not terribly useful here, because most of the "interesting" stuff concerning udev startup that appears on the console never makes it into a log. Note that, for the bad cases, I don't see the same console output that Michael reported, although the net effect is the same: the partitions don't get found, so I'm offered the chance to enter my root password and do some poking around, and when I do, none of the block devices are present under /dev. I'm open to suggestions on how to take this analysis further. Michael indicated he's running a conference this week, so I don't know when he'll be able to come up for air. -- ------------------------------------------------------------------------ Bob Tracy | "They couldn't hit an elephant at this dist- " rct@frus.com | - Last words of Union General John Sedgwick, | Battle of Spotsylvania Court House, U.S. Civil War ------------------------------------------------------------------------ ^ permalink raw reply [flat|nested] 40+ messages in thread
* Re: [BUG] 2.6.23-rc3 can't see sd partitions on Alpha 2007-12-07 16:59 ` Bob Tracy @ 2007-12-07 18:06 ` Ingo Molnar 2007-12-07 18:19 ` Kay Sievers 0 siblings, 1 reply; 40+ messages in thread From: Ingo Molnar @ 2007-12-07 18:06 UTC (permalink / raw) To: Bob Tracy Cc: Andrew Morton, mcree, linux-kernel, rjw, rth, ink, linux-scsi, kay.sievers, greg * Bob Tracy <rct@frus.com> wrote: > Ingo Molnar wrote: > > > > * Bob Tracy <rct@frus.com> wrote: > > > > > > Current state of the source tree is the 6f37ac... version, so I'll > > > > start backing out the above diffs in related groups and continue > > > > until I've got a working kernel. For lack of an obvious target, > > > > I'll start with the seemingly innocuous change to sysctl_check.c. > > > > I'll report back when I've got something. > > > > > > That was quick :-). Backing out the sysctl_check.c diff gives me a > > > working kernel. Beats the #$%@! out of me how/why, though. > > > > > > Michael Cree: could you try backing out the diff below from your > > > 2.6.24-rc3 tree and see if things are now working for you? > > > > > > Here's "uname -a", just to confirm (maybe) I'm running on what I say > > > works: > > > > > > Linux smirkin 2.6.24-rc2-g6f37ac79-dirty #2 Fri Dec 7 08:03:12 CST 2007 alpha > > > > > > Here's the diff I backed out (patch -R). It's short... > > > > > > diff --git a/kernel/sysctl_check.c b/kernel/sysctl_check.c > > > index 5a2f2b2..4abc6d2 100644 > > > --- a/kernel/sysctl_check.c > > > +++ b/kernel/sysctl_check.c > > > @@ -738,7 +738,7 @@ static struct trans_ctl_table trans_net_table[] = { > > > { NET_ROSE, "rose", trans_net_rose_table }, > > > { NET_IPV6, "ipv6", trans_net_ipv6_table }, > > > { NET_X25, "x25", trans_net_x25_table }, > > > - { NET_TR, "tr", trans_net_tr_table }, > > > + { NET_TR, "token-ring", trans_net_tr_table }, > > > { NET_DECNET, "decnet", trans_net_decnet_table }, > > > /* NET_ECONET not used */ > > > { NET_SCTP, "sctp", trans_net_sctp_table }, > > > > reverting this makes the kernel image shorter by 8 bytes - so > > perhaps some alignment issue somewhere? Or something gets overflown? > > Does any of this get actually used by your bootup? > > Dunno... The dmesg output is not terribly useful here, because most > of the "interesting" stuff concerning udev startup that appears on the > console never makes it into a log. Note that, for the bad cases, I > don't see the same console output that Michael reported, although the > net effect is the same: the partitions don't get found, so I'm offered > the chance to enter my root password and do some poking around, and > when I do, none of the block devices are present under /dev. > > I'm open to suggestions on how to take this analysis further. Michael > indicated he's running a conference this week, so I don't know when > he'll be able to come up for air. i'm not sure how to do direct debugging on udev, so i can only guess about what effect on the kernel side could have caused this. One bad hack would be to "probe" udevd's behavior by changing the NET_TR entry in various ways: "tr" -> "token-ring" # breaks "tr" -> "tr" # works "tr" -> "token-rin0" # ? (1) "tr" -> "TR" # ? (2) the question is, does tweak (1) and tweak (2) work or break? but it would be a lot more effective i guess to get some udevd expert's attention on this ... Ingo ^ permalink raw reply [flat|nested] 40+ messages in thread
* Re: [BUG] 2.6.23-rc3 can't see sd partitions on Alpha 2007-12-07 18:06 ` Ingo Molnar @ 2007-12-07 18:19 ` Kay Sievers 2007-12-07 19:36 ` Bob Tracy 0 siblings, 1 reply; 40+ messages in thread From: Kay Sievers @ 2007-12-07 18:19 UTC (permalink / raw) To: Ingo Molnar Cc: Bob Tracy, Andrew Morton, mcree, linux-kernel, rjw, rth, ink, linux-scsi, greg On Fri, 2007-12-07 at 19:06 +0100, Ingo Molnar wrote: > * Bob Tracy <rct@frus.com> wrote: > > > Ingo Molnar wrote: > > > > > > * Bob Tracy <rct@frus.com> wrote: > > > > > > > > Current state of the source tree is the 6f37ac... version, so I'll > > > > > start backing out the above diffs in related groups and continue > > > > > until I've got a working kernel. For lack of an obvious target, > > > > > I'll start with the seemingly innocuous change to sysctl_check.c. > > > > > I'll report back when I've got something. > > > > > > > > That was quick :-). Backing out the sysctl_check.c diff gives me a > > > > working kernel. Beats the #$%@! out of me how/why, though. > > > > > > > > Michael Cree: could you try backing out the diff below from your > > > > 2.6.24-rc3 tree and see if things are now working for you? > > > > > > > > Here's "uname -a", just to confirm (maybe) I'm running on what I say > > > > works: > > > > > > > > Linux smirkin 2.6.24-rc2-g6f37ac79-dirty #2 Fri Dec 7 08:03:12 CST 2007 alpha > > > > > > > > Here's the diff I backed out (patch -R). It's short... > > > > > > > > diff --git a/kernel/sysctl_check.c b/kernel/sysctl_check.c > > > > index 5a2f2b2..4abc6d2 100644 > > > > --- a/kernel/sysctl_check.c > > > > +++ b/kernel/sysctl_check.c > > > > @@ -738,7 +738,7 @@ static struct trans_ctl_table trans_net_table[] = { > > > > { NET_ROSE, "rose", trans_net_rose_table }, > > > > { NET_IPV6, "ipv6", trans_net_ipv6_table }, > > > > { NET_X25, "x25", trans_net_x25_table }, > > > > - { NET_TR, "tr", trans_net_tr_table }, > > > > + { NET_TR, "token-ring", trans_net_tr_table }, > > > > { NET_DECNET, "decnet", trans_net_decnet_table }, > > > > /* NET_ECONET not used */ > > > > { NET_SCTP, "sctp", trans_net_sctp_table }, > > > > > > reverting this makes the kernel image shorter by 8 bytes - so > > > perhaps some alignment issue somewhere? Or something gets overflown? > > > Does any of this get actually used by your bootup? > > > > Dunno... The dmesg output is not terribly useful here, because most > > of the "interesting" stuff concerning udev startup that appears on the > > console never makes it into a log. Note that, for the bad cases, I > > don't see the same console output that Michael reported, although the > > net effect is the same: the partitions don't get found, so I'm offered > > the chance to enter my root password and do some poking around, and > > when I do, none of the block devices are present under /dev. > > > > I'm open to suggestions on how to take this analysis further. Michael > > indicated he's running a conference this week, so I don't know when > > he'll be able to come up for air. > > i'm not sure how to do direct debugging on udev, so i can only guess > about what effect on the kernel side could have caused this. One bad > hack would be to "probe" udevd's behavior by changing the NET_TR entry > in various ways: > > "tr" -> "token-ring" # breaks > "tr" -> "tr" # works > "tr" -> "token-rin0" # ? (1) > "tr" -> "TR" # ? (2) > > the question is, does tweak (1) and tweak (2) work or break? > > but it would be a lot more effective i guess to get some udevd expert's > attention on this ... Could we get the output of: ls -l /sys/block/sda/ and: grep . /sys/block/sda/*/dev ? Kay ^ permalink raw reply [flat|nested] 40+ messages in thread
* Re: [BUG] 2.6.23-rc3 can't see sd partitions on Alpha 2007-12-07 18:19 ` Kay Sievers @ 2007-12-07 19:36 ` Bob Tracy 0 siblings, 0 replies; 40+ messages in thread From: Bob Tracy @ 2007-12-07 19:36 UTC (permalink / raw) To: Kay Sievers Cc: Ingo Molnar, Bob Tracy, Andrew Morton, mcree, linux-kernel, rjw, rth, ink, linux-scsi, greg Kay Sievers wrote: > On Fri, 2007-12-07 at 19:06 +0100, Ingo Molnar wrote: > > i'm not sure how to do direct debugging on udev, so i can only guess > > about what effect on the kernel side could have caused this. One bad > > hack would be to "probe" udevd's behavior by changing the NET_TR entry > > in various ways: > > > > "tr" -> "token-ring" # breaks > > "tr" -> "tr" # works > > "tr" -> "token-rin0" # ? (1) > > "tr" -> "TR" # ? (2) > > > > the question is, does tweak (1) and tweak (2) work or break? > > > > but it would be a lot more effective i guess to get some udevd expert's > > attention on this ... > > Could we get the output of: > ls -l /sys/block/sda/ > and: > grep . /sys/block/sda/*/dev > ? Here are the requested items for the 2.6.24-rc2-g6f37ac79-dirty kernel (the working one with the sysctl_check.c patch reverted): smirkin:/# ls -l /sys/block/sda total 0 -r--r--r-- 1 root root 8192 Dec 7 08:36 capability -r--r--r-- 1 root root 8192 Dec 7 08:36 dev lrwxrwxrwx 1 root root 0 Dec 7 08:36 device -> ../../devices/pci0000:00/0000:00:14.0/0000:01:09.0/host0/target0:0:0/0:0:0:0 drwxr-xr-x 2 root root 0 Dec 7 08:36 holders drwxr-xr-x 3 root root 0 Dec 7 08:36 queue -r--r--r-- 1 root root 8192 Dec 7 08:36 range -r--r--r-- 1 root root 8192 Dec 7 08:36 removable drwxr-xr-x 3 root root 0 Dec 7 08:36 sda1 drwxr-xr-x 3 root root 0 Dec 7 08:36 sda2 drwxr-xr-x 3 root root 0 Dec 7 08:36 sda3 drwxr-xr-x 3 root root 0 Dec 7 08:36 sda4 drwxr-xr-x 3 root root 0 Dec 7 08:36 sda5 drwxr-xr-x 3 root root 0 Dec 7 08:36 sda6 drwxr-xr-x 3 root root 0 Dec 7 08:36 sda7 -r--r--r-- 1 root root 8192 Dec 7 08:36 size drwxr-xr-x 2 root root 0 Dec 7 08:36 slaves -r--r--r-- 1 root root 8192 Dec 7 08:36 stat lrwxrwxrwx 1 root root 0 Dec 7 08:36 subsystem -> ../../block --w------- 1 root root 8192 Dec 7 08:36 uevent smirkin:/# grep . /sys/block/sda/*/dev /sys/block/sda/sda1/dev:8:1 /sys/block/sda/sda2/dev:8:2 /sys/block/sda/sda3/dev:8:3 /sys/block/sda/sda4/dev:8:4 /sys/block/sda/sda5/dev:8:5 /sys/block/sda/sda6/dev:8:6 /sys/block/sda/sda7/dev:8:7 Assuming /sys/block even exists for the non-working case, I'll forward that info in a few hours when I can get home to reboot the machine. -- ------------------------------------------------------------------------ Bob Tracy | "They couldn't hit an elephant at this dist- " rct@frus.com | - Last words of Union General John Sedgwick, | Battle of Spotsylvania Court House, U.S. Civil War ------------------------------------------------------------------------ ^ permalink raw reply [flat|nested] 40+ messages in thread
* Re: [BUG] 2.6.23-rc3 can't see sd partitions on Alpha 2007-12-07 14:55 ` Bob Tracy 2007-12-07 15:05 ` Ingo Molnar @ 2007-12-07 20:43 ` Michael Cree 2007-12-07 21:19 ` Kay Sievers 1 sibling, 1 reply; 40+ messages in thread From: Michael Cree @ 2007-12-07 20:43 UTC (permalink / raw) To: Bob Tracy Cc: Andrew Morton, mingo, linux-kernel, rjw, rth, ink, linux-scsi, kay.sievers, greg Bob Tracy wrote: > That was quick :-). Backing out the sysctl_check.c diff gives me a > working kernel. Beats the #$%@! out of me how/why, though. > > Michael Cree: could you try backing out the diff below from your > 2.6.24-rc3 tree and see if things are now working for you? Yes (conference is now over). I backed out the sysctl_check patch from 2.6.24-rc3 and, indeed, got a working kernel. The working kernel (was probably 2.6.24-rc3 less sysctl_check patch, but might have been a 2.6.23 variant) has the following in /sys/block alpha:~# ls -l /sys/block/ total 0 drwxr-xr-x 5 root root 0 2007-12-08 08:55 fd0 drwxr-xr-x 6 root root 0 2007-12-08 08:55 hde drwxr-xr-x 5 root root 0 2007-12-08 08:55 hdf drwxr-xr-x 10 root root 0 2007-12-08 08:55 sda drwxr-xr-x 9 root root 0 2007-12-08 08:55 sdb alpha:~# ls -l /sys/block/sda total 0 -r--r--r-- 1 root root 8192 2007-12-08 08:55 capability -r--r--r-- 1 root root 8192 2007-12-08 08:55 dev lrwxrwxrwx 1 root root 0 2007-12-08 08:55 device -> ../../devices/pci0001:01/0001:01:06.0/host0/target0:0:1/0:0:1:0 drwxr-xr-x 2 root root 0 2007-12-08 08:55 holders drwxr-xr-x 3 root root 0 2007-12-08 08:55 queue -r--r--r-- 1 root root 8192 2007-12-08 08:55 range -r--r--r-- 1 root root 8192 2007-12-08 08:55 removable drwxr-xr-x 3 root root 0 2007-12-08 08:55 sda1 drwxr-xr-x 3 root root 0 2007-12-08 08:55 sda2 drwxr-xr-x 3 root root 0 2007-12-08 08:55 sda3 drwxr-xr-x 3 root root 0 2007-12-08 08:55 sda4 drwxr-xr-x 3 root root 0 2007-12-08 08:55 sda5 -r--r--r-- 1 root root 8192 2007-12-08 08:55 size drwxr-xr-x 2 root root 0 2007-12-08 08:55 slaves -r--r--r-- 1 root root 8192 2007-12-08 08:55 stat lrwxrwxrwx 1 root root 0 2007-12-08 08:55 subsystem -> ../../block --w------- 1 root root 8192 2007-12-08 08:55 uevent alpha:~# grep . /sys/block/sda/*/dev /sys/block/sda/sda1/dev:8:1 /sys/block/sda/sda2/dev:8:2 /sys/block/sda/sda3/dev:8:3 /sys/block/sda/sda4/dev:8:4 /sys/block/sda/sda5/dev:8:5 The broken kernel (2.6.24-rc3) has the following in /sys/block alpha:~# ls -l /sys/block/ total 0 drwxr-xr-x 5 root root 0 Dec 8 09:22 fd0 drwxr-xr-x 6 root root 0 Dec 8 09:22 hde drwxr-xr-x 5 root root 0 Dec 8 09:23 hdf drwxr-xr-x 10 root root 0 Dec 8 09:22 sda drwxr-xr-x 9 root root 0 Dec 8 09:23 sdb alpha:~# ls -l /sys/block/sda total 0 -r--r--r-- 1 root root 8192 Dec 8 09:22 capability -r--r--r-- 1 root root 8192 Dec 8 09:22 dev lrwxrwxrwx 1 root root 0 Dec 8 09:23 device -> ../../devices/pci0001:01/0001:01:06.0/host0/target0:0:1/0:0:1:0 drwxr-xr-x 2 root root 0 Dec 8 09:22 holders drwxr-xr-x 3 root root 0 Dec 8 09:22 queue -r--r--r-- 1 root root 8192 Dec 8 09:22 range -r--r--r-- 1 root root 8192 Dec 8 09:22 removable drwxr-xr-x 3 root root 0 Dec 8 09:22 sda1 drwxr-xr-x 3 root root 0 Dec 8 09:22 sda2 drwxr-xr-x 3 root root 0 Dec 8 09:22 sda3 drwxr-xr-x 3 root root 0 Dec 8 09:22 sda4 drwxr-xr-x 3 root root 0 Dec 8 09:22 sda5 -r--r--r-- 1 root root 8192 Dec 8 09:22 size drwxr-xr-x 2 root root 0 Dec 8 09:22 slaves -r--r--r-- 1 root root 8192 Dec 8 09:22 stat lrwxrwxrwx 1 root root 0 Dec 8 09:22 subsystem -> ../../block --w------- 1 root root 8192 Dec 8 09:22 uevent alpha:~# grep . /sys/block/sda/*/dev /sys/block/sda/sda1/dev:8:1 /sys/block/sda/sda2/dev:8:2 /sys/block/sda/sda3/dev:8:3 /sys/block/sda/sda4/dev:8:4 /sys/block/sda/sda5/dev:8:5 I failed to spot any difference (other than the formatting of the dates) between the two! Cheers Michael. ^ permalink raw reply [flat|nested] 40+ messages in thread
* Re: [BUG] 2.6.23-rc3 can't see sd partitions on Alpha 2007-12-07 20:43 ` Michael Cree @ 2007-12-07 21:19 ` Kay Sievers 2007-12-07 22:39 ` Bob Tracy ` (2 more replies) 0 siblings, 3 replies; 40+ messages in thread From: Kay Sievers @ 2007-12-07 21:19 UTC (permalink / raw) To: Michael Cree Cc: Bob Tracy, Andrew Morton, mingo, linux-kernel, rjw, rth, ink, linux-scsi, greg On Sat, 2007-12-08 at 09:43 +1300, Michael Cree wrote: > Bob Tracy wrote: > > That was quick :-). Backing out the sysctl_check.c diff gives me a > > working kernel. Beats the #$%@! out of me how/why, though. > > > > Michael Cree: could you try backing out the diff below from your > > 2.6.24-rc3 tree and see if things are now working for you? > > Yes (conference is now over). I backed out the sysctl_check patch from > 2.6.24-rc3 and, indeed, got a working kernel. > > The working kernel (was probably 2.6.24-rc3 less sysctl_check patch, but > might have been a 2.6.23 variant) has the following in /sys/block > alpha:~# grep . /sys/block/sda/*/dev > /sys/block/sda/sda1/dev:8:1 > /sys/block/sda/sda2/dev:8:2 > /sys/block/sda/sda3/dev:8:3 > /sys/block/sda/sda4/dev:8:4 > /sys/block/sda/sda5/dev:8:5 > > I failed to spot any difference (other than the formatting of the dates) > between the two! Yeah, that looks all fine. What distro is that, and what's the udev version? You are booting your kernel with an initramfs? Is the udev daemon (still) running while it fails? If you run /sbin/udevtrigger, do the nodes appear? Kay ^ permalink raw reply [flat|nested] 40+ messages in thread
* Re: [BUG] 2.6.23-rc3 can't see sd partitions on Alpha 2007-12-07 21:19 ` Kay Sievers @ 2007-12-07 22:39 ` Bob Tracy 2007-12-08 4:53 ` Bob Tracy 2007-12-08 5:05 ` Bob Tracy 2 siblings, 0 replies; 40+ messages in thread From: Bob Tracy @ 2007-12-07 22:39 UTC (permalink / raw) To: Kay Sievers Cc: Michael Cree, Bob Tracy, Andrew Morton, mingo, linux-kernel, rjw, rth, ink, linux-scsi, greg Kay Sievers wrote: > Yeah, that looks all fine. > > What distro is that, and what's the udev version? Mine is Debian Etch, normally with the latest released or -rcX kernel from kernel.org. Updates current as of about 18 hours ago. Udev package version is 0.105-4. The RELEASE-NOTES file in /usr/share/doc/udev says "udev 105". > You are booting your kernel with an initramfs? Not in my case: everything I need at boot time is built-in. > Is the udev daemon (still) running while it fails? > > If you run /sbin/udevtrigger, do the nodes appear? I can answer the above later when I'm back in front of the machine, but even in the "not good" case, I still see the following messages from the /etc/rcS.d/S03udev file: Starting the hotplug events dispatcher udevd. Synthesizing the initial hotplug events. This is where udevtrigger gets called, followed by the load_input_modules and create_dev_makedev functions, then... Waiting for /dev to be fully populated. which is where udevsettle gets called. None of the above appear to be exiting abnormally for the bad case, but I'll definitely take a closer look at what MAKEDEV (/dev/MAKEDEV --> /sbin/MAKEDEV) is doing. In particular, Debian MAKEDEV is looking at /proc/devices to decide what to do, so maybe "cat /proc/devices" would be useful to look at for the broken case. -- ------------------------------------------------------------------------ Bob Tracy | "They couldn't hit an elephant at this dist- " rct@frus.com | - Last words of Union General John Sedgwick, | Battle of Spotsylvania Court House, U.S. Civil War ------------------------------------------------------------------------ ^ permalink raw reply [flat|nested] 40+ messages in thread
* Re: [BUG] 2.6.23-rc3 can't see sd partitions on Alpha 2007-12-07 21:19 ` Kay Sievers 2007-12-07 22:39 ` Bob Tracy @ 2007-12-08 4:53 ` Bob Tracy 2007-12-08 5:05 ` Bob Tracy 2 siblings, 0 replies; 40+ messages in thread From: Bob Tracy @ 2007-12-08 4:53 UTC (permalink / raw) To: Kay Sievers Cc: Michael Cree, Bob Tracy, Andrew Morton, mingo, linux-kernel, rjw, rth, ink, linux-scsi, greg Kay Sievers wrote: > Is the udev daemon (still) running while it fails? Yes. > If you run /sbin/udevtrigger, do the nodes appear? No. Exit status is 0, and there are no errors. Everything looks fine under /sys/block, and there doesn't seem to be a problem with /proc/devices either. -- ------------------------------------------------------------------------ Bob Tracy | "They couldn't hit an elephant at this dist- " rct@frus.com | - Last words of Union General John Sedgwick, | Battle of Spotsylvania Court House, U.S. Civil War ------------------------------------------------------------------------ ^ permalink raw reply [flat|nested] 40+ messages in thread
* Re: [BUG] 2.6.23-rc3 can't see sd partitions on Alpha 2007-12-07 21:19 ` Kay Sievers 2007-12-07 22:39 ` Bob Tracy 2007-12-08 4:53 ` Bob Tracy @ 2007-12-08 5:05 ` Bob Tracy 2007-12-08 15:48 ` Kay Sievers 2 siblings, 1 reply; 40+ messages in thread From: Bob Tracy @ 2007-12-08 5:05 UTC (permalink / raw) To: Kay Sievers Cc: Michael Cree, Bob Tracy, Andrew Morton, mingo, linux-kernel, rjw, rth, ink, linux-scsi, greg Kay Sievers wrote: > Is the udev daemon (still) running while it fails? Yes, and there's something else I forgot to mention that may be significant... For the bad case, in addition to udevd, "ps -ef" shows a "sh -e /lib/udev/net.agent" running with a PPID of 1. This process doesn't exit until I reboot. If this is normal under the circumstances, please disregard. -- ------------------------------------------------------------------------ Bob Tracy | "They couldn't hit an elephant at this dist- " rct@frus.com | - Last words of Union General John Sedgwick, | Battle of Spotsylvania Court House, U.S. Civil War ------------------------------------------------------------------------ ^ permalink raw reply [flat|nested] 40+ messages in thread
* Re: [BUG] 2.6.23-rc3 can't see sd partitions on Alpha 2007-12-08 5:05 ` Bob Tracy @ 2007-12-08 15:48 ` Kay Sievers 2007-12-09 0:51 ` Michael Cree 2007-12-10 15:05 ` Bob Tracy 0 siblings, 2 replies; 40+ messages in thread From: Kay Sievers @ 2007-12-08 15:48 UTC (permalink / raw) To: Bob Tracy Cc: Michael Cree, Andrew Morton, mingo, linux-kernel, rjw, rth, ink, linux-scsi, greg On Fri, 2007-12-07 at 23:05 -0600, Bob Tracy wrote: > Kay Sievers wrote: > > Is the udev daemon (still) running while it fails? > > Yes, and there's something else I forgot to mention that may be > significant... For the bad case, in addition to udevd, "ps -ef" > shows a "sh -e /lib/udev/net.agent" running with a PPID of 1. This > process doesn't exit until I reboot. If this is normal under the > circumstances, please disregard. Does SysRq-T show where it hangs? Kay ^ permalink raw reply [flat|nested] 40+ messages in thread
* Re: [BUG] 2.6.23-rc3 can't see sd partitions on Alpha 2007-12-08 15:48 ` Kay Sievers @ 2007-12-09 0:51 ` Michael Cree 2007-12-09 4:19 ` Bob Tracy 2007-12-10 15:05 ` Bob Tracy 1 sibling, 1 reply; 40+ messages in thread From: Michael Cree @ 2007-12-09 0:51 UTC (permalink / raw) To: Kay Sievers Cc: Bob Tracy, Andrew Morton, mingo, linux-kernel, rjw, rth, ink, linux-scsi, greg Kay Sievers wrote: > On Fri, 2007-12-07 at 23:05 -0600, Bob Tracy wrote: >> Kay Sievers wrote: >>> Is the udev daemon (still) running while it fails? >> Yes, and there's something else I forgot to mention that may be >> significant... For the bad case, in addition to udevd, "ps -ef" >> shows a "sh -e /lib/udev/net.agent" running with a PPID of 1. This >> process doesn't exit until I reboot. If this is normal under the >> circumstances, please disregard. > > Does SysRq-T show where it hangs? Ummm... No. I didn't have the CONFIG_MAGIC_SYSRQ flag set, so I set it, and recompiled the kernel. Guess what - now the system comes up normally without any problem. The block devices appear in /dev. To recap: without CONFIG_MAGIC_SYSRQ on the 2.6.24-rc3 kernel the missing block devices error in /dev occurs and the init scripts fall over on startup, and with CONFIG_MAGIC_SYSRQ the system comes up normally. To answer the earlier questions about distro, and udev version, my system is similar to Bob's, except that I am running Debian testing/lenny which comes with udev version 114 (dpkg reports udev version 0.114-2). I am running an EV67 variant CPU. I do not run an initramfs - I have the necessary drivers for the various discs compiled into the kernel and use the root kernel option to point to the required root partition. When running the broken kernel udev is running (according to 'ps') and executing /sbin/udevtrigger manually generates a number of errors of the form: scsi_id[<pid>]: scsi_id: unable to access '/block' The missing /dev/* entries do not appear. Cheerz Michael. ^ permalink raw reply [flat|nested] 40+ messages in thread
* Re: [BUG] 2.6.23-rc3 can't see sd partitions on Alpha 2007-12-09 0:51 ` Michael Cree @ 2007-12-09 4:19 ` Bob Tracy 2007-12-09 18:07 ` Ivan Kokshaysky 0 siblings, 1 reply; 40+ messages in thread From: Bob Tracy @ 2007-12-09 4:19 UTC (permalink / raw) To: Michael Cree Cc: Kay Sievers, Bob Tracy, Andrew Morton, mingo, linux-kernel, rjw, rth, ink, linux-scsi, greg Michael Cree wrote: > Kay Sievers wrote: > > On Fri, 2007-12-07 at 23:05 -0600, Bob Tracy wrote: > >> Kay Sievers wrote: > >>> Is the udev daemon (still) running while it fails? > >> Yes, and there's something else I forgot to mention that may be > >> significant... For the bad case, in addition to udevd, "ps -ef" > >> shows a "sh -e /lib/udev/net.agent" running with a PPID of 1. This > >> process doesn't exit until I reboot. If this is normal under the > >> circumstances, please disregard. > > > > Does SysRq-T show where it hangs? > > Ummm... No. I didn't have the CONFIG_MAGIC_SYSRQ flag set, so I set it, > and recompiled the kernel. Guess what - now the system comes up > normally without any problem. The block devices appear in /dev. To > recap: without CONFIG_MAGIC_SYSRQ on the 2.6.24-rc3 kernel the missing > block devices error in /dev occurs and the init scripts fall over on > startup, and with CONFIG_MAGIC_SYSRQ the system comes up normally. I *do* have CONFIG_MAGIC_SYSRQ set. Anyone care to bet whether my machine starts working again if I disable it? Sheesh... The "kernel alignment issue" theory is making sense... We change the size of an initialized variable with the patch, and the problem shows up. We shift starting addresses a different way by tweaking kernel options, and two wrongs make a right? I've seen it happen, and tracking this down isn't going to be easy. Anyone want to wade through the different System.map files and hazard a guess where we're leaving the rails? Here's a very brief diff excerpt between the System.map files corresponding to "sysctl_check patch reverted" (the -dirty version) and "with sysctl_check patch". At least they agree up to line 10870 :-) ... --- /boot/System.map-2.6.24-rc2-g6f37ac79-dirty 2007-12-07 08:03:50.000000000 -0 600 +++ System.map 2007-12-07 13:43:37.000000000 -0600 @@ -10868,9414 +10868,9414 @@ fffffc0000684b00 R kallsyms_markers fffffc0000684d00 R kallsyms_token_table fffffc0000685100 R kallsyms_token_index -fffffc00006f61e0 r __pci_fixup_PCI_VENDOR_ID_SERVERWORKSPCI_DEVICE_ID_SERVERWORKS_CSB5IDEquirk_svwks_csb5ide -fffffc00006f61e0 R __start_pci_fixups_early -fffffc00006f61f0 r __pci_fixup_PCI_VENDOR_ID_INTELPCI_DEVICE_ID_INTEL_82801CA_10quirk_ide_samemode (...) -fffffc0000716120 r __param_bic_scale -fffffc0000716148 r __param_tcp_friendliness -fffffc0000716170 R __end_rodata -fffffc0000716170 R __stop___param +fffffc00006f61f0 r __pci_fixup_PCI_VENDOR_ID_SERVERWORKSPCI_DEVICE_ID_SERVERWORKS_CSB5IDEquirk_svwks_csb5ide +fffffc00006f61f0 R __start_pci_fixups_early +fffffc00006f6200 r __pci_fixup_PCI_VENDOR_ID_INTELPCI_DEVICE_ID_INTEL_82801CA_10quirk_ide_samemode (...) +fffffc0000716130 r __param_bic_scale +fffffc0000716158 r __param_tcp_friendliness +fffffc0000716180 R __end_rodata +fffffc0000716180 R __stop___param fffffc0000718000 A __init_begin fffffc0000718000 T _sinittext fffffc0000718000 t set_reset_devices > When running the broken kernel udev is running (according to 'ps') and > executing /sbin/udevtrigger manually generates a number of errors of the > form: > > scsi_id[<pid>]: scsi_id: unable to access '/block' > > The missing /dev/* entries do not appear. I don't get the errors that Michael is seeing, and udevtrigger seems to be exiting without errors (return code 0). The last part is the same: the missing /dev/* entries do not appear. --Bob T. ^ permalink raw reply [flat|nested] 40+ messages in thread
* Re: [BUG] 2.6.23-rc3 can't see sd partitions on Alpha 2007-12-09 4:19 ` Bob Tracy @ 2007-12-09 18:07 ` Ivan Kokshaysky 2007-12-10 15:08 ` Bob Tracy 0 siblings, 1 reply; 40+ messages in thread From: Ivan Kokshaysky @ 2007-12-09 18:07 UTC (permalink / raw) To: Bob Tracy Cc: Michael Cree, Kay Sievers, Andrew Morton, mingo, linux-kernel, rjw, rth, linux-scsi, greg On Sat, Dec 08, 2007 at 10:19:39PM -0600, Bob Tracy wrote: > I *do* have CONFIG_MAGIC_SYSRQ set. Anyone care to bet whether my > machine starts working again if I disable it? Sheesh... Incredible... Toggling CONFIG_MAGIC_SYSRQ works for me too, so I'm finally able to reproduce the problem (which is the main positive result so far ;-) There are lots of possible reasons why this happens, but at the moment I honestly have no idea. For now I have reassigned the bug #9457 to myself and will gradually hack into udev... Ivan. ^ permalink raw reply [flat|nested] 40+ messages in thread
* Re: [BUG] 2.6.23-rc3 can't see sd partitions on Alpha 2007-12-09 18:07 ` Ivan Kokshaysky @ 2007-12-10 15:08 ` Bob Tracy 2007-12-10 23:12 ` Ivan Kokshaysky 0 siblings, 1 reply; 40+ messages in thread From: Bob Tracy @ 2007-12-10 15:08 UTC (permalink / raw) To: Ivan Kokshaysky Cc: Bob Tracy, Michael Cree, Kay Sievers, Andrew Morton, mingo, linux-kernel, rjw, rth, linux-scsi, greg Ivan Kokshaysky wrote: > On Sat, Dec 08, 2007 at 10:19:39PM -0600, Bob Tracy wrote: > > I *do* have CONFIG_MAGIC_SYSRQ set. Anyone care to bet whether my > > machine starts working again if I disable it? Sheesh... > > Incredible... > > Toggling CONFIG_MAGIC_SYSRQ works for me too, so I'm finally able > to reproduce the problem (which is the main positive result so far ;-) > > There are lots of possible reasons why this happens, but at the > moment I honestly have no idea. > For now I have reassigned the bug #9457 to myself and will gradually hack > into udev... Thanks... Let me know if there's anything useful I can do to help. --Bob T. ^ permalink raw reply [flat|nested] 40+ messages in thread
* Re: [BUG] 2.6.23-rc3 can't see sd partitions on Alpha 2007-12-10 15:08 ` Bob Tracy @ 2007-12-10 23:12 ` Ivan Kokshaysky 0 siblings, 0 replies; 40+ messages in thread From: Ivan Kokshaysky @ 2007-12-10 23:12 UTC (permalink / raw) To: Bob Tracy Cc: Michael Cree, Kay Sievers, Andrew Morton, mingo, linux-kernel, rjw, rth, linux-scsi, greg On Mon, Dec 10, 2007 at 09:08:53AM -0600, Bob Tracy wrote: > Ivan Kokshaysky wrote: > > For now I have reassigned the bug #9457 to myself and will gradually hack > > into udev... > > Thanks... Let me know if there's anything useful I can do to help. It turns out to be yet another strncpy() bug that indeed shows up only with certain src/dst alignments and breaks kobject_get_path(). Ugh... Hopefully I'll have a patch tomorrow. Ivan. ^ permalink raw reply [flat|nested] 40+ messages in thread
* Re: [BUG] 2.6.23-rc3 can't see sd partitions on Alpha 2007-12-08 15:48 ` Kay Sievers 2007-12-09 0:51 ` Michael Cree @ 2007-12-10 15:05 ` Bob Tracy 1 sibling, 0 replies; 40+ messages in thread From: Bob Tracy @ 2007-12-10 15:05 UTC (permalink / raw) To: Kay Sievers Cc: Bob Tracy, Michael Cree, Andrew Morton, mingo, linux-kernel, rjw, rth, ink, linux-scsi, greg Kay Sievers wrote: > On Fri, 2007-12-07 at 23:05 -0600, Bob Tracy wrote: > > Kay Sievers wrote: > > > Is the udev daemon (still) running while it fails? > > > > Yes, and there's something else I forgot to mention that may be > > significant... For the bad case, in addition to udevd, "ps -ef" > > shows a "sh -e /lib/udev/net.agent" running with a PPID of 1. This > > process doesn't exit until I reboot. If this is normal under the > > circumstances, please disregard. > > Does SysRq-T show where it hangs? A quick comparison of the trace sections for udevd and net.agent indicates those traces are identical: none of the function names in the traces appear to be what you might be looking for, i.e., the processes appear to have been waiting for an event of some kind, and woke up long enough to process the SysRq-T keyboard interrupt and the corresponding action. Hmm... Ok... The state information itself is probably more useful in this context. Here's the info for net.agent: net.agent S fffffc000032c37c 0 895 1 fffffc0000743b10 0000000000000010 fffffc00004f3b5c fffffc00007601a8 0000000000000001 0000000000000074 fffffc0000747758 fffffc00230f0000 0000000000000007 0000000000000007 fffffc00004f390c 0000000000000010 fffffc00004e9eb8 fffffc00230f0000 0000000000000014 fffffc0023085140 0000000000000001 0000000000000014 fffffc00001de000 0000000000000001 fffffc000055dcfc fffffc0023085140 fffffc00232788c0 0000000000000001 Addresses of presumed interest from System.map: fffffc000032c000 t do_wait fffffc0000740000 D init_thread_union fffffc00004f3b40 t sysrq_handle_showstate fffffc00007601a8 d sysrq_showstate_op fffffc0000747758 D console_printk fffffc002... not in System.map (module?) -- I'll track this down later if needed. fffffc00004f3850 T __handle_sysrq fffffc00004e9850 t kbd_event fffffc00001... not in System.map (?? begins with fffffc0000300000 A swapper_pg_dir) fffffc000055dc30 t input_pass_event --Bob T. ^ permalink raw reply [flat|nested] 40+ messages in thread
* Re: [BUG] 2.6.23-rc3 can't see sd partitions on Alpha 2007-12-07 5:07 ` Bob Tracy 2007-12-07 10:26 ` Andrew Morton @ 2007-12-07 11:40 ` Ingo Molnar 1 sibling, 0 replies; 40+ messages in thread From: Ingo Molnar @ 2007-12-07 11:40 UTC (permalink / raw) To: Bob Tracy Cc: Andrew Morton, mcree, linux-kernel, rjw, rth, ink, linux-scsi, kay.sievers, greg * Bob Tracy <rct@frus.com> wrote: > > I'm struggling to see how any of those could have broken block > > device mounting on alpha. Are you sure you bisected right? > > Based on what's in that commit, it *does* appear something went wrong > with bisection. If the implicated commit is the next one in time > sequence relative to > > # good: [2f1f53bdc6531696934f6ee7bbdfa2ab4f4f62a3] CRISv10 fasttimer: Scrap INLINE and name timeval_cmp better > > then the test of whether I bisected correctly is as simple as applying > the commit and seeing if things break, because I'm running on the > kernel corresponding to 2f1f53bdc6531696934f6ee7bbdfa2ab4f4f62a3 right > now. Let me give that a try and I'll report back. Worst case, I'll > have to start over and write off the past four days... generally it's easier to just go "back in time" and re-try the last known "good" and last-known "bad" commit IDs to establish that they are indeed correctly identified. if they are not then step back one more in the bisection log. No need to spend another 4 days on this, if most of the bisection is OK. You can replay a corrected git bisection log quickly, by doing: git-bisect reset git-bisect < bisect.log Ingo ^ permalink raw reply [flat|nested] 40+ messages in thread
* Re: [BUG] 2.6.23-rc3 can't see sd partitions on Alpha 2007-12-07 0:33 ` Andrew Morton 2007-12-07 5:07 ` Bob Tracy @ 2007-12-07 5:42 ` Bob Tracy 2007-12-07 9:33 ` Ingo Molnar 2 siblings, 0 replies; 40+ messages in thread From: Bob Tracy @ 2007-12-07 5:42 UTC (permalink / raw) To: Andrew Morton Cc: Bob Tracy, mingo, mcree, linux-kernel, rjw, rth, ink, linux-scsi, kay.sievers, greg I wrote: > If the implicated commit is the next one in time > sequence relative to > > # good: [2f1f53bdc6531696934f6ee7bbdfa2ab4f4f62a3] CRISv10 fasttimer: Scrap INLINE and name timeval_cmp better > > then the test of whether I bisected correctly is as simple as applying > the commit and seeing if things break, because I'm running on the > kernel corresponding to 2f1f53bdc6531696934f6ee7bbdfa2ab4f4f62a3 right > now. Let me give that a try and I'll report back. Verified that 6f37ac793d6ba7b35d338f791974166f67fdd9ba is the next commit after the "good" kernel I'm running now. The build is running, and I should have an answer for us in a few hours. -- ------------------------------------------------------------------------ Bob Tracy | "They couldn't hit an elephant at this dist- " rct@frus.com | - Last words of Union General John Sedgwick, | Battle of Spotsylvania Court House, U.S. Civil War ------------------------------------------------------------------------ ^ permalink raw reply [flat|nested] 40+ messages in thread
* Re: [BUG] 2.6.23-rc3 can't see sd partitions on Alpha 2007-12-07 0:33 ` Andrew Morton 2007-12-07 5:07 ` Bob Tracy 2007-12-07 5:42 ` Bob Tracy @ 2007-12-07 9:33 ` Ingo Molnar 2 siblings, 0 replies; 40+ messages in thread From: Ingo Molnar @ 2007-12-07 9:33 UTC (permalink / raw) To: Andrew Morton Cc: Bob Tracy, mcree, linux-kernel, rjw, rth, ink, linux-scsi, kay.sievers, greg * Andrew Morton <akpm@linux-foundation.org> wrote: > > # bad: [6f37ac793d6ba7b35d338f791974166f67fdd9ba] Merge branch 'master' of master.kernel.org:/pub/scm/linux/kernel/git/davem/net-2.6 > > git-bisect bad 6f37ac793d6ba7b35d338f791974166f67fdd9ba > > # good: [2f1f53bdc6531696934f6ee7bbdfa2ab4f4f62a3] CRISv10 fasttimer: Scrap INLINE and name timeval_cmp better > > git-bisect good 2f1f53bdc6531696934f6ee7bbdfa2ab4f4f62a3 > I'm struggling to see how any of those could have broken block device > mounting on alpha. Are you sure you bisected right? the bisection log looks healthy so far - with nicely alternating good/bad bisection points. Barring the possibility that the bug is non-deterministic, i'd guess the bisection points are OK, at least judging from their statistical properties. but ... i went over the diffs too, and i fail to see how they could affect the bootup path of an Alpha box, which i suspect has no networking dependency up to the failure point. Ingo ^ permalink raw reply [flat|nested] 40+ messages in thread
* Re: [BUG] 2.6.23-rc3 can't see sd partitions on Alpha 2007-12-07 0:16 ` Bob Tracy 2007-12-07 0:33 ` Andrew Morton @ 2007-12-07 0:44 ` Rafael J. Wysocki 1 sibling, 0 replies; 40+ messages in thread From: Rafael J. Wysocki @ 2007-12-07 0:44 UTC (permalink / raw) To: Bob Tracy Cc: Ingo Molnar, Michael Cree, Andrew Morton, linux-kernel, rth, ink, linux-scsi, Kay Sievers, Greg KH On Friday, 7 of December 2007, Bob Tracy wrote: > OK. Finally have this thing painted into a corner: git has identified > 6f37ac793d6ba7b35d338f791974166f67fdd9ba as the first bad commit. > > From "git bisect log", this corresponds to > > # bad: [6f37ac793d6ba7b35d338f791974166f67fdd9ba] Merge branch 'master' of master.kernel.org:/pub/scm/linux/kernel/git/davem/net-2.6 Something's gone wrong, as this commit doesn't modify code. > Here's the full log: > > git-bisect start > # good: [9aae299f7fd1888ea3a195cfe0edef17bb647415] Linux 2.6.24-rc2 > git-bisect good 9aae299f7fd1888ea3a195cfe0edef17bb647415 > # bad: [f05092637dc0d9a3f2249c9b283b973e6e96b7d2] Linux 2.6.24-rc3 > git-bisect bad f05092637dc0d9a3f2249c9b283b973e6e96b7d2 > # good: [e6a5c27f3b0fef72e528fc35e343af4b2db790ff] Merge branch 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/avi/kvm > git-bisect good e6a5c27f3b0fef72e528fc35e343af4b2db790ff > # good: [42614fcde7bfdcbe43a7b17035c167dfebc354dd] vmstat: fix section mismatch warning > git-bisect good 42614fcde7bfdcbe43a7b17035c167dfebc354dd > # bad: [a052f4473603765eb6b4c19754689977601dc1d1] Merge git://git.kernel.org/pub/scm/linux/kernel/git/sam/x86 > git-bisect bad a052f4473603765eb6b4c19754689977601dc1d1 > # good: [d8e5219f9f5ca7518eb820db9f3d287a1d46fcf5] CRISv10 improve and bugfix fasttimer > git-bisect good d8e5219f9f5ca7518eb820db9f3d287a1d46fcf5 > # good: [d90bf5a976793edfa88d3bb2393f0231eb8ce1e5] [NET]: rt_check_expire() can take a long time, add a cond_resched() > git-bisect good d90bf5a976793edfa88d3bb2393f0231eb8ce1e5 > # good: [2a113281f5cd2febbab21a93c8943f8d3eece4d3] kconfig: use $K64BIT to set 64BIT with all*config targets > git-bisect good 2a113281f5cd2febbab21a93c8943f8d3eece4d3 > # good: [2e2cd8bad6e03ceea73495ee6d557044213d95de] CRISv10 memset library add lineendings to asm > git-bisect good 2e2cd8bad6e03ceea73495ee6d557044213d95de > # bad: [6f37ac793d6ba7b35d338f791974166f67fdd9ba] Merge branch 'master' of master.kernel.org:/pub/scm/linux/kernel/git/davem/net-2.6 > git-bisect bad 6f37ac793d6ba7b35d338f791974166f67fdd9ba > # good: [2f1f53bdc6531696934f6ee7bbdfa2ab4f4f62a3] CRISv10 fasttimer: Scrap INLINE and name timeval_cmp better > git-bisect good 2f1f53bdc6531696934f6ee7bbdfa2ab4f4f62a3 ^ permalink raw reply [flat|nested] 40+ messages in thread
end of thread, other threads:[~2007-12-10 23:12 UTC | newest] Thread overview: 40+ messages (download: mbox.gz / follow: Atom feed) -- links below jump to the message on this page -- 2007-11-18 5:20 [BUG] 2.6.23-rc3 can't see sd partitions on Alpha Bob Tracy 2007-11-25 6:38 ` Andrew Morton 2007-11-25 12:15 ` Rafael J. Wysocki 2007-11-26 13:48 ` Bob Tracy 2007-11-30 22:30 ` Michael Cree 2007-11-30 22:42 ` Andrew Morton 2007-11-30 23:26 ` Rafael J. Wysocki 2007-12-02 20:53 ` Michael Cree 2007-12-03 1:17 ` Bob Tracy 2007-12-04 12:16 ` Ingo Molnar 2007-12-04 15:36 ` Bob Tracy 2007-12-05 17:30 ` Bob Tracy 2007-12-07 0:16 ` Bob Tracy 2007-12-07 0:33 ` Andrew Morton 2007-12-07 5:07 ` Bob Tracy 2007-12-07 10:26 ` Andrew Morton 2007-12-07 11:37 ` Ingo Molnar 2007-12-07 13:39 ` Bob Tracy 2007-12-07 14:55 ` Bob Tracy 2007-12-07 15:05 ` Ingo Molnar 2007-12-07 16:59 ` Bob Tracy 2007-12-07 18:06 ` Ingo Molnar 2007-12-07 18:19 ` Kay Sievers 2007-12-07 19:36 ` Bob Tracy 2007-12-07 20:43 ` Michael Cree 2007-12-07 21:19 ` Kay Sievers 2007-12-07 22:39 ` Bob Tracy 2007-12-08 4:53 ` Bob Tracy 2007-12-08 5:05 ` Bob Tracy 2007-12-08 15:48 ` Kay Sievers 2007-12-09 0:51 ` Michael Cree 2007-12-09 4:19 ` Bob Tracy 2007-12-09 18:07 ` Ivan Kokshaysky 2007-12-10 15:08 ` Bob Tracy 2007-12-10 23:12 ` Ivan Kokshaysky 2007-12-10 15:05 ` Bob Tracy 2007-12-07 11:40 ` Ingo Molnar 2007-12-07 5:42 ` Bob Tracy 2007-12-07 9:33 ` Ingo Molnar 2007-12-07 0:44 ` Rafael J. Wysocki
This is a public inbox, see mirroring instructions for how to clone and mirror all data and code used for this inbox; as well as URLs for NNTP newsgroup(s).