* linux-next: Tree for May 8 @ 2009-05-08 8:02 Stephen Rothwell 2009-05-08 12:52 ` [PowerPC] Next May 8 boot failure: OOPS during ibmveth module init Sachin Sant 0 siblings, 1 reply; 10+ messages in thread From: Stephen Rothwell @ 2009-05-08 8:02 UTC (permalink / raw) To: linux-next; +Cc: LKML [-- Attachment #1: Type: text/plain, Size: 7549 bytes --] Hi all, Changes since 20090507: New tree: bluetooth The infiniband tree gained a build failure so I used the version from next-20090507. The net tree gained a conflict against Linus' tree. The vfs tree lost its build failure, but gained a conflict against the fuse tree. ---------------------------------------------------------------------------- I have created today's linux-next tree at git://git.kernel.org/pub/scm/linux/kernel/git/sfr/linux-next.git (patches at http://www.kernel.org/pub/linux/kernel/people/sfr/linux-next/). If you are tracking the linux-next tree using git, you should not use "git pull" to do so as that will try to merge the new linux-next release with the old one. You should use "git fetch" as mentioned in the FAQ on the wiki (see below). You can see which trees have been included by looking in the Next/Trees file in the source. There are also quilt-import.log and merge.log files in the Next directory. Between each merge, the tree was built with a ppc64_defconfig for powerpc and an allmodconfig for x86_64. After the final fixups (if any), it is also built with powerpc allnoconfig (32 and 64 bit), ppc44x_defconfig and allyesconfig (minus CONFIG_PROFILE_ALL_BRANCHES) and i386, sparc and sparc64 defconfig. These builds also have CONFIG_ENABLE_WARN_DEPRECATED, CONFIG_ENABLE_MUST_CHECK and CONFIG_DEBUG_INFO disabled when necessary. Below is a summary of the state of the merge. We are up to 136 trees (counting Linus' and 18 trees of patches pending for Linus' tree), more are welcome (even if they are currently empty). Thanks to those who have contributed, and to those who haven't, please do. Status of my local build tests will be at http://kisskb.ellerman.id.au/linux-next . If maintainers want to give advice about cross compilers/configs that work, we are always open to add more builds. Thanks to Jan Dittmer for adding the linux-next tree to his build tests at http://l4x.org/k/ , the guys at http://test.kernel.org/ and Randy Dunlap for doing many randconfig builds. There is a wiki covering stuff to do with linux-next at http://linux.f-seidel.de/linux-next/pmwiki/ . Thanks to Frank Seidel. -- Cheers, Stephen Rothwell sfr@canb.auug.org.au $ git checkout master $ git reset --hard stable Merging origin/master Merging fixes/fixes Merging arm-current/master Merging m68k-current/for-linus Merging powerpc-merge/merge Merging sparc-current/master Merging scsi-rc-fixes/master Merging net-current/master Merging sound-current/for-linus Merging pci-current/for-linus Merging wireless-current/master Merging kbuild-current/master Merging quilt/driver-core.current Merging quilt/usb.current Merging cpufreq-current/fixes Merging input-current/for-linus Merging md-current/for-linus Merging audit-current/for-linus Merging crypto-current/master Merging dwmw2/master Merging arm/devel CONFLICT (content): Merge conflict in arch/arm/Kconfig Merging avr32/avr32-arch Merging blackfin/for-linus Merging cris/for-next Merging ia64/test Merging m68k/for-next Merging m68knommu/for-next Merging mips/mips-for-linux-next Merging parisc/master Merging powerpc/next Merging 4xx/next Merging galak/next Merging pxa/for-next Merging s390/features Merging sh/master Merging sparc/master Merging x86/auto-x86-next Merging xtensa/master Merging configfs/linux-next Merging ext4/next Merging fatfs/master Merging fuse/for-next Merging gfs2/master Merging jfs/next Merging nfs/linux-next Merging nfsd/nfsd-next Merging nilfs2/for-next Merging ocfs2/linux-next Merging squashfs/master Merging v9fs/for-next CONFLICT (content): Merge conflict in net/9p/protocol.c Merging ubifs/linux-next Merging xfs/master Merging tip-core/auto-core-next Merging cpus4096/auto-cpus4096-next Merging tracing/auto-tracing-next Merging genirq/auto-genirq-next Merging safe-poison-pointers/auto-safe-poison-pointers-next Merging sched/auto-sched-next Merging stackprotector/auto-stackprotector-next Merging timers/auto-timers-next Merging pci/linux-next Merging quilt/device-mapper Merging hid/for-next Merging quilt/i2c Merging quilt/jdelvare-hwmon Merging quilt/kernel-doc Merging v4l-dvb/master Merging quota/for_next Merging kbuild/master [master f3434eb] Revert "initconst adjustments" Merging ide/for-next Merging libata/NEXT Merging infiniband/for-next $ git reset --hard HEAD^ Merging refs/next/20090507/infiniband Merging acpi/test Merging ieee1394/for-next Merging ubi/linux-next Merging kvm/master Merging dlm/next Merging scsi/master CONFLICT (content): Merge conflict in include/Kbuild Merging async_tx/next Merging udf/for_next Merging net/master CONFLICT (content): Merge conflict in include/linux/mmc/sdio_ids.h CONFLICT (content): Merge conflict in include/net/tcp.h Merging wireless/master [master 1f48946] Revert "rfkill: remove user_claim stuff" Merging mtd/master Merging crypto/master Merging vfs/for-next CONFLICT (content): Merge conflict in fs/fuse/inode.c Merging sound/for-next Merging cpufreq/next Merging quilt/rr CONFLICT (content): Merge conflict in arch/x86/kernel/cpu/cpufreq/acpi-cpufreq.c Merging cifs/master Merging mmc/next Merging input/next Merging bkl-removal/bkl-removal Merging lsm/for-next Merging block/for-next CONFLICT (content): Merge conflict in drivers/block/hd.c CONFLICT (content): Merge conflict in drivers/block/mg_disk.c Merging embedded/master Merging firmware/master Merging pcmcia/master Merging battery/master Merging leds/for-mm Merging backlight/for-mm Merging kgdb/kgdb-next Merging slab/for-next Merging uclinux/for-next Merging md/for-next Merging mfd/for-next Merging hdlc/hdlc-next Merging drm/drm-next Merging voltage/for-next Merging security-testing/next Merging lblnet/master Merging quilt/ttydev Merging agp/agp-next Merging generic-ipi/auto-generic-ipi-next Merging oprofile/auto-oprofile-next Merging fastboot/auto-fastboot-next Merging sparseirq/auto-sparseirq-next Merging iommu/auto-iommu-next Merging uwb/for-upstream Merging watchdog/master Merging bdev/master Merging dwmw2-iommu/master CONFLICT (content): Merge conflict in drivers/pci/intel-iommu.c Merging cputime/cputime Merging osd/linux-next Merging jc_docs/docs-next Merging nommu/master Merging trivial/for-next Merging audit/for-next Merging omap/for-next Merging quilt/aoe Merging kmemleak/kmemleak CONFLICT (delete/modify): arch/x86/kernel/vmlinux_32.lds.S deleted in HEAD and modified in kmemleak/kmemleak. Version kmemleak/kmemleak of arch/x86/kernel/vmlinux_32.lds.S left in tree. CONFLICT (delete/modify): arch/x86/kernel/vmlinux_64.lds.S deleted in HEAD and modified in kmemleak/kmemleak. Version kmemleak/kmemleak of arch/x86/kernel/vmlinux_64.lds.S left in tree. CONFLICT (content): Merge conflict in lib/Kconfig.debug CONFLICT (content): Merge conflict in mm/slob.c $ git rm -f arch/x86/kernel/vmlinux_32.lds.S arch/x86/kernel/vmlinux_64.lds.S Applying: kmemleak: fixup for x86 vmlinux.ld.S merge Merging suspend/linux-next Merging bluetooth/master Merging quilt/driver-core Merging quilt/usb CONFLICT (content): Merge conflict in drivers/usb/serial/sierra.c Merging quilt/staging CONFLICT (content): Merge conflict in drivers/staging/rt2860/common/mlme.c CONFLICT (content): Merge conflict in drivers/staging/rt2870/common/mlme.c CONFLICT (content): Merge conflict in drivers/staging/rt3070/common/mlme.c Merging scsi-post-merge/master [-- Attachment #2: Type: application/pgp-signature, Size: 197 bytes --] ^ permalink raw reply [flat|nested] 10+ messages in thread
* [PowerPC] Next May 8 boot failure: OOPS during ibmveth module init 2009-05-08 8:02 linux-next: Tree for May 8 Stephen Rothwell @ 2009-05-08 12:52 ` Sachin Sant 2009-05-08 19:57 ` David Miller 0 siblings, 1 reply; 10+ messages in thread From: Sachin Sant @ 2009-05-08 12:52 UTC (permalink / raw) To: linux-next; +Cc: Stephen Rothwell, netdev, linuxppc-dev [-- Attachment #1: Type: text/plain, Size: 5383 bytes --] Todays Next failed to boot on a Power6 JS22 blade with following oops. Unable to handle kernel paging request for data at address 0x654af306c04b990 Faulting instruction address: 0xc00000000003a740 Oops: Kernel access of bad area, sig: 11 [#1] SMP NR_CPUS=1024 NUMA pSeries Modules linked in: ibmveth(+) sg sd_mod crc_t10dif ibmvscsic scsi_transport_srp scsi_tgt scsi_mod NIP: c00000000003a740 LR: c000000000361e20 CTR: 0000000000000000 REGS: c000000042af6e80 TRAP: 0300 Not tainted (2.6.30-rc4-next-20090508) MSR: 8000000000009032 <EE,ME,IR,DR> CR: 28222286 XER: 20000001 DAR: 0654af306c04b990, DSISR: 0000000040000000 TASK = c0000000428084d0[590] 'modprobe' THREAD: c000000042af4000 CPU: 0 GPR00: c000000000361e10 c000000042af7100 c000000000eb8190 c00000004427cc80 GPR04: 0654af306c04b990 0000000000000006 0000000000000000 0000000000000002 GPR08: c00000004427cc00 0000000000000088 0000000000000280 000000000000007c GPR12: 0000000084222284 c000000000f92400 0000000000000000 0000000000000000 GPR16: 0000000000000000 0000000000000000 d000000000ed4346 d000000000ed3aa0 GPR20: d000000000ed6358 c00000004427cc00 00000000ffffffff 0000000000000000 GPR24: 0000000000000000 c0000000402d0000 0000000000000010 c00000004194b400 GPR28: 0000000000000006 0654af306c04b990 c000000000e44f18 0000000000000000 NIP [c00000000003a740] .memcpy+0x240/0x278 LR [c000000000361e20] .__nla_put+0x30/0x4c Call Trace: [c000000042af7100] [c000000000361e10] .__nla_put+0x20/0x4c (unreliable) [c000000042af7190] [c000000000361e88] .nla_put+0x4c/0x60 [c000000042af7200] [c00000000053993c] .rtnl_fill_ifinfo+0x308/0x614 [c000000042af7300] [c00000000053a118] .rtmsg_ifinfo+0x104/0x198 [c000000042af73b0] [c00000000053a244] .rtnetlink_event+0x98/0xb0 [c000000042af7430] [c0000000005c8330] .notifier_call_chain+0x68/0xdc [c000000042af74d0] [c000000000530488] .register_netdevice+0x390/0x418 [c000000042af75a0] [c000000000530568] .register_netdev+0x58/0x80 [c000000042af7630] [d000000000ed2da4] .ibmveth_probe+0x2c8/0x3a4 [ibmveth] [c000000042af7730] [c000000000023208] .vio_bus_probe+0x2f0/0x358 [c000000042af77f0] [c000000000462be0] .driver_probe_device+0xd4/0x1bc [c000000042af7890] [c000000000462d5c] .__driver_attach+0x94/0xd8 [c000000042af7920] [c000000000462164] .bus_for_each_dev+0x80/0xe8 [c000000042af79d0] [c0000000004629b0] .driver_attach+0x28/0x40 [c000000042af7a50] [c000000000461808] .bus_add_driver+0xdc/0x27c [c000000042af7af0] [c0000000004631d0] .driver_register+0xf0/0x1b0 [c000000042af7b90] [c000000000025178] .vio_register_driver+0x44/0x60 [c000000042af7c20] [d000000000ed2ed4] .ibmveth_module_init+0x54/0xa60 [ibmveth] [c000000042af7ca0] [c0000000000092c0] .do_one_initcall+0x80/0x19c [c000000042af7d90] [c0000000000bf884] .SyS_init_module+0xe0/0x248 [c000000042af7e30] [c000000000008534] syscall_exit+0x0/0x40 Instruction dump: 7cb01120 7c862214 7c661a14 4bfffe04 409c001c 80040000 81240004 38840008 90030000 91230004 38630008 409d0014 <80040000> 38840004 90030000 38630004 ---[ end trace 695e9dc0c5a9da2f ]--- udevd-event[587]: '/sbin/modprobe' abnormal exit Unable to handle kernel paging request for data at address 0x654af306c04b990 Faulting instruction address: 0xc000000000543ce0 Oops: Kernel access of bad area, sig: 11 [#2] SMP NR_CPUS=1024 NUMA pSeries Modules linked in: ibmveth(+) sg sd_mod crc_t10dif ibmvscsic scsi_transport_srp scsi_tgt scsi_mod NIP: c000000000543ce0 LR: c000000000543dcc CTR: c00000000053dd98 REGS: c0000000408b3740 TRAP: 0300 Tainted: G D (2.6.30-rc4-next-20090508) MSR: 8000000000009032 <EE,ME,IR,DR> CR: 24042428 XER: 00000001 DAR: 0654af306c04b990, DSISR: 0000000040000000 TASK = c00000004053d880[752] 'udevd' THREAD: c0000000408b0000 CPU: 3 GPR00: c000000000543dcc c0000000408b39c0 c000000000eb8190 c0000000448e0000 GPR04: 0000000000010000 c000000000796dec 0000000000000006 0000000000000000 GPR08: c00000004265cf34 c000000000ea7250 c00000004265cf34 0000000000000000 GPR12: 0000000044042488 c000000000f92a00 0000000000000001 0000000000000001 GPR16: 00000000100372dc 00000000100374e0 00000000100376f8 0000000000000000 GPR20: 0000000010036ec8 0000000000000000 00000fffdb3a6618 0000000000000200 GPR24: 0000000000000006 0000000000000005 c0000000448f0000 c0000000448e0000 GPR28: 0654af306c04b990 0000000000000000 c000000000e45318 c0000000448e0000 NIP [c000000000543ce0] ._format_mac_addr+0x54/0xd4 LR [c000000000543dcc] .sysfs_format_mac+0x30/0x6c Call Trace: [c0000000408b39c0] [c00000000010c9a4] .__alloc_pages_internal+0x1b8/0x590 (unreliable) [c0000000408b3a70] [c000000000543dcc] .sysfs_format_mac+0x30/0x6c [c0000000408b3b00] [c00000000053dde8] .show_address+0x50/0x88 [c0000000408b3b90] [c00000000045ead4] .dev_attr_show+0x4c/0x94 [c0000000408b3c20] [c0000000001bce48] .sysfs_read_file+0x10c/0x1d0 [c0000000408b3ce0] [c00000000014c9a0] .vfs_read+0xd0/0x1bc [c0000000408b3d80] [c00000000014cb94] .SyS_read+0x58/0xa0 [c0000000408b3e30] [c000000000008534] syscall_exit+0x0/0x40 Instruction dump: f8010010 ebc2cc58 f821ff51 7c7b1b78 7cd83378 7cbc2b78 7f432214 7c7f1b78 3ba00000 3b26ffff 48000044 e8be8000 <88dc0000> 3b9c0001 4be0ff89 60000000 ---[ end trace 695e9dc0c5a9da30 ]--- attempt to access beyond end of device Next May 7 with same config boots fine. Thanks -Sachin -- --------------------------------- Sachin Sant IBM Linux Technology Center India Systems and Technology Labs Bangalore, India --------------------------------- [-- Attachment #2: log --] [-- Type: text/plain, Size: 13642 bytes --] boot: next Using 007ba4b8 bytes for initrd buffer Please wait, loading kernel... Allocated 01500000 bytes for kernel @ 02200000 Elf64 kernel loaded... Loading ramdisk... ramdisk loaded 007ba4b8 @ 03700000 OF stdout device is: /vdevice/vty@30000000 Preparing to boot Linux version 2.6.30-rc4-next-20090508 (root@mjs22lp5) (gcc version 4.3.2 [gcc-4_3-branch revision 141291] (SUSE Linux) ) #3 SMP Fri May 8 17:04:14 IST 2009 Calling ibm,client-architecture... done command line: root=/dev/sda3 sysrq=8 memory layout at init: alloc_bottom : 0000000003ec0000 alloc_top : 0000000008000000 alloc_top_hi : 0000000008000000 rmo_top : 0000000008000000 ram_top : 0000000008000000 instantiating rtas at 0x0000000007630000... done boot cpu hw idx 0000000000000000 starting cpu hw idx 0000000000000002... done copying OF device tree... Building dt strings... Building dt structure... Device tree strings 0x0000000003ed0000 -> 0x0000000003ed1585 Device tree struct 0x0000000003ee0000 -> 0x0000000003ef0000 Calling quiesce... returning from prom_init Phyp-dump disabled at boot time Using pSeries machine description Using 1TB segments Found initrd at 0xc000000003700000:0xc000000003eba4b8 console [udbg0] enabled Partition configured for 8 cpus. CPU maps initialized for 2 threads per core Starting Linux PPC64 #3 SMP Fri May 8 17:04:14 IST 2009 ----------------------------------------------------- ppc64_pft_size = 0x19 physicalMemorySize = 0x80000000 htab_hash_mask = 0x3ffff ----------------------------------------------------- Initializing cgroup subsys cpuset Initializing cgroup subsys cpu Linux version 2.6.30-rc4-next-20090508 (root@mjs22lp5) (gcc version 4.3.2 [gcc-4_3-branch revision 141291] (SUSE Linux) ) #3 SMP Fri May 8 17:04:14 IST 2009 [boot]0012 Setup Arch EEH: No capable adapters found PPC64 nvram contains 15360 bytes Zone PFN ranges: DMA 0x00000000 -> 0x00008000 Normal 0x00008000 -> 0x00008000 Movable zone start PFN for each node early_node_map[3] active PFN ranges 1: 0x00000000 -> 0x00000800 0: 0x00000800 -> 0x00004600 1: 0x00004600 -> 0x00008000 [boot]0015 Setup Done Built 2 zonelists in Node order, mobility grouping on. Total pages: 32726 Policy zone: DMA Kernel command line: root=/dev/sda3 sysrq=8 Experimental hierarchical RCU implementation. Experimental hierarchical RCU init done. NR_IRQS:512 [boot]0020 XICS Init [boot]0021 XICS Done PID hash table entries: 4096 (order: 12, 32768 bytes) clocksource: timebase mult[7d0000] shift[22] registered Console: colour dummy device 80x25 console handover: boot [udbg0] -> real [hvc0] allocated 1310720 bytes of page_cgroup please try cgroup_disable=memory option if you don't want freeing bootmem node 0 freeing bootmem node 1 Memory: 2032704k/2097152k available (13056k kernel code, 69440k reserved, 2048k data, 4268k bss, 4672k init) Calibrating delay loop... 1022.36 BogoMIPS (lpj=5111808) Security Framework initialized SELinux: Disabled at boot. Dentry cache hash table entries: 262144 (order: 5, 2097152 bytes) Inode-cache hash table entries: 131072 (order: 4, 1048576 bytes) Mount-cache hash table entries: 4096 Initializing cgroup subsys ns Initializing cgroup subsys cpuacct Initializing cgroup subsys memory Initializing cgroup subsys devices Initializing cgroup subsys freezer Processor 1 found. Processor 2 found. Processor 3 found. Brought up 4 CPUs net_namespace: 1888 bytes NET: Registered protocol family 16 IBM eBus Device Driver PCI: Probing PCI hardware bio: create slab <bio-0> at 0 usbcore: registered new interface driver usbfs usbcore: registered new interface driver hub usbcore: registered new device driver usb Failed to register trace events module notifier NET: Registered protocol family 2 IP route cache hash table entries: 16384 (order: 1, 131072 bytes) TCP established hash table entries: 65536 (order: 4, 1048576 bytes) TCP bind hash table entries: 65536 (order: 4, 1048576 bytes) TCP: Hash tables configured (established 65536 bind 65536) TCP reno registered NET: Registered protocol family 1 Unpacking initramfs... Freeing initrd memory: 7913k freed IOMMU table initialized, virtual merging enabled audit: initializing netlink socket (disabled) type=2000 audit(1241782614.530:1): initialized HugeTLB registered 16 MB page size, pre-allocated 0 pages HugeTLB registered 16 GB page size, pre-allocated 0 pages VFS: Disk quotas dquot_6.5.2 Dquot-cache hash table entries: 8192 (order 0, 65536 bytes) Btrfs loaded msgmni has been set to 3984 alg: No test for stdrng (krng) Block layer SCSI generic (bsg) driver version 0.4 loaded (major 254) io scheduler noop registered io scheduler anticipatory registered io scheduler deadline registered io scheduler cfq registered (default) pci_hotplug: PCI Hot Plug PCI Core version: 0.5 rpaphp: RPA HOT Plug PCI Controller Driver version: 0.1 Generic RTC Driver v1.07 Serial: 8250/16550 driver, 4 ports, IRQ sharing disabled pmac_zilog: 0.6 (Benjamin Herrenschmidt <benh@kernel.crashing.org>) input: Macintosh mouse button emulation as /devices/virtual/input/input0 Uniform Multi-Platform E-IDE driver ide-gd driver 1.18 ehci_hcd: USB 2.0 'Enhanced' Host Controller (EHCI) Driver ohci_hcd: USB 1.1 'Open' Host Controller (OHCI) Driver mice: PS/2 mouse device common for all mice EDAC MC: Ver: 2.1.0 May 8 2009 usbcore: registered new interface driver hiddev usbcore: registered new interface driver usbhid usbhid: v2.6:USB HID core driver TCP cubic registered NET: Registered protocol family 15 registered taskstats version 1 Freeing unused kernel memory: 4672k freed doing fast boot SysRq : Changing Loglevel Loglevel set to 8 SCSI subsystem initialized vio_register_driver: driver ibmvscsi registering ibmvscsi 30000002: SRP_VERSION: 16.a scsi0 : IBM POWER Virtual SCSI Adapter 1.5.8 ibmvscsi 30000002: partner initialization complete ibmvscsi 30000002: sent SRP login ibmvscsi 30000002: SRP_LOGIN succeeded ibmvscsi 30000002: host srp version: 16.a, host partition 06-1C12A (1), OS 3, max io 262144 scsi 0:0:1:0: Direct-Access AIX VDASD 0001 PQ: 0 ANSI: 3 scsi 0:0:2:0: CD-ROM AIX VOPTA PQ: 0 ANSI: 4 Creating device nodes with udev udevd version 128 started Driver 'sd' needs updating - please use bus_type methods sd 0:0:1:0: [sda] 33554432 512-byte hardware sectors: (17.1 GB/16.0 GiB) sd 0:0:1:0: [sda] Write Protect is off sd 0:0:1:0: [sda] Mode Sense: 17 00 00 08 sd 0:0:1:0: [sda] Cache data unavailable sd 0:0:1:0: [sda] Assuming drive cache: write through sd 0:0:1:0: [sda] Cache data unavailable sd 0:0:1:0: [sda] Assuming drive cache: write through sda: sda1 sda2 sda3 sd 0:0:1:0: [sda] Attached SCSI disk Boot logging started on /dev/hvc0(/dev/console) at Fri May 8 11:36:54 2009 Waiting for device /dev/sda3 to appear: ok showconsole: Warning: the ioctl TIOCGDEV is not known by the kernel fsck 1.41.1 (01-Sep-2008) [/sbin/fsck.ext3 (1) -- /] fsck.ext3 -a /dev/sda3 /dev/sda3: clean, 263693/983040 files, 2840945/3929888 blocks fsck succeeded. Mounting root device read-write. Mounting root /dev/sda3 mount -o rw,acl,user_xattr -t ext3 /dev/sda3 /root kjournald starting. Commit interval 5 seconds EXT3 FS on sda3, internal journal EXT3-fs: mounted filesystem with writeback data mode. mount: can't find /root/proc in /etc/fstab or /etc/mtab INIT: version 2.86 booting System Boot Control: Running /etc/init.d/boot Mounting procfs at /proc done Mounting sysfs at /sys done Mounting debugfs at /sys/kernel/debug done Remounting tmpfs at /dev done Initializing /dev done Mounting devpts at /dev/pts done Starting udevd: udevd version 128 started done Loading drivers, configuring devices: sd 0:0:1:0: Attached scsi generic sg0 type 0 scsi 0:0:2:0: Attached scsi generic sg1 type 5 drivers/net/ibmveth.c: ibmveth: IBM i/pSeries Virtual Ethernet Driver 1.03 vio_register_driver: driver ibmveth registering Unable to handle kernel paging request for data at address 0x654af306c04b990 Faulting instruction address: 0xc00000000003a740 Oops: Kernel access of bad area, sig: 11 [#1] SMP NR_CPUS=1024 NUMA pSeries Modules linked in: ibmveth(+) sg sd_mod crc_t10dif ibmvscsic scsi_transport_srp scsi_tgt scsi_mod NIP: c00000000003a740 LR: c000000000361e20 CTR: 0000000000000000 REGS: c000000042af6e80 TRAP: 0300 Not tainted (2.6.30-rc4-next-20090508) MSR: 8000000000009032 <EE,ME,IR,DR> CR: 28222286 XER: 20000001 DAR: 0654af306c04b990, DSISR: 0000000040000000 TASK = c0000000428084d0[590] 'modprobe' THREAD: c000000042af4000 CPU: 0 GPR00: c000000000361e10 c000000042af7100 c000000000eb8190 c00000004427cc80 GPR04: 0654af306c04b990 0000000000000006 0000000000000000 0000000000000002 GPR08: c00000004427cc00 0000000000000088 0000000000000280 000000000000007c GPR12: 0000000084222284 c000000000f92400 0000000000000000 0000000000000000 GPR16: 0000000000000000 0000000000000000 d000000000ed4346 d000000000ed3aa0 GPR20: d000000000ed6358 c00000004427cc00 00000000ffffffff 0000000000000000 GPR24: 0000000000000000 c0000000402d0000 0000000000000010 c00000004194b400 GPR28: 0000000000000006 0654af306c04b990 c000000000e44f18 0000000000000000 NIP [c00000000003a740] .memcpy+0x240/0x278 LR [c000000000361e20] .__nla_put+0x30/0x4c Call Trace: [c000000042af7100] [c000000000361e10] .__nla_put+0x20/0x4c (unreliable) [c000000042af7190] [c000000000361e88] .nla_put+0x4c/0x60 [c000000042af7200] [c00000000053993c] .rtnl_fill_ifinfo+0x308/0x614 [c000000042af7300] [c00000000053a118] .rtmsg_ifinfo+0x104/0x198 [c000000042af73b0] [c00000000053a244] .rtnetlink_event+0x98/0xb0 [c000000042af7430] [c0000000005c8330] .notifier_call_chain+0x68/0xdc [c000000042af74d0] [c000000000530488] .register_netdevice+0x390/0x418 [c000000042af75a0] [c000000000530568] .register_netdev+0x58/0x80 [c000000042af7630] [d000000000ed2da4] .ibmveth_probe+0x2c8/0x3a4 [ibmveth] [c000000042af7730] [c000000000023208] .vio_bus_probe+0x2f0/0x358 [c000000042af77f0] [c000000000462be0] .driver_probe_device+0xd4/0x1bc [c000000042af7890] [c000000000462d5c] .__driver_attach+0x94/0xd8 [c000000042af7920] [c000000000462164] .bus_for_each_dev+0x80/0xe8 [c000000042af79d0] [c0000000004629b0] .driver_attach+0x28/0x40 [c000000042af7a50] [c000000000461808] .bus_add_driver+0xdc/0x27c [c000000042af7af0] [c0000000004631d0] .driver_register+0xf0/0x1b0 [c000000042af7b90] [c000000000025178] .vio_register_driver+0x44/0x60 [c000000042af7c20] [d000000000ed2ed4] .ibmveth_module_init+0x54/0xa60 [ibmveth] [c000000042af7ca0] [c0000000000092c0] .do_one_initcall+0x80/0x19c [c000000042af7d90] [c0000000000bf884] .SyS_init_module+0xe0/0x248 [c000000042af7e30] [c000000000008534] syscall_exit+0x0/0x40 Instruction dump: 7cb01120 7c862214 7c661a14 4bfffe04 409c001c 80040000 81240004 38840008 90030000 91230004 38630008 409d0014 <80040000> 38840004 90030000 38630004 ---[ end trace 695e9dc0c5a9da2f ]--- udevd-event[587]: '/sbin/modprobe' abnormal exit Unable to handle kernel paging request for data at address 0x654af306c04b990 Faulting instruction address: 0xc000000000543ce0 Oops: Kernel access of bad area, sig: 11 [#2] SMP NR_CPUS=1024 NUMA pSeries Modules linked in: ibmveth(+) sg sd_mod crc_t10dif ibmvscsic scsi_transport_srp scsi_tgt scsi_mod NIP: c000000000543ce0 LR: c000000000543dcc CTR: c00000000053dd98 REGS: c0000000408b3740 TRAP: 0300 Tainted: G D (2.6.30-rc4-next-20090508) MSR: 8000000000009032 <EE,ME,IR,DR> CR: 24042428 XER: 00000001 DAR: 0654af306c04b990, DSISR: 0000000040000000 TASK = c00000004053d880[752] 'udevd' THREAD: c0000000408b0000 CPU: 3 GPR00: c000000000543dcc c0000000408b39c0 c000000000eb8190 c0000000448e0000 GPR04: 0000000000010000 c000000000796dec 0000000000000006 0000000000000000 GPR08: c00000004265cf34 c000000000ea7250 c00000004265cf34 0000000000000000 GPR12: 0000000044042488 c000000000f92a00 0000000000000001 0000000000000001 GPR16: 00000000100372dc 00000000100374e0 00000000100376f8 0000000000000000 GPR20: 0000000010036ec8 0000000000000000 00000fffdb3a6618 0000000000000200 GPR24: 0000000000000006 0000000000000005 c0000000448f0000 c0000000448e0000 GPR28: 0654af306c04b990 0000000000000000 c000000000e45318 c0000000448e0000 NIP [c000000000543ce0] ._format_mac_addr+0x54/0xd4 LR [c000000000543dcc] .sysfs_format_mac+0x30/0x6c Call Trace: [c0000000408b39c0] [c00000000010c9a4] .__alloc_pages_internal+0x1b8/0x590 (unreliable) [c0000000408b3a70] [c000000000543dcc] .sysfs_format_mac+0x30/0x6c [c0000000408b3b00] [c00000000053dde8] .show_address+0x50/0x88 [c0000000408b3b90] [c00000000045ead4] .dev_attr_show+0x4c/0x94 [c0000000408b3c20] [c0000000001bce48] .sysfs_read_file+0x10c/0x1d0 [c0000000408b3ce0] [c00000000014c9a0] .vfs_read+0xd0/0x1bc [c0000000408b3d80] [c00000000014cb94] .SyS_read+0x58/0xa0 [c0000000408b3e30] [c000000000008534] syscall_exit+0x0/0x40 Instruction dump: f8010010 ebc2cc58 f821ff51 7c7b1b78 7cd83378 7cbc2b78 7f432214 7c7f1b78 3ba00000 3b26ffff 48000044 e8be8000 <88dc0000> 3b9c0001 4be0ff89 60000000 ---[ end trace 695e9dc0c5a9da30 ]--- attempt to access beyond end of device sda3: rw=0, want=31439208, limit=31439205 IBM eHEA ethernet device driver (Release EHEA_0101) irq: irq 590080 on host null mapped to virtual irq 256 Driver 'sr' needs updating - please use bus_type methods sr0: scsi-1 drive Uniform CD-ROM driver Revision: 3.20 sr 0:0:2:0: Attached scsi CD-ROM sr0 ^ permalink raw reply [flat|nested] 10+ messages in thread
* Re: [PowerPC] Next May 8 boot failure: OOPS during ibmveth module init 2009-05-08 12:52 ` [PowerPC] Next May 8 boot failure: OOPS during ibmveth module init Sachin Sant @ 2009-05-08 19:57 ` David Miller 2009-05-08 22:33 ` Jiri Pirko 0 siblings, 1 reply; 10+ messages in thread From: David Miller @ 2009-05-08 19:57 UTC (permalink / raw) To: sachinp; +Cc: linux-next, linuxppc-dev, sfr, netdev, jpirko From: Sachin Sant <sachinp@in.ibm.com> Date: Fri, 08 May 2009 18:22:48 +0530 > Todays Next failed to boot on a Power6 JS22 blade with following oops. Jiri, I suspect this might be your address list changes. Although that's just a guess. But please take a look. > Unable to handle kernel paging request for data at address > 0x654af306c04b990 > Faulting instruction address: 0xc00000000003a740 > Oops: Kernel access of bad area, sig: 11 [#1] > SMP NR_CPUS=1024 NUMA pSeries > Modules linked in: ibmveth(+) sg sd_mod crc_t10dif ibmvscsic > scsi_transport_srp scsi_tgt scsi_mod > NIP: c00000000003a740 LR: c000000000361e20 CTR: 0000000000000000 > REGS: c000000042af6e80 TRAP: 0300 Not tainted > (2.6.30-rc4-next-20090508) > MSR: 8000000000009032 <EE,ME,IR,DR> CR: 28222286 XER: 20000001 > DAR: 0654af306c04b990, DSISR: 0000000040000000 > TASK = c0000000428084d0[590] 'modprobe' THREAD: c000000042af4000 CPU: > 0 > GPR00: c000000000361e10 c000000042af7100 c000000000eb8190 > c00000004427cc80 > GPR04: 0654af306c04b990 0000000000000006 0000000000000000 > 0000000000000002 > GPR08: c00000004427cc00 0000000000000088 0000000000000280 > 000000000000007c > GPR12: 0000000084222284 c000000000f92400 0000000000000000 > 0000000000000000 > GPR16: 0000000000000000 0000000000000000 d000000000ed4346 > d000000000ed3aa0 > GPR20: d000000000ed6358 c00000004427cc00 00000000ffffffff > 0000000000000000 > GPR24: 0000000000000000 c0000000402d0000 0000000000000010 > c00000004194b400 > GPR28: 0000000000000006 0654af306c04b990 c000000000e44f18 > 0000000000000000 > NIP [c00000000003a740] .memcpy+0x240/0x278 > LR [c000000000361e20] .__nla_put+0x30/0x4c > Call Trace: > [c000000042af7100] [c000000000361e10] .__nla_put+0x20/0x4c > (unreliable) > [c000000042af7190] [c000000000361e88] .nla_put+0x4c/0x60 > [c000000042af7200] [c00000000053993c] .rtnl_fill_ifinfo+0x308/0x614 > [c000000042af7300] [c00000000053a118] .rtmsg_ifinfo+0x104/0x198 > [c000000042af73b0] [c00000000053a244] .rtnetlink_event+0x98/0xb0 > [c000000042af7430] [c0000000005c8330] .notifier_call_chain+0x68/0xdc > [c000000042af74d0] [c000000000530488] .register_netdevice+0x390/0x418 > [c000000042af75a0] [c000000000530568] .register_netdev+0x58/0x80 > [c000000042af7630] [d000000000ed2da4] .ibmveth_probe+0x2c8/0x3a4 > [ibmveth] > [c000000042af7730] [c000000000023208] .vio_bus_probe+0x2f0/0x358 > [c000000042af77f0] [c000000000462be0] .driver_probe_device+0xd4/0x1bc > [c000000042af7890] [c000000000462d5c] .__driver_attach+0x94/0xd8 > [c000000042af7920] [c000000000462164] .bus_for_each_dev+0x80/0xe8 > [c000000042af79d0] [c0000000004629b0] .driver_attach+0x28/0x40 > [c000000042af7a50] [c000000000461808] .bus_add_driver+0xdc/0x27c > [c000000042af7af0] [c0000000004631d0] .driver_register+0xf0/0x1b0 > [c000000042af7b90] [c000000000025178] .vio_register_driver+0x44/0x60 > [c000000042af7c20] [d000000000ed2ed4] .ibmveth_module_init+0x54/0xa60 > [ibmveth] > [c000000042af7ca0] [c0000000000092c0] .do_one_initcall+0x80/0x19c > [c000000042af7d90] [c0000000000bf884] .SyS_init_module+0xe0/0x248 > [c000000042af7e30] [c000000000008534] syscall_exit+0x0/0x40 > Instruction dump: > 7cb01120 7c862214 7c661a14 4bfffe04 409c001c 80040000 81240004 > 38840008 > 90030000 91230004 38630008 409d0014 <80040000> 38840004 90030000 > 38630004 > ---[ end trace 695e9dc0c5a9da2f ]--- > > udevd-event[587]: '/sbin/modprobe' abnormal exit > > Unable to handle kernel paging request for data at address > 0x654af306c04b990 > Faulting instruction address: 0xc000000000543ce0 > Oops: Kernel access of bad area, sig: 11 [#2] > SMP NR_CPUS=1024 NUMA pSeries > Modules linked in: ibmveth(+) sg sd_mod crc_t10dif ibmvscsic > scsi_transport_srp scsi_tgt scsi_mod > NIP: c000000000543ce0 LR: c000000000543dcc CTR: c00000000053dd98 > REGS: c0000000408b3740 TRAP: 0300 Tainted: G D > (2.6.30-rc4-next-20090508) > MSR: 8000000000009032 <EE,ME,IR,DR> CR: 24042428 XER: 00000001 > DAR: 0654af306c04b990, DSISR: 0000000040000000 > TASK = c00000004053d880[752] 'udevd' THREAD: c0000000408b0000 CPU: 3 > GPR00: c000000000543dcc c0000000408b39c0 c000000000eb8190 > c0000000448e0000 > GPR04: 0000000000010000 c000000000796dec 0000000000000006 > 0000000000000000 > GPR08: c00000004265cf34 c000000000ea7250 c00000004265cf34 > 0000000000000000 > GPR12: 0000000044042488 c000000000f92a00 0000000000000001 > 0000000000000001 > GPR16: 00000000100372dc 00000000100374e0 00000000100376f8 > 0000000000000000 > GPR20: 0000000010036ec8 0000000000000000 00000fffdb3a6618 > 0000000000000200 > GPR24: 0000000000000006 0000000000000005 c0000000448f0000 > c0000000448e0000 > GPR28: 0654af306c04b990 0000000000000000 c000000000e45318 > c0000000448e0000 > NIP [c000000000543ce0] ._format_mac_addr+0x54/0xd4 > LR [c000000000543dcc] .sysfs_format_mac+0x30/0x6c > Call Trace: > [c0000000408b39c0] [c00000000010c9a4] > .__alloc_pages_internal+0x1b8/0x590 (unreliable) > [c0000000408b3a70] [c000000000543dcc] .sysfs_format_mac+0x30/0x6c > [c0000000408b3b00] [c00000000053dde8] .show_address+0x50/0x88 > [c0000000408b3b90] [c00000000045ead4] .dev_attr_show+0x4c/0x94 > [c0000000408b3c20] [c0000000001bce48] .sysfs_read_file+0x10c/0x1d0 > [c0000000408b3ce0] [c00000000014c9a0] .vfs_read+0xd0/0x1bc > [c0000000408b3d80] [c00000000014cb94] .SyS_read+0x58/0xa0 > [c0000000408b3e30] [c000000000008534] syscall_exit+0x0/0x40 > Instruction dump: > f8010010 ebc2cc58 f821ff51 7c7b1b78 7cd83378 7cbc2b78 7f432214 > 7c7f1b78 > 3ba00000 3b26ffff 48000044 e8be8000 <88dc0000> 3b9c0001 4be0ff89 > 60000000 > ---[ end trace 695e9dc0c5a9da30 ]--- > > attempt to access beyond end of device > > Next May 7 with same config boots fine. > > Thanks > -Sachin > > -- > > --------------------------------- > Sachin Sant > IBM Linux Technology Center > India Systems and Technology Labs > Bangalore, India > --------------------------------- > ^ permalink raw reply [flat|nested] 10+ messages in thread
* Re: [PowerPC] Next May 8 boot failure: OOPS during ibmveth module init 2009-05-08 19:57 ` David Miller @ 2009-05-08 22:33 ` Jiri Pirko 2009-05-11 17:07 ` [PowerPC] Next May 8 boot failure: OOPS during ibmveth moduleinit Sachin Sant 0 siblings, 1 reply; 10+ messages in thread From: Jiri Pirko @ 2009-05-08 22:33 UTC (permalink / raw) To: David Miller; +Cc: sachinp, linux-next, linuxppc-dev, sfr, netdev Fri, May 08, 2009 at 09:57:22PM CEST, davem@davemloft.net wrote: >From: Sachin Sant <sachinp@in.ibm.com> >Date: Fri, 08 May 2009 18:22:48 +0530 > >> Todays Next failed to boot on a Power6 JS22 blade with following oops. > >Jiri, I suspect this might be your address list changes. > >Although that's just a guess. But please take a look. Hmm, only thing I see might cause the problem would be if calling __hw_addr_add in dev_addr_init fails, then dev->dev_addr would contain zeroes (which looks this is not the case). But in this case the oops would appear earlier (in ibmveth_probe dev_addr memcpy). Will do the patch which checks the result to behave correctly in case of oom but this imho this wouldn't help. Strange, I will dig into this more tomorrow. > >> Unable to handle kernel paging request for data at address >> 0x654af306c04b990 >> Faulting instruction address: 0xc00000000003a740 >> Oops: Kernel access of bad area, sig: 11 [#1] >> SMP NR_CPUS=1024 NUMA pSeries >> Modules linked in: ibmveth(+) sg sd_mod crc_t10dif ibmvscsic >> scsi_transport_srp scsi_tgt scsi_mod >> NIP: c00000000003a740 LR: c000000000361e20 CTR: 0000000000000000 >> REGS: c000000042af6e80 TRAP: 0300 Not tainted >> (2.6.30-rc4-next-20090508) >> MSR: 8000000000009032 <EE,ME,IR,DR> CR: 28222286 XER: 20000001 >> DAR: 0654af306c04b990, DSISR: 0000000040000000 >> TASK = c0000000428084d0[590] 'modprobe' THREAD: c000000042af4000 CPU: >> 0 >> GPR00: c000000000361e10 c000000042af7100 c000000000eb8190 >> c00000004427cc80 >> GPR04: 0654af306c04b990 0000000000000006 0000000000000000 >> 0000000000000002 >> GPR08: c00000004427cc00 0000000000000088 0000000000000280 >> 000000000000007c >> GPR12: 0000000084222284 c000000000f92400 0000000000000000 >> 0000000000000000 >> GPR16: 0000000000000000 0000000000000000 d000000000ed4346 >> d000000000ed3aa0 >> GPR20: d000000000ed6358 c00000004427cc00 00000000ffffffff >> 0000000000000000 >> GPR24: 0000000000000000 c0000000402d0000 0000000000000010 >> c00000004194b400 >> GPR28: 0000000000000006 0654af306c04b990 c000000000e44f18 >> 0000000000000000 >> NIP [c00000000003a740] .memcpy+0x240/0x278 >> LR [c000000000361e20] .__nla_put+0x30/0x4c >> Call Trace: >> [c000000042af7100] [c000000000361e10] .__nla_put+0x20/0x4c >> (unreliable) >> [c000000042af7190] [c000000000361e88] .nla_put+0x4c/0x60 >> [c000000042af7200] [c00000000053993c] .rtnl_fill_ifinfo+0x308/0x614 >> [c000000042af7300] [c00000000053a118] .rtmsg_ifinfo+0x104/0x198 >> [c000000042af73b0] [c00000000053a244] .rtnetlink_event+0x98/0xb0 >> [c000000042af7430] [c0000000005c8330] .notifier_call_chain+0x68/0xdc >> [c000000042af74d0] [c000000000530488] .register_netdevice+0x390/0x418 >> [c000000042af75a0] [c000000000530568] .register_netdev+0x58/0x80 >> [c000000042af7630] [d000000000ed2da4] .ibmveth_probe+0x2c8/0x3a4 >> [ibmveth] >> [c000000042af7730] [c000000000023208] .vio_bus_probe+0x2f0/0x358 >> [c000000042af77f0] [c000000000462be0] .driver_probe_device+0xd4/0x1bc >> [c000000042af7890] [c000000000462d5c] .__driver_attach+0x94/0xd8 >> [c000000042af7920] [c000000000462164] .bus_for_each_dev+0x80/0xe8 >> [c000000042af79d0] [c0000000004629b0] .driver_attach+0x28/0x40 >> [c000000042af7a50] [c000000000461808] .bus_add_driver+0xdc/0x27c >> [c000000042af7af0] [c0000000004631d0] .driver_register+0xf0/0x1b0 >> [c000000042af7b90] [c000000000025178] .vio_register_driver+0x44/0x60 >> [c000000042af7c20] [d000000000ed2ed4] .ibmveth_module_init+0x54/0xa60 >> [ibmveth] >> [c000000042af7ca0] [c0000000000092c0] .do_one_initcall+0x80/0x19c >> [c000000042af7d90] [c0000000000bf884] .SyS_init_module+0xe0/0x248 >> [c000000042af7e30] [c000000000008534] syscall_exit+0x0/0x40 >> Instruction dump: >> 7cb01120 7c862214 7c661a14 4bfffe04 409c001c 80040000 81240004 >> 38840008 >> 90030000 91230004 38630008 409d0014 <80040000> 38840004 90030000 >> 38630004 >> ---[ end trace 695e9dc0c5a9da2f ]--- >> >> udevd-event[587]: '/sbin/modprobe' abnormal exit >> >> Unable to handle kernel paging request for data at address >> 0x654af306c04b990 >> Faulting instruction address: 0xc000000000543ce0 >> Oops: Kernel access of bad area, sig: 11 [#2] >> SMP NR_CPUS=1024 NUMA pSeries >> Modules linked in: ibmveth(+) sg sd_mod crc_t10dif ibmvscsic >> scsi_transport_srp scsi_tgt scsi_mod >> NIP: c000000000543ce0 LR: c000000000543dcc CTR: c00000000053dd98 >> REGS: c0000000408b3740 TRAP: 0300 Tainted: G D >> (2.6.30-rc4-next-20090508) >> MSR: 8000000000009032 <EE,ME,IR,DR> CR: 24042428 XER: 00000001 >> DAR: 0654af306c04b990, DSISR: 0000000040000000 >> TASK = c00000004053d880[752] 'udevd' THREAD: c0000000408b0000 CPU: 3 >> GPR00: c000000000543dcc c0000000408b39c0 c000000000eb8190 >> c0000000448e0000 >> GPR04: 0000000000010000 c000000000796dec 0000000000000006 >> 0000000000000000 >> GPR08: c00000004265cf34 c000000000ea7250 c00000004265cf34 >> 0000000000000000 >> GPR12: 0000000044042488 c000000000f92a00 0000000000000001 >> 0000000000000001 >> GPR16: 00000000100372dc 00000000100374e0 00000000100376f8 >> 0000000000000000 >> GPR20: 0000000010036ec8 0000000000000000 00000fffdb3a6618 >> 0000000000000200 >> GPR24: 0000000000000006 0000000000000005 c0000000448f0000 >> c0000000448e0000 >> GPR28: 0654af306c04b990 0000000000000000 c000000000e45318 >> c0000000448e0000 >> NIP [c000000000543ce0] ._format_mac_addr+0x54/0xd4 >> LR [c000000000543dcc] .sysfs_format_mac+0x30/0x6c >> Call Trace: >> [c0000000408b39c0] [c00000000010c9a4] >> .__alloc_pages_internal+0x1b8/0x590 (unreliable) >> [c0000000408b3a70] [c000000000543dcc] .sysfs_format_mac+0x30/0x6c >> [c0000000408b3b00] [c00000000053dde8] .show_address+0x50/0x88 >> [c0000000408b3b90] [c00000000045ead4] .dev_attr_show+0x4c/0x94 >> [c0000000408b3c20] [c0000000001bce48] .sysfs_read_file+0x10c/0x1d0 >> [c0000000408b3ce0] [c00000000014c9a0] .vfs_read+0xd0/0x1bc >> [c0000000408b3d80] [c00000000014cb94] .SyS_read+0x58/0xa0 >> [c0000000408b3e30] [c000000000008534] syscall_exit+0x0/0x40 >> Instruction dump: >> f8010010 ebc2cc58 f821ff51 7c7b1b78 7cd83378 7cbc2b78 7f432214 >> 7c7f1b78 >> 3ba00000 3b26ffff 48000044 e8be8000 <88dc0000> 3b9c0001 4be0ff89 >> 60000000 >> ---[ end trace 695e9dc0c5a9da30 ]--- >> >> attempt to access beyond end of device >> >> Next May 7 with same config boots fine. >> >> Thanks >> -Sachin >> >> -- >> >> --------------------------------- >> Sachin Sant >> IBM Linux Technology Center >> India Systems and Technology Labs >> Bangalore, India >> --------------------------------- >> ^ permalink raw reply [flat|nested] 10+ messages in thread
* Re: [PowerPC] Next May 8 boot failure: OOPS during ibmveth moduleinit 2009-05-08 22:33 ` Jiri Pirko @ 2009-05-11 17:07 ` Sachin Sant 2009-05-12 7:04 ` Stephen Rothwell 0 siblings, 1 reply; 10+ messages in thread From: Sachin Sant @ 2009-05-11 17:07 UTC (permalink / raw) To: Jiri Pirko; +Cc: David Miller, linux-next, linuxppc-dev, sfr, netdev Jiri Pirko wrote: > Fri, May 08, 2009 at 09:57:22PM CEST, davem@davemloft.net wrote: > >> From: Sachin Sant <sachinp@in.ibm.com> >> Date: Fri, 08 May 2009 18:22:48 +0530 >> >> >>> Todays Next failed to boot on a Power6 JS22 blade with following oops. >>> >> Jiri, I suspect this might be your address list changes. >> >> Although that's just a guess. But please take a look. >> > > Hmm, only thing I see might cause the problem would be if calling __hw_addr_add > in dev_addr_init fails, then dev->dev_addr would contain zeroes (which looks > this is not the case). But in this case the oops would appear earlier (in > ibmveth_probe dev_addr memcpy). > > Will do the patch which checks the result to behave correctly in case of oom > but this imho this wouldn't help. Strange, I will dig into this more tomorrow. > I can recreate this problem with today's Next release as well. Let me know if you want me to try out any thing to debug this problem. Thanks -Sachin -- --------------------------------- Sachin Sant IBM Linux Technology Center India Systems and Technology Labs Bangalore, India --------------------------------- ^ permalink raw reply [flat|nested] 10+ messages in thread
* Re: [PowerPC] Next May 8 boot failure: OOPS during ibmveth moduleinit 2009-05-11 17:07 ` [PowerPC] Next May 8 boot failure: OOPS during ibmveth moduleinit Sachin Sant @ 2009-05-12 7:04 ` Stephen Rothwell 2009-05-12 7:44 ` Stephen Rothwell 0 siblings, 1 reply; 10+ messages in thread From: Stephen Rothwell @ 2009-05-12 7:04 UTC (permalink / raw) To: Sachin Sant; +Cc: Jiri Pirko, David Miller, linux-next, linuxppc-dev, netdev [-- Attachment #1: Type: text/plain, Size: 1381 bytes --] Hi all, On Mon, 11 May 2009 22:37:09 +0530 Sachin Sant <sachinp@in.ibm.com> wrote: > > Jiri Pirko wrote: > > Fri, May 08, 2009 at 09:57:22PM CEST, davem@davemloft.net wrote: > > > >> From: Sachin Sant <sachinp@in.ibm.com> > >> Date: Fri, 08 May 2009 18:22:48 +0530 > >> > >> > >>> Todays Next failed to boot on a Power6 JS22 blade with following oops. > >>> > >> Jiri, I suspect this might be your address list changes. > >> > >> Although that's just a guess. But please take a look. > >> > > > > Hmm, only thing I see might cause the problem would be if calling __hw_addr_add > > in dev_addr_init fails, then dev->dev_addr would contain zeroes (which looks > > this is not the case). But in this case the oops would appear earlier (in > > ibmveth_probe dev_addr memcpy). > > > > Will do the patch which checks the result to behave correctly in case of oom > > but this imho this wouldn't help. Strange, I will dig into this more tomorrow. Bisected down to: commit f001fde5eadd915f4858d22ed70d7040f48767cf Author: Jiri Pirko <jpirko@redhat.com> Date: Tue May 5 02:48:28 2009 +0000 net: introduce a list of device addresses dev_addr_list (v6) I can reproduce this at will on the last three linux-next trees. -- Cheers, Stephen Rothwell sfr@canb.auug.org.au http://www.canb.auug.org.au/~sfr/ [-- Attachment #2: Type: application/pgp-signature, Size: 197 bytes --] ^ permalink raw reply [flat|nested] 10+ messages in thread
* Re: [PowerPC] Next May 8 boot failure: OOPS during ibmveth moduleinit 2009-05-12 7:04 ` Stephen Rothwell @ 2009-05-12 7:44 ` Stephen Rothwell 2009-05-12 8:16 ` Jiri Pirko ` (2 more replies) 0 siblings, 3 replies; 10+ messages in thread From: Stephen Rothwell @ 2009-05-12 7:44 UTC (permalink / raw) To: David Miller; +Cc: Sachin Sant, Jiri Pirko, linux-next, linuxppc-dev, netdev Hi Dave, This fixes it (I wonder if this bug is lurking in any other drivers): From: Stephen Rothwell <sfr@canb.auug.org.au> Date: Tue, 12 May 2009 17:24:02 +1000 Subject: [PATCH] net/ibmveth: fix panic in probe netdev->dev_addr changed from being an array to being a pointer, so we should not take its address for memcpy(). Signed-off-by: Stephen Rothwell <sfr@canb.auug.org.au> --- drivers/net/ibmveth.c | 2 +- 1 files changed, 1 insertions(+), 1 deletions(-) diff --git a/drivers/net/ibmveth.c b/drivers/net/ibmveth.c index 7902e5e..8daffad 100644 --- a/drivers/net/ibmveth.c +++ b/drivers/net/ibmveth.c @@ -1285,7 +1285,7 @@ static int __devinit ibmveth_probe(struct vio_dev *dev, const struct vio_device_ netdev->features |= NETIF_F_LLTX; spin_lock_init(&adapter->stats_lock); - memcpy(&netdev->dev_addr, &adapter->mac_addr, netdev->addr_len); + memcpy(netdev->dev_addr, &adapter->mac_addr, netdev->addr_len); for(i = 0; i<IbmVethNumBufferPools; i++) { struct kobject *kobj = &adapter->rx_buff_pool[i].kobj; -- 1.6.3 -- Cheers, Stephen Rothwell sfr@canb.auug.org.au http://www.canb.auug.org.au/~sfr/ ^ permalink raw reply related [flat|nested] 10+ messages in thread
* Re: [PowerPC] Next May 8 boot failure: OOPS during ibmveth moduleinit 2009-05-12 7:44 ` Stephen Rothwell @ 2009-05-12 8:16 ` Jiri Pirko 2009-05-12 8:17 ` Wei Yongjun 2009-05-13 6:39 ` Stephen Rothwell 2 siblings, 0 replies; 10+ messages in thread From: Jiri Pirko @ 2009-05-12 8:16 UTC (permalink / raw) To: Stephen Rothwell; +Cc: linuxppc-dev, linux-next, David Miller, netdev Tue, May 12, 2009 at 09:44:51AM CEST, sfr@canb.auug.org.au wrote: >Hi Dave, > >This fixes it (I wonder if this bug is lurking in any other drivers): Grepping the sources I see some other places suffering this problem. I'll send patch for this. > >From: Stephen Rothwell <sfr@canb.auug.org.au> >Date: Tue, 12 May 2009 17:24:02 +1000 >Subject: [PATCH] net/ibmveth: fix panic in probe > >netdev->dev_addr changed from being an array to being a pointer, so we >should not take its address for memcpy(). Nice! Signed-off-by: Jiri Pirko <jpirko@redhat.com> > >Signed-off-by: Stephen Rothwell <sfr@canb.auug.org.au> >--- > drivers/net/ibmveth.c | 2 +- > 1 files changed, 1 insertions(+), 1 deletions(-) > >diff --git a/drivers/net/ibmveth.c b/drivers/net/ibmveth.c >index 7902e5e..8daffad 100644 >--- a/drivers/net/ibmveth.c >+++ b/drivers/net/ibmveth.c >@@ -1285,7 +1285,7 @@ static int __devinit ibmveth_probe(struct vio_dev *dev, const struct vio_device_ > netdev->features |= NETIF_F_LLTX; > spin_lock_init(&adapter->stats_lock); > >- memcpy(&netdev->dev_addr, &adapter->mac_addr, netdev->addr_len); >+ memcpy(netdev->dev_addr, &adapter->mac_addr, netdev->addr_len); > > for(i = 0; i<IbmVethNumBufferPools; i++) { > struct kobject *kobj = &adapter->rx_buff_pool[i].kobj; >-- >1.6.3 > > >-- >Cheers, >Stephen Rothwell sfr@canb.auug.org.au >http://www.canb.auug.org.au/~sfr/ ^ permalink raw reply [flat|nested] 10+ messages in thread
* Re: [PowerPC] Next May 8 boot failure: OOPS during ibmveth moduleinit 2009-05-12 7:44 ` Stephen Rothwell 2009-05-12 8:16 ` Jiri Pirko @ 2009-05-12 8:17 ` Wei Yongjun 2009-05-13 6:39 ` Stephen Rothwell 2 siblings, 0 replies; 10+ messages in thread From: Wei Yongjun @ 2009-05-12 8:17 UTC (permalink / raw) To: Stephen Rothwell Cc: David Miller, Sachin Sant, Jiri Pirko, linux-next, linuxppc-dev, netdev Stephen Rothwell wrote: > Hi Dave, > > This fixes it (I wonder if this bug is lurking in any other drivers): > Yes, there are some other exists. This spatch script can help to found this. (http://www.emn.fr/x-info/coccinelle/) # cat netdev_dev_addr.cocci @@ struct net_device *dev; @@ memcpy( - &dev->dev_addr + dev->dev_addr , ...); @@ struct net_device *dev; expression E; @@ memcpy(E, - &dev->dev_addr + dev->dev_addr , ...); @@ expression E; @@ - &E->dev_addr + E->dev_addr > From: Stephen Rothwell <sfr@canb.auug.org.au> > Date: Tue, 12 May 2009 17:24:02 +1000 > Subject: [PATCH] net/ibmveth: fix panic in probe > > netdev->dev_addr changed from being an array to being a pointer, so we > should not take its address for memcpy(). > > Signed-off-by: Stephen Rothwell <sfr@canb.auug.org.au> > --- > drivers/net/ibmveth.c | 2 +- > 1 files changed, 1 insertions(+), 1 deletions(-) > > diff --git a/drivers/net/ibmveth.c b/drivers/net/ibmveth.c > index 7902e5e..8daffad 100644 > --- a/drivers/net/ibmveth.c > +++ b/drivers/net/ibmveth.c > @@ -1285,7 +1285,7 @@ static int __devinit ibmveth_probe(struct vio_dev *dev, const struct vio_device_ > netdev->features |= NETIF_F_LLTX; > spin_lock_init(&adapter->stats_lock); > > - memcpy(&netdev->dev_addr, &adapter->mac_addr, netdev->addr_len); > + memcpy(netdev->dev_addr, &adapter->mac_addr, netdev->addr_len); > > for(i = 0; i<IbmVethNumBufferPools; i++) { > struct kobject *kobj = &adapter->rx_buff_pool[i].kobj; > maybe this line should be fix too. @@ -1368,7 +1368,7 @@ static void ibmveth_proc_unregister_driv static int ibmveth_show(struct seq_file *seq, void *v) { struct ibmveth_adapter *adapter = seq->private; - char *current_mac = ((char*) &adapter->netdev->dev_addr); + char *current_mac = ((char*) adapter->netdev->dev_addr); char *firmware_mac = ((char*) &adapter->mac_addr) ; seq_printf(seq, "%s %s\n\n", ibmveth_driver_string, ibmveth_driver_version); ^ permalink raw reply [flat|nested] 10+ messages in thread
* Re: [PowerPC] Next May 8 boot failure: OOPS during ibmveth moduleinit 2009-05-12 7:44 ` Stephen Rothwell 2009-05-12 8:16 ` Jiri Pirko 2009-05-12 8:17 ` Wei Yongjun @ 2009-05-13 6:39 ` Stephen Rothwell 2 siblings, 0 replies; 10+ messages in thread From: Stephen Rothwell @ 2009-05-13 6:39 UTC (permalink / raw) To: David Miller; +Cc: Sachin Sant, Jiri Pirko, linux-next, linuxppc-dev, netdev [-- Attachment #1: Type: text/plain, Size: 546 bytes --] Hi Dave, On Tue, 12 May 2009 17:44:51 +1000 Stephen Rothwell <sfr@canb.auug.org.au> wrote: > > From: Stephen Rothwell <sfr@canb.auug.org.au> > Date: Tue, 12 May 2009 17:24:02 +1000 > Subject: [PATCH] net/ibmveth: fix panic in probe > > netdev->dev_addr changed from being an array to being a pointer, so we > should not take its address for memcpy(). I have applied this to linux-next until you decide what to do with it. -- Cheers, Stephen Rothwell sfr@canb.auug.org.au http://www.canb.auug.org.au/~sfr/ [-- Attachment #2: Type: application/pgp-signature, Size: 197 bytes --] ^ permalink raw reply [flat|nested] 10+ messages in thread
end of thread, other threads:[~2009-05-13 6:39 UTC | newest] Thread overview: 10+ messages (download: mbox.gz / follow: Atom feed) -- links below jump to the message on this page -- 2009-05-08 8:02 linux-next: Tree for May 8 Stephen Rothwell 2009-05-08 12:52 ` [PowerPC] Next May 8 boot failure: OOPS during ibmveth module init Sachin Sant 2009-05-08 19:57 ` David Miller 2009-05-08 22:33 ` Jiri Pirko 2009-05-11 17:07 ` [PowerPC] Next May 8 boot failure: OOPS during ibmveth moduleinit Sachin Sant 2009-05-12 7:04 ` Stephen Rothwell 2009-05-12 7:44 ` Stephen Rothwell 2009-05-12 8:16 ` Jiri Pirko 2009-05-12 8:17 ` Wei Yongjun 2009-05-13 6:39 ` Stephen Rothwell
This is a public inbox, see mirroring instructions for how to clone and mirror all data and code used for this inbox; as well as URLs for NNTP newsgroup(s).