You are not logged in.

#1 2013-12-16 11:06:29

eazar001
Member
Registered: 2013-07-07
Posts: 12

[Solved] Kernel BUG at include/linux/netdevice.h:501, and sky2?

So, ever since a certain point in time between kernel 3.11.x and now (3.12.5-1), I have been getting this strangely inconsistent error on bootup. I would estimate that about 3 out of every 4 reboots leads to this error message, and the bootup process terminates in a black screen and what (appears) to be a system hang. However I CTL+ALT+F2, log in, and I output dmesg to a boot log, and everything my eyes have seen is now your visual property as well. The remaining 25% of my reboots patches through and boots directly to my window manager successfully. Now the output seems to be screaming to me "sky2 is the culprit", as EIP seems to be pointing in its general direction at the time of error.  However, I am more inclined to believe that my hardware's interaction with the driver is more likely to be responsible. If sky2 was actually a real problem, I doubt I would be the only one complaining about this. Nonetheless, I am quite vexed, and would greatly appreciate anbody's help here. Much appreciated, and perhaps I can provide you with some other (perhaps) useful information:

MotherBoard: Asus P5KE (with suspect ethernet controller built in)
Ethernet Controller: Marvell Technology Group Ltd. 88E8056 PCI-E Gigabit Ethernet Controller (rev 12)
Kernel: 3.12.5-1
CPU: Dual core 2.33 ghz
RAM: 2GB (recently has not passed a memtest)

[    9.699863] sky2: driver version 1.30
[    9.699960] sky2 0000:02:00.0: Yukon-2 EC Ultra chip revision 2
[    9.700020] sky2 0000:02:00.0: irq 45 for MSI/MSI-X
[    9.700498] ------------[ cut here ]------------
[    9.703391] kernel BUG at include/linux/netdevice.h:501!
[    9.703391] invalid opcode: 0000 [#1] PREEMPT SMP 
[    9.703391] Modules linked in: sky2(+) i2c_core snd_hda_codec_analog acpi_cpufreq processor button snd_hda_intel(+) snd_hda_codec snd_hwdep snd_pcm snd_page_alloc snd_timer snd soundcore shpchp intel_agp asus_atk0110 intel_gtt agpgart ext4 crc16 mbcache jbd2 sd_mod hid_logitech ff_memless usbhid hid firewire_ohci ahci libahci firewire_core crc_itu_t libata scsi_mod ehci_pci uhci_hcd ehci_hcd usbcore usb_common
[    9.703391] CPU: 0 PID: 155 Comm: dhcpcd Not tainted 3.12.5-1-ARCH #1
[    9.703391] Hardware name: System manufacturer P5K-E/P5K-E, BIOS 0906    12/10/2007
[    9.703391] task: f562f4d0 ti: f50fa000 task.ti: f50fa000
[    9.703391] EIP: 0060:[<fa99372f>] EFLAGS: 00010246 CPU: 0
[    9.703391] EIP is at sky2_setup_irq+0x9f/0xb0 [sky2]
[    9.703391] EAX: 00000000 EBX: f51c8100 ECX: 00000089 EDX: 00000000
[    9.703391] ESI: f5dfb000 EDI: 00000000 EBP: f50fbdf0 ESP: f50fbdd8
[    9.703391]  DS: 007b ES: 007b FS: 00d8 GS: 00e0 SS: 0068
[    9.830016] sky2 0000:02:00.0 eth0: addr 00:1b:fc:61:1c:91
[    9.830018] CR0: 80050033 CR2: f9133760 CR3: 356f4000 CR4: 000007d0
[    9.830018] Stack:
[    9.830018]  00000000 f51b1000 f51c8100 f51b1000 f51c8100 00000000 f50fbe2c fa996169
[    9.830018]  00000020 00000000 c1007210 00000400 f51b15b4 00000000 f50fbe1c c1007210
[    9.830018]  f51b1500 000000e0 f51b1000 fa999f40 00000000 f50fbe50 c135ed4b 00000246
[    9.830018] Call Trace:
[    9.830018]  [<fa996169>] sky2_open+0x309/0x360 [sky2]
[    9.830018]  [<c1007210>] ? via_no_dac+0x40/0x40
[    9.830018]  [<c1007210>] ? via_no_dac+0x40/0x40
[    9.830018]  [<c135ed4b>] __dev_open+0x9b/0x120
[    9.830018]  [<c1431cbe>] ? _raw_spin_unlock_bh+0x1e/0x20
[    9.830018]  [<c135efd9>] __dev_change_flags+0x89/0x150
[    9.830018]  [<c135f148>] dev_change_flags+0x18/0x50
[    9.830018]  [<c13bb8e0>] devinet_ioctl+0x5d0/0x6e0
[    9.830018]  [<c13bcced>] inet_ioctl+0x6d/0xa0
[    9.830018]  [<c1346094>] sock_ioctl+0x114/0x250
[    9.830018]  [<c1345f80>] ? sock_fasync+0x80/0x80
[    9.830018]  [<c1166416>] do_vfs_ioctl+0x2e6/0x4e0
[    9.830018]  [<c1435127>] ? __do_page_fault+0x2e7/0x580
[    9.830018]  [<c11576f6>] ? __fput+0x106/0x1f0
[    9.830018]  [<c116668f>] SyS_ioctl+0x7f/0x90
[    9.830018]  [<c143874d>] sysenter_do_call+0x12/0x28
[    9.830018] Code: 8d 74 26 00 8b 86 d8 01 00 00 83 c6 64 89 34 24 c7 44 24 04 91 97 99 fa 89 44 24 08 e8 fb d1 95 c6 89 f8 83 c4 0c 5b 5e 5f 5d c3 <0f> 0b eb 0d 90 90 90 90 90 90 90 90 90 90 90 90 90 55 89 e5 53
[    9.830018] EIP: [<fa99372f>] sky2_setup_irq+0x9f/0xb0 [sky2] SS:ESP 0068:f50fbdd8
[   10.114977] ---[ end trace 6853181d11e5e45a ]---
[   10.121597] [drm] Initialized drm 1.1.0 20060810
[   10.598013] nvidia: module license 'NVIDIA' taints kernel.
[   11.366133] ACPI Warning: 0x00000400-0x0000041f SystemIO conflicts with Region \SMRG 1 (20130725/utaddress-251)
[   11.372053] input: PC Speaker as /devices/platform/pcspkr/input/input15
[   11.389363] ACPI: If an ACPI driver is available for this device, you should use it instead of the native driver
[   11.436774] input: HDA NVidia HDMI/DP,pcm=9 as /devices/pci0000:00/0000:00:01.0/0000:01:00.1/sound/card1/input18
[   11.449056] input: HDA NVidia HDMI/DP,pcm=8 as /devices/pci0000:00/0000:00:01.0/0000:01:00.1/sound/card1/input17
[   11.461240] input: HDA NVidia HDMI/DP,pcm=7 as /devices/pci0000:00/0000:00:01.0/0000:01:00.1/sound/card1/input16
[   11.473379] input: HDA NVidia HDMI/DP,pcm=3 as /devices/pci0000:00/0000:00:01.0/0000:01:00.1/sound/card1/input14
[   11.485835] vgaarb: device changed decodes: PCI:0000:01:00.0,olddecodes=io+mem,decodes=none:owns=none
[   11.498462] [drm] Initialized nvidia-drm 0.0.0 20130102 for 0000:01:00.0 on minor 0
[   11.510829] NVRM: loading NVIDIA UNIX x86 Kernel Module  331.20  Wed Oct 30 16:59:55 PDT 2013
[   11.523659] iTCO_vendor_support: vendor-support=0
[   11.539930] gpio_ich: GPIO from 195 to 255 on gpio_ich
[   11.714329] microcode: CPU0 sig=0x6fb, pf=0x1, revision=0xb6
[   11.727418] iTCO_wdt: Intel TCO WatchDog Timer Driver v1.10
[   11.740101] iTCO_wdt: Found a ICH9R TCO device (Version=2, TCOBASE=0x0860)
[   11.753434] iTCO_wdt: initialized. heartbeat=30 sec (nowayout=0)
[   11.767642] microcode: CPU1 sig=0x6fb, pf=0x1, revision=0xb6
[   11.781056] microcode: Microcode Update Driver: v2.00 <tigran@aivazian.fsnet.co.uk>, Peter Oruba
[   12.042064] coretemp coretemp.0: Using relative temperature scale!
[   12.055598] coretemp coretemp.0: Using relative temperature scale!
[   14.944832] systemd-journald[110]: File /var/log/journal/2e32c4787d3945018666d6257394d8b8/user-1000.journal corrupted or uncleanly shut down, renaming and replacing.
[   18.610023] nvidia 0000:01:00.0: irq 46 for MSI/MSI-X
[   19.320584] NVRM: Your system is not currently configured to drive a VGA console
[   19.320590] NVRM: on the primary VGA device. The NVIDIA Linux graphics driver
[   19.320593] NVRM: requires the use of a text-mode VGA console. Use of other console
[   19.320595] NVRM: drivers including, but not limited to, vesafb, may result in
[   19.320597] NVRM: corruption and stability problems, and is not supported.

Last edited by eazar001 (2014-01-24 22:02:50)

Offline

#2 2013-12-16 22:21:16

eazar001
Member
Registered: 2013-07-07
Posts: 12

Re: [Solved] Kernel BUG at include/linux/netdevice.h:501, and sky2?

Oh, and I forgot to mention, when I blacklist the sky2 module everything seems to works perfectly, no kernel bugs or anything. However, this obviously is not a long term solution.

Offline

#3 2013-12-17 01:13:21

eazar001
Member
Registered: 2013-07-07
Posts: 12

Re: [Solved] Kernel BUG at include/linux/netdevice.h:501, and sky2?

UPDATE: So I ran some low level troubleshooting and decided to test out the linux-lts (3.10.24-1-lts) kernel. No problems at all. Everything works fine with my ethernet adapter and the sky2 module. It just appears that something occurred in the newer kernels to break the interface for at least one Marvell adapter. This solution works for now, but I suppose I should file a bug report eventually. In the meantime, any insight is welcome.

Offline

#4 2014-01-24 22:02:26

eazar001
Member
Registered: 2013-07-07
Posts: 12

Re: [Solved] Kernel BUG at include/linux/netdevice.h:501, and sky2?

My issue was finally resolved, thanks to the help of a nice chap on kernel.org. Please see this link for the included patch for this module version: https://bugzilla.kernel.org/show_bug.cgi?id=67151

Offline

Board footer

Powered by FluxBB