You are not logged in.

#1 2014-03-03 03:39:35

ackalker
Member
Registered: 2012-11-27
Posts: 201

Intermittent kernel BUG on boot, no display

Using Arch Linux x86_64, linux 3.13.5-1, nvidia-304xx 304.119-1, I am getting an intermittent kernel BUG on boot, after which some other kernel messages scroll by, then black screen.

Soft power-off works, after reboot everything usually works fine.

Here's the relevant portion of the journal from a failing boot:

Mar 03 00:13:25 miki-desktop kernel: kernel BUG at drivers/cpufreq/cpufreq.c:1456!
Mar 03 00:13:25 miki-desktop kernel: invalid opcode: 0000 [#1] PREEMPT SMP 
Mar 03 00:13:25 miki-desktop kernel: Modules linked in: nvidia(PO+) snd_mpu401_uart snd_opl3_lib snd_hwdep gameport acpi_cpufreq(+) microcode(+) snd_rawmidi snd_pcm snd_page_alloc forcedeth(+) pcspkr processor i2c_nforce2 i2c_core shpchp button wmi usbip_host(C) usbip_core(C) snd_seq snd_seq_device snd_timer snd soundcore nfs lockd sunrpc fscache ext4 crc16 mbcache jbd2 usb_storage sr_mod sd_mod cdrom ata_generic pata_acpi ahci libahci libata scsi_mod ehci_pci ohci_pci ohci_hcd ehci_hcd usbcore usb_common
Mar 03 00:13:25 miki-desktop kernel: CPU: 0 PID: 149 Comm: systemd-udevd Tainted: P         C O 3.13.5-1-ARCH #1
Mar 03 00:13:25 miki-desktop kernel: Hardware name: FUJITSU SIEMENS MS-7504VP/MS-7504VP, BIOS V1.16 08/11/2008
Mar 03 00:13:25 miki-desktop kernel: task: ffff8800c34c4800 ti: ffff8800c34e2000 task.ti: ffff8800c34e2000
Mar 03 00:13:25 miki-desktop kernel: RIP: 0010:[<ffffffff813e9593>]  [<ffffffff813e9593>] cpufreq_get+0x93/0xa0
Mar 03 00:13:25 miki-desktop kernel: RSP: 0018:ffff8800c34e3c78  EFLAGS: 00010246
Mar 03 00:13:25 miki-desktop kernel: RAX: 00000000fffffffe RBX: 0000000000000000 RCX: 00000000bfebfbff
Mar 03 00:13:25 miki-desktop kernel: RDX: 0000000000000000 RSI: 0000000000000001 RDI: 0000000000000000
Mar 03 00:13:25 miki-desktop kernel: RBP: ffff8800c34e3c88 R08: ffff8800bf83afb8 R09: ffff8800bf83afbc
Mar 03 00:13:25 miki-desktop kernel: R10: 00000000000045ba R11: 0000000000000000 R12: 0000000000000000
Mar 03 00:13:25 miki-desktop kernel: R13: ffffffffa0e92b20 R14: ffffffffa0e92690 R15: 0000000000000001
Mar 03 00:13:25 miki-desktop kernel: FS:  00007f214fff0780(0000) GS:ffff88012fc00000(0000) knlGS:0000000000000000
Mar 03 00:13:25 miki-desktop kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Mar 03 00:13:25 miki-desktop kernel: CR2: 00007f214dc32000 CR3: 00000000c343c000 CR4: 00000000000407f0
Mar 03 00:13:25 miki-desktop kernel: Stack:
Mar 03 00:13:25 miki-desktop kernel:  ffff8800bf83afc8 ffff8800bff99008 ffff8800c34e3c98 ffffffffa0acd420
Mar 03 00:13:25 miki-desktop kernel:  ffff8800bf83afb8 ffffffffa0a9929b ffff8800bf83afc0 ffffffffa04f6497
Mar 03 00:13:25 miki-desktop kernel:  ffff8800c0700008 ffff8800c34e3d38 ffff8800bf838000 ffffffffa04c5c6e
Mar 03 00:13:25 miki-desktop kernel: Call Trace:
Mar 03 00:13:25 miki-desktop kernel:  [<ffffffffa0acd420>] os_get_cpu_frequency+0x10/0x20 [nvidia]
Mar 03 00:13:25 miki-desktop kernel:  [<ffffffffa0a9929b>] _nv014908rm+0x9/0x21 [nvidia]
Mar 03 00:13:25 miki-desktop kernel:  [<ffffffffa04f6497>] ? _nv001098rm+0x25b/0xbbc [nvidia]
Mar 03 00:13:25 miki-desktop kernel:  [<ffffffffa04c5c6e>] ? _nv001102rm+0x98/0xdb [nvidia]
Mar 03 00:13:25 miki-desktop kernel:  [<ffffffffa0aa3576>] ? rm_init_rm+0x24/0x7e [nvidia]
Mar 03 00:13:25 miki-desktop kernel:  [<ffffffffa0f401bb>] ? nvidia_init_module+0x1bb/0x707 [nvidia]
Mar 03 00:13:25 miki-desktop kernel:  [<ffffffffa0f40000>] ? 0xffffffffa0f3ffff
Mar 03 00:13:25 miki-desktop kernel:  [<ffffffff8100214a>] ? do_one_initcall+0xfa/0x1b0
Mar 03 00:13:25 miki-desktop kernel:  [<ffffffff81087d58>] ? __blocking_notifier_call_chain+0x58/0x70
Mar 03 00:13:25 miki-desktop kernel:  [<ffffffff810d890b>] ? load_module+0x1e2b/0x2570
Mar 03 00:13:25 miki-desktop kernel:  [<ffffffff810d5140>] ? store_uevent+0x40/0x40
Mar 03 00:13:25 miki-desktop kernel:  [<ffffffff810d90f2>] ? SyS_init_module+0xa2/0xf0
Mar 03 00:13:25 miki-desktop kernel:  [<ffffffff8152142d>] ? system_call_fastpath+0x1a/0x1f
Mar 03 00:13:25 miki-desktop kernel: Code: 00 00 4c 89 e7 e8 be f3 12 00 89 df e8 a7 e5 ff ff 4c 89 e7 89 c3 e8 6d f4 cb ff 48 c7 c7 a0 a4 89 81 e8 61 f4 cb ff 89 d8 eb 9f <0f> 0b 66 66 2e 0f 1f 84 00 00 00 00 00 66 66 66 66 90 55 48 85 
Mar 03 00:13:25 miki-desktop kernel: RIP  [<ffffffff813e9593>] cpufreq_get+0x93/0xa0
Mar 03 00:13:25 miki-desktop kernel:  RSP <ffff8800c34e3c78>
$ inxi -G
Graphics:  Card: NVIDIA C73 [GeForce 7050 / nForce 630i] 
           X.Org: 1.15.0 drivers: nvidia (unloaded: fbdev,nv,vesa) Resolution: 1600x900@60.0hz 
           GLX Renderer: GeForce 7050 / nForce 630i/integrated/SSE2 GLX Version: 2.1.2 NVIDIA 304.119

Bug looks very similar to this one on the Nvidia DevZone for the Nvidia 331.20 driver.

Last edited by ackalker (2014-03-03 03:54:47)

Offline

#2 2014-03-03 13:17:34

Rexilion
Member
Registered: 2013-12-23
Posts: 784

Re: Intermittent kernel BUG on boot, no display

The post you mentioned indicates it's a bug (probably race condition) in the cpufreq code. This could be triggered by the nvidia module.

See if you can reproduce this without the nvidia module. If you can, then it's a genuine bug and you should report it upstream. If you cannot, you are out of luck because the kernel developers are not going to help you if you use proprietary modules.

Another 'solution' would be to disable cpu frequency scaling in the BIOS.


fs/super.c : "Self-destruct in 5 seconds.  Have a nice day...\n",

Offline

#3 2014-03-04 02:08:00

ackalker
Member
Registered: 2012-11-27
Posts: 201

Re: Intermittent kernel BUG on boot, no display

Yes I know, "tainted" kernel and all, I hope to switch to nouveau driver some day when it is stable on my system.
I've read some older reports about similar bugs and fixes for older kernel versions, hoping that most had been fixed by now smile
I will see if the problem persists or perhaps is solved in a newer kernel coming our way.

Last edited by ackalker (2014-03-04 02:10:53)

Offline

#4 2014-03-04 22:54:41

Rexilion
Member
Registered: 2013-12-23
Posts: 784

Re: Intermittent kernel BUG on boot, no display

It's too soon to point at the nvidia module. It causes a lot of havoc, but that does not imply it is the cause of every problem.


fs/super.c : "Self-destruct in 5 seconds.  Have a nice day...\n",

Offline

#5 2014-03-04 23:11:22

aaronp24
Member
Registered: 2014-03-04
Posts: 4

Re: Intermittent kernel BUG on boot, no display

I just debugged this same problem.  Does this patch help for you? https://patchwork.kernel.org/patch/3767101/

Offline

#6 2014-03-17 17:00:33

ackalker
Member
Registered: 2012-11-27
Posts: 201

Re: Intermittent kernel BUG on boot, no display

Thanks for your reply smile

I haven't had time yet to investigate. Has your patch been accepted upstream?

Offline

#7 2014-03-17 22:08:25

aaronp24
Member
Registered: 2014-03-04
Posts: 4

Re: Intermittent kernel BUG on boot, no display

Yeah, it ended up here: http://git.kernel.org/linus/999976e0f623.  It should be in v3.14-rc6 and up.

Last edited by aaronp24 (2014-03-17 22:08:49)

Offline

#8 2014-03-24 17:15:13

ackalker
Member
Registered: 2012-11-27
Posts: 201

Re: Intermittent kernel BUG on boot, no display

Thanks for landing this patch :-)

Offline

#9 2014-03-24 17:58:24

aaronp24
Member
Registered: 2014-03-04
Posts: 4

Re: Intermittent kernel BUG on boot, no display

Looks like it ended up in 3.13.7 too.

Offline

Board footer

Powered by FluxBB