You are not logged in.
Hi,
for a few weeks now, i have a lots of crash when my laptop is on heavy load
(compiling, playing games (usually spring, etc)) not when watching movies (vlc))
heres the last lines of log before it crash
Jun 29 11:31:50 ZoidBox kernel: CE: hpet increasing min_delta_ns to 75936 nsec
i downgraded the nvidia driver from 185 to 173 after reading a post with similar symptoms
but it just crash again with
Jun 29 11:31:50 ZoidBox kernel: CE: hpet increasing min_delta_ns to 75936 nsec
Jun 29 11:45:17 ZoidBox kernel: ACPI Exception (evregion-0422): AE_TIME, Returned by Handler for [EmbeddedControl] [20090320]
Jun 29 11:45:17 ZoidBox kernel: ACPI Error (psparse-0537): Method parse/execution failed [\_TZ_.THR1._TMP] (Node f7013600), AE_TIME
any idea what could be the reason, or how to know it ![]()
thanks
Last edited by freakyzoidberg (2009-06-30 03:40:36)
Zoidberg
Offline
What does google say?
Try noapic.
"I'm Winston Wolfe. I solve problems."
~ Need moar games? [arch-games] ~ [aurcheck] AUR haz updates? ~
Offline
thanks will try this,
i let you know
edit:
i havent applied yet noapic at boot but here my last log before crash on spring rts
Jun 29 12:04:40 ZoidBox kernel: hda-intel: IRQ timing workaround is activated for card #0. Suggest a bigger bdl_pos_adj.
Jun 29 12:10:21 ZoidBox kernel: CE: hpet increasing min_delta_ns to 15000 nsec
Jun 29 12:12:08 ZoidBox kernel: CE: hpet increasing min_delta_ns to 22500 nsec
Jun 29 12:13:16 ZoidBox kernel: CE: hpet increasing min_delta_ns to 33750 nsec
Jun 29 12:13:43 ZoidBox kernel: CE: hpet increasing min_delta_ns to 50624 nsec
Jun 29 12:16:35 ZoidBox kernel: CE: hpet increasing min_delta_ns to 75936 nseczoidberg@ZoidBox ~ $ uname -vr
2.6.30-ARCH #1 SMP PREEMPT Fri Jun 19 21:25:17 UTC 2009Last edited by freakyzoidberg (2009-06-29 00:29:02)
Zoidberg
Offline
nop freezed again even with noapic in grub menu.lst
Zoidberg
Offline
Try adding hpet=disable in menu.lst...
Offline
heavy load + freeze ?
usually thats temperature related....
Offline
It could also be related to bad RAM. Do you have access to another stick of RAM (that you know to be good, of course) to test it?
R.
Offline
A bug report with some possible fixes:
https://bugs.launchpad.net/ubuntu/+sour … bug/267913
I got this message in my dmesg:
hda-intel: IRQ timing workaround is activated for card #0. Suggest a bigger bdl_pos_adj.
You (Nil) might try doing a "cat /proc/interrupts" at the console and see if snd-hda-intel is using the same interrupt as your SD card reader. If it is, you may be able to work around the SD card problem by adding the line "options snd-hda-intel enable_msi=1" to the bottom of your /etc/modprobe.d/alsa-base file.
I would like to add information that may shed light on this.
I too started getting this warning in the log after the upgrade from hardy to intrepidcould it have something to do with changing the time on the system?
For example, if I run hwclock --systohc + adjtimexconfig, I see the warning after a few seconds in the log. Here's the sequence:
Feb 22 01:07:45 myhost sudo: username : TTY=pts/9 ; PWD=/home/username ; USER=root
; COMMAND=/sbin/hwclock --systohc
Feb 22 01:07:47 myhost sudo: username : TTY=pts/9 ; PWD=/home/username ; USER=root
; COMMAND=/usr/sbin/adjtimexconfig
Feb 22 01:07:58 myhost kernel: [439231.900129] hda-intel: IRQ timing workaround is
activated for card #0. Suggest a bigger bdl_pos_adj.This makes me suspect that changing the hardware clock, causes the hda-intel driver to wrongly conclude that something might be wrong and activate the presumably inferior "IRQ timing workaround". A pity.
Offline
wow loads of answer:)
after setting noapic and disabling hpet, i didnt noticed any more freeze
but i wasnt able to read a simple divx with vlc too much frames drop.
so removed those.
the laptop has a few years and wasnt acting like that before (used debian on it during the 2 last years)
about the temperature
cpu is about 80 degrees when freezing and hddtemp says something laround 55-60 degrees
for the memory, i have a laptop which reduce the chance to find a stick quickly. but i ll follow your idea and will try a memcheck
cat /proc/interrupts says
CPU0 CPU1
0: 5596141 4652 IO-APIC-edge timer
1: 1929 0 IO-APIC-edge i8042
8: 445 0 IO-APIC-edge rtc0
9: 1563358 29192 IO-APIC-fasteoi acpi
12: 123 0 IO-APIC-edge i8042
14: 0 0 IO-APIC-edge ata_piix
15: 0 0 IO-APIC-edge ata_piix
16: 384113 19 IO-APIC-fasteoi uhci_hcd:usb5, ohci1394, nvidia
17: 256 0 IO-APIC-fasteoi mmc0
18: 0 0 IO-APIC-fasteoi uhci_hcd:usb4
19: 0 0 IO-APIC-fasteoi uhci_hcd:usb3
23: 161730 0 IO-APIC-fasteoi ehci_hcd:usb1, uhci_hcd:usb2
28: 61009 1 PCI-MSI-edge ahci
29: 391261 5 PCI-MSI-edge HDA Intel
30: 147922 40 PCI-MSI-edge eth0
31: 0 0 PCI-MSI-edge iwl3945
NMI: 0 0 Non-maskable interrupts
LOC: 1214800 3724442 Local timer interrupts
SPU: 0 0 Spurious interrupts
RES: 847061 1067025 Rescheduling interrupts
CAL: 429 2672 Function call interrupts
TLB: 57055 49347 TLB shootdowns
TRM: 0 0 Thermal event interrupts
ERR: 0
MIS: 0so i guess there is no IRQ mistake
what i dont understand is why an error about sound caused by a possible hdd overheat when compiling result in computer freeze ^^
i ll be away today, but will keep the post update later this week, if i find anything to fix or to narrow the problem
thanks all of you anyway for your time and answers.
Last edited by freakyzoidberg (2009-06-29 20:00:35)
Zoidberg
Offline
about the temperature
cpu is about 80 degrees when freezing and hddtemp says something laround 55-60 degrees
i suppose you mean Farenheit, otherwise, theres your problem: 80C is too hot.
Offline
nop celsius
CPU
40*C on idle, or coding/surfing
50*C while watching videos
70-75*C on 1080p videos
never saw more than 80
but looking at the spec of a core2duo it says it can go up to 90-95* without damages
what scares me the most is the hard drive which go up to 60*C but in the laptop there is no 'cooling corridor' towards the hdd
on idle the hdd is at 46*C i cleaned it like last week, could it be possible that it s system related like never caching r/w or anything ?
Zoidberg
Offline
Mmm ouch, install cpufreq and do your hardware a favor.
"I'm Winston Wolfe. I solve problems."
~ Need moar games? [arch-games] ~ [aurcheck] AUR haz updates? ~
Offline
already have ondemand :s
Zoidberg
Offline
I suppose ondemand is not the right one since it uses the max frequency when the system is on heavy load; try limiting the maximum frequency in /etc/conf.d/cpufreq or setting the 'userspace' governor.
Have you tried booting the kernel just disabling hpet?
"I'm Winston Wolfe. I solve problems."
~ Need moar games? [arch-games] ~ [aurcheck] AUR haz updates? ~
Offline
Your temps are ways to high. (all of them) Your CPU will not be damaged below the spec'ed temperatures, and somewhere before reaching that level, it will enventually simpy shutdown. even if that is not your problem, I have not seen a Core2Duo working reliably on temps higher than about 70°C. You might try mprime (http://aur.archlinux.org/packages.php?ID=6975). I have not used it on Linux, but pretty much on Windows for stability testing. If this shows any errors before your system freezes, your problem will be CPU/Ram-Overheating related. last but not least: As you mentioned, you are scared about your HDD-Temps, you really should be. I learned the hard way, that temperatures as high as that, will almost certainly result in a final headcrash. If there already are sectors damaged, you might have damaged binarys on your system, that cause the freeze. But i think this should result in a segfault and happen anytime the binary is accesed. Anyhow i have seen some bad things what dust can do to the cooling system of a notebook. You might try to open and clean it. (I recommend to google for an disassembly guide though
)
Offline
i m currently booting with :
kernel /boot/vmlinuz26 root=/dev/sda2 ro vga=791 irqpoll hpet=disableirqpoll because noapic was causing way too much perf decrease
and to fix this in my log
irq 11: nobody cared (try booting with the "irqpoll" option)i still have those stuff during boot and have already been reported by someone, and dont know if its related to my problem
Jun 30 14:50:14 ZoidBox load-modules.sh: 'acpi:PNP0C04:' is not a valid module or alias name
Jun 30 14:50:14 ZoidBox load-modules.sh: 'acpi:PNP0303:' is not a valid module or alias name
Jun 30 14:50:14 ZoidBox load-modules.sh: 'acpi:PNP0100:' is not a valid module or alias name
Jun 30 14:50:14 ZoidBox load-modules.sh: 'acpi:PNP0C02:' is not a valid module or alias name
Jun 30 14:50:14 ZoidBox load-modules.sh: 'acpi:PNP0C02:' is not a valid module or alias namei tried also only using hpet=disable but crashed again.
i set cpufreq on userspace and change /etc/conf.d/cpufreq max to 1.2GHz (real freq is 1.6Ghz)
Zoidberg
Offline
success to finish a party in spring
at the end temperature are as follow
/dev/sda: HTS541010G9SA00: 49°C
temperature: 52 Cthe begining of the end of my trouble ? ![]()
i guess the userspace + cpu hard limit did the trick
so clearly heat related
the only downside is that i now have a 1.2Ghz core 2 duo ![]()
Zoidberg
Offline
Well, you could always experiment with cooling options. I use one of those pads which you can place your laptop on, with fans running below.
Or maybe just a huge block of ice (wrapped in plastic, of course) =p
Allan-Volunteer on the (topic being discussed) mailn lists. You never get the people who matters attention on the forums.
jasonwryan-Installing Arch is a measure of your literacy. Maintaining Arch is a measure of your diligence. Contributing to Arch is a measure of your competence.
Griemak-Bleeding edge, not bleeding flat. Edge denotes falls will occur from time to time. Bring your own parachute.
Offline
is it possible to tweak sysctl to less use the harddrive ? i mean more caching or delay writting ?
could not find any suitable article on how doing that
after idling the cpu is at 34*C but the HDD remains at 45*C
Zoidberg
Offline
ok i feel quite like a fool now,
while looking at my rc.conf i noticed that laptop-mode was not set.
i honnestly think this is the reason why the laptop overheated (maybe helped by some dust of course
)
sorry guys if i make you loose your time, but thank you anyway for your help and support
glad to have join arch for a reactive community like you were on my problem ![]()
i ll mark this post as fix tomorrow if the temperature stay under an acceptable level.
Last edited by freakyzoidberg (2009-06-30 05:12:20)
Zoidberg
Offline
shouldnt there be anything related to laptop-mode and cpufreq in the begginers guide? its not too obvious people should set them up after install.
Offline
i had it at a time, and then removed it because of i dont remember what, and apparently forgot to came back to that issue
Zoidberg
Offline