You are not logged in.
$ git bisect good
Bisecting: 202 revisions left to test after this (roughly 8 steps)
[7b17fbfe4fd6b8da85b345d819cdce6ed4ee1d24] media: i2c: alvium: Move V4L2_CID_GAIN to V4L2_CID_ANALOG_GAIN
$ git describe
v6.10.2-607-g7b17fbfe4fd6
linux-stable-6.10.2.r607.g7b17fbfe4fd6-1-x86_64.pkg.tar.zst/linux-stable-headers-6.10.2.r607.g7b17fbfe4fd6-1-x86_64.pkg.tar.zst.
Offline
Cheers Gnat. Ye I don't use arch much for past decade ; I forgot about the archive so indeed there are some downgradable versions. That's good.
I checked out the version I wanted from the ABS and built it. took 24mins whilst I was out somewhere and ye so far so good. Although from that first reboot I have ever seen like that, there was about 2 days almost from the recent kenrel update.. so I do not yet know if it's a kernel issue (or if it's ryzen only seeing this, a kernel induced hw bug). Either way, fingers crossed and gl.
Last edited by pliny (2024-08-10 21:38:16)
Offline
$ git bisect good Bisecting: 202 revisions left to test after this (roughly 8 steps) [7b17fbfe4fd6b8da85b345d819cdce6ed4ee1d24] media: i2c: alvium: Move V4L2_CID_GAIN to V4L2_CID_ANALOG_GAIN $ git describe v6.10.2-607-g7b17fbfe4fd6
linux-stable-6.10.2.r607.g7b17fbfe4fd6-1-x86_64.pkg.tar.zst/linux-stable-headers-6.10.2.r607.g7b17fbfe4fd6-1-x86_64.pkg.tar.zst.
Seems stable
Offline
$ git bisect good
Bisecting: 101 revisions left to test after this (roughly 7 steps)
[60609323f13288b4d39d96080c3c58d06a43ccef] ASoC: codecs: wcd939x: Fix typec mux and switch leak during device removal
$ git describe
v6.10.2-708-g60609323f132
linux-stable-6.10.2.r708.g60609323f132-1-x86_64.pkg.tar.zst/linux-stable-headers-6.10.2.r708.g60609323f132-1-x86_64.pkg.tar.zst
Offline
$ git bisect good Bisecting: 101 revisions left to test after this (roughly 7 steps) [60609323f13288b4d39d96080c3c58d06a43ccef] ASoC: codecs: wcd939x: Fix typec mux and switch leak during device removal $ git describe v6.10.2-708-g60609323f132
linux-stable-6.10.2.r708.g60609323f132-1-x86_64.pkg.tar.zst/linux-stable-headers-6.10.2.r708.g60609323f132-1-x86_64.pkg.tar.zst
Crash
Last edited by Gnat (2024-08-11 05:13:07)
Offline
$ git bisect bad
Bisecting: 50 revisions left to test after this (roughly 6 steps)
[154d33dc8de81a9f61fa8f4d2004f0a2f2d829fe] ubi: eba: properly rollback inside self_check_eba
$ git describe
v6.10.2-657-g154d33dc8de8
linux-stable-6.10.2.r657.g154d33dc8de8-1-x86_64.pkg.tar.zst/linux-stable-headers-6.10.2.r657.g154d33dc8de8-1-x86_64.pkg.tar.zst
Offline
$ git bisect bad Bisecting: 50 revisions left to test after this (roughly 6 steps) [154d33dc8de81a9f61fa8f4d2004f0a2f2d829fe] ubi: eba: properly rollback inside self_check_eba $ git describe v6.10.2-657-g154d33dc8de8
linux-stable-6.10.2.r657.g154d33dc8de8-1-x86_64.pkg.tar.zst/linux-stable-headers-6.10.2.r657.g154d33dc8de8-1-x86_64.pkg.tar.zst
stable
Offline
$ git bisect good
Bisecting: 25 revisions left to test after this (roughly 5 steps)
[8192c533e89d9fb69b2490398939236b78cda79b] scsi: qla2xxx: Fix for possible memory corruption
$ git describe
v6.10.2-682-g8192c533e89d
linux-stable-6.10.2.r682.g8192c533e89d-1-x86_64.pkg.tar.zst/linux-stable-headers-6.10.2.r682.g8192c533e89d-1-x86_64.pkg.tar.zst
Offline
$ git bisect good Bisecting: 25 revisions left to test after this (roughly 5 steps) [8192c533e89d9fb69b2490398939236b78cda79b] scsi: qla2xxx: Fix for possible memory corruption $ git describe v6.10.2-682-g8192c533e89d
linux-stable-6.10.2.r682.g8192c533e89d-1-x86_64.pkg.tar.zst/linux-stable-headers-6.10.2.r682.g8192c533e89d-1-x86_64.pkg.tar.zst
stable
Offline
$ git bisect good
Bisecting: 12 revisions left to test after this (roughly 4 steps)
[1a802eaa152b8380070a65e1e11710d880a99291] drm/i915/gt: Do not consider preemption during execlists_dequeue for gen8
$ git describe
v6.10.2-695-g1a802eaa152b
linux-stable-6.10.2.r695.g1a802eaa152b-1-x86_64.pkg.tar.zst/linux-stable-headers-6.10.2.r695.g1a802eaa152b-1-x86_64.pkg.tar.zst
Offline
$ git bisect good Bisecting: 12 revisions left to test after this (roughly 4 steps) [1a802eaa152b8380070a65e1e11710d880a99291] drm/i915/gt: Do not consider preemption during execlists_dequeue for gen8 $ git describe v6.10.2-695-g1a802eaa152b
linux-stable-6.10.2.r695.g1a802eaa152b-1-x86_64.pkg.tar.zst/linux-stable-headers-6.10.2.r695.g1a802eaa152b-1-x86_64.pkg.tar.zst
stable
Offline
$ git bisect good
Bisecting: 6 revisions left to test after this (roughly 3 steps)
[9f33d44ab5ef592f2b4175d7fc87dc44291e2832] drm/amd/amdgpu: Fix uninitialized variable warnings
$ git describe
v6.10.2-701-g9f33d44ab5ef
linux-stable-6.10.2.r701.g9f33d44ab5ef-1-x86_64.pkg.tar.zst/linux-stable-headers-6.10.2.r701.g9f33d44ab5ef-1-x86_64.pkg.tar.zst.
Offline
After installing 6.10.4 and seeing the problem get slightly worse, I downgraded as suggested in post #49, to 6.10.2 and my system is behaving normally again. No random reboots in the last several hours.
Offline
$ git bisect good Bisecting: 6 revisions left to test after this (roughly 3 steps) [9f33d44ab5ef592f2b4175d7fc87dc44291e2832] drm/amd/amdgpu: Fix uninitialized variable warnings $ git describe v6.10.2-701-g9f33d44ab5ef
linux-stable-6.10.2.r701.g9f33d44ab5ef-1-x86_64.pkg.tar.zst/linux-stable-headers-6.10.2.r701.g9f33d44ab5ef-1-x86_64.pkg.tar.zst.
crash
Offline
drm/amd/amdgpu: Fix uninitialized variable warnings
drm/amdgpu: add missed harvest check for VCN IP v4/v5
drm/amdgpu: reset vm state machine after gpu reset(vram lost)
drm/dp_mst: Fix all mstb marked as not probed after suspend/resume
drm/udl: Remove DRM_CONNECTOR_POLL_HPD
drm/amdgpu/sdma5.2: Update wptr registers as well as doorbell
https://github.com/torvalds/linux/commi … ca37debe9f looks prone to cause a sudden power draw towards the GPU, you'd have traded in a hanging GPU for an undervolted CPU if the overall supply is tight
Online
Great, so instead of a hanging GPU (seeminly not causing much issues) many amd users with integrated gpu (even if not using the igpu at all) now have cpu crashing issues?
This workaround seems to do more harm than good...
Last edited by Gnat (2024-08-13 08:21:13)
Offline
So i am following this thread since a couple of days (when i started to noticing reboot problems) i did not have yet downgraded to the correct 6.10.2, i was hoping that 6.10.4 could've solved the issue but it seems it is not the case. I believe i did not understand what the issue is completely, so the problem seems to be how manufacturers handle power management for amd cpus/gpus? And the problem of system reboots or gpu hanging is just a consequence of that particular kernel on those "bad power supplied" cpu/gpus ?
I'd like to learn more about the topic, just out of curiosity.
Also, i have amd microcode installed on my system i don't know if i can do anything more to mitigate the problem.
Last edited by neosnakex34 (2024-08-13 09:00:48)
Offline
Kernel 6.10.4 and mesa 24.1.5-2 still crashes?
Excuse my poor English.
Offline
nb that i was Just theorizing in culprit and explanation.
We'll still have top confirm the cause (though the bisection hast now narrowed to the GPU and the other commits don't Look nearly as suspicious)
If this pans Out to be the Offending commit, that's actually "good" because the Problem isn't Just the outfall of some actual fix/Feature Implementation but rather unfortunate target shifting.
Online
Yes, they updated to 6.10.4 but had a sudden reboot still. So back to 6.9.10 which I built locally and it's all good as before.
So keep bisecting I guess
For those who don't wish to wait, downgrade to say 6.9.10 and pin linux till it's fixed in pacman.conf
Last edited by pliny (2024-08-13 10:34:33)
Offline
For those who don't wish to wait, downgrade to say 6.9.10 and pin linux till it's fixed in pacman.conf
It's not preferred to downgrade that low. 6.10.2 is the latest kernel that doesn't have the issue. Just use the
downgrade
utility, since it's much faster and easier.
Offline
$ git bisect bad
Bisecting: 2 revisions left to test after this (roughly 2 steps)
[972dd51f1857b1da0aeedcfbe4a32e9aea743542] drm/dp_mst: Fix all mstb marked as not probed after suspend/resume
$ git describe
v6.10.2-698-g972dd51f1857
linux-stable-6.10.2.r698.g972dd51f1857-1-x86_64.pkg.tar.zst/linux-stable-headers-6.10.2.r698.g972dd51f1857-1-x86_64.pkg.tar.zst
Offline
$ git bisect bad Bisecting: 2 revisions left to test after this (roughly 2 steps) [972dd51f1857b1da0aeedcfbe4a32e9aea743542] drm/dp_mst: Fix all mstb marked as not probed after suspend/resume $ git describe v6.10.2-698-g972dd51f1857
linux-stable-6.10.2.r698.g972dd51f1857-1-x86_64.pkg.tar.zst/linux-stable-headers-6.10.2.r698.g972dd51f1857-1-x86_64.pkg.tar.zst
crash
Offline
$ git bisect bad
Bisecting: 0 revisions left to test after this (roughly 1 step)
[60887a89986f0b762e0916acd55068d525ff63fb] drm/udl: Remove DRM_CONNECTOR_POLL_HPD
$ git describe
v6.10.2-697-g60887a89986f
linux-stable-6.10.2.r697.g60887a89986f-1-x86_64.pkg.tar.zst/linux-stable-headers-6.10.2.r697.g60887a89986f-1-x86_64.pkg.tar.zst.
Offline
After installing 6.10.4 and seeing the problem get slightly worse, I downgraded as suggested in post #49, to 6.10.2 and my system is behaving normally again. No random reboots in the last several hours.
Scratch this. 6.10.2 also has the issue though for me it's not nearly as frequent. I had another random reboot similar from when I went from 6.9 to 6.10.3.
`ooo/ OS: Arch Linux x86_64
`+oooo: Host: A7
`+oooooo: Kernel: 6.10.2-arch1-2
-+oooooo+: Uptime: 3 mins
`/:-:++oooo+: Packages: 1104 (pacman), 7 (flatpak)
`/++++/+++++++: Shell: bash 5.2.32
`/++++++++++++++: Resolution: 3840x2160
`/+++ooooooooooooo/` DE: GNOME 46.4
./ooosssso++osssssso+` WM: Mutter
.oossssso-````/ossssss+` WM Theme: Adwaita
-osssssso. :ssssssso. Theme: Adwaita-dark [GTK2/3]
:osssssss/ osssso+++. Icons: Adwaita [GTK2/3]
/ossssssss/ +ssssooo/- Terminal: gnome-terminal
`/ossssso+/:- -:/+osssso+- CPU: AMD Ryzen 9 7940HS w/ Radeon 780M Graphics (16) @ 5.263GHz
`+sso+:-` `.-/+oso: GPU: AMD ATI 65:00.0 Phoenix1
`++:. `-/+/ Memory: 8344MiB / 31377MiB
This is a minicomputer and not a laptop as might be indicated by the mobile graphics.
Last edited by dobie2564 (2024-08-13 19:33:47)
Offline