You are not logged in.

#1 2020-02-10 12:15:46

Dennis
Member
Registered: 2014-11-04
Posts: 56
Website

[workaround] NTFS parts randomly "drop out" of running system

I am experiencing issues with NTFS partitions seemingly randomly becoming inaccessible (relevant dmesg output at the bottom). I do not know what is triggering it. Already checked all partitions for errors in Windows 10.

After the "drop out" it is impossible to access anything on the affected partition(until reboot, then it just works again, though rebooting requires hard-reset because shutdown indefinitely waits for an unmount which never succeeds).

It is also impossible to unmount, neither regular unmount nor lazy unmount have an effect, remounting also fails. It happens to different partitions not always the same one and it also happens on both rotational disks and SSDs. It never affects my non-NTFS partitions though.

lsa -al shows an affected partition like this (C is affected here):

rwxrwxrwx 11 dennis dennis 4.0K 16. Mai 2019  .
drwxrwxrwx 18 dennis dennis 4.0K 25. Dez 11:26 ..
drwxrwxrwx  1 dennis dennis 8.0K  7. Feb 18:46 <redacted A>
drwxrwxrwx  1 dennis dennis  48K  7. Feb 18:46 <redacted B>
d?????????  ? ?      ?         ?             ? <redacted C>

mount options from fstab:

UUID=<redacted UA>	<redacted A>	ntfs-3g	uid=dennis,gid=dennis,rw,user,exec,umask=000,allow_other,big_writes 0 0
UUID=<redacted UB>	<redacted B>	ntfs-3g	uid=dennis,gid=dennis,rw,user,exec,umask=000,allow_other,big_writes 0 0
UUID=<redacted UC>	<redacted C>	ntfs-3g	uid=dennis,gid=dennis,rw,user,exec,umask=000,allow_other,big_writes 0 0

I also already tried with and without the "big_writes" mount option. It happens in both cases.

Where could I look for further info? Any ideas what could be causing this or how to fix it without rebooting?

relevant dmesg output:

[12856.391775] general protection fault: 0000 [#1] PREEMPT SMP PTI
[12856.391787] CPU: 0 PID: 446 Comm: mount.ntfs-3g Tainted: P           OE     5.5.2-1-ck-skylake #1
[12856.391791] Hardware name: Gigabyte Technology Co., Ltd. Z170-HD3P/Z170-HD3P-CF, BIOS F22g 03/09/2018
[12856.391802] RIP: 0010:__x86_indirect_thunk_rax+0x10/0x20
[12856.391808] Code: 89 ca e9 f3 7c d0 ff 48 8d 0c c8 e9 f4 7a d0 ff cc cc cc cc cc cc cc cc cc cc e8 07 00 00 00 f3 90 0f ae e8 eb f9 48 89 04 24 <c3> 66 66 2e 0f 1f 84 00 00 00 00 00 0f 1f 40 00 e8 07 00 00 00 f3
[12856.391812] RSP: 0018:ffffb9df809f7c28 EFLAGS: 00010206
[12856.391817] RAX: 7b08280b02425100 RBX: ffff9838189fb0a0 RCX: 0000000000000000
[12856.391820] RDX: 0000000000000000 RSI: ffffb9df90e77b40 RDI: ffff98382ace6c00
[12856.391823] RBP: ffff9838189fb090 R08: 0000000000000000 R09: ffffb9df809f7bd0
[12856.391827] R10: 00007ffe6ea2a000 R11: fffffb4990671c40 R12: ffff98382ace6c00
[12856.391830] R13: 7b08280b02425100 R14: ffff9838189fb090 R15: 0000000000000078
[12856.391834] FS:  00007f416969f780(0000) GS:ffff98382ec00000(0000) knlGS:0000000000000000
[12856.391837] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[12856.391840] CR2: 000055c35ddabd78 CR3: 000000041aac0003 CR4: 00000000003606f0
[12856.391844] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
[12856.391847] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
[12856.391850] Call Trace:
[12856.391866]  ? fuse_request_end+0xae/0x1e0 [fuse]
[12856.391876]  ? fuse_dev_do_write+0x24b/0xdf0 [fuse]
[12856.391885]  ? preempt_count_add+0x49/0xa0
[12856.391895]  ? fuse_copy_one+0x3f/0x50 [fuse]
[12856.391904]  ? fuse_copy_args+0x4a/0x140 [fuse]
[12856.391913]  ? fuse_dev_write+0x53/0x90 [fuse]
[12856.391923]  ? do_iter_readv_writev+0x158/0x1d0
[12856.391931]  ? do_iter_write+0x7d/0x190
[12856.391938]  ? vfs_writev+0xa6/0xf0
[12856.391949]  ? syscall_trace_enter+0x188/0x2d0
[12856.391956]  ? do_writev+0x6b/0x110
[12856.391963]  ? do_syscall_64+0x4e/0x150
[12856.391969]  ? entry_SYSCALL_64_after_hwframe+0x44/0xa9
[12856.391977] Modules linked in: snd_hrtimer rfkill vsock xt_pkttype ipt_REJECT nf_reject_ipv4 xt_tcpudp xt_conntrack nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 libcrc32c iptable_filter intel_rapl_msr intel_rapl_common snd_usb_audio snd_usbmidi_lib uvcvideo snd_rawmidi x86_pkg_temp_thermal intel_powerclamp coretemp kvm_intel iTCO_wdt iTCO_vendor_support kvm ucsi_ccg typec_ucsi mei_hdcp typec intel_wmi_thunderbolt ppdev irqbypass crct10dif_pclmul crc32_pclmul ghash_clmulni_intel aesni_intel crypto_simd cryptd r8169 glue_helper intel_cstate intel_uncore realtek nvidia_drm(POE) snd_hda_codec_hdmi intel_rapl_perf nvidia_modeset(POE) i2c_i801 snd_hda_codec_realtek libphy snd_hda_codec_generic ledtrig_audio drm_kms_helper usbtv videobuf2_vmalloc videobuf2_memops videobuf2_v4l2 snd_hda_intel videobuf2_common snd_intel_dspcfg drm videodev snd_hda_codec mc snd_hda_core agpgart joydev snd_hwdep syscopyarea mei_me sysfillrect sysimgblt input_leds mousedev snd_pcm mei fb_sys_fops i2c_nvidia_gpu wmi
[12856.392036]  parport_pc parport evdev mac_hid snd_seq_dummy snd_seq_oss snd_seq_midi_event snd_seq snd_seq_device snd_timer snd soundcore cuse fuse sg vhba(OE) ip_tables x_tables ext4 crc32c_generic crc16 mbcache jbd2 sr_mod cdrom sd_mod hid_generic usbhid hid ahci libahci crc32c_intel libata xhci_pci xhci_hcd scsi_mod nvidia(POE) ipmi_devintf ipmi_msghandler
[12856.392074] ---[ end trace 9937872c77340efb ]---
[12856.392081] RIP: 0010:__x86_indirect_thunk_rax+0x10/0x20
[12856.392087] Code: 89 ca e9 f3 7c d0 ff 48 8d 0c c8 e9 f4 7a d0 ff cc cc cc cc cc cc cc cc cc cc e8 07 00 00 00 f3 90 0f ae e8 eb f9 48 89 04 24 <c3> 66 66 2e 0f 1f 84 00 00 00 00 00 0f 1f 40 00 e8 07 00 00 00 f3
[12856.392091] RSP: 0018:ffffb9df809f7c28 EFLAGS: 00010206
[12856.392095] RAX: 7b08280b02425100 RBX: ffff9838189fb0a0 RCX: 0000000000000000
[12856.392098] RDX: 0000000000000000 RSI: ffffb9df90e77b40 RDI: ffff98382ace6c00
[12856.392101] RBP: ffff9838189fb090 R08: 0000000000000000 R09: ffffb9df809f7bd0
[12856.392104] R10: 00007ffe6ea2a000 R11: fffffb4990671c40 R12: ffff98382ace6c00
[12856.392107] R13: 7b08280b02425100 R14: ffff9838189fb090 R15: 0000000000000078
[12856.392111] FS:  00007f416969f780(0000) GS:ffff98382ec00000(0000) knlGS:0000000000000000
[12856.392114] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[12856.392117] CR2: 000055c35ddabd78 CR3: 000000041aac0003 CR4: 00000000003606f0
[12856.392121] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
[12856.392124] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400

Last edited by Dennis (2020-02-14 16:15:26)

Offline

#2 2020-02-10 14:53:58

V1del
Forum Moderator
Registered: 2012-10-16
Posts: 21,410

Re: [workaround] NTFS parts randomly "drop out" of running system

Can you reproduce on the standard kernel? There have been some reports with issues with the CK patchset.

General Protection fault is often also memory related, so you should check your memory with memtest and maybe also ensure your microcodes are set up

Online

#3 2020-02-10 19:15:12

Dennis
Member
Registered: 2014-11-04
Posts: 56
Website

Re: [workaround] NTFS parts randomly "drop out" of running system

Thank you for the suggestions. Microcode patches are already set up with early loading at boot. RAM is flawless. I have not tried the standard kernel yet, decided to work around the issue by converting my most frequently used partitions to ext4 now. Will test with the standard kernel and the remaining NTFS partitions later.

Offline

#4 2020-02-10 20:09:28

Buddlespit
Member
From: Chesapeake, Va.
Registered: 2014-02-07
Posts: 501

Re: [workaround] NTFS parts randomly "drop out" of running system

I've had the same problems with a shared (ntfs) harddrive. Mine was memory related on a Ryzen cpu (2700X). If I lowered my memory to 3200 and loosened the timings to 15-15-15-36 (from 3400, 15-15-15-34), I stopped having the ntfs drive drop out and the insanely long reboot times.

Offline

#5 2020-02-10 21:01:49

deadite66
Member
Registered: 2015-06-13
Posts: 31

Re: [workaround] NTFS parts randomly "drop out" of running system

I had the same problem with NTFS drives on my Haswell system but since the motherboard died and moved to skylake-x haven't had a single problem with them.
it first started with the 5.4 kernel for me.

this was the report i made at the time. https://bugzilla.kernel.org/show_bug.cgi?id=205803
though i report it was fixed it did happen a few times after i replied, at least until the mobo died.

Last edited by deadite66 (2020-02-10 21:05:56)

Offline

#6 2020-02-11 10:13:30

Dennis
Member
Registered: 2014-11-04
Posts: 56
Website

Re: [workaround] NTFS parts randomly "drop out" of running system

Thanks for the input everyone. My memory clock and timings already are at safe values 2133,15-15-15-36 and the problem persists (in CK kernel).

Left the machine running for 12h copying stuff around under the standard kernel now and so far all NTFS partitions are still up and alive, so I think it's safe to assume it's an issue with the CK patchset as V1del suggested.

So, I'm changing the thread-title and I'll ask in the CK support thread to have people have a look at this one. Meanwhile I'll continue to convert all my NTFS partitions to ext4.

Last edited by Dennis (2020-02-11 11:06:20)

Offline

#7 2020-02-11 11:08:09

Dennis
Member
Registered: 2014-11-04
Posts: 56
Website

Re: [workaround] NTFS parts randomly "drop out" of running system

I spoke too early, while I continued to copy stuff around, the problem occurred in the standard kernel as well.

dmesg output:

[46107.267970] kernel tried to execute NX-protected page - exploit attempt? (uid: 0)
[46107.267979] BUG: unable to handle page fault for address: ffff96d826e59cc0
[46107.267983] #PF: supervisor instruction fetch in kernel mode
[46107.267987] #PF: error_code(0x0011) - permissions violation
[46107.267991] PGD 245001067 P4D 245001067 PUD 28dac5063 PMD 8000000226e000e3 
[46107.268000] Oops: 0011 [#1] PREEMPT SMP PTI
[46107.268006] CPU: 0 PID: 484 Comm: mount.ntfs-3g Tainted: P           OE     5.5.2-arch1-1 #1
[46107.268009] Hardware name: Gigabyte Technology Co., Ltd. Z170-HD3P/Z170-HD3P-CF, BIOS F22g 03/09/2018
[46107.268016] RIP: 0010:0xffff96d826e59cc0
[46107.268021] Code: ff ff 10 9b e5 26 d8 96 ff ff a0 9c e5 26 d8 96 ff ff a0 9c e5 26 d8 96 ff ff 00 00 00 00 00 00 00 00 78 ab 9b 8b d8 96 ff ff <04> 00 28 00 02 00 00 00 00 00 00 00 00 00 00 00 08 f9 f1 2d da 96
[46107.268025] RSP: 0018:ffffadae802b3c28 EFLAGS: 00010286
[46107.268030] RAX: ffff96d826e59cc0 RBX: ffff96da28b80b50 RCX: 0000000000000000
[46107.268033] RDX: 0000000000000000 RSI: ffffadae83cdfbb0 RDI: ffff96da24de9600
[46107.268037] RBP: ffff96da28b80b40 R08: 0000000000000000 R09: ffffadae802b3bc8
[46107.268040] R10: 00007ffc6ad7e000 R11: ffff96da0e0a4be8 R12: ffff96da24de9600
[46107.268043] R13: ffff96d826e59cc0 R14: ffff96da28b80b40 R15: 0000000000000078
[46107.268047] FS:  00007efe6744b780(0000) GS:ffff96da2ec00000(0000) knlGS:0000000000000000
[46107.268051] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[46107.268055] CR2: ffff96d826e59cc0 CR3: 0000000410834004 CR4: 00000000003606f0
[46107.268058] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
[46107.268061] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
[46107.268064] Call Trace:
[46107.268081]  ? fuse_request_end+0xae/0x1e0 [fuse]
[46107.268091]  ? fuse_dev_do_write+0x25d/0xee0 [fuse]
[46107.268102]  ? set_page_dirty+0xe/0xb0
[46107.268108]  ? set_page_dirty_lock+0x35/0x50
[46107.268115]  ? preempt_count_add+0x68/0xa0
[46107.268125]  ? fuse_dev_write+0x66/0xa0 [fuse]
[46107.268134]  ? do_iter_readv_writev+0x166/0x1e0
[46107.268141]  ? do_iter_write+0x7d/0x190
[46107.268146]  ? vfs_writev+0xe0/0x130
[46107.268156]  ? do_writev+0x73/0x120
[46107.268164]  ? do_syscall_64+0x4e/0x150
[46107.268172]  ? entry_SYSCALL_64_after_hwframe+0x44/0xa9
[46107.268180] Modules linked in: snd_hrtimer vsock rfkill xt_pkttype ipt_REJECT nf_reject_ipv4 xt_tcpudp xt_conntrack nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 libcrc32c iptable_filter snd_usb_audio uvcvideo snd_usbmidi_lib snd_rawmidi snd_hda_codec_hdmi intel_rapl_msr intel_rapl_common x86_pkg_temp_thermal intel_powerclamp coretemp kvm_intel kvm irqbypass crct10dif_pclmul wacom crc32_pclmul iTCO_wdt iTCO_vendor_support ghash_clmulni_intel mei_hdcp nvidia_drm(POE) ucsi_ccg nvidia_modeset(POE) typec_ucsi typec snd_hda_codec_realtek aesni_intel intel_wmi_thunderbolt snd_hda_codec_generic crypto_simd ppdev input_leds usbtv cryptd glue_helper ledtrig_audio intel_cstate videobuf2_vmalloc snd_hda_intel drm_kms_helper videobuf2_memops intel_uncore videobuf2_v4l2 snd_intel_dspcfg snd_hda_codec videobuf2_common intel_rapl_perf cp210x i2c_i801 videodev r8169 realtek libphy snd_hda_core drm mc agpgart snd_hwdep syscopyarea joydev mousedev uas sysfillrect mei_me sysimgblt fb_sys_fops snd_pcm
[46107.268245]  i2c_nvidia_gpu wmi mei parport_pc parport evdev mac_hid snd_seq_dummy snd_seq_oss snd_seq_midi_event snd_seq snd_seq_device snd_timer snd soundcore cuse fuse sg vhba(OE) ip_tables x_tables ext4 crc32c_generic crc16 mbcache jbd2 usb_storage hid_generic usbhid hid sr_mod cdrom sd_mod ahci libahci libata xhci_pci scsi_mod crc32c_intel xhci_hcd nvidia(POE) ipmi_devintf ipmi_msghandler
[46107.268286] CR2: ffff96d826e59cc0
[46107.268292] ---[ end trace 05d8858794920e72 ]---
[46107.268298] RIP: 0010:0xffff96d826e59cc0
[46107.268303] Code: ff ff 10 9b e5 26 d8 96 ff ff a0 9c e5 26 d8 96 ff ff a0 9c e5 26 d8 96 ff ff 00 00 00 00 00 00 00 00 78 ab 9b 8b d8 96 ff ff <04> 00 28 00 02 00 00 00 00 00 00 00 00 00 00 00 08 f9 f1 2d da 96
[46107.268307] RSP: 0018:ffffadae802b3c28 EFLAGS: 00010286
[46107.268311] RAX: ffff96d826e59cc0 RBX: ffff96da28b80b50 RCX: 0000000000000000
[46107.268314] RDX: 0000000000000000 RSI: ffffadae83cdfbb0 RDI: ffff96da24de9600
[46107.268317] RBP: ffff96da28b80b40 R08: 0000000000000000 R09: ffffadae802b3bc8
[46107.268320] R10: 00007ffc6ad7e000 R11: ffff96da0e0a4be8 R12: ffff96da24de9600
[46107.268324] R13: ffff96d826e59cc0 R14: ffff96da28b80b40 R15: 0000000000000078
[46107.268328] FS:  00007efe6744b780(0000) GS:ffff96da2ec00000(0000) knlGS:0000000000000000
[46107.268332] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[46107.268335] CR2: ffff96d826e59cc0 CR3: 0000000410834004 CR4: 00000000003606f0
[46107.268338] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
[46107.268341] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400

Offline

#8 2020-02-11 15:21:54

V1del
Forum Moderator
Registered: 2012-10-16
Posts: 21,410

Re: [workaround] NTFS parts randomly "drop out" of running system

hmm, I don't use big_writes maybe a regression there?

Online

#9 2020-02-11 15:30:01

Dennis
Member
Registered: 2014-11-04
Posts: 56
Website

Re: [workaround] NTFS parts randomly "drop out" of running system

I had enabled big_writes only to see if it made a difference. It happens with and without big_writes for me.

Offline

#10 2020-02-14 16:14:30

Dennis
Member
Registered: 2014-11-04
Posts: 56
Website

Re: [workaround] NTFS parts randomly "drop out" of running system

Could not find anymore information on this so I ended up converting all of my NTFS partitions to ext4 as a workaround (except for the Windows system partition and for accessing the ext4 partitions under Windows I'm using a proprietary driver from Paragon).

Offline

#11 2020-02-18 01:34:17

CodingKoopa
Wiki Maintainer
Registered: 2020-02-18
Posts: 4

Re: [workaround] NTFS parts randomly "drop out" of running system

After the "drop out" it is impossible to access anything on the affected partition(until reboot, then it just works again, though rebooting requires hard-reset because shutdown indefinitely waits for an unmount which never succeeds).

I have been getting this same behavior, on occassion - the partition becomes inaccessible, and I have to forcefully reboot the system for it to work again.

I have been experiencing issues with my NTFS partition for some times, but today I reproduced issue (one of the issues?) as soon as my system booted up. Here is what I consider to be the relevant information:

Feb 17 19:58:29 comet-observatory kernel: kernel tried to execute NX-protected page - exploit attempt? (uid: 0)
Feb 17 19:58:29 comet-observatory kernel: BUG: unable to handle page fault for address: ffffa641c30c7dc0
Feb 17 19:58:29 comet-observatory kernel: #PF: supervisor instruction fetch in kernel mode
Feb 17 19:58:29 comet-observatory kernel: #PF: error_code(0x0011) - permissions violation
Feb 17 19:58:29 comet-observatory kernel: PGD 21587e067 P4D 21587e067 PUD 21587f067 PMD 20c61d067 PTE 80000001ec3e4163
Feb 17 19:58:29 comet-observatory kernel: Oops: 0011 [#1] SMP NOPTI
Feb 17 19:58:29 comet-observatory kernel: CPU: 6 PID: 598 Comm: mount.ntfs Tainted: P        W  OE     5.5.4-15-tkg-pds #1
Feb 17 19:58:29 comet-observatory kernel: Hardware name: System manufacturer System Product Name/ROG STRIX B350-F GAMING, BIOS 5216 09/02/2019
Feb 17 19:58:29 comet-observatory kernel: RIP: 0010:0xffffa641c30c7dc0
Feb 17 19:58:29 comet-observatory kernel: Code: 00 00 00 50 0b 8d 00 00 00 00 00 50 0b 8d 36 8f ff ff f0 7e 0c c3 41 a6 ff ff d0 7e 0c c3 41 a6 ff ff 22 19 ea 87 ff ff ff ff <60> d0 62 91 36 8f ff ff 00 5b 4a 56 35 8f ff ff ed 0a 6b 8c 10 00
Feb 17 19:58:29 comet-observatory kernel: RSP: 0018:ffffa641c19ebc28 EFLAGS: 00010286
Feb 17 19:58:29 comet-observatory kernel: RAX: ffffa641c30c7dc0 RBX: ffff8f3692100760 RCX: ffff8f36921007c8
Feb 17 19:58:29 comet-observatory kernel: RDX: 0000000000000000 RSI: ffffa641c30c7c40 RDI: ffff8f368da9a000
Feb 17 19:58:29 comet-observatory kernel: RBP: ffff8f3692100750 R08: 0000000000000000 R09: ffffa641c19ebbc8
Feb 17 19:58:29 comet-observatory kernel: R10: ffff8f367cf51148 R11: fffff13587deb501 R12: ffff8f368da9a000
Feb 17 19:58:29 comet-observatory kernel: R13: ffffa641c30c7dc0 R14: ffff8f3692100750 R15: ffffa641c19ebd20
Feb 17 19:58:29 comet-observatory kernel: FS:  00007fdf5afc5180(0000) GS:ffff8f3696980000(0000) knlGS:0000000000000000
Feb 17 19:58:29 comet-observatory kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Feb 17 19:58:29 comet-observatory kernel: CR2: ffffa641c30c7dc0 CR3: 00000001f6b70000 CR4: 00000000003406e0
Feb 17 19:58:29 comet-observatory kernel: Call Trace:
Feb 17 19:58:29 comet-observatory kernel:  ? fuse_request_end+0xb0/0x1e0 [fuse]
Feb 17 19:58:29 comet-observatory kernel:  ? fuse_dev_do_write+0x245/0xec0 [fuse]
Feb 17 19:58:29 comet-observatory kernel:  ? fuse_copy_do+0xac/0xb0 [fuse]
Feb 17 19:58:29 comet-observatory kernel:  ? fuse_copy_args+0x4b/0x140 [fuse]
Feb 17 19:58:29 comet-observatory kernel:  ? set_page_dirty+0xe/0xb0
Feb 17 19:58:29 comet-observatory kernel:  ? set_page_dirty_lock+0x3a/0x50
Feb 17 19:58:29 comet-observatory kernel:  ? fuse_dev_write+0x66/0xa0 [fuse]
Feb 17 19:58:29 comet-observatory kernel:  ? do_iter_readv_writev+0x187/0x200
Feb 17 19:58:29 comet-observatory kernel:  ? do_iter_write+0x7d/0x190
Feb 17 19:58:29 comet-observatory kernel:  ? vfs_writev+0xe0/0x130
Feb 17 19:58:29 comet-observatory kernel:  ? do_writev+0x73/0x120
Feb 17 19:58:29 comet-observatory kernel:  ? do_syscall_64+0x4e/0x150
Feb 17 19:58:29 comet-observatory kernel:  ? entry_SYSCALL_64_after_hwframe+0x44/0xa9
Feb 17 19:58:29 comet-observatory kernel: Modules linked in: ccm rfcomm uvcvideo videobuf2_vmalloc videobuf2_memops videobuf2_v4l2 videobuf2_common videodev input_leds cmac algif_hash algif_skcipher af_alg joydev mousedev hid_generic bnep btusb btrtl btbcm btintel bluetooth snd_usb_audio snd_usbmidi_lib usbhid snd_rawmidi ecdh_generic snd_seq_device ecc hid mc fuse snd_hda_codec_hdmi nls_iso8859_1 nls_cp437 vfat ath9k fat eeepc_wmi ath9k_common snd_hda_codec_realtek asus_wmi battery edac_mce_amd sparse_keymap wmi_bmof ath9k_hw snd_hda_codec_generic kvm_amd ledtrig_audio ath kvm snd_hda_intel snd_intel_dspcfg irqbypass mac80211 snd_hda_codec snd_hda_core crct10dif_pclmul crc32_pclmul snd_hwdep ghash_clmulni_intel cfg80211 igb snd_pcm snd_timer aesni_intel snd ccp i2c_algo_bit crypto_simd rfkill cryptd glue_helper pcspkr soundcore k10temp rng_core i2c_piix4 dca libarc4 gpio_amdpt pinctrl_amd evdev wmi mac_hid acpi_cpufreq bochs_drm drm_vram_helper qxl drm_ttm_helper ttm crypto_user ip_tables x_tables ext4 crc32c_generic
Feb 17 19:58:29 comet-observatory kernel:  crc16 mbcache jbd2 sd_mod ahci libahci libata xhci_pci crc32c_intel xhci_hcd scsi_mod nvidia_drm(POE) drm_kms_helper syscopyarea sysfillrect sysimgblt fb_sys_fops drm agpgart nvidia_uvm(OE) nvidia_modeset(POE) nvidia(POE) ipmi_devintf ipmi_msghandler
Feb 17 19:58:29 comet-observatory kernel: CR2: ffffa641c30c7dc0
Feb 17 19:58:29 comet-observatory kernel: ---[ end trace e64074b8a68ab162 ]---
Feb 17 19:58:29 comet-observatory kernel: RIP: 0010:0xffffa641c30c7dc0
Feb 17 19:58:29 comet-observatory kernel: Code: 00 00 00 50 0b 8d 00 00 00 00 00 50 0b 8d 36 8f ff ff f0 7e 0c c3 41 a6 ff ff d0 7e 0c c3 41 a6 ff ff 22 19 ea 87 ff ff ff ff <60> d0 62 91 36 8f ff ff 00 5b 4a 56 35 8f ff ff ed 0a 6b 8c 10 00
Feb 17 19:58:29 comet-observatory kernel: RSP: 0018:ffffa641c19ebc28 EFLAGS: 00010286
Feb 17 19:58:29 comet-observatory kernel: RAX: ffffa641c30c7dc0 RBX: ffff8f3692100760 RCX: ffff8f36921007c8
Feb 17 19:58:29 comet-observatory kernel: RDX: 0000000000000000 RSI: ffffa641c30c7c40 RDI: ffff8f368da9a000
Feb 17 19:58:29 comet-observatory kernel: RBP: ffff8f3692100750 R08: 0000000000000000 R09: ffffa641c19ebbc8
Feb 17 19:58:29 comet-observatory kernel: R10: ffff8f367cf51148 R11: fffff13587deb501 R12: ffff8f368da9a000
Feb 17 19:58:29 comet-observatory kernel: R13: ffffa641c30c7dc0 R14: ffff8f3692100750 R15: ffffa641c19ebd20
Feb 17 19:58:29 comet-observatory kernel: FS:  00007fdf5afc5180(0000) GS:ffff8f3696980000(0000) knlGS:0000000000000000
Feb 17 19:58:29 comet-observatory kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Feb 17 19:58:29 comet-observatory kernel: CR2: ffffa641c30c7dc0 CR3: 00000001f6b70000 CR4: 00000000003406e0

I encountered another crash in this log as well, although this one seems to more so be related to Nvidia software:

Feb 17 19:57:37 comet-observatory kernel: ------------[ cut here ]------------
Feb 17 19:57:37 comet-observatory kernel: refcount_t: underflow; use-after-free.
Feb 17 19:57:37 comet-observatory kernel: WARNING: CPU: 1 PID: 652 at lib/refcount.c:28 refcount_warn_saturate+0xa6/0xf0
Feb 17 19:57:37 comet-observatory kernel: Modules linked in: btrtl btbcm btintel bluetooth snd_usb_audio(+) snd_usbmidi_lib usbhid(+) snd_rawmidi ecdh_generic snd_seq_device ecc hid mc fuse snd_hda_codec_hdmi(+) nls_iso8859_1 nls_cp437 vfat ath9k fat eeepc_wmi ath9k_common snd_hda_codec_realtek asus_wmi battery edac_mce_amd sparse_keymap wmi_bmof ath9k_hw snd_hda_codec_generic kvm_amd ledtrig_audio ath kvm snd_hda_intel snd_intel_dspcfg irqbypass mac80211 snd_hda_codec snd_hda_core crct10dif_pclmul crc32_pclmul snd_hwdep ghash_clmulni_intel cfg80211 igb snd_pcm snd_timer aesni_intel snd ccp i2c_algo_bit crypto_simd rfkill cryptd glue_helper pcspkr soundcore k10temp rng_core i2c_piix4 dca libarc4 gpio_amdpt pinctrl_amd evdev wmi mac_hid acpi_cpufreq bochs_drm drm_vram_helper qxl drm_ttm_helper ttm crypto_user ip_tables x_tables ext4 crc32c_generic crc16 mbcache jbd2 sd_mod ahci libahci libata xhci_pci crc32c_intel xhci_hcd scsi_mod nvidia_drm(POE) drm_kms_helper syscopyarea sysfillrect sysimgblt fb_sys_fops drm agpgart
Feb 17 19:57:37 comet-observatory kernel:  nvidia_uvm(OE) nvidia_modeset(POE) nvidia(POE) ipmi_devintf ipmi_msghandler
Feb 17 19:57:37 comet-observatory kernel: CPU: 1 PID: 652 Comm: Xorg.wrap Tainted: P           OE     5.5.4-15-tkg-pds #1
Feb 17 19:57:37 comet-observatory kernel: Hardware name: System manufacturer System Product Name/ROG STRIX B350-F GAMING, BIOS 5216 09/02/2019
Feb 17 19:57:37 comet-observatory kernel: RIP: 0010:refcount_warn_saturate+0xa6/0xf0
Feb 17 19:57:37 comet-observatory kernel: Code: 05 7f 2c 0d 01 01 e8 bb 82 c4 ff 0f 0b c3 80 3d 6d 2c 0d 01 00 75 95 48 c7 c7 08 95 d3 88 c6 05 5d 2c 0d 01 01 e8 9c 82 c4 ff <0f> 0b c3 80 3d 4c 2c 0d 01 00 0f 85 72 ff ff ff 48 c7 c7 60 95 d3
Feb 17 19:57:37 comet-observatory kernel: RSP: 0018:ffffa641c1f0bd90 EFLAGS: 00010286
Feb 17 19:57:37 comet-observatory kernel: RAX: 0000000000000000 RBX: 0000000000000000 RCX: 0000000000000007
Feb 17 19:57:37 comet-observatory kernel: RDX: 0000000000000007 RSI: 0000000000000082 RDI: ffff8f3696859a00
Feb 17 19:57:37 comet-observatory kernel: RBP: ffff8f36768174e8 R08: 00000000000004b9 R09: 0000000000000003
Feb 17 19:57:37 comet-observatory kernel: R10: 0000000000000000 R11: 0000000000000001 R12: ffff8f368b34eae8
Feb 17 19:57:37 comet-observatory kernel: R13: ffff8f368b34e800 R14: 0000000000000008 R15: 0000000000000000
Feb 17 19:57:37 comet-observatory kernel: FS:  00007fd6ba32f540(0000) GS:ffff8f3696840000(0000) knlGS:0000000000000000
Feb 17 19:57:37 comet-observatory kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Feb 17 19:57:37 comet-observatory kernel: CR2: 00007fd6ba2f6ba0 CR3: 00000001f2218000 CR4: 00000000003406e0
Feb 17 19:57:37 comet-observatory kernel: Call Trace:
Feb 17 19:57:37 comet-observatory kernel:  nv_drm_atomic_helper_disable_all+0xec/0x290 [nvidia_drm]
Feb 17 19:57:37 comet-observatory kernel:  nv_drm_master_drop+0x22/0x60 [nvidia_drm]
Feb 17 19:57:37 comet-observatory kernel:  drm_drop_master+0x1e/0x30 [drm]
Feb 17 19:57:37 comet-observatory kernel:  drm_master_release+0x9f/0xb0 [drm]
Feb 17 19:57:37 comet-observatory kernel:  drm_file_free.part.0+0x1fe/0x260 [drm]
Feb 17 19:57:37 comet-observatory kernel:  drm_release+0x9a/0xd0 [drm]
Feb 17 19:57:37 comet-observatory kernel:  __fput+0xb9/0x250
Feb 17 19:57:37 comet-observatory kernel:  task_work_run+0x90/0xb0
Feb 17 19:57:37 comet-observatory kernel:  exit_to_usermode_loop+0xda/0x100
Feb 17 19:57:37 comet-observatory kernel:  do_syscall_64+0x11f/0x150
Feb 17 19:57:37 comet-observatory kernel:  entry_SYSCALL_64_after_hwframe+0x44/0xa9
Feb 17 19:57:37 comet-observatory kernel: RIP: 0033:0x7fd6ba258d07
Feb 17 19:57:37 comet-observatory kernel: Code: ff ff e8 3c e3 01 00 66 2e 0f 1f 84 00 00 00 00 00 66 90 f3 0f 1e fa 64 8b 04 25 18 00 00 00 85 c0 75 10 b8 03 00 00 00 0f 05 <48> 3d 00 f0 ff ff 77 41 c3 48 83 ec 18 89 7c 24 0c e8 93 4c f9 ff
Feb 17 19:57:37 comet-observatory kernel: RSP: 002b:00007ffdbaf0f1a8 EFLAGS: 00000246 ORIG_RAX: 0000000000000003
Feb 17 19:57:37 comet-observatory kernel: RAX: 0000000000000000 RBX: 0000000000000001 RCX: 00007fd6ba258d07
Feb 17 19:57:37 comet-observatory kernel: RDX: 00007ffdbaf0f1c0 RSI: 00000000c04064a0 RDI: 0000000000000003
Feb 17 19:57:37 comet-observatory kernel: RBP: 00007ffdbaf0f210 R08: 0000000000000000 R09: 00007ffdbaf0f030
Feb 17 19:57:37 comet-observatory kernel: R10: 000055f20671764b R11: 0000000000000246 R12: 0000000000000003
Feb 17 19:57:37 comet-observatory kernel: R13: 0000000000000001 R14: 0000000000000000 R15: 00007ffdbaf0f1c0
Feb 17 19:57:37 comet-observatory kernel: ---[ end trace e64074b8a68ab161 ]---

In the event that more output than this is needed, I have the whole journald output for that boot saved. I won't link the whole thing out of privacy concerns, but I will add the first couple of lines with kernel + command line info:

Feb 17 19:57:34 archlinux kernel: Linux version 5.5.4-15-tkg-pds (linux-tkg-pds-zen@archlinux) (gcc version 9.2.1 20200130 (Arch Linux 9.2.1+20200130-2)) #1 TKG SMP Mon, 17 Feb 2020 13:43:05 +0000
Feb 17 19:57:34 archlinux kernel: Command line: initrd=\amd-ucode.img initrd=initramfs-linux-tkg-pds-zen.img rw root=PARTUUID=91fb9373-d9b2-4e6d-a376-0388afe85bf0 nowatchdog workqueue.power_efficient=0 libahci.ignore_sss=1 nvidia-drm.modeset=1 quiet rd.udev.log_priority=3 vt.global_cursor_default=0

As far as I understand this, FUSE is crashing, preventing the partition from being mounted properly. Should a bug report be filed?

Last edited by CodingKoopa (2020-02-18 01:36:12)

Offline

#12 2020-02-18 02:42:35

loqs
Member
Registered: 2014-03-06
Posts: 17,192

Re: [workaround] NTFS parts randomly "drop out" of running system

@CodingKoopa can you reproduce the issue on an untainted unpatched kernel?

Offline

#13 2020-02-18 02:54:30

CodingKoopa
Wiki Maintainer
Registered: 2020-02-18
Posts: 4

Re: [workaround] NTFS parts randomly "drop out" of running system

That may be difficult to do because of my proprietary graphics drivers. If I can reproduce it on an untainted vanilla Arch kernel, I'll share here.

EDIT: Due to constant crashing with Nouveau, being unable to reproduce the issue, and another crash (the Nvidia one I pasted above), I likely won't be able to get untainted info so any time soon, unfortunately.

Last edited by CodingKoopa (2020-02-18 05:42:02)

Offline

#14 2020-03-13 18:32:17

offbeatwitch
Member
Registered: 2020-03-13
Posts: 3

Re: [workaround] NTFS parts randomly "drop out" of running system

I'm also experiencing random NTFS crashes - and as luck would have it, after having a poke the only tainted modules I had enabled were from VirtualBox. So I've blacklisted those, and rebooted, but of course the kernel threw another BUG right after I logged in, making my kernel technically tainted.
For now, here's the two BUG logs as pulled from dmesg: first, unrelated bug (which is Not Tainted) and NTFS bug, tainted by the previous BUG but nothing else.

I'm going for a reboot now, so I'll continue trying to hit the NTFS bug without hitting any others first.

Offline

#15 2020-03-14 19:07:46

offbeatwitch
Member
Registered: 2020-03-13
Posts: 3

Re: [workaround] NTFS parts randomly "drop out" of running system

Okay, I'm back and this time I bring an untainted log of the kernel bug!

[18950.137910] kernel tried to execute NX-protected page - exploit attempt? (uid: 0)
[18950.137913] BUG: unable to handle page fault for address: ffff8dfaed665c40
[18950.137915] #PF: supervisor instruction fetch in kernel mode
[18950.137916] #PF: error_code(0x0011) - permissions violation
[18950.137917] PGD 195001067 P4D 195001067 PUD 3ef767063 PMD 384bdc063 PTE 800000036d665063
[18950.137920] Oops: 0011 [#1] PREEMPT SMP PTI
[18950.137922] CPU: 3 PID: 527 Comm: mount.ntfs-3g Not tainted 5.5.4-arch1-1 #1
[18950.137923] Hardware name: MSI MS-7994/H110M ECO (MS-7994), BIOS 2.00 11/26/2015
[18950.137925] RIP: 0010:0xffff8dfaed665c40
[18950.137926] Code: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 <80> 40 00 00 00 00 00 00 00 00 00 00 00 00 00 00 01 00 00 00 00 00
[18950.137928] RSP: 0018:ffffb62540327c28 EFLAGS: 00010282
[18950.137929] RAX: ffff8dfaed665c40 RBX: ffff8dfbbc18b2e0 RCX: 0000000000000000
[18950.137930] RDX: 0000000000000000 RSI: ffffb6254dcafbb0 RDI: ffff8dfbd99d1c00
[18950.137931] RBP: ffff8dfbbc18b2d0 R08: 0000000000000000 R09: ffffb62540327bc8
[18950.137932] R10: 00007ffdb8806000 R11: ffff8dfbd8c0e028 R12: ffff8dfbd99d1c00
[18950.137933] R13: ffff8dfaed665c40 R14: ffff8dfbbc18b2d0 R15: 0000000000000078
[18950.137934] FS:  00007fefc2974180(0000) GS:ffff8dfbded80000(0000) knlGS:0000000000000000
[18950.137935] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[18950.137936] CR2: ffff8dfaed665c40 CR3: 000000044bbcc006 CR4: 00000000003606e0
[18950.137937] Call Trace:
[18950.137943]  ? fuse_request_end+0xae/0x1e0 [fuse]
[18950.137946]  ? fuse_dev_do_write+0x25d/0xee0 [fuse]
[18950.137949]  ? set_page_dirty+0xe/0xb0
[18950.137951]  ? set_page_dirty_lock+0x35/0x50
[18950.137953]  ? preempt_count_add+0x68/0xa0
[18950.137956]  ? fuse_dev_write+0x66/0xa0 [fuse]
[18950.137959]  ? do_iter_readv_writev+0x166/0x1e0
[18950.137961]  ? do_iter_write+0x7d/0x190
[18950.137963]  ? vfs_writev+0xe0/0x130
[18950.137965]  ? do_writev+0x73/0x120
[18950.137968]  ? do_syscall_64+0x4e/0x150
[18950.137970]  ? entry_SYSCALL_64_after_hwframe+0x44/0xa9
[18950.137972] Modules linked in: uinput rfcomm cmac algif_hash algif_skcipher af_alg hid_steam bnep btusb btrtl btbcm btintel bluetooth ecdh_generic ecc cfg80211 rfkill 8021q garp mrp stp llc nct6775 hwmon_vid uas usb_storage snd_usb_audio snd_usbmidi_lib snd_rawmidi snd_seq_device mc intel_rapl_msr intel_rapl_common x86_pkg_temp_thermal intel_powerclamp coretemp kvm_intel kvm amdgpu irqbypass crct10dif_pclmul crc32_pclmul ghash_clmulni_intel iTCO_wdt iTCO_vendor_support 8250_dw aesni_intel nls_iso8859_1 crypto_simd nls_cp437 vfat ppdev snd_hda_codec_realtek cryptd mxm_wmi fat glue_helper snd_hda_codec_generic snd_hda_codec_hdmi ledtrig_audio gpu_sched intel_cstate i2c_algo_bit snd_hda_intel intel_uncore intel_rapl_perf snd_intel_dspcfg e1000e ttm snd_hda_codec i2c_i801 pcspkr drm_kms_helper snd_hda_core snd_hwdep snd_pcm mousedev joydev snd_timer mei_me drm input_leds mei snd agpgart intel_lpss_pci syscopyarea sysfillrect sysimgblt fb_sys_fops parport_pc intel_lpss soundcore idma64
[18950.138025]  intel_pch_thermal parport wmi evdev mac_hid fuse sg crypto_user ip_tables x_tables ext4 crc32c_generic crc16 mbcache jbd2 hid_generic usbhid hid sr_mod cdrom sd_mod ahci libahci libata xhci_pci crc32c_intel xhci_hcd scsi_mod
[18950.138036] CR2: ffff8dfaed665c40
[18950.138037] ---[ end trace e21638d2dd3e770c ]---
[18950.138039] RIP: 0010:0xffff8dfaed665c40
[18950.138040] Code: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 <80> 40 00 00 00 00 00 00 00 00 00 00 00 00 00 00 01 00 00 00 00 00
[18950.138041] RSP: 0018:ffffb62540327c28 EFLAGS: 00010282
[18950.138043] RAX: ffff8dfaed665c40 RBX: ffff8dfbbc18b2e0 RCX: 0000000000000000
[18950.138043] RDX: 0000000000000000 RSI: ffffb6254dcafbb0 RDI: ffff8dfbd99d1c00
[18950.138044] RBP: ffff8dfbbc18b2d0 R08: 0000000000000000 R09: ffffb62540327bc8
[18950.138045] R10: 00007ffdb8806000 R11: ffff8dfbd8c0e028 R12: ffff8dfbd99d1c00
[18950.138046] R13: ffff8dfaed665c40 R14: ffff8dfbbc18b2d0 R15: 0000000000000078
[18950.138047] FS:  00007fefc2974180(0000) GS:ffff8dfbded80000(0000) knlGS:0000000000000000
[18950.138048] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[18950.138049] CR2: ffff8dfaed665c40 CR3: 000000044bbcc006 CR4: 00000000003606e0

It definitely seems to occur after at least some writing; but it doesn't seem to be a consistent amount, I've gone days without hitting the bug, only for it to crop back up again. It definitely seems to take a while; my current uptime is about 5 hours.

Offline

#16 2020-03-14 19:45:29

loqs
Member
Registered: 2014-03-06
Posts: 17,192

Re: [workaround] NTFS parts randomly "drop out" of running system

Offline

#17 2020-03-15 19:06:33

offbeatwitch
Member
Registered: 2020-03-13
Posts: 3

Re: [workaround] NTFS parts randomly "drop out" of running system

guess we're waiting for kernel 5.6 then!

Offline

#18 2020-03-16 15:46:10

loqs
Member
Registered: 2014-03-06
Posts: 17,192

Re: [workaround] NTFS parts randomly "drop out" of running system

Offline

#19 2020-03-16 20:53:31

CodingKoopa
Wiki Maintainer
Registered: 2020-02-18
Posts: 4

Re: [workaround] NTFS parts randomly "drop out" of running system

I'm glad to see, thanks all smile

Offline

Board footer

Powered by FluxBB