You are not logged in.
My system freezes periodically and I get the following errors in the journal:
Feb 13 13:13:10 acer-c720 kernel: ata1.00: exception Emask 0x10 SAct 0x0 SErr 0x400100 action 0x6 frozen
Feb 13 13:13:10 acer-c720 kernel: ata1.00: irq_stat 0x08000000, interface fatal error
Feb 13 13:13:10 acer-c720 kernel: ata1: SError: { UnrecovData Handshk }
Feb 13 13:13:10 acer-c720 kernel: ata1.00: failed command: WRITE DMA
Feb 13 13:13:10 acer-c720 kernel: ata1.00: cmd ca/00:18:70:b2:8b/00:00:00:00:00/e3 tag 30 dma 12288 out
res 50/00:10:20:b2:8b/00:00:00:00:00/e3 Emask 0x10 (ATA bus error)
Feb 13 13:13:10 acer-c720 kernel: ata1.00: status: { DRDY }
Feb 13 13:13:10 acer-c720 kernel: ata1: hard resetting link
Feb 13 13:13:10 acer-c720 kernel: ata1: SATA link up 6.0 Gbps (SStatus 133 SControl 300)
Feb 13 13:13:10 acer-c720 kernel: ata1.00: configured for UDMA/133
Feb 13 13:13:10 acer-c720 kernel: ata1: EH complete
Feb 13 13:13:10 acer-c720 kernel: ata1.00: exception Emask 0x10 SAct 0x0 SErr 0x400100 action 0x6 frozen
Feb 13 13:13:10 acer-c720 kernel: ata1.00: irq_stat 0x08000000, interface fatal error
Feb 13 13:13:10 acer-c720 kernel: ata1: SError: { UnrecovData Handshk }
Feb 13 13:13:10 acer-c720 kernel: ata1.00: failed command: WRITE DMA
Feb 13 13:13:10 acer-c720 kernel: ata1.00: cmd ca/00:50:38:80:36/00:00:00:00:00/e6 tag 10 dma 40960 out
res 50/00:08:d0:7f:36/00:00:00:00:00/e6 Emask 0x10 (ATA bus error)
Feb 13 13:13:10 acer-c720 kernel: ata1.00: status: { DRDY }
Feb 13 13:13:10 acer-c720 kernel: ata1: hard resetting link
Feb 13 13:13:10 acer-c720 kernel: ata1: SATA link up 6.0 Gbps (SStatus 133 SControl 300)
Feb 13 13:13:10 acer-c720 kernel: ata1.00: configured for UDMA/133
Feb 13 13:13:10 acer-c720 kernel: ata1: EH complete
Feb 13 13:13:11 acer-c720 kernel: ata1.00: exception Emask 0x10 SAct 0x0 SErr 0x400100 action 0x6 frozen
Feb 13 13:13:11 acer-c720 kernel: ata1.00: irq_stat 0x08000000, interface fatal error
Feb 13 13:13:11 acer-c720 kernel: ata1: SError: { UnrecovData Handshk }
Feb 13 13:13:11 acer-c720 kernel: ata1.00: failed command: WRITE DMA
Feb 13 13:13:11 acer-c720 kernel: ata1.00: cmd ca/00:50:38:80:36/00:00:00:00:00/e6 tag 12 dma 40960 out
res 50/00:00:af:c2:e7/00:00:0e:00:00/e0 Emask 0x10 (ATA bus error)
Feb 13 13:13:11 acer-c720 kernel: ata1.00: status: { DRDY }
Feb 13 13:13:11 acer-c720 kernel: ata1: hard resetting link
Feb 13 13:13:11 acer-c720 kernel: ata1: SATA link up 6.0 Gbps (SStatus 133 SControl 300)
Feb 13 13:13:11 acer-c720 kernel: ata1.00: configured for UDMA/133
Feb 13 13:13:11 acer-c720 kernel: ata1: EH complete
Feb 13 13:13:52 acer-c720 kernel: ata1: limiting SATA link speed to 3.0 Gbps
Feb 13 13:13:52 acer-c720 kernel: ata1.00: exception Emask 0x0 SAct 0x0 SErr 0x400001 action 0x6 frozen
Feb 13 13:13:52 acer-c720 kernel: ata1: SError: { RecovData Handshk }
Feb 13 13:13:52 acer-c720 kernel: ata1.00: failed command: READ DMA
Feb 13 13:13:52 acer-c720 kernel: ata1.00: cmd c8/00:70:a8:70:ea/00:00:00:00:00/ec tag 17 dma 57344 in
res 40/00:00:00:00:00/00:00:00:00:00/00 Emask 0x4 (timeout)
Feb 13 13:13:52 acer-c720 kernel: ata1.00: status: { DRDY }
Feb 13 13:13:52 acer-c720 kernel: ata1: hard resetting link
Feb 13 13:13:52 acer-c720 kernel: ata1: SATA link up 3.0 Gbps (SStatus 123 SControl 320)
Feb 13 13:13:52 acer-c720 kernel: ata1.00: configured for UDMA/133
Feb 13 13:13:52 acer-c720 kernel: ata1.00: device reported invalid CHS sector 0
Feb 13 13:13:52 acer-c720 kernel: ata1: EH complete
Feb 13 13:13:52 acer-c720 gnome-session[593]: Gjs-MessagI had been getting some other errors (https://bbs.archlinux.org/viewtopic.php?id=208861) but after disabling NCQ I get the above now instead.
$ lspci |grep ATA
00:1f.2 SATA controller: Intel Corporation 8 Series SATA Controller 1 [AHCI mode] (rev 04)Disk seems healthy:
$ sudo smartctl --attributes --log=selftest /dev/sda
smartctl 6.4 2015-06-04 r4109 [x86_64-linux-4.4.1-2-ARCH] (local build)
Copyright (C) 2002-15, Bruce Allen, Christian Franke, www.smartmontools.org
=== START OF READ SMART DATA SECTION ===
SMART Attributes Data Structure revision number: 16
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE
1 Raw_Read_Error_Rate 0x000a 100 100 000 Old_age Always - 0
9 Power_On_Hours 0x0012 100 100 000 Old_age Always - 2832
12 Power_Cycle_Count 0x0012 100 100 000 Old_age Always - 2460
168 Unknown_Attribute 0x0012 100 100 000 Old_age Always - 136
170 Unknown_Attribute 0x0013 100 100 010 Pre-fail Always - 35
173 Unknown_Attribute 0x0000 100 100 000 Old_age Offline - 26411045
192 Power-Off_Retract_Count 0x0012 100 100 000 Old_age Always - 84
194 Temperature_Celsius 0x0023 070 070 000 Pre-fail Always - 30
196 Reallocated_Event_Count 0x0000 100 100 000 Old_age Offline - 0
218 Unknown_Attribute 0x0000 100 100 000 Old_age Offline - 120
241 Total_LBAs_Written 0x0012 100 100 000 Old_age Always - 1850109
SMART Self-test log structure revision number 1
Num Test_Description Status Remaining LifeTime(hours) LBA_of_first_error
# 1 Conveyance offline Completed without error 00% 2831 -
# 2 Extended offline Completed without error 00% 2831 -
# 3 Short offline Completed without error 00% 2831 -$ sudo smartctl -H /dev/sda
smartctl 6.4 2015-06-04 r4109 [x86_64-linux-4.4.1-2-ARCH] (local build)
Copyright (C) 2002-15, Bruce Allen, Christian Franke, www.smartmontools.org
=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSEDI have followed this https://wiki.archlinux.org/index.php/SS … NCQ_errors and added the appropriate flag and still get the above error.
$ grep GRUB_CMDLINE_LINUX_DEFAULT /etc/default/grub
GRUB_CMDLINE_LINUX_DEFAULT="quiet loglevel=3 udev.log-priority=3 cryptdevice=/dev/disk/by-uuid/a5778458-94e1-46cd-8651-aba327ed87c3:cryptboot:allow-discards cryptdevice=/dev/disk/by-uuid/bb7396f5-f246-4edf-9f1f-298c9ca560ac:cryptroot:allow-discards cryptkey:/dev/disk/by-uuid/bb7396f5-f246-4edf-9f1f-298c9ca560ac:ext4:/crypto_keyfile.bin console=tty1 modprobe.blacklist=ehci_pci i915.semaphores=1 ipv6.disable=1 reboot=bios libata.force=noncq"$ cat /etc/fstab
# /dev/mapper/cryptroot
UUID=71b4e9d6-1c26-44ec-845d-13f0f3f4ee56 / ext4 rw,relatime,noatime,data=ordered 0 1
# /dev/mapper/cryptboot
UUID=31492f56-431c-4853-a85b-cfefc6bf256a /boot ext4 rw,relatime,noatime,data=ordered 0 2
/swapfile none swap defaults 0 0$ sudo cat /etc/crypttab
cryptboot /dev/sda1 /crypto_keyfile.bin allow-discardsOffline
My system freezes periodically and I get the following errors in the journal:
Feb 13 13:13:10 acer-c720 kernel: ata1.00: exception Emask 0x10 SAct 0x0 SErr 0x400100 action 0x6 frozen Feb 13 13:13:10 acer-c720 kernel: ata1.00: irq_stat 0x08000000, interface fatal error Feb 13 13:13:10 acer-c720 kernel: ata1: SError: { UnrecovData Handshk }
I've not come across the "UnrecovData Handshk" error before, but after a bit of searching online for those terms it would appear to be a hardware issue.
Some initial results...
- the "Drive interface issue #3" section of https://lime-technology.com/wiki/index. … ive_Issues maybe of interest.
- the "storage" section towards the top of this link maybe relevant: http://www.researchut.com/blog/lenovo-yoga-2-13-debian
Other search results point to SATA port issues, but I guess from your hostname you are using an acer c720 laptop with a pcie ssd?
You could try...
- opening up the case and reseating the ssd to see if it improves matters
- forcing 3.0 Gbps speed at boot time
- checking if your BIOS is up to date
Offline