You are not logged in.

#1 2016-02-13 19:36:05

wba072
Member
Registered: 2010-11-11
Posts: 33

ata1.00: failed command: WRITE DMA, SSD speed limited to 3.0 Gbps

My system freezes periodically and I get the following errors in the journal:

Feb 13 13:13:10 acer-c720 kernel: ata1.00: exception Emask 0x10 SAct 0x0 SErr 0x400100 action 0x6 frozen
Feb 13 13:13:10 acer-c720 kernel: ata1.00: irq_stat 0x08000000, interface fatal error
Feb 13 13:13:10 acer-c720 kernel: ata1: SError: { UnrecovData Handshk }
Feb 13 13:13:10 acer-c720 kernel: ata1.00: failed command: WRITE DMA
Feb 13 13:13:10 acer-c720 kernel: ata1.00: cmd ca/00:18:70:b2:8b/00:00:00:00:00/e3 tag 30 dma 12288 out
                                           res 50/00:10:20:b2:8b/00:00:00:00:00/e3 Emask 0x10 (ATA bus error)
Feb 13 13:13:10 acer-c720 kernel: ata1.00: status: { DRDY }
Feb 13 13:13:10 acer-c720 kernel: ata1: hard resetting link
Feb 13 13:13:10 acer-c720 kernel: ata1: SATA link up 6.0 Gbps (SStatus 133 SControl 300)
Feb 13 13:13:10 acer-c720 kernel: ata1.00: configured for UDMA/133
Feb 13 13:13:10 acer-c720 kernel: ata1: EH complete
Feb 13 13:13:10 acer-c720 kernel: ata1.00: exception Emask 0x10 SAct 0x0 SErr 0x400100 action 0x6 frozen
Feb 13 13:13:10 acer-c720 kernel: ata1.00: irq_stat 0x08000000, interface fatal error
Feb 13 13:13:10 acer-c720 kernel: ata1: SError: { UnrecovData Handshk }
Feb 13 13:13:10 acer-c720 kernel: ata1.00: failed command: WRITE DMA
Feb 13 13:13:10 acer-c720 kernel: ata1.00: cmd ca/00:50:38:80:36/00:00:00:00:00/e6 tag 10 dma 40960 out
                                           res 50/00:08:d0:7f:36/00:00:00:00:00/e6 Emask 0x10 (ATA bus error)
Feb 13 13:13:10 acer-c720 kernel: ata1.00: status: { DRDY }
Feb 13 13:13:10 acer-c720 kernel: ata1: hard resetting link
Feb 13 13:13:10 acer-c720 kernel: ata1: SATA link up 6.0 Gbps (SStatus 133 SControl 300)
Feb 13 13:13:10 acer-c720 kernel: ata1.00: configured for UDMA/133
Feb 13 13:13:10 acer-c720 kernel: ata1: EH complete
Feb 13 13:13:11 acer-c720 kernel: ata1.00: exception Emask 0x10 SAct 0x0 SErr 0x400100 action 0x6 frozen
Feb 13 13:13:11 acer-c720 kernel: ata1.00: irq_stat 0x08000000, interface fatal error
Feb 13 13:13:11 acer-c720 kernel: ata1: SError: { UnrecovData Handshk }
Feb 13 13:13:11 acer-c720 kernel: ata1.00: failed command: WRITE DMA
Feb 13 13:13:11 acer-c720 kernel: ata1.00: cmd ca/00:50:38:80:36/00:00:00:00:00/e6 tag 12 dma 40960 out
                                           res 50/00:00:af:c2:e7/00:00:0e:00:00/e0 Emask 0x10 (ATA bus error)
Feb 13 13:13:11 acer-c720 kernel: ata1.00: status: { DRDY }
Feb 13 13:13:11 acer-c720 kernel: ata1: hard resetting link
Feb 13 13:13:11 acer-c720 kernel: ata1: SATA link up 6.0 Gbps (SStatus 133 SControl 300)
Feb 13 13:13:11 acer-c720 kernel: ata1.00: configured for UDMA/133
Feb 13 13:13:11 acer-c720 kernel: ata1: EH complete
Feb 13 13:13:52 acer-c720 kernel: ata1: limiting SATA link speed to 3.0 Gbps
Feb 13 13:13:52 acer-c720 kernel: ata1.00: exception Emask 0x0 SAct 0x0 SErr 0x400001 action 0x6 frozen
Feb 13 13:13:52 acer-c720 kernel: ata1: SError: { RecovData Handshk }
Feb 13 13:13:52 acer-c720 kernel: ata1.00: failed command: READ DMA
Feb 13 13:13:52 acer-c720 kernel: ata1.00: cmd c8/00:70:a8:70:ea/00:00:00:00:00/ec tag 17 dma 57344 in
                                           res 40/00:00:00:00:00/00:00:00:00:00/00 Emask 0x4 (timeout)
Feb 13 13:13:52 acer-c720 kernel: ata1.00: status: { DRDY }
Feb 13 13:13:52 acer-c720 kernel: ata1: hard resetting link
Feb 13 13:13:52 acer-c720 kernel: ata1: SATA link up 3.0 Gbps (SStatus 123 SControl 320)
Feb 13 13:13:52 acer-c720 kernel: ata1.00: configured for UDMA/133
Feb 13 13:13:52 acer-c720 kernel: ata1.00: device reported invalid CHS sector 0
Feb 13 13:13:52 acer-c720 kernel: ata1: EH complete
Feb 13 13:13:52 acer-c720 gnome-session[593]: Gjs-Messag

I had been getting some other errors (https://bbs.archlinux.org/viewtopic.php?id=208861) but after disabling NCQ I get the above now instead.

$ lspci |grep ATA
00:1f.2 SATA controller: Intel Corporation 8 Series SATA Controller 1 [AHCI mode] (rev 04)

Disk seems healthy:

$ sudo smartctl --attributes --log=selftest /dev/sda
smartctl 6.4 2015-06-04 r4109 [x86_64-linux-4.4.1-2-ARCH] (local build)
Copyright (C) 2002-15, Bruce Allen, Christian Franke, www.smartmontools.org

=== START OF READ SMART DATA SECTION ===
SMART Attributes Data Structure revision number: 16
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME          FLAG     VALUE WORST THRESH TYPE      UPDATED  WHEN_FAILED RAW_VALUE
  1 Raw_Read_Error_Rate     0x000a   100   100   000    Old_age   Always       -       0
  9 Power_On_Hours          0x0012   100   100   000    Old_age   Always       -       2832
 12 Power_Cycle_Count       0x0012   100   100   000    Old_age   Always       -       2460
168 Unknown_Attribute       0x0012   100   100   000    Old_age   Always       -       136
170 Unknown_Attribute       0x0013   100   100   010    Pre-fail  Always       -       35
173 Unknown_Attribute       0x0000   100   100   000    Old_age   Offline      -       26411045
192 Power-Off_Retract_Count 0x0012   100   100   000    Old_age   Always       -       84
194 Temperature_Celsius     0x0023   070   070   000    Pre-fail  Always       -       30
196 Reallocated_Event_Count 0x0000   100   100   000    Old_age   Offline      -       0
218 Unknown_Attribute       0x0000   100   100   000    Old_age   Offline      -       120
241 Total_LBAs_Written      0x0012   100   100   000    Old_age   Always       -       1850109

SMART Self-test log structure revision number 1
Num  Test_Description    Status                  Remaining  LifeTime(hours)  LBA_of_first_error
# 1  Conveyance offline  Completed without error       00%      2831         -
# 2  Extended offline    Completed without error       00%      2831         -
# 3  Short offline       Completed without error       00%      2831         -
$ sudo smartctl -H /dev/sda
smartctl 6.4 2015-06-04 r4109 [x86_64-linux-4.4.1-2-ARCH] (local build)
Copyright (C) 2002-15, Bruce Allen, Christian Franke, www.smartmontools.org

=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED

I have followed this https://wiki.archlinux.org/index.php/SS … NCQ_errors and added the appropriate flag and still get the above error.

$ grep GRUB_CMDLINE_LINUX_DEFAULT /etc/default/grub 
GRUB_CMDLINE_LINUX_DEFAULT="quiet loglevel=3 udev.log-priority=3 cryptdevice=/dev/disk/by-uuid/a5778458-94e1-46cd-8651-aba327ed87c3:cryptboot:allow-discards cryptdevice=/dev/disk/by-uuid/bb7396f5-f246-4edf-9f1f-298c9ca560ac:cryptroot:allow-discards cryptkey:/dev/disk/by-uuid/bb7396f5-f246-4edf-9f1f-298c9ca560ac:ext4:/crypto_keyfile.bin console=tty1 modprobe.blacklist=ehci_pci i915.semaphores=1 ipv6.disable=1 reboot=bios libata.force=noncq"
$ cat /etc/fstab 
# /dev/mapper/cryptroot
UUID=71b4e9d6-1c26-44ec-845d-13f0f3f4ee56	/         	ext4        rw,relatime,noatime,data=ordered	0 1

# /dev/mapper/cryptboot
UUID=31492f56-431c-4853-a85b-cfefc6bf256a	/boot     	ext4        rw,relatime,noatime,data=ordered	0 2

/swapfile                                   none        swap        defaults                            0 0
$ sudo cat /etc/crypttab
cryptboot      /dev/sda1                                    /crypto_keyfile.bin     allow-discards

Offline

#2 2016-02-20 23:14:37

paulkerry
Member
From: Sheffield, UK
Registered: 2014-10-02
Posts: 611

Re: ata1.00: failed command: WRITE DMA, SSD speed limited to 3.0 Gbps

wba072 wrote:

My system freezes periodically and I get the following errors in the journal:

Feb 13 13:13:10 acer-c720 kernel: ata1.00: exception Emask 0x10 SAct 0x0 SErr 0x400100 action 0x6 frozen
Feb 13 13:13:10 acer-c720 kernel: ata1.00: irq_stat 0x08000000, interface fatal error
Feb 13 13:13:10 acer-c720 kernel: ata1: SError: { UnrecovData Handshk }

I've not come across the "UnrecovData Handshk" error before, but after a bit of searching online for those terms it would appear to be a hardware issue.

Some initial results...
- the "Drive interface issue #3" section of https://lime-technology.com/wiki/index. … ive_Issues maybe of interest.
- the "storage" section towards the top of this link maybe relevant: http://www.researchut.com/blog/lenovo-yoga-2-13-debian

Other search results point to SATA port issues, but I guess from your hostname you are using an acer c720 laptop with a pcie ssd?

You could try...
- opening up the case and reseating the ssd to see if it improves matters
- forcing 3.0 Gbps speed at boot time
- checking if your BIOS is up to date

Offline

Board footer

Powered by FluxBB