You are not logged in.

#1 2020-07-14 21:53:51

kevin_arefer
Member
Registered: 2020-04-09
Posts: 3

Input output error

Hello,
I took a screenshot and when saving, the system got corrupted and every command threw I/O error.
The I/O error already told me that the problem was something related with the disk, but when viewing logs, got some "BIOS ERROR"
Can you help me detect if it is my ssd corrupted, sata cable or maybe just a motherboard controller?

̣̣Jul 14 17:08:33 archlinux kernel: ata2.00: exception Emask 0x10 SAct 0x800 SErr 0x400100 action 0x6 frozen
Jul 14 17:08:33 archlinux kernel: ata2.00: irq_stat 0x08000000, interface fatal error
Jul 14 17:08:33 archlinux kernel: ata2: SError: { UnrecovData Handshk }
Jul 14 17:08:33 archlinux kernel: ata2.00: failed command: WRITE FPDMA QUEUED
Jul 14 17:08:33 archlinux kernel: ata2.00: cmd 61/40:58:f0:38:b5/03:00:01:00:00/40 tag 11 ncq dma 425984 out
                                           res 40/00:5c:f0:38:b5/00:00:01:00:00/40 Emask 0x10 (ATA bus error)
Jul 14 17:08:33 archlinux kernel: ata2.00: status: { DRDY }
Jul 14 17:08:33 archlinux kernel: ata2: hard resetting link
Jul 14 17:08:33 archlinux kernel: ata2: SATA link up 6.0 Gbps (SStatus 133 SControl 300)
Jul 14 17:08:33 archlinux kernel: ACPI BIOS Error (bug): Could not resolve symbol [\_SB.PCI0.SAT0.PRT1._GTF.DSSP], AE_NOT_FOUND (20200326/psargs-330)
Jul 14 17:08:33 archlinux kernel: ACPI Error: Aborting method \_SB.PCI0.SAT0.PRT1._GTF due to previous error (AE_NOT_FOUND) (20200326/psparse-529)
Jul 14 17:08:33 archlinux kernel: ACPI BIOS Error (bug): Could not resolve symbol [\_SB.PCI0.SAT0.PRT1._GTF.DSSP], AE_NOT_FOUND (20200326/psargs-330)
Jul 14 17:08:33 archlinux kernel: ACPI Error: Aborting method \_SB.PCI0.SAT0.PRT1._GTF due to previous error (AE_NOT_FOUND) (20200326/psparse-529)
Jul 14 17:08:33 archlinux kernel: ata2.00: configured for UDMA/133
Jul 14 17:08:33 archlinux kernel: ata2: hard resetting link
Jul 14 17:08:33 archlinux kernel: ata2: SATA link up 6.0 Gbps (SStatus 133 SControl 300)
Jul 14 17:08:33 archlinux kernel: ata2: SATA link up 6.0 Gbps (SStatus 133 SControl 300)
Jul 14 17:08:33 archlinux kernel: ACPI BIOS Error (bug): Could not resolve symbol [\_SB.PCI0.SAT0.PRT1._GTF.DSSP], AE_NOT_FOUND (20200326/psargs-330)
Jul 14 17:08:33 archlinux kernel: ACPI Error: Aborting method \_SB.PCI0.SAT0.PRT1._GTF due to previous error (AE_NOT_FOUND) (20200326/psparse-529)
Jul 14 17:08:33 archlinux kernel: ACPI BIOS Error (bug): Could not resolve symbol [\_SB.PCI0.SAT0.PRT1._GTF.DSSP], AE_NOT_FOUND (20200326/psargs-330)
Jul 14 17:08:33 archlinux kernel: ACPI Error: Aborting method \_SB.PCI0.SAT0.PRT1._GTF due to previous error (AE_NOT_FOUND) (20200326/psparse-529)
Jul 14 17:08:33 archlinux kernel: ata2.00: configured for UDMA/133
Jul 14 17:08:33 archlinux kernel: ata2: EH complete
Jul 14 17:09:14 archlinux kernel: ata2.00: exception Emask 0x10 SAct 0x78000000 SErr 0x400100 action 0x6 frozen
Jul 14 17:09:14 archlinux kernel: ata2.00: irq_stat 0x08000000, interface fatal error
Jul 14 17:09:14 archlinux kernel: ata2: SError: { UnrecovData Handshk }
Jul 14 17:09:14 archlinux kernel: ata2.00: failed command: WRITE FPDMA QUEUED
Jul 14 17:09:14 archlinux kernel: ata2.00: cmd 61/40:d8:00:e8:62/05:00:02:00:00/40 tag 27 ncq dma 688128 out
                                           res 40/00:dc:00:e8:62/00:00:02:00:00/40 Emask 0x10 (ATA bus error)
Jul 14 17:09:14 archlinux kernel: ata2.00: status: { DRDY }
Jul 14 17:09:14 archlinux kernel: ata2.00: failed command: WRITE FPDMA QUEUED
Jul 14 17:09:14 archlinux kernel: ata2.00: cmd 61/08:e0:00:88:2e/00:00:00:00:00/40 tag 28 ncq dma 4096 out
                                           res 40/00:dc:00:e8:62/00:00:02:00:00/40 Emask 0x10 (ATA bus error)
Jul 14 17:09:14 archlinux kernel: ata2.00: status: { DRDY }
Jul 14 17:09:14 archlinux kernel: ata2.00: failed command: WRITE FPDMA QUEUED
Jul 14 17:09:14 archlinux kernel: ata2.00: cmd 61/c8:e8:40:ed:62/02:00:02:00:00/40 tag 29 ncq dma 364544 out
                                           res 40/00:dc:00:e8:62/00:00:02:00:00/40 Emask 0x10 (ATA bus error)
Jul 14 17:09:14 archlinux kernel: ata2.00: status: { DRDY }
Jul 14 17:09:14 archlinux kernel: ata2.00: failed command: WRITE FPDMA QUEUED
Jul 14 17:09:14 archlinux kernel: ata2.00: cmd 61/78:f0:08:f0:62/05:00:02:00:00/40 tag 30 ncq dma 716800 out
                                           res 40/00:dc:00:e8:62/00:00:02:00:00/40 Emask 0x10 (ATA bus error)
Jul 14 17:09:14 archlinux kernel: ata2.00: status: { DRDY }
Jul 14 17:09:14 archlinux kernel: ata2: hard resetting link
Jul 14 17:09:14 archlinux kernel: ata2: SATA link up 6.0 Gbps (SStatus 133 SControl 300)
Jul 14 17:09:14 archlinux kernel: ACPI BIOS Error (bug): Could not resolve symbol [\_SB.PCI0.SAT0.PRT1._GTF.DSSP], AE_NOT_FOUND (20200326/psargs-330)
Jul 14 17:09:14 archlinux kernel: ACPI Error: Aborting method \_SB.PCI0.SAT0.PRT1._GTF due to previous error (AE_NOT_FOUND) (20200326/psparse-529)
Jul 14 17:09:14 archlinux kernel: ACPI BIOS Error (bug): Could not resolve symbol [\_SB.PCI0.SAT0.PRT1._GTF.DSSP], AE_NOT_FOUND (20200326/psargs-330)
Jul 14 17:09:14 archlinux kernel: ACPI Error: Aborting method \_SB.PCI0.SAT0.PRT1._GTF due to previous error (AE_NOT_FOUND) (20200326/psparse-529)
Jul 14 17:09:14 archlinux kernel: ata2.00: configured for UDMA/133
Jul 14 17:09:14 archlinux kernel: ata2: EH complete
Jul 14 17:10:41 archlinux kernel: ata2.00: exception Emask 0x10 SAct 0x60000007 SErr 0x400100 action 0x6 frozen
Jul 14 17:10:41 archlinux kernel: ata2.00: irq_stat 0x08000000, interface fatal error
Jul 14 17:10:41 archlinux kernel: ata2: SError: { UnrecovData Handshk }
Jul 14 17:10:41 archlinux kernel: ata2.00: failed command: WRITE FPDMA QUEUED
Jul 14 17:10:41 archlinux kernel: ata2.00: cmd 61/08:00:18:36:30/00:00:01:00:00/40 tag 0 ncq dma 4096 out
                                           res 40/00:ec:c0:ec:9c/00:00:01:00:00/40 Emask 0x10 (ATA bus error)
Jul 14 17:10:41 archlinux kernel: ata2.00: status: { DRDY }
Jul 14 17:10:41 archlinux kernel: ata2.00: failed command: WRITE FPDMA QUEUED
Jul 14 17:10:41 archlinux kernel: ata2.00: cmd 61/08:08:c8:8e:30/00:00:01:00:00/40 tag 1 ncq dma 4096 out
                                           res 40/00:ec:c0:ec:9c/00:00:01:00:00/40 Emask 0x10 (ATA bus error)
Jul 14 17:10:41 archlinux kernel: ata2.00: status: { DRDY }
Jul 14 17:10:41 archlinux kernel: ata2.00: failed command: WRITE FPDMA QUEUED
Jul 14 17:10:41 archlinux kernel: ata2.00: cmd 61/08:10:20:ee:9c/00:00:01:00:00/40 tag 2 ncq dma 4096 out
                                           res 40/00:ec:c0:ec:9c/00:00:01:00:00/40 Emask 0x10 (ATA bus error)
Jul 14 17:10:41 archlinux kernel: ata2.00: status: { DRDY }
Jul 14 17:10:41 archlinux kernel: ata2.00: failed command: WRITE FPDMA QUEUED
Jul 14 17:10:41 archlinux kernel: ata2.00: cmd 61/40:e8:c0:ec:9c/00:00:01:00:00/40 tag 29 ncq dma 32768 out
                                           res 40/00:ec:c0:ec:9c/00:00:01:00:00/40 Emask 0x10 (ATA bus error)
Jul 14 17:10:41 archlinux kernel: ata2.00: status: { DRDY }
Jul 14 17:10:41 archlinux kernel: ata2.00: failed command: WRITE FPDMA QUEUED
Jul 14 17:10:41 archlinux kernel: ata2.00: cmd 61/38:f0:d8:fa:b7/03:00:01:00:00/40 tag 30 ncq dma 421888 out
                                           res 40/00:ec:c0:ec:9c/00:00:01:00:00/40 Emask 0x10 (ATA bus error)
Jul 14 17:10:41 archlinux kernel: ata2.00: status: { DRDY }
Jul 14 17:10:41 archlinux kernel: ata2: hard resetting link
Jul 14 17:10:42 archlinux kernel: ata2: SATA link up 6.0 Gbps (SStatus 133 SControl 300)
Jul 14 17:10:42 archlinux kernel: ACPI BIOS Error (bug): Could not resolve symbol [\_SB.PCI0.SAT0.PRT1._GTF.DSSP], AE_NOT_FOUND (20200326/psargs-330)
Jul 14 17:10:42 archlinux kernel: ACPI Error: Aborting method \_SB.PCI0.SAT0.PRT1._GTF due to previous error (AE_NOT_FOUND) (20200326/psparse-529)
Jul 14 17:10:42 archlinux kernel: ACPI BIOS Error (bug): Could not resolve symbol [\_SB.PCI0.SAT0.PRT1._GTF.DSSP], AE_NOT_FOUND (20200326/psargs-330)
Jul 14 17:10:42 archlinux kernel: ACPI Error: Aborting method \_SB.PCI0.SAT0.PRT1._GTF due to previous error (AE_NOT_FOUND) (20200326/psparse-529)
Jul 14 17:10:42 archlinux kernel: ata2.00: configured for UDMA/133
Jul 14 17:10:42 archlinux kernel: ata2: EH complete
Jul 14 17:10:56 archlinux dbus-daemon[503]: [session uid=1000 pid=501] Activating service name='org.gnome.Screenshot' requested by ':1.39' (uid=1000 pid=803 comm="cinnamon --replace ")
Jul 14 17:10:57 archlinux dbus-daemon[503]: [session uid=1000 pid=501] Successfully activated service 'org.gnome.Screenshot'
Jul 14 17:10:58 archlinux gnome-screensho[7436]: Unable to select area using GNOME Shell's builtin screenshot interface, resorting to fallback X11.
Jul 14 17:11:02 archlinux gnome-screensho[7436]: Unable to use GNOME Shell's builtin screenshot interface, resorting to fallback X11.
Jul 14 17:11:02 archlinux dbus-daemon[408]: [system] Activating via systemd: service name='org.freedesktop.hostname1' unit='dbus-org.freedesktop.hostname1.service' requested by ':1.2534' (uid=1000 pid=7436 comm="/usr/bin/gnome-screenshot --gapplication-service ")
-- Reboot --

This is my disks stack:

NAME   MAJ:MIN RM   SIZE RO TYPE MOUNTPOINT
sda      8:0    0 223.6G  0 disk  # SSD WITH WINDOWS INSTALLED
├─sda1   8:1    0   529M  0 part 
├─sda2   8:2    0   100M  0 part 
├─sda3   8:3    0    16M  0 part 
└─sda4   8:4    0   223G  0 part 
sdb      8:16   0 111.8G  0 disk   # SSD WITH ARCH LINUX INSTALLED
├─sdb1   8:17   0   500M  0 part  # EFI PARTITION FOR ARCH
└─sdb2   8:18   0 111.3G  0 part /
sdc      8:32   0 931.5G  0 disk  # HDD FOR CROSS SYSTEM DATA STORAGE
├─sdc1   8:33   0   579M  0 part 
└─sdc2   8:34   0   931G  0 part /run/media/kevin/DATA

Motherboard: MSI H110M PRO VH PLUS.
I got this exact same error very frequently with a previous Arch Linux installation. Reinstalled the system and have used it with no problems for 2 months until now.
Any tips will be deeply appreciated

Offline

#2 2020-07-14 21:59:42

loqs
Member
Registered: 2014-03-06
Posts: 18,928

Re: Input output error

I would suggest running a S.M.A.R.T. self test on the device.  After the test has completed or if if the device does not support self tests post the output of:

# smartctl -a /dev/<device>

Offline

#3 2020-07-14 22:25:37

kevin_arefer
Member
Registered: 2020-04-09
Posts: 3

Re: Input output error

loqs wrote:

I would suggest running a S.M.A.R.T. self test on the device.  After the test has completed or if if the device does not support self tests post the output of:

# smartctl -a /dev/<device>

Thank you for your suggestion. I ran a long test and these are the results:

kevin@archlinux ~]$ sudo smartctl -a /dev/sdb
[sudo] password for kevin: 
smartctl 7.1 2019-12-30 r5022 [x86_64-linux-5.7.4-arch1-1] (local build)
Copyright (C) 2002-19, Bruce Allen, Christian Franke, www.smartmontools.org

=== START OF INFORMATION SECTION ===
Model Family:     Crucial/Micron BX/MX1/2/3/500, M5/600, 1100 SSDs
Device Model:     CT120BX500SSD1
Serial Number:    1943E3D1A61F
LU WWN Device Id: 0 000000 000000000
Firmware Version: M6CR013
User Capacity:    120,034,123,776 bytes [120 GB]
Sector Size:      512 bytes logical/physical
Rotation Rate:    Solid State Device
Form Factor:      2.5 inches
Device is:        In smartctl database [for details use: -P show]
ATA Version is:   ACS-2 T13/2015-D revision 3
SATA Version is:  SATA 3.2, 6.0 Gb/s (current: 6.0 Gb/s)
Local Time is:    Tue Jul 14 18:18:25 2020 -04
SMART support is: Available - device has SMART capability.
SMART support is: Enabled

=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED
See vendor-specific Attribute list for marginal Attributes.

General SMART Values:
Offline data collection status:  (0x00)	Offline data collection activity
					was never started.
					Auto Offline Data Collection: Disabled.
Self-test execution status:      (  41)	The self-test routine was interrupted
					by the host with a hard or soft reset.
Total time to complete Offline 
data collection: 		(  120) seconds.
Offline data collection
capabilities: 			 (0x11) SMART execute Offline immediate.
					No Auto Offline data collection support.
					Suspend Offline collection upon new
					command.
					No Offline surface scan supported.
					Self-test supported.
					No Conveyance Self-test supported.
					No Selective Self-test supported.
SMART capabilities:            (0x0002)	Does not save SMART data before
					entering power-saving mode.
					Supports SMART auto save timer.
Error logging capability:        (0x01)	Error logging supported.
					General Purpose Logging supported.
Short self-test routine 
recommended polling time: 	 (   2) minutes.
Extended self-test routine
recommended polling time: 	 (  10) minutes.

SMART Attributes Data Structure revision number: 1
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME          FLAG     VALUE WORST THRESH TYPE      UPDATED  WHEN_FAILED RAW_VALUE
  1 Raw_Read_Error_Rate     0x002f   100   100   050    Pre-fail  Always       -       0
  5 Reallocate_NAND_Blk_Cnt 0x0032   100   100   010    Old_age   Always       -       0
  9 Power_On_Hours          0x0032   100   100   050    Old_age   Always       -       2346
 12 Power_Cycle_Count       0x0032   100   100   050    Old_age   Always       -       75
171 Program_Fail_Count      0x0032   100   100   050    Old_age   Always       -       0
172 Erase_Fail_Count        0x0032   100   100   050    Old_age   Always       -       0
173 Ave_Block-Erase_Count   0x0032   100   100   050    Old_age   Always       -       27
174 Unexpect_Power_Loss_Ct  0x0032   100   100   050    Old_age   Always       -       16
180 Unused_Reserve_NAND_Blk 0x0032   100   100   050    Old_age   Always       -       100
183 SATA_Interfac_Downshift 0x0032   100   100   050    Old_age   Always       -       24
184 Error_Correction_Count  0x0032   100   100   050    Old_age   Always       -       0
187 Reported_Uncorrect      0x0032   100   100   050    Old_age   Always       -       0
194 Temperature_Celsius     0x0022   068   044   050    Old_age   Always   In_the_past 32 (Min/Max 23/56)
196 Reallocated_Event_Count 0x0032   100   100   050    Old_age   Always       -       0
197 Current_Pending_Sector  0x0032   100   100   050    Old_age   Always       -       0
198 Offline_Uncorrectable   0x0030   100   100   050    Old_age   Offline      -       0
199 UDMA_CRC_Error_Count    0x0032   100   100   050    Old_age   Always       -       177
202 Percent_Lifetime_Remain 0x0030   099   099   001    Old_age   Offline      -       99
206 Write_Error_Rate        0x002e   100   100   050    Old_age   Always       -       0
210 Success_RAIN_Recov_Cnt  0x0032   100   100   050    Old_age   Always       -       0
246 Total_LBAs_Written      0x0032   100   100   050    Old_age   Always       -       1842446720
247 Host_Program_Page_Count 0x0032   100   100   050    Old_age   Always       -       57576460
248 FTL_Program_Page_Count  0x0032   100   100   050    Old_age   Always       -       52235632

SMART Error Log Version: 1
Warning: ATA error count 0 inconsistent with error log pointer 4

ATA Error Count: 0
	CR = Command Register [HEX]
	FR = Features Register [HEX]
	SC = Sector Count Register [HEX]
	SN = Sector Number Register [HEX]
	CL = Cylinder Low Register [HEX]
	CH = Cylinder High Register [HEX]
	DH = Device/Head Register [HEX]
	DC = Device Command Register [HEX]
	ER = Error register [HEX]
	ST = Status register [HEX]
Powered_Up_Time is measured from power on, and printed as
DDd+hh:mm:SS.sss where DD=days, hh=hours, mm=minutes,
SS=sec, and sss=millisec. It "wraps" after 49.710 days.

Error -4 occurred at disk power-on lifetime: 0 hours (0 days + 0 hours)
  When the command that caused the error occurred, the device was active or idle.

  After command completion occurred, registers were:
  ER ST SC SN CL CH DH
  -- -- -- -- -- -- --
  00 00 00 00 00 00 00   at LBA = 0x00000000 = 0

  Commands leading to the command that caused the error were:
  CR FR SC SN CL CH DH DC   Powered_Up_Time  Command/Feature_Name
  -- -- -- -- -- -- -- --  ----------------  --------------------
  61 10 18 08 89 b2 00 00      00:00:00.000  WRITE FPDMA QUEUED
  61 08 58 30 d0 d5 00 00      00:00:00.000  WRITE FPDMA QUEUED
  61 08 58 30 d0 d5 00 00      00:00:00.000  WRITE FPDMA QUEUED
  61 08 60 d0 62 9d 00 00      00:00:00.000  WRITE FPDMA QUEUED
  61 08 58 30 d0 d5 00 00      00:00:00.000  WRITE FPDMA QUEUED

SMART Self-test log structure revision number 1
Num  Test_Description    Status                  Remaining  LifeTime(hours)  LBA_of_first_error
# 1  Extended offline    Interrupted (host reset)      90%      2346         -

Selective Self-tests/Logging not supported

Offline

#4 2020-07-14 22:31:35

loqs
Member
Registered: 2014-03-06
Posts: 18,928

Re: Input output error

# 1  Extended offline    Interrupted (host reset)      90%      2346         -

The self test was interrupted.  Though it is looking more likely the issue is with the controller or the connection.

Offline

Board footer

Powered by FluxBB