You are not logged in.
In kernel 4.16.2 and 4.16.3 I see these messages appearing esporadically :
Apr 20 14:20:23 arch udisksd[1345]: Error performing housekeeping for drive /org/freedesktop/UDisks2/drives/HGST_HTS721010A9E630_JR10006P0HNZ3E: Error updating SMART data: sk_disk_smart_read_data: Input/output error (udisks-error-quark, 0)
Apr 20 14:20:23 arch udisksd[1345]: Error performing housekeeping for drive /org/freedesktop/UDisks2/drives/Samsung_SSD_850_EVO_M_2e2_250GB_S33CNX0J644440Z: Error updating SMART data: sk_disk_smart_read_data: Input/output error (udisks-error-quark, 0)
And sometimes this :
Apr 20 14:10:23 arch udisksd[1345]: Error performing housekeeping for drive /org/freedesktop/UDisks2/drives/HGST_HTS721010A9E630_JR10006P0HNZ3E: Error updating SMART data: Error sending ATA command CHECK POWER MODE: Unexpected sense data returned:
0000: 01 00 1d 00 00 00 0e 09 0c 00 00 00 ff 00 00 00 ................
0010: 00 00 00 00 50 00 00 00 00 00 00 00 00 00 00 00 ....P...........
(g-io-error-quark, 0)
When I use kernel 4.15.x it doesn' t show those messages, I used smartctl extended test to check the two disks and it doesn't show any error if I'm reading it correctly:
$sudo smartctl -a /dev/sda
smartctl 6.6 2017-11-05 r4594 [x86_64-linux-4.16.3-1] (local build)
Copyright (C) 2002-17, Bruce Allen, Christian Franke, www.smartmontools.org
=== START OF INFORMATION SECTION ===
Model Family: Samsung based SSDs
Device Model: Samsung SSD 850 EVO M.2 250GB
Serial Number: S33CNX0J644440Z
LU WWN Device Id: 5 002538 d4215b1c4
Firmware Version: EMT21B6Q
User Capacity: 250,059,350,016 bytes [250 GB]
Sector Size: 512 bytes logical/physical
Rotation Rate: Solid State Device
Form Factor: M.2
Device is: In smartctl database [for details use: -P show]
ATA Version is: ACS-2, ATA8-ACS T13/1699-D revision 4c
SATA Version is: SATA 3.1, 6.0 Gb/s (current: 6.0 Gb/s)
Local Time is: Fri Apr 20 14:25:39 2018 -05
SMART support is: Available - device has SMART capability.
SMART support is: Enabled
=== START OF READ SMART DATA SECTION ===
SMART Status not supported: Incomplete response, ATA output registers missing
SMART overall-health self-assessment test result: PASSED
Warning: This result is based on an Attribute check.
General SMART Values:
Offline data collection status: (0x00) Offline data collection activity
was never started.
Auto Offline Data Collection: Disabled.
Self-test execution status: ( 0) The previous self-test routine completed
without error or no self-test has ever
been run.
Total time to complete Offline
data collection: ( 0) seconds.
Offline data collection
capabilities: (0x53) SMART execute Offline immediate.
Auto Offline data collection on/off support.
Suspend Offline collection upon new
command.
No Offline surface scan supported.
Self-test supported.
No Conveyance Self-test supported.
Selective Self-test supported.
SMART capabilities: (0x0003) Saves SMART data before entering
power-saving mode.
Supports SMART auto save timer.
Error logging capability: (0x01) Error logging supported.
General Purpose Logging supported.
Short self-test routine
recommended polling time: ( 2) minutes.
Extended self-test routine
recommended polling time: ( 133) minutes.
SCT capabilities: (0x003d) SCT Status supported.
SCT Error Recovery Control supported.
SCT Feature Control supported.
SCT Data Table supported.
SMART Attributes Data Structure revision number: 1
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE
5 Reallocated_Sector_Ct 0x0033 100 100 010 Pre-fail Always - 0
9 Power_On_Hours 0x0032 099 099 000 Old_age Always - 3391
12 Power_Cycle_Count 0x0032 099 099 000 Old_age Always - 694
177 Wear_Leveling_Count 0x0013 099 099 000 Pre-fail Always - 16
179 Used_Rsvd_Blk_Cnt_Tot 0x0013 100 100 010 Pre-fail Always - 0
181 Program_Fail_Cnt_Total 0x0032 100 100 010 Old_age Always - 0
182 Erase_Fail_Count_Total 0x0032 100 100 010 Old_age Always - 0
183 Runtime_Bad_Block 0x0013 100 100 010 Pre-fail Always - 0
187 Uncorrectable_Error_Cnt 0x0032 100 100 000 Old_age Always - 0
190 Airflow_Temperature_Cel 0x0032 065 045 000 Old_age Always - 35
195 ECC_Error_Rate 0x001a 200 200 000 Old_age Always - 0
199 CRC_Error_Count 0x003e 100 100 000 Old_age Always - 0
235 POR_Recovery_Count 0x0012 099 099 000 Old_age Always - 24
241 Total_LBAs_Written 0x0032 099 099 000 Old_age Always - 7870705350
SMART Error Log Version: 1
No Errors Logged
SMART Self-test log structure revision number 1
Num Test_Description Status Remaining LifeTime(hours) LBA_of_first_error
# 1 Short offline Completed without error 00% 3391 -
# 2 Extended offline Completed without error 00% 3391 -
SMART Selective self-test log data structure revision number 1
SPAN MIN_LBA MAX_LBA CURRENT_TEST_STATUS
1 0 0 Not_testing
2 0 0 Not_testing
3 0 0 Not_testing
4 0 0 Not_testing
5 0 0 Not_testing
255 0 65535 Read_scanning was never started
Selective self-test flags (0x0):
After scanning selected spans, do NOT read-scan remainder of disk.
If Selective self-test is pending on power-up, resume after 0 minute delay.
Should I be worried?, is a bug in kernel 4.16 and 4.15 is correct, or 4.16 is correct showing them?
EDIT: Or udisks2/udisksd needs to be updated for Kernel 4.16.x ...
Last edited by gnox (2018-04-20 19:50:13)
Offline
Is there any corresponding output from dmesg indicating an ATA error?
Offline
Is there any corresponding output from dmesg indicating an ATA error?
Hi, no
$dmesg | rg -i ata
...
[ 2.204327] ahci 0000:00:1f.2: AHCI 0001.0300 32 slots 6 ports 6 Gbps 0x12 impl SATA mode
[ 2.214580] ata1: DUMMY
[ 2.214584] ata2: SATA max UDMA/133 abar m2048@0xf741a000 port 0xf741a180 irq 29
[ 2.214585] ata3: DUMMY
[ 2.214586] ata4: DUMMY
[ 2.214589] ata5: SATA max UDMA/133 abar m2048@0xf741a000 port 0xf741a300 irq 29
[ 2.214590] ata6: DUMMY
[ 2.527986] ata5: SATA link up 6.0 Gbps (SStatus 133 SControl 300)
[ 2.528014] ata2: SATA link up 6.0 Gbps (SStatus 133 SControl 300)
[ 2.529730] ata5.00: ACPI cmd ef/10:06:00:00:00:00 (SET FEATURES) succeeded
[ 2.529736] ata5.00: ACPI cmd f5/00:00:00:00:00:00 (SECURITY FREEZE LOCK) filtered out
[ 2.529741] ata5.00: ACPI cmd b1/c1:00:00:00:00:00 (DEVICE CONFIGURATION OVERLAY) filtered out
[ 2.529833] ata2.00: supports DRM functions and may not be fully accessible
[ 2.530804] ata2.00: disabling queued TRIM support
[ 2.530808] ata2.00: ATA-9: Samsung SSD 850 EVO M.2 250GB, EMT21B6Q, max UDMA/133
[ 2.530812] ata2.00: 488397168 sectors, multi 1: LBA48 NCQ (depth 31/32), AA
[ 2.531097] ata5.00: ATA-8: HGST HTS721010A9E630, JB0OA3J0, max UDMA/133
[ 2.531102] ata5.00: 1953525168 sectors, multi 16: LBA48 NCQ (depth 31/32), AA
[ 2.532928] ata5.00: ACPI cmd ef/10:06:00:00:00:00 (SET FEATURES) succeeded
[ 2.532935] ata5.00: ACPI cmd f5/00:00:00:00:00:00 (SECURITY FREEZE LOCK) filtered out
[ 2.532939] ata5.00: ACPI cmd b1/c1:00:00:00:00:00 (DEVICE CONFIGURATION OVERLAY) filtered out
[ 2.533939] ata2.00: supports DRM functions and may not be fully accessible
[ 2.534189] ata5.00: configured for UDMA/133
[ 2.534662] ata2.00: disabling queued TRIM support
[ 2.536455] ata2.00: configured for UDMA/133
[ 2.547038] scsi 1:0:0:0: Direct-Access ATA Samsung SSD 850 1B6Q PQ: 0 ANSI: 5
[ 2.547645] scsi 4:0:0:0: Direct-Access ATA HGST HTS721010A9 A3J0 PQ: 0 ANSI: 5
..
$dmesg -l err
[ 0.297199] ACPI Error: No handler for Region [EC__] (00000000465b2692) [EmbeddedControl] (20180105/evregion-166)
[ 0.297202] ACPI Error: Region EmbeddedControl (ID=3) has no handler (20180105/exfldio-299)
[ 0.297206] ACPI Error: Method parse/execution failed \_SB.PCI0.LPCB.EC._REG, AE_NOT_EXIST (20180105/psparse-550)
[ 0.783308] ACPI BIOS Error (bug): Failure looking up [\_SB.PCI0.GFX0.DD02._BCL], AE_NOT_FOUND (20180105/psargs-364)
[ 0.783314] ACPI Error: Method parse/execution failed \_SB.PCI0.PEG0.PEGP.DD02._BCL, AE_NOT_FOUND (20180105/psparse-550)
These last messages always appear since many kernel versions before, they doesnt affect anything afaik...
Only an aditional new message with the 4.16.x kernel that I see, but is unrelated, is with nvidia :
Apr 20 14:56:06 arch kernel: Bad or missing usercopy whitelist? Kernel memory exposure attempt detected from SLUB object 'nvidia_stack_cache' (offset 11440, size 3)!
Apr 20 14:56:06 arch kernel: WARNING: CPU: 5 PID: 13258 at mm/usercopy.c:81 usercopy_warn+0x7e/0xa0
Offline
are you using the BFQ I/O scheduler?
Offline