Questions tagged [smart]
Self-Monitoring, Analysis and Reporting Technology
215 questions
0
votes
0
answers
138
views
How to monitor health of USB SSD that does not fully support SMART?
I have USB SSD that does not fully support SMART. I would like to get warning that I should buy replacement before it's too late. What are my options?
Here is the output of smartctl -a -d scsi /dev/...
0
votes
1
answer
312
views
Seagate Exos X18 shows excessive Total_LBAs_Read value in S.M.A.R.T on FW SN02
I have Grafana set up to monitor S.M.A.R.T of my drives in my NAS using prometheus. What caught my eye was that some of the drives show they have over 5 PB (Peta-bytes) read, which is very weird, ...
0
votes
1
answer
727
views
Can one use "Percentage Used" reliably on Samsung EVO NVMe disks?
There are 2 disks in the server, both are Samsung SSD 970 EVO Plus 2TB and with 2B2QEXM7 firmware.
# smartctl -A /dev/nvme0
smartctl 7.1 2019-12-30 r5022 [x86_64-linux-5.4.0-187-generic] (local build)
...
0
votes
2
answers
330
views
How to detect smartd errors from logs
I want to monitor multiple servers, and want to know when a drive fails.
I have a script which monitors /var/log/syslog. What i want is a regex that matches only when a drive has a big problem. I do ...
0
votes
1
answer
241
views
Can a faulty cable (or other issue) cause SMART errors with a perfectly good drive?
On a lot of my drives I get WRITE|READ FPDMA QUEUED errors, like so:
Error 23 occurred at disk power-on lifetime: 43915 hours (1829 days + 19 hours)
When the command that caused the error occurred, ...
0
votes
2
answers
320
views
HDD failed, but can PCB replacement fix it?
My WD RED WD80EFAX HDD suddenly died last week: I shut down my Proxmox server, booted it up again and the drive started "clicking". It was clicking for a while, until it stopped and no ...
1
vote
1
answer
236
views
RAID1 drive vanished, but could be re-added after reboot. Should I be worried?
I am running Debian 12 with an MD RAID1 array (2 drives) for my personal data (no system files on array).
Today I got a mail from mdadm about a DegradedArray event, during a time where my drives are ...
0
votes
1
answer
427
views
Scheduling SMART, scrub, resilvering considerations, conflicting?
I have set up regular scrub on my ZFS pools and am beginning to look at setting up smart check every now and then, but I have a distinct memory of an entry I am unable to find that one should not ...
0
votes
0
answers
242
views
Many drives in two RAID6 arrays simultaneously failed, seem to be working after reboot except SMART long test
in my storage server, I operate three RAID6 Linux software arrays. Everything was working fine until it was not.
There are two RAID6 arrays and one RAID5 array, all consisting of SATA drives, all ...
1
vote
0
answers
183
views
Backup drive on Linux system keeps getting corrupt. How to troubleshoot?
My webserver (Debian BullsEye) has an internal 2TB harddrive which sole purpose is to hold a backup of the server's main HD. Every day a cronjob is started that creates a backup using rsync. But every ...
0
votes
2
answers
1k
views
Interpret smartctl -a output
Please help me understand this
root@bdb16e4bb2e3:/opt/scrutiny# smartctl --all /dev/sdb
smartctl 7.2 2020-12-30 r5155 [aarch64-linux-5.15.0-1024-raspi] (local build)
Copyright (C) 2002-20, Bruce Allen,...
1
vote
0
answers
140
views
APC change to smart slot / card interface
I bought a new APC SMT1000C 1000VA UPS, which has the smart card interface on the back. I was planning to reuse my old AP9606 network card, but I discovered that APC has modified the slot to block ...
2
votes
0
answers
481
views
How to add to excludes alerts on smartmontool
I faced with problem and hope for your help.
Started getting notifications from smart on Debian 10 server:
Device: /dev/nvme1, Critical Warning (0x04): Reliability
Found that this alert causing ...
1
vote
0
answers
2k
views
How to interpret smartctl output
I ran a SMART scan of the ssd of my server, and I'm having difficulties to understand the output. Any insights please ?
Thanks
smartctl 7.0 2018-12-30 r4883 [x86_64-linux-3.10.0-1160.71.1.el7.x86_64] (...
1
vote
1
answer
3k
views
btrfs - failing disk generated checksum errors, disk replaced, errors remain
I had a pair of 3TB disks in a btrfs raid1 array.
One of these disks started failing (smartd shows bad sectors), and so I bought a pair of new 8TB drives to replace both disks in the array.
I replaced ...