Chris | 27 Feb 14:49 2015
Picon

smartctl-timeouts_v1.0 scripts (scterc timeout fix)


Hi there,

here are some "smartctl-timeouts" scripts that fix commonly
mismatching defaults with drives that have no error recovery timeout
configured, which has often lead to data loss. (See the README for
further information.)

The scripts are the result of discussions on the linux-raid
mailinglist, but it may be best to ship the scripts with smartctl
already (possibly together with a default "non-redundant-disk" udev rule
to ensure proper default timeouts), and let redundancy controlling
packages like mdadm, lvm, btrfs etc. ship their own udev rules to call
the scripts appropriately.

Please respond whether you'd like to ship the scripts together with
smartctl.

Cheers,
Chris

In case the .zip file does not get through the mailinglist, it is
available in this listarchive:
http://comments.gmane.org/gmane.linux.raid/48193

------------------------------------------------------------------------------
Dive into the World of Parallel Programming The Go Parallel Website, sponsored
(Continue reading)

radiatejava | 27 Feb 10:43 2015
Picon

smartmontools query

I have few doubts with smartmontools, can someone clarify ?

1. Shall I rely on smartctl -H <dev> to see if the device is in good
health ? Or do I need to do further selftest, short or long test ? My
aim is just to find if the disk is fine or not for read and write. We
have a high availability solution and we want to use this utility to
failover to standby node in case of any issue with the disk.

2. Health check with -H option or selftest or short test – are they
handled by the device driver independently ? Or they consume some CPU
cycles ? Any data read or write is involved to run these tests that
takes CPU times ?

3. What exactly -H checks ?

Appreciate reply, thanks!

------------------------------------------------------------------------------
Dive into the World of Parallel Programming The Go Parallel Website, sponsored
by Intel and developed in partnership with Slashdot Media, is your hub for all
things parallel software development, from weekly thought leadership blogs to
news, videos, case studies, tutorials and more. Take a look and join the 
conversation now. http://goparallel.sourceforge.net/
_______________________________________________
Smartmontools-support mailing list
Smartmontools-support <at> lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/smartmontools-support
Philippe Frossard | 19 Feb 09:02 2015
Picon

Fwd: Super Talent with SM223TM (Silicon Motion)


No news ?!

-------- Message transféré --------
Sujet : 	Super Talent with SM223TM (Silicon Motion)
Date : 	Thu, 12 Feb 2015 11:14:00 +0100
De : 	Philippe Frossard <philippe.frossard <at> free.fr>
Répondre à : 	philippe.frossard <at> free.fr
Pour : 	smartmontools-support <at> lists.sourceforge.net

Hi,
The controller SM223TM is not in smartctl database (see below),
Do you have an alternate program ?
Thanks.

smartctl -i /dev/hdc

smartctl version 5.38 [i686-pc-linux-gnu] Copyright (C) 2002-8 Bruce Allen
Home page is http://smartmontools.sourceforge.net/

=== START OF INFORMATION SECTION ===
Device Model:     SUPER TALENT
Serial Number:    $$$$$$$$
Firmware Version: 20080820
User Capacity:    16,039,018,496 bytes
Device is:        Not in smartctl database [for details use: -P showall]
ATA Version is:   4
ATA Standard is:  Exact ATA specification draft version not indicated
Local Time is:    Thu Feb 12 11:03:41 2015 CET
SMART support is: Ambiguous - ATA IDENTIFY DEVICE words 85-87 don't show
(Continue reading)

Vladimir Stackov | 21 Feb 17:00 2015
Picon

JMB394 and S.M.A.R.T. passthrough

Greetings,

I have got an interesting SATA DAS device CFI-B4043JDGG [1] that was
based on JMB394 [2] PMP.
The problem is that I can't get SMART data from HDD through JMB394
with smartmontools [3].
The only way to get data is to use some proprietary tool named
"jmicron hw raid manager".
Seems like it uses some proprietary ATA extensions that wasn't
supported but smartmontools.

Could you advise me a way to sniff somehow that ATA commands?
If I can get them then I could try to implement something like
pmpjmicron (by analogy with usbjmicron).

[1] http://www.cfi-rus.ru/cfib4043jdgg-text (only russian)
[2] http://www.jmicron.com/PDF/brief/jmb394.pdf
[3] https://gist.github.com/Am1GO/f81029f3f2f6771e4281

--

-- 
Kind regards,
Vladimir.

------------------------------------------------------------------------------
Download BIRT iHub F-Type - The Free Enterprise-Grade BIRT Server
from Actuate! Instantly Supercharge Your Business Reports and Dashboards
with Interactivity, Sharing, Native Excel Exports, App Integration & more
Get technology previously reserved for billion-dollar corporations, FREE
http://pubads.g.doubleclick.net/gampad/clk?id=190641631&iu=/4140/ostg.clktrk
(Continue reading)

Justin Pryzby | 12 Feb 18:48 2015

Re: FW: [hpsa] regular freezes with smartctl -d cciss, 0

[Re-adding smartmontools]

Just noticed: it looks like the freeze happened when my nagios check queried
SMART on the same drive as my shell loop.

On Thu, Feb 12, 2015 at 11:45:10AM -0600, Justin Pryzby wrote:
> Hi All,
> 
> Thanks for your response, and sorry for the delay; the server was inaccessible
> for the last 45 days (wouldn't boot, perhaps due to storage issues; and, IPMI
> ended up with ARP disabled, somehow..).
> 
> You're right: there are 9 drives, which I take to mean a 2 drive RAID1 (160GB),
> plus spare, and 6 drive RAID5 (3.7TB), plus spare.
> 
> All the kernel messages I gave before are also visible on 3.13.0-45-generic
> (ubuntu).
> 
> I found the following using commands like this:
> sudo smartctl -i -d cciss,10 /dev/sg0
> 
> cciss,10: SRC 8x6G
> cciss,9: DL18xG6BP
> cciss,8: Hitachi Ultrastar A7K2000 (HUA722010CLA330)
> cciss,7: GB1000EAMYC
> cciss,6: GB1000EAMYC
> cciss,5: Hitachi Ultrastar A7K2000 (HUA722010CLA330)
> cciss,4: Hitachi Ultrastar A7K2000 (HUA722010CLA330)
> cciss,3: Hitachi Ultrastar A7K2000 (HUA722010CLA330)
> cciss,2: GB0160EAFJE
(Continue reading)

Larry Rosenman | 11 Feb 18:56 2015

Pending Sectors, during LONG test

The last 2 times (If I recall correctly), I've gotten email from SmartD 
(FreeBSD 11) with complaints about the following 2 counters:

197 Current_Pending_Sector  0x0012   100   100   000    Old_age   Always 
       -       8
198 Offline_Uncorrectable   0x0010   100   100   000    Old_age   
Offline      -       8

Once the test completes, they seem(!) to go away.

SMART Error Log Version: 1
No Errors Logged

SMART Self-test log structure revision number 1
Num  Test_Description    Status                  Remaining  
LifeTime(hours)  LBA_of_first_error
# 1  Extended offline    Self-test routine in progress 90%     20538     
     -

Do I have an issue?

Full smartctl -a:
borg.lerctr.org /home/ler $ sudo smartctl -a /dev/ada4
smartctl 6.3 2014-07-26 r3976 [FreeBSD 11.0-CURRENT amd64] (local build)
Copyright (C) 2002-14, Bruce Allen, Christian Franke, 
www.smartmontools.org

=== START OF INFORMATION SECTION ===
Model Family:     Seagate Barracuda Green (AF)
Device Model:     ST2000DL003-9VT166
(Continue reading)

Lorne Wanamaker | 11 Feb 15:15 2015
Picon

E-mail Notification Issue

Hello,

I am having a hard time getting e-mail notifications from smartd. here is is my smart.conf

/dev/sda -a -d megaraid,4 -a -s (S/../.././02|L/../../6/03)
/dev/sda -a -d megaraid,5 -a -s (S/../.././02|L/../../6/05)
/dev/sda -a -d megaraid,6 -a -s (S/../.././02|L/../../6/07)
/dev/sda -a -d sat+megaraid,7 -a -s S/../.././02
/dev/sda -a -d megaraid,4 -H -m feldan1 <at> gmail.com.com -M exec /usr/local/bin/smartdnotify

Here is smartdnotify contents:

#! /bin/sh

# Send mail
echo "$SMARTD_MESSAGE" | mail -s "$SMARTD_FAILTYPE" "$SMARTD_ADDRESS"

# Notify user
wall "$SMARTD_MESSAGE"

I tested sendmail and it works fine. Do I need to add the bottom line for each device? Or is there a way to make one line send for warnings on all devices?

Kindest regards,
Lorne
------------------------------------------------------------------------------
Dive into the World of Parallel Programming. The Go Parallel Website,
sponsored by Intel and developed in partnership with Slashdot Media, is your
hub for all things parallel software development, from weekly thought
leadership blogs to news, videos, case studies, tutorials and more. Take a
look and join the conversation now. http://goparallel.sourceforge.net/
_______________________________________________
Smartmontools-support mailing list
Smartmontools-support <at> lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/smartmontools-support
Tmore1 | 9 Feb 05:58 2015
Picon

1058:0830 WD My Passport Ultra external USB 1TB WDBZFP

This one isn't on the list, but it works with '-d sat' on Debian Wheezy.

<tmore1 <at> gmx.com>

------------------------------------------------------------------------------
Dive into the World of Parallel Programming. The Go Parallel Website,
sponsored by Intel and developed in partnership with Slashdot Media, is your
hub for all things parallel software development, from weekly thought
leadership blogs to news, videos, case studies, tutorials and more. Take a
look and join the conversation now. http://goparallel.sourceforge.net/
Rubén Merino Pesquera | 6 Feb 23:29 2015
Picon

Question about SMART 173 parameter...

Good afternoon,

I own a two year-old MBPr with an SSD SM256E, I was always trying to guess the units used in the smart parameter 173 reported by smartctl. For example:


ID# ATTRIBUTE_NAME          FLAG     VALUE WORST THRESH TYPE      UPDATED  WHEN_FAILED RAW_VALUE
  1 Raw_Read_Error_Rate     0x001a   200   200   000    Old_age   Always       -       0
  5 Reallocated_Sector_Ct   0x0033   100   100   000    Pre-fail  Always       -       0
  9 Power_On_Hours          0x0032   099   099   000    Old_age   Always       -       4801
 12 Power_Cycle_Count       0x0032   098   098   000    Old_age   Always       -       1701
169 Unknown_Attribute       0x0013   253   253   010    Pre-fail  Always       -       1005450172672
173 Wear_Leveling_Count     0x0032   194   194   100    Old_age   Always       -       17193435208

I always assumed it was measured in KiB (at least it was what made the most sense to me) but after reading this post in reddit of a user with a similar computer, (and believing that it can't possibly wrote more than 10 PB to his ssd), I think I am clearly mistaken.

Could you clarify it for me, please?

Kind regards and thanks in advance.
------------------------------------------------------------------------------
Dive into the World of Parallel Programming. The Go Parallel Website,
sponsored by Intel and developed in partnership with Slashdot Media, is your
hub for all things parallel software development, from weekly thought
leadership blogs to news, videos, case studies, tutorials and more. Take a
look and join the conversation now. http://goparallel.sourceforge.net/
_______________________________________________
Smartmontools-support mailing list
Smartmontools-support <at> lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/smartmontools-support
Markus Naeher | 5 Feb 15:20 2015
Picon

Seagate STBV5000200 refuses (nearly) any smatrctl action

Hi,

I have bought a external USB3 harddisk Seagate Expansion Desktop 5TB (STBV5000200) and I tried to
self-test ist before using it.
Without the -T permissive options, I got this error message:
"A mandatory SMART command failed: exiting."

Then, I tried a few parameter combinations (all with -T verypermissive), but all of them also failed (see attachment).
I assume you need more data. Just tell me how I can provide it to you and I'll do.

Thanks and Regards,
Markus
root <at> TuxServer:~# smartctl -d sat,auto -T verypermissive -i -H -c /dev/sdn
smartctl 6.3 2014-07-26 r3976 [x86_64-linux-3.18.4-1-ARCH] (local build)
Copyright (C) 2002-14, Bruce Allen, Christian Franke, www.smartmontools.org

/dev/sdn [SCSI]: Device open changed type from 'sat,auto' to 'scsi'
=== START OF INFORMATION SECTION ===
Vendor:               Seagate
Product:              Expansion Desk
Revision:             0739
Compliance:           SPC-4
User Capacity:        5.000.981.073.920 bytes [5,00 TB]
Logical block size:   4096 bytes
scsiModePageOffset: response length too short, resp_len=12 offset=12 bd_len=8
scsiModePageOffset: response length too short, resp_len=12 offset=12 bd_len=8
>> Terminate command early due to bad response to IEC mode page

=== START OF READ SMART DATA SECTION ===

root <at> TuxServer:~# smartctl -d sat,12 -T verypermissive -i -H -c /dev/sdn
smartctl 6.3 2014-07-26 r3976 [x86_64-linux-3.18.4-1-ARCH] (local build)
Copyright (C) 2002-14, Bruce Allen, Christian Franke, www.smartmontools.org

Read Device Identity failed: scsi error unsupported field in scsi command

=== START OF INFORMATION SECTION ===
Device Model:     [No Information Found]
Serial Number:    [No Information Found]
Firmware Version: [No Information Found]
Device is:        Not in smartctl database [for details use: -P showall]
ATA Version is:   [No Information Found]
Local Time is:    Thu Feb  5 14:13:09 2015 CET
SMART support is: Ambiguous - ATA IDENTIFY DEVICE words 82-83 don't show if SMART supported.
SMART support is: Ambiguous - ATA IDENTIFY DEVICE words 85-87 don't show if SMART is enabled.
                  Checking to be sure by trying SMART RETURN STATUS command.
SMART support is: Unknown - Try option -s with argument 'on' to enable it.
Read SMART Data failed: scsi error unsupported field in scsi command

=== START OF READ SMART DATA SECTION ===
SMART Status command failed: scsi error unsupported field in scsi command
SMART overall-health self-assessment test result: UNKNOWN!
SMART Status, Attributes and Thresholds cannot be read.

root <at> TuxServer:~# smartctl -d sat,16 -T verypermissive -i -H -c /dev/sdn
smartctl 6.3 2014-07-26 r3976 [x86_64-linux-3.18.4-1-ARCH] (local build)
Copyright (C) 2002-14, Bruce Allen, Christian Franke, www.smartmontools.org

Read Device Identity failed: scsi error unsupported field in scsi command

=== START OF INFORMATION SECTION ===
Device Model:     [No Information Found]
Serial Number:    [No Information Found]
Firmware Version: [No Information Found]
Device is:        Not in smartctl database [for details use: -P showall]
ATA Version is:   [No Information Found]
Local Time is:    Thu Feb  5 14:13:13 2015 CET
SMART support is: Ambiguous - ATA IDENTIFY DEVICE words 82-83 don't show if SMART supported.
SMART support is: Ambiguous - ATA IDENTIFY DEVICE words 85-87 don't show if SMART is enabled.
                  Checking to be sure by trying SMART RETURN STATUS command.
SMART support is: Unknown - Try option -s with argument 'on' to enable it.
Read SMART Data failed: scsi error unsupported field in scsi command

=== START OF READ SMART DATA SECTION ===
SMART Status command failed: scsi error unsupported field in scsi command
SMART overall-health self-assessment test result: UNKNOWN!
SMART Status, Attributes and Thresholds cannot be read.

root <at> TuxServer:~# smartctl -d sat,12 -T verypermissive -s on /dev/sdn
smartctl 6.3 2014-07-26 r3976 [x86_64-linux-3.18.4-1-ARCH] (local build)
Copyright (C) 2002-14, Bruce Allen, Christian Franke, www.smartmontools.org

Read Device Identity failed: scsi error unsupported field in scsi command

SMART support is: Ambiguous - ATA IDENTIFY DEVICE words 82-83 don't show if SMART supported.
SMART support is: Ambiguous - ATA IDENTIFY DEVICE words 85-87 don't show if SMART is enabled.
                  Checking to be sure by trying SMART RETURN STATUS command.
SMART support is: Unknown - Try option -s with argument 'on' to enable it.=== START OF ENABLE/DISABLE
COMMANDS SECTION ===
SMART Enable failed: scsi error unsupported field in scsi command

root <at> TuxServer:~# smartctl -d sat,12 -T verypermissive -t long /dev/sdn
smartctl 6.3 2014-07-26 r3976 [x86_64-linux-3.18.4-1-ARCH] (local build)
Copyright (C) 2002-14, Bruce Allen, Christian Franke, www.smartmontools.org

Read Device Identity failed: scsi error unsupported field in scsi command

SMART support is: Ambiguous - ATA IDENTIFY DEVICE words 82-83 don't show if SMART supported.
SMART support is: Ambiguous - ATA IDENTIFY DEVICE words 85-87 don't show if SMART is enabled.
                  Checking to be sure by trying SMART RETURN STATUS command.
SMART support is: Unknown - Try option -s with argument 'on' to enable it.Read SMART Data failed: scsi error
unsupported field in scsi command

root <at> TuxServer:~# smartctl -d sat,16 -T verypermissive -s on /dev/sdn
smartctl 6.3 2014-07-26 r3976 [x86_64-linux-3.18.4-1-ARCH] (local build)
Copyright (C) 2002-14, Bruce Allen, Christian Franke, www.smartmontools.org

Read Device Identity failed: scsi error unsupported field in scsi command

SMART support is: Ambiguous - ATA IDENTIFY DEVICE words 82-83 don't show if SMART supported.
SMART support is: Ambiguous - ATA IDENTIFY DEVICE words 85-87 don't show if SMART is enabled.
                  Checking to be sure by trying SMART RETURN STATUS command.
SMART support is: Unknown - Try option -s with argument 'on' to enable it.=== START OF ENABLE/DISABLE
COMMANDS SECTION ===
SMART Enable failed: scsi error unsupported field in scsi command

root <at> TuxServer:~# smartctl -d sat,16 -T verypermissive -t long /dev/sdn
smartctl 6.3 2014-07-26 r3976 [x86_64-linux-3.18.4-1-ARCH] (local build)
Copyright (C) 2002-14, Bruce Allen, Christian Franke, www.smartmontools.org

Read Device Identity failed: scsi error unsupported field in scsi command

SMART support is: Ambiguous - ATA IDENTIFY DEVICE words 82-83 don't show if SMART supported.
SMART support is: Ambiguous - ATA IDENTIFY DEVICE words 85-87 don't show if SMART is enabled.
                  Checking to be sure by trying SMART RETURN STATUS command.
SMART support is: Unknown - Try option -s with argument 'on' to enable it.Read SMART Data failed: scsi error
unsupported field in scsi command

------------------------------------------------------------------------------
Dive into the World of Parallel Programming. The Go Parallel Website,
sponsored by Intel and developed in partnership with Slashdot Media, is your
hub for all things parallel software development, from weekly thought
leadership blogs to news, videos, case studies, tutorials and more. Take a
look and join the conversation now. http://goparallel.sourceforge.net/
_______________________________________________
Smartmontools-support mailing list
Smartmontools-support <at> lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/smartmontools-support

Non-medium errors

Problem statement:

- High count of Non-medium errors noticed on on txslbds29071
     Errors Corrected by           Total   Correction     Gigabytes    Total
               ECC          rereads/    errors   algorithm      processed    uncorrected
           fast | delayed   rewrites  corrected  invocations   [10^9 bytes]  errors
read:          0     5283         0      5283      20419       6413.661           0
write:         0  5020619         0   5020619    4083105      55636.866           0
verify:        0        0         0         0        205          0.000           0

Non-medium error count:    24213

1.	What is the threshold for such non-medium errors
2.	The reason behind many non-medium errors?

------------------------------------------------------------------------------
Dive into the World of Parallel Programming. The Go Parallel Website,
sponsored by Intel and developed in partnership with Slashdot Media, is your
hub for all things parallel software development, from weekly thought
leadership blogs to news, videos, case studies, tutorials and more. Take a
look and join the conversation now. http://goparallel.sourceforge.net/

Gmane