rouble | 1 Jul 2005 06:25
Picon

Re: Starting S.M.A.R.T. daemon: smartd (failed)

HI JLC,

Thats sort of what I was looking for. I'd be willing to help - only I
have no php skills.

Let me know, maybe I can help test it out. 

Were you planning on just hacking something up - or were you going to
actually add it to the phpsysinfo source code ?

Cheers
rouble

On 6/30/05, Jean-Luc Cooke <jlcooke <at> jlcooke.ca> wrote:
> I'll be making changes to phpsysinfo soon to do just this soon.
> 
> I want to track hd temps and link out to smartctl -a $dev.
> 
> Can you wait a week or two?
> 
> JLC
> 
> On Thu, Jun 30, 2005 at 04:02:42PM -0400, rouble wrote:
> > Thanks. That helped.
> >
> > One more question, Is there a way I can remotely monitor the smartd
> > process and status of the hard disks (preferably in a browser) ?
> >
> > Something along the lines of phpsysinfo
> > (http://phpsysinfo.sourceforge.net/phpsysinfo/?template=classic) which
(Continue reading)

cc | 4 Jul 2005 15:21

smartd and script call when errors

Hi,

I've got some problems to deal with smartd. This is my smartd.conf:

/dev/hdc -d ata -o on -S on -H -l selftest -m root <at> localhost -M exec
/root/diskfail.sh -s O/../.././..

and my /root/diskfail.sh script is:

#!/bin/bash
#generate report for disk failure
echo "$SMART_MESSAGE" > /root/report 
smartctl -l selftest /dev/hdc >> /root/report
lftp -e "mput report;exit" -u diskfailure,zopzop 192.168.1.1 > /dev/null 2>&1
rm -f /root/report

In my syslog I've :Jul  4 15:03:27 localhost smartd[4277]: Configuration file
/etc/smartd.conf parsed. 
Jul  4 15:03:27 localhost smartd[4277]: Device: /dev/hdc, opened 
Jul  4 15:03:27 localhost smartd[4277]: Device: /dev/hdc, found in smartd
database. 
Jul  4 15:03:28 localhost smartd[4277]: Device: /dev/hdc, enabled SMART
Attribute Autosave. 
Jul  4 15:03:28 localhost smartd[4277]: Device: /dev/hdc, enabled SMART
Automatic Offline Testing. 
Jul  4 15:03:28 localhost smartd[4277]: Device: /dev/hdc, is SMART capable.
Adding to "monitor" list. 
Jul  4 15:03:28 localhost smartd[4277]: Monitoring 1 ATA and 0 SCSI devices 
Jul  4 15:03:28 localhost smartd[4277]: Device: /dev/hdc, 330 Currently
unreadable (pending) sectors 
(Continue reading)

Andrew Yates | 4 Jul 2005 15:33
Picon

what does this error mean?

Last night I received an email from smartctl saying:
The following warning/error was logged by the smartd daemon:
Device: /dev/hda, ATA error count increased from 0 to 3

I ran two "smartctl -t long /dev/hda" checks within the 12 hours or so
since then, the output of 'smartctl -a /dev/hda' is listed below, at
the end of the email.

Does anyone know why this would happen? Should I buy a new drive and
get rid of this one, or is it just a false alarm?

# smartctl -a /dev/hda
smartctl version 5.33 [i386-pc-linux-gnu] Copyright (C) 2002-4 Bruce Allen
Home page is http://smartmontools.sourceforge.net/

=== START OF INFORMATION SECTION ===
Device Model:     ST3200822A
Serial Number:    3LJ1SEC0
Firmware Version: 3.01
User Capacity:    200,049,647,616 bytes
Device is:        In smartctl database [for details use: -P show]
ATA Version is:   6
ATA Standard is:  ATA/ATAPI-6 T13 1410D revision 2
Local Time is:    Sun Jul  3 17:20:41 2005 EDT
SMART support is: Available - device has SMART capability.
SMART support is: Enabled

=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED

(Continue reading)

Roderick Johnstone | 8 Jul 2005 14:40
Picon
Picon
Favicon

Fixing disk errors underneath linux software raid 5

Hi

I've got a linux software raid system set up. It is raid 5 across 4 disks.

We recently started using smartmontools and I have found that two disks 
in the raidset have failed an extended offline test.

/dev/hdg
# 1  Extended offline    Completed: read failure       10%     12581 
      490225453

/dev/hdi
# 1  Extended offline    Completed: read failure       10%     12518 
      490216008

hdg has one error logged but from ages ago, while hdi has no errors logged.

/dev/hdg reports:
Current_Pending_Sector 4
Offline_Uncorrectable  5

/dev/hdi reports:
Current_Pending_Sector 2
Offline_Uncorrectable  3

Unless I'm very unlucky the unreadable sectors on both disks aren't in 
the same file so the raid system can reconstruct the data correctly at 
the moment.

However, I'd like some advice on finding any files affected, and on 
(Continue reading)

Francesco Turco | 9 Jul 2005 12:48
Picon

selftest log is no more updated

hello,

my os is gentoo linux; smartmontools version is 5.33.
my hd is a Maxtor DiamondMax Plus 9 160GB.

i've got a problem. when i run a test (for example smartctl -t offline
/dev/hda or smartctl -t short /dev/hda), i wait for it to be finished,
but after i can't see a new entry in smartctl -l selftest /dev/hda. what
could it be? maybe i disabled something (i recently tried this command: 
smartctl --smart=on --offlineauto=on --saveauto=on /dev/hda)?

thanks,

fctk

-------------------------------------------------------
This SF.Net email is sponsored by the 'Do More With Dual!' webinar happening
July 14 at 8am PDT/11am EDT. We invite you to explore the latest in dual
core and dual graphics technology at this free one hour event hosted by HP,
AMD, and NVIDIA.  To register visit http://www.hp.com/go/dualwebinar
Helmut Manck | 9 Jul 2005 15:12
Picon

SNMP-Interface

Hi,

a time ago, I had the idea of making SMART-data available via SNMP. I
like this idea because there are lots of tools for monitoring and
graphing SNMP-data ( MRTG, OpenNMS, Cacti etc).
Attached is a patch which makes some of this SMART infomation available
via SNMP.
The patch modifies smartd in a way that in addition to doing its normal
jobs, smartd also serves the agentX-interface of net-snmp. The
agentX-interface works via a UNIX-socket-interface, so its not security
prob.

The patch was developed against net-snmp-5.2.1-3, You will need
net-snmp-devel, also, ( from net-snmp.sourceforge.net) to make it
compile. Anyway, it was tested on RedHat 3.0 und FC2 ( you have to
enable agentx in snmpd and make the tree readable, also).
For the moment, only ATA is supported. The MIB-tree I took is
not-official, some work to do here.

Usage example:

[root <at> tesla root]# snmpwalk -c public -v 2c localhost enterprises.99999
SMARTCTL-MIB::smartctlDevice.1 = STRING: "/dev/hda"
SMARTCTL-MIB::smartctlDevice.2 = STRING: "/dev/hdb"
SMARTCTL-MIB::smartctlType.1 = INTEGER: 1
SMARTCTL-MIB::smartctlType.2 = INTEGER: 1
SMARTCTL-MIB::smartctlPort.1 = INTEGER: 0
SMARTCTL-MIB::smartctlPort.2 = INTEGER: 0
SMARTCTL-MIB::smartctlModel.1 = STRING: "Maxtor 6Y120P0"
SMARTCTL-MIB::smartctlModel.2 = STRING: "Maxtor 6Y120P0"
(Continue reading)

Helmut Manck | 9 Jul 2005 15:29
Picon

3ware on ES 3.0

Hi,

had anyone success in using "smartctl -a -d 3ware,0 /dev/sda" on RedHat
ES3.0? I am getting the following result:

[root <at> server001 smartmontools-5.34]# ./smartctl -a -d 3ware,1 /dev/sda
smartctl version 5.34 [i686-pc-linux-gnu] Copyright (C) 2002-5 Bruce Allen
Home page is http://smartmontools.sourceforge.net/

WARNING - NO DEVICE FOUND ON 3WARE CONTROLLER (disk 1)
Smartctl: Device Read Identity Failed (not an ATA/ATAPI device)

A mandatory SMART command failed: exiting. To continue, add one or more
'-T permissive' options.

Versions
======
Kernel: 2.4.21
Smartctl: 5.34 (cvs)

Best regards, Helmut Manck

-------------------------------------------------------
This SF.Net email is sponsored by the 'Do More With Dual!' webinar happening
July 14 at 8am PDT/11am EDT. We invite you to explore the latest in dual
core and dual graphics technology at this free one hour event hosted by HP,
AMD, and NVIDIA.  To register visit http://www.hp.com/go/dualwebinar
Boniforti Flavio | 9 Jul 2005 17:06
Picon
Favicon

What should I do?

I've discovered following messages from SMART:

Device: /dev/hdb, 1 Currently unreadable (pending) sectors
Device: /dev/hdb, 1 Offline uncorrectable sectors

What should I be doing to correct this behaviour?

Thanks.

--

-- 
Registered Linux User #251752
  --- VB LUG Co-Founder ---
Command, n.: Statement presented by a human and accepted by a computer 
in such a manner as to make the human feel as if he is in control.

-------------------------------------------------------
This SF.Net email is sponsored by the 'Do More With Dual!' webinar happening
July 14 at 8am PDT/11am EDT. We invite you to explore the latest in dual
core and dual graphics technology at this free one hour event hosted by HP,
AMD, and NVIDIA.  To register visit http://www.hp.com/go/dualwebinar
Andrea Riela | 11 Jul 2005 13:16
Picon

about self-test failed


Hi folks,

I've installed smartd on freebsd 5.4, but when I try a self-test I see:

Jul  9 02:23:48 asclepius smartd[494]: Device: /dev/da0, execute Short 
Self-Test failed (err: -1)
Jul  9 03:23:48 asclepius smartd[494]: Device: /dev/da0, execute Long 
Self-Test failed (err: -1)
Jul 10 02:23:48 asclepius smartd[494]: Device: /dev/da0, execute Short 
Self-Test failed (err: -1)
Jul 11 02:23:49 asclepius smartd[494]: Device: /dev/da0, execute Short 
Self-Test failed (err: -1)

My smartd.conf:

/dev/da0 -H -m admin <at> domain.dom
/dev/da0 -a -o on -S on -s (S/../.././02|L/../../6/03)

The system uses an Adaptec 2110S with two scsi disks in mirroring

Is it a correct configuration?
Any advice will be appreciated
Regards
Andrea
TiciĆ  | 11 Jul 2005 23:34
Picon

Re: What should I do?

On 7/9/05, Boniforti Flavio <bonnyfused <at> tiscali.it> wrote:
> I've discovered following messages from SMART:
> 
> Device: /dev/hdb, 1 Currently unreadable (pending) sectors
> Device: /dev/hdb, 1 Offline uncorrectable sectors
> 
> What should I be doing to correct this behaviour?
> 
> Thanks.

Hello.

dd if=/dev/hdb of=/dev/null, this read the entire hard disk and (may
be) force to reallocate the bad sectors.

good luck.

-------------------------------------------------------
This SF.Net email is sponsored by the 'Do More With Dual!' webinar happening
July 14 at 8am PDT/11am EDT. We invite you to explore the latest in dual
core and dual graphics technology at this free one hour event hosted by HP,
AMD, and NVIDIA.  To register visit http://www.hp.com/go/dualwebinar

Gmane