Willi Fehler | 16 Apr 14:52 2014
Picon

[Check_mk (english)] Check_MK/megacli


Hello,

we are using Check_MK/megacli to monitor the Raid/BBU for our hosts.
Since we are using "MegaCLI SAS RAID Management Tool Ver 8.07.14 Dec
16, 2013" we have the problem that the BBU state is Optimal instead of
Operational.

CRIT - Battery State is Optimal, but should be Operational(!!), Charge
is 89 %

ii  check-mk-agent                     1.2.2p2-2
ii  check-mk-agent-logwatch            1.2.2p2-2

I found on http://mathias-kettner.de/check_mk_changelog_stable.html
that version 1.2.4b4 fixes some false alarms:

0441 FIX: megaraid_bbu: Fix several false alarms and cases where
inventory failed

Should we upgrade to the latest version?

Regards - Willi

--

-- 
Willi Fehler
Engineering

www.home24.de

(Continue reading)

José Miguel Almagro | 16 Apr 13:31 2014

[Check_mk (english)] Problem with Custom notification

Hello to all!

I'm trying to send SMS notifications.

In another nagios installation, I have already a script working.

But it seems that check_mk calls to the script without passing any 
variables I put in the text boxes.

So the result is an empty file.

This is the script, called sms2:

                 #!/bin/bash
                 # SMS (using SMS Server)
                 # Notification via sms using the Vector sms tools

                 if [ $# != 2 ]; then
                         echo 'Sintaxis: ./sms2 telefono "TEXTO A ENVIAR"';
                         exit 1;
                 fi

                 TMPFILE=`mktemp /tmp/sms.XXXXXXXXXX` || exit 1
                 BATCHFILE=`mktemp /tmp/bfile.XXXXXXXXXX` || exit 1
                 chmod a+rwx $TMPFILE
                 echo "To: 34$1" >> $TMPFILE
                 echo "" >> $TMPFILE
                 echo "$2" >> $TMPFILE

                 echo "put $TMPFILE" >> $BATCHFILE
(Continue reading)

Joel Stockley | 16 Apr 01:32 2014

[Check_mk (english)] Parents and unreachable child config

Hi there,

 

We are attempting to set up a parent/child relationship with a few of our servers to the gateway VM, and the gateway VM to our switch. I have set up the relationship, but cannot for the life of me find the correct way to remove the unreachable tag from the child hosts.

 

In one folder (New Zealand) I have all our NZ hosts. My current parent set up is 5 hyperV VMs > HyperV host. Hyper V host and a physical server > Gateway. Gateway > switch. What I would like to do is if the switch goes down, have only the switch notify us via our email alerts. If the Gateway goes down, only the gateway notify us. If the Hyper V host and all its VMs go down, only the HyperV host alert us.

 

In /omd/sites/xxxxxx/etc/check_mk/conf.d/wato/NewZealand/hosts.mk I have the following configured:

 

 

# Written by WATO

# encoding: utf-8

 

all_hosts += [

  "NZ_Printer|lan|prod|ping|wato|/" + FOLDER_PATH + "/",

  "NZ_Switch|lan|prod|snmp|snmp-v1|wato|/" + FOLDER_PATH + "/",

  "Gateway|lan|critical|snmp|snmp-v1|wato|/" + FOLDER_PATH + "/",

  "PhysicalSvr|cmk-agent|prod|lan|tcp|wato|/" + FOLDER_PATH + "/",

  "VM1|cmk-agent|prod|lan|tcp|wato|/" + FOLDER_PATH + "/",

  "Mail|cmk-agent|prod|lan|tcp|wato|/" + FOLDER_PATH + "/",

  "Lync|cmk-agent|prod|lan|tcp|wato|/" + FOLDER_PATH + "/",

  "HyperV|cmk-agent|prod|lan|tcp|wato|/" + FOLDER_PATH + "/",

  "TSServer|cmk-agent|prod|lan|tcp|wato|/" + FOLDER_PATH + "/",

  "WSUS|cmk-agent|prod|lan|tcp|wato|/" + FOLDER_PATH + "/",

]

 

# Explicit IP addresses

ipaddresses.update({'NZ_Printer': 'xxx.xxx.xxx.31',

'NZ_Switch': u'xxx.xxx.xxx.51',

'Gateway': u'xxx.xxx.xxx.1',

'PhysicalSvr': u'xxx.xxx.xxx.253',

'VM1': u'xxx.xxx.xxx.21',

'Mail': u'xxx.xxx.xxx.13',

'Lync': u'xxx.xxx.xxx.20',

'HyperV': u'xxx.xxx.xxx.4',

'TSServer': u'xxx.xxx.xxx.19',

'WSUS': u'xxx.xxx.xxx.14'})

 

# Settings for parents

extra_host_conf.setdefault('parents', []).extend(

  [('Gateway', ['PhysicalSvr']),

('HyperV,Gateway', ['VM1']),

('HyperV,Gateway', ['Lync']),

('Gateway', ['NZ_Switch']),

('HyperV,Gateway', ['Mail']),

('HyperV,Gateway', ['TSServer']),

('Gateway,NZ_Switch', ['NZ_Printer']),

('HyperV,Gateway', ['WSUS']),

('Gateway', ['HyperV'])])

 

host_contactgroups.append(

  ( ['IT'], [ '/' + FOLDER_PATH + '/' ], ALL_HOSTS ))

 

# Host attributes (needed for WATO)

host_attributes.update(

{'NZ_Printer': {'ipaddress': 'xxx.xxx.xxx.31',

                'parents': ['Gateway', 'NZ_Switch'],

                'tag_agent': 'ping'},

'NZ_Switch': {'ipaddress': u'xxx.xxx.xxx.51',

               'parents': ['Gateway'],

               'tag_agent': 'snmp-v1'},

'Gateway': {'ipaddress': u'xxx.xxx.xxx.1',

                'tag_agent': 'snmp-v1',

 

 

I believe that I need to remove the ‘u’ from the explicit IP addresses section for each of the child hosts to have that host not notify if they go down as well as their host. This would be our VM1, Mail, Lync, TSServer, and WSUS servers under the HyperV server afaik. I am unsure about the PhysicalSvr under the Gateway and hwo that would work, or the gateway to the switch.

 

Is this correct? How will this work with the proposed set of notifications above?

 

Thank you in advance for the assistance!

 

Regards

Joel

 

 

Joel Stockley


IT Support Engineer | Pure-IP

+64 9 984 1909 direct | 

Joel.Stockley <at> pure-ip.com

Offices: Auckland | Sydney | San Francisco | London
 

 

_______________________________________________
checkmk-en mailing list
checkmk-en@...
http://lists.mathias-kettner.de/mailman/listinfo/checkmk-en
Dardo Valdez | 15 Apr 23:51 2014
Picon

[Check_mk (english)] check_hostgroup

Hello,

I got help from Jim in the last attempt to get this done, thanks.

Once again I tried to nail the detection of hosts status in a hostgroup using Livestatus, I'm pretty sure it's fully working (I've tested it using the fake status in check_mk and worked well), and now the variable reflect the right names of the information retrieved from livestatus.

Now I'm looking to add a filter to prevent the hosts in downtime to trigger alerts. And then probably coding it all as a rightful check for check_mk (perfdata and all).
There you go:

#!/bin/sh
# This check checks the overall state of a host group.
# It fetches the current state of all hosts of a group
# via livestatus (the path to the livestatus socket is
# provided as a third, optional parameter).
#
# Provide the name of the host group as first
# parameter.
#
# If you specify a number NUM as second parameter,
# then the state of this check is not considered to
# be the worst host state of the group but the
# NUM'th worst state. E.g. if there is one up
# and one unreachable host, then specifying 2 will
# result in a unreachable state and 3 in a Up state.
#
# WARN is considered to be worse than UNKNOWN.

GROUPNAME=$1
if [ -z "$GROUPNAME" ] ; then
    echo "Usage: $0 HOST_GROUP [NUM [SOCKET] ]"
    exit 3
fi
NUM=${2:-1}
SOCKET=${3:-$OMD_ROOT/tmp/run/live}

check ()
{
  IFS=\; read COUNT UP DOWN UNREACH
  if [ $DOWN -ge $NUM ]
  then
      status=DOWN
      exitcode=2
  elif [ $((UP + UNREACH)) -ge $NUM ] ; then
      status=UNREACH
      exitcode=1
  else
      status=OK
      exitcode=0
  fi
  echo "$status - $COUNT hosts: $DOWN Down, $UNREACH Unreachable, $UP Up"
  exit $exitcode
}

cat <<EOF | unixcat $SOCKET | check
GET hosts
Filter: groups >= $GROUPNAME
Stats: last_hard_state >= 0
Stats: last_hard_state = 0
Stats: last_hard_state = 1
Stats: last_hard_state = 2
EOF


Thanks for your time.

Libres Consultores
Dardo Valdez - ar.linkedin.com/in/dardovaldez/
Cel. 379-4316500
Skype: dardo_valdez
_______________________________________________
checkmk-en mailing list
checkmk-en@...
http://lists.mathias-kettner.de/mailman/listinfo/checkmk-en
Picon

Re: [Check_mk (english)] zfsget problems since upgrade - possible bug

Solution found:

I replaced the zfsget in the checks folder with the recent one from git

 

 

--

 

 

Hi list,

 

i’ve upgrade check_mk to 1.2.5i2p1 today. Now I am getting those problems on all my solaris boxes

 

17:06:28 prod <at> nagios2 ~ cmk -nv --debug EDVSUN06                                                                                                                                                           1

Check_mk version 1.2.5i2p1

Connecting via TCP to 141.80.180.174:6556.

CPU load             OK - 15min load 0.01 at 4 CPUs

Number of threads    OK - 69 threads

TCP Connections      OK - ESTABLISHED: 11, TIME_WAIT: 2, BOUND: 3

Uptime               OK - up since Tue Apr 12 06:50:27 2011 (1099d 10:16:44)

fs_/                 OK - 64.4% used (7.53 of 11.7 GB), (levels at 75.22/87.61%), trend: +20.04kB / 24 hours

Traceback (most recent call last):

  File "/omd/sites/prod/local/share/check_mk/modules/check_mk.py", line 6050, in <module>

    do_check(hostname, ipaddress, check_types)

  File "/omd/sites/prod/local/share/check_mk/modules/check_mk_base.py", line 956, in do_check

    agent_version, num_success, error_sections, problems = do_all_checks_on_host(hostname, ipaddress, only_check_types)

  File "/omd/sites/prod/local/share/check_mk/modules/check_mk_base.py", line 1155, in do_all_checks_on_host

    result = check_function(item, params, info)

  File "/omd/sites/prod/local/share/check_mk/checks/zfsget", line 162, in check_zfsget

    entries = parse_zfsget(info)

  File "/omd/sites/prod/local/share/check_mk/checks/zfsget", line 101, in parse_zfsget

    zfsget_parse_df_info(parsed, info[start_of_df:])

  File "/omd/sites/prod/local/share/check_mk/checks/zfsget", line 109, in zfsget_parse_df_info

    for device, fs_type, kbytes, used, avail, percent, mountpoint in info:

ValueError: need more than 6 values to unpack

 

 

The info-object contains something like this

[['pool/coolstack', '52428800', '2106490', '50322309', '5%', '/opt/coolstack'], [….], [….], [….]]

which does not if the server is waiting for 7 arguments

 

looks like a bug to me.

 

best

 

--

Mathias Decker

Attachment (smime.p7s): application/pkcs7-signature, 7883 bytes
_______________________________________________
checkmk-en mailing list
checkmk-en@...
http://lists.mathias-kettner.de/mailman/listinfo/checkmk-en
Craig Cook | 15 Apr 18:57 2014
Picon

Re: [Check_mk (english)] Memory and Interface percentage utilization PNP graphs

>For Memory and Interface utilization (for both servers and network
>devices), we need the PNP graphs in percentage in addition to absolute
>values.
>
>Has anyone done this before OR does anyone have any suggestions to achieve this?

This may be overkill for you, but we use graphios to send the check_mk metrics to graphite.  Graphite has the raw numbers and is an excellent trending tool.

Craig
_______________________________________________
checkmk-en mailing list
checkmk-en@...
http://lists.mathias-kettner.de/mailman/listinfo/checkmk-en
Picon

[Check_mk (english)] zfsget problems since upgrade - possible bug

Hi list,

 

i’ve upgrade check_mk to 1.2.5i2p1 today. Now I am getting those problems on all my solaris boxes

 

17:06:28 prod <at> nagios2 ~ cmk -nv --debug EDVSUN06                                                                                                                                                           1

Check_mk version 1.2.5i2p1

Connecting via TCP to 141.80.180.174:6556.

CPU load             OK - 15min load 0.01 at 4 CPUs

Number of threads    OK - 69 threads

TCP Connections      OK - ESTABLISHED: 11, TIME_WAIT: 2, BOUND: 3

Uptime               OK - up since Tue Apr 12 06:50:27 2011 (1099d 10:16:44)

fs_/                 OK - 64.4% used (7.53 of 11.7 GB), (levels at 75.22/87.61%), trend: +20.04kB / 24 hours

Traceback (most recent call last):

  File "/omd/sites/prod/local/share/check_mk/modules/check_mk.py", line 6050, in <module>

    do_check(hostname, ipaddress, check_types)

  File "/omd/sites/prod/local/share/check_mk/modules/check_mk_base.py", line 956, in do_check

    agent_version, num_success, error_sections, problems = do_all_checks_on_host(hostname, ipaddress, only_check_types)

  File "/omd/sites/prod/local/share/check_mk/modules/check_mk_base.py", line 1155, in do_all_checks_on_host

    result = check_function(item, params, info)

  File "/omd/sites/prod/local/share/check_mk/checks/zfsget", line 162, in check_zfsget

    entries = parse_zfsget(info)

  File "/omd/sites/prod/local/share/check_mk/checks/zfsget", line 101, in parse_zfsget

    zfsget_parse_df_info(parsed, info[start_of_df:])

  File "/omd/sites/prod/local/share/check_mk/checks/zfsget", line 109, in zfsget_parse_df_info

    for device, fs_type, kbytes, used, avail, percent, mountpoint in info:

ValueError: need more than 6 values to unpack

 

 

The info-object contains something like this

[['pool/coolstack', '52428800', '2106490', '50322309', '5%', '/opt/coolstack'], [….], [….], [….]]

which does not if the server is waiting for 7 arguments

 

looks like a bug to me.

 

best

 

--

Mathias Decker

Attachment (smime.p7s): application/pkcs7-signature, 7883 bytes
_______________________________________________
checkmk-en mailing list
checkmk-en@...
http://lists.mathias-kettner.de/mailman/listinfo/checkmk-en
Dhawal Doshy | 15 Apr 13:53 2014
Picon

[Check_mk (english)] Memory and Interface percentage utilization PNP graphs

Hello List,

For Memory and Interface utilization (for both servers and network
devices), we need the PNP graphs in percentage in addition to absolute
values.

Has anyone done this before OR does anyone have any suggestions to achieve this?

Thanks in advance.

- dhawal
Renan Guerra Nannetti | 14 Apr 13:06 2014
Picon

[Check_mk (english)] bagios + check_mk not receiving snmp traps

Folks,

I'm not getting the SNMP Traps on my Event Console...
I've enabled MKEVENTD and also modified the etc/init.d/mkeventd start line including :

${DAEMON}_open514 --syslog --syslog-fd 3 --snmptrap --snmptrap-fd 5

I've also, on my monitored host, included the below lines in the snmpd.conf :

trapsink <nagios_server_ip> <community>
trap2sink <nagios_server_ip> <community>

However, it's not working on Nagios+Check_mk...
The host is does sending snmp traps, once I'm getting those traps displayed in the monitor system that I'm migrating from (ZenOss Core).

Using OMD 1.11 (Check_MK 1.2.4).

Would appreciate any help on this subject.

Best,


renan.
_______________________________________________
checkmk-en mailing list
checkmk-en@...
http://lists.mathias-kettner.de/mailman/listinfo/checkmk-en
Henry Huang | 14 Apr 12:01 2014
Picon

[Check_mk (english)] Share HTTP Response code metric and monitoring

Hi,

I wrote a custom bash script that will collect Apache http response code and graph them on PNP4Nagios.



Thanks,

Henry
_______________________________________________
checkmk-en mailing list
checkmk-en@...
http://lists.mathias-kettner.de/mailman/listinfo/checkmk-en
Henri Wahl | 14 Apr 11:35 2014
Picon

[Check_mk (english)] Adding SNMP MIBs to Check_MK

Hi list,
we need to add some extra MIBs to our Check_MK. Does anybod have a hint
where they have to be added if it is possible at all?
Regards
Henri Wahl

-- 
Henri Wahl

IT Department
Leibniz-Institut fuer Festkoerper- u.
Werkstoffforschung Dresden

tel: (03 51) 46 59 - 797
email: h.wahl@...
http://www.ifw-dresden.de

Nagios status monitor Nagstamon:
http://nagstamon.ifw-dresden.de

DHCPv6 server dhcpy6d:
http://dhcpy6d.ifw-dresden.de

IFW Dresden e.V., Helmholtzstrasse 20, D-01069 Dresden
VR Dresden Nr. 1369
Vorstand: Prof. Dr. Juergen Eckert, Dr. h.c. Dipl.-Finw. Rolf Pfrengle

_______________________________________________
checkmk-en mailing list
checkmk-en@...
http://lists.mathias-kettner.de/mailman/listinfo/checkmk-en

Gmane