Bruno Henrique Barbosa | 17 Jul 21:27 2014
Picon

[Check_mk (english)] Host notifications issue

Hi everyone,

I'm having some trouble with host notifications by mail. Environment is CentOS 6.5, Nagios 4.0.7 with Check_MK version 1.2.5i3.

I'm testing sending notifications, so I setup one host with WATO and created a ruleset to receive this host notifications. I've supressed service notifications for this it so I could only check for host notifications (which don't flap), but even though I'm forcing the host_subject parameter on my ruleset, I still receive the message like this:

Subject: Check_MK: MYHOST/$ OK -> $    July 17 2014 16:11     
From:        Nagios  
To:    bruno-barbosa-Fyimt0/DmrQ39yzSjRtAkw@public.gmane.org
Host    MYHOST (myhost)
Service    $
Event    OK → $
Address    154.0.0.11
Date / Time    Thu Jul 17 16:11:37 BRT 2014
Plugin Output    $
Additional Output    $
Performance Data    $

In order to test, I've set on my user's notification ruleset, under Notification Method, the field 'Subject for Host Notifications' as this: $HOSTNAME$ is $HOSTSHORTSTATE$

Input for debug (/var/lib/check_mk/notify/notify.log):

2014-07-17 16:11:38 ----------------------------------------------------------------------
2014-07-17 16:11:38 Got raw notification context with 48 variables
2014-07-17 16:11:38 Raw notification context:
                    CONTACTEMAIL=
                    CONTACTNAME=check-mk-notify
                    CONTACTPAGER=
                    DATE=07-17-2014
                    HOSTADDRESS=154.0.0.11
                    HOSTALIAS=myhost
                    HOSTATTEMPT=1
                    HOSTCHECKCOMMAND=check-mk-host-ping
                    HOSTDOWNTIME=0
                    HOSTNAME=MYHOST
                    HOSTNOTIFICATIONNUMBER=1
                    HOSTOUTPUT=CRITICAL - 154.0.0.11: rta nan, lost 100%
                    HOSTPERFDATA=rta=0,000ms;200,000;500,000;0; pl=100%;40;80;; rtmax=0,000ms;;;; rtmin=0,000ms;;;;
                    HOSTPROBLEMID=30
                    HOSTSTATE=DOWN
                    HOSTSTATEID=1
                    HOSTTAGS=lan prod ping wato /wato/cameras/
                    HOST_EC_CONTACT=$
                    HOST_SL=$
                    LASTHOSTSTATE=DOWN
                    LASTHOSTSTATECHANGE=1405624297
                    LASTHOSTSTATEID=1
                    LASTHOSTUP=1405624278
                    LASTSERVICEOK=$
                    LASTSERVICESTATE=$
                    LASTSERVICESTATECHANGE=$
                    LASTSERVICESTATEID=$
                    LONGDATETIME=Thu Jul 17 16:11:37 BRT 2014
                    LONGHOSTOUTPUT=
                    LONGSERVICEOUTPUT=$
                    NOTIFICATIONAUTHOR=
                    NOTIFICATIONAUTHORALIAS=
                    NOTIFICATIONAUTHORNAME=
                    NOTIFICATIONCOMMENT=
                    NOTIFICATIONTYPE=PROBLEM
                    SERVICEATTEMPT=$
                    SERVICECHECKCOMMAND=$
                    SERVICEDESC=$
                    SERVICENOTIFICATIONNUMBER=$
                    SERVICEOUTPUT=$
                    SERVICEPERFDATA=$
                    SERVICEPROBLEMID=$
                    SERVICESTATE=$
                    SERVICESTATEID=$
                    SERVICE_EC_CONTACT=$
                    SERVICE_SL=$
                    SHORTDATETIME=07-17-2014 16:11:37
                    SVC_SL=$
2014-07-17 16:11:38 Computed variables:
                    CONTACTS=
                    HOSTSHORTSTATE=DOWN
                    HOSTURL=/check_mk/index.py?start_url=view.py%3Fview_name%3Dhoststatus%26host%3DMYHOST
                    LASTHOSTSHORTSTATE=DOWN
                    LASTHOSTSTATECHANGE_REL=0d 00:00:01
                    LASTHOSTUP_REL=0d 00:00:20
                    LASTSERVICEOK_REL=16268d 19:11:38
                    LASTSERVICESHORTSTATE=$
                    LASTSERVICESTATECHANGE_REL=16268d 19:11:38
                    LOGDIR=/var/lib/check_mk/notify
                    MAIL_COMMAND=mail -s '$SUBJECT$' '$CONTACTEMAIL$'
                    MONITORING_HOST=nagios
                    PREVIOUSHOSTHARDSHORTSTATE=UP
                    PREVIOUSHOSTHARDSTATE=UP
                    PREVIOUSSERVICEHARDSHORTSTATE=OK
                    PREVIOUSSERVICEHARDSTATE=OK
                    SERVICESHORTSTATE=$
                    SERVICEURL=/check_mk/index.py?start_url=view.py%3Fview_name%3Dservice%26host%3DMYHOST%26service%3D%24
                    WHAT=SERVICE
2014-07-17 16:11:38 Preparing rule based notifications
2014-07-17 16:11:38 Found 1 user specific rules
2014-07-17 16:11:38 Global rule 'Notify all contacts of a host/service via HTML email'...
2014-07-17 16:11:38  -> does not match: This rule is disabled
2014-07-17 16:11:38 User pr208223's rule 'teste'...
2014-07-17 16:11:38  -> matches!
2014-07-17 16:11:38    - adding notification of pr208223 via mail
2014-07-17 16:11:38 Executing 1 notifications:
2014-07-17 16:11:38   * notifying pr208223 via mail, parameters: host_subject, bulk: no
2014-07-17 16:11:38      executing /usr/share/check_mk/notifications/mail

_____________________

So I suppose the message should come instead with the subject: "MYHOST is DOWN" (or "MYHOST is UP") whenever the state changes. Still, it appears like Check_MK: MYHOST/$ OK -> $    {date} as above.

Service notifications work fine for me, but I'd like to receive host notifications normally (without those $), because most of my devices are going to be monitored like: "is it alive/down? send mail".

Hope I was clear on my doubt and will be glad if someone can help. Sorry for the bad English and wall of text :P
Thanks!
_______________________________________________
checkmk-en mailing list
checkmk-en@...
http://lists.mathias-kettner.de/mailman/listinfo/checkmk-en
Mike Hanby | 17 Jul 19:42 2014
Picon

[Check_mk (english)] Git Error in WATO

Howdy,

Versions:
  * check_mk 1.5.2i4p2
  * omd 1.10

Lately most times when I make changes via WATO to hosts, I get the 
following error (I have git enabled in WATO):

"Error executing GIT command cd '/omd/sites/mysite/etc/check_mk' && git 
add *.d/wato 2>&1: The following paths are ignored by one of your 
.gitignore files: mkeventd.d/wato Use -f if you really want to add them. 
fatal: no files added"

Here are the .gitignore files under ~/etc/check_mk. Why is it trying to 
add mkeventd.d/wato?:

:~/etc/check_mk$ find . -name .gitignore -print -exec cat {} \;
./conf.d/.gitignore
*
!wato
!wato/*

./.gitignore
*
!*.d
!.gitignore

./multisite.d/.gitignore
*
!wato
!wato/*

I can manually run the git add command with -f, but I suspect that WATO 
shouldn't be trying to add files under mkeventd.d/wato to the git repo.

I have another server running the same versions of CheckMK and OMD with 
the same configurations and I've never gotten a git error.

Any thoughts what might be triggering this or where I can look to debug?

Thanks,

Mike
Sketch | 17 Jul 19:38 2014

[Check_mk (english)] NagVis multisite auth with LDAP

I spent a little time getting this working, mostly due to some of it not 
being very well documented, and because I confused authmodule and 
authorisationmodule.  Someone on IRC suggesed I post it for reference. I 
get the impression it is basically supposed to work out of the box in OMD, 
but for those not running OMD (like me), here it is.

This is with NagVis 1.8b4.  I haven't tried previous versions.

In CMK, the only thing required is to enabled "wato_write_nagvis_auth = 
True" in /etc/check_mk/multisite.mk

In nagvis.ini.php I have:

[global]
logonmodule="LogonMultisite"
logon_multisite_serials="/etc/nagios/auth.serials"
logon_multisite_secret="/etc/nagios/auth.secret"
logon_multisite_createuser=1
logon_multisite_createrole="Guests"

authorisationmodule="CoreAuthorisationModMultisite"
authorisation_multisite_file="/var/lib/check_mk/wato/auth/auth.php"
authorisation_group_backends="live_1"

That's it.  If you are using htpasswd authentication instead of LDAP, you 
probably have to add 'logon_multisite_htpasswd="/path/to/htpasswd"', as 
well, but I haven't tested that.

--
Sketch
Jason Humes | 16 Jul 19:54 2014
Picon

Re: [Check_mk (english)] check_mk unresponsive during change activations

All servers were using their own certificates.  I think part of the issue was how we redirect people who hit
the root URL and we force them into the multisite...so if they hit www.monitoringdemo.com they get
redirected to monitoringdemo.com/omdsitename/check_mk

Thanks


Jason 

-----Original Message-----
From: Jim Welch [mailto:jim.welch <at> oit.gatech.edu] 
Sent: Wednesday, July 16, 2014 8:14 AM
To: Jason Humes
Subject: Re: [Check_mk (english)] check_mk unresponsive during change activations

How are your SSL certs configured? Do all servers use their own cert or are they using a shared cert?

----------------------------

----- Original Message -----
From: Jason Humes <JHumes <at> acs.on.ca>
To: Dhawal Doshy <dhawal.doshy <at> gmail.com>, Andreas Döhler <andreas.doehler <at> gmail.com>
Cc: checkmk-en <at> lists.mathias-kettner.de
Sent: Wed, 16 Jul 2014 08:05:14 -0400 (EDT)
Subject: Re: [Check_mk (english)] check_mk unresponsive during change activations

How did you get it working?  We were able to get everything talking but with HTTPS enabled we had to keep
logging in between sites...I'm not very good with apache so I'm sure it was some failure on my part with
configuring apache.

Jason 

-----Original Message-----
From: Dhawal Doshy [mailto:dhawal.doshy <at> gmail.com]
Sent: Tuesday, July 15, 2014 11:26 AM
To: Andreas Döhler
Cc: Jason Humes; checkmk-en <at> lists.mathias-kettner.de
Subject: Re: [Check_mk (english)] check_mk unresponsive during change activations

Works fine with https here. We have about 25+ sites on HTTPS + 4 read-only masters on HTTP.

On Mon, Jul 14, 2014 at 10:58 PM, Andreas Döhler <andreas.doehler <at> gmail.com> wrote:
> OK all sites with HTTPS i don't tried yet :) I will have a look inside 
> my testing environment.
>
> br
> Andreas
>
> 2014-07-14 17:20 GMT+02:00, Jason Humes <JHumes <at> acs.on.ca>:
>> Hi
>> We tried to do this too but was unable to figure out how to get 
>> multiple sites working with HTTPS and not having to login multiple 
>> times…we were told it was not possible, thus we run everything on one huge server.
>>
>> Thanks
>>
>>
>> Jason
>>
>> From: Andreas Döhler [mailto:andreas.doehler <at> gmail.com]
>> Sent: Sunday, July 13, 2014 4:24 PM
>> To: Jason Humes
>> Cc: checkmk-en <at> lists.mathias-kettner.de
>> Subject: Re: [Check_mk (english)] check_mk unresponsive during change 
>> activations
>>
>> Like Dhawal already mentioned we use in our bigger installations one 
>> server only for web frontend. No checks running on this machine.
>> From there we configure all the checking machines as slaves. This is 
>> working without such problems.
>>
>> We experienced this problem first in installations with many users of 
>> the frontend (multisite + nagvis). Then it was decided to make a 
>> standalone frontend server and nearly all problems are gone.
>>
>> br
>> Andreas
>>
>> 2014-07-11 17:20 GMT+02:00 Jason Humes
>> <JHumes <at> acs.on.ca<mailto:JHumes <at> acs.on.ca>>:
>> We used to be able to get multiple hosts scanning for services in 
>> parallel and then save them all one at a time and activate all the 
>> changes at once…now we have to wait for one host to finish scanning 
>> before we can start the next as there is zero response during scanning, activating, etc.
>>
>> Thanks
>>
>>
>> Jason
>>
>> From: Pinkoski, David
>> [mailto:dPinkoski <at> dnps.com<mailto:dPinkoski <at> dnps.com>]
>> Sent: Friday, July 11, 2014 11:12 AM
>> To: Jason Humes; Jim Welch
>> Cc:
>> checkmk-en <at> lists.mathias-kettner.de<mailto:checkmk-en <at> lists.mathias-k
>> ettner.de>
>>
>> Subject: RE: [Check_mk (english)] check_mk unresponsive during change 
>> activations
>>
>> Could it be “retain_state_information=1 “ 
>> “state_retention_file=/usr/local/nagios/var/retention.dat” ?
>>
>> Perhaps changing WATO restart mode for Nagios from “restart” to “reload”
>> will help.
>>
>>
>> From:
>> checkmk-en-bounces <at> lists.mathias-kettner.de<mailto:checkmk-en-bounces
>>  <at> lists.mathias-kettner.de>
>> [mailto:checkmk-en-bounces <at> lists.mathias-kettner.de] On Behalf Of 
>> Jason Humes
>> Sent: Friday, July 11, 2014 10:55 AM
>> To: Jim Welch
>> Cc:
>> checkmk-en <at> lists.mathias-kettner.de<mailto:checkmk-en <at> lists.mathias-k
>> ettner.de>
>> Subject: Re: [Check_mk (english)] check_mk unresponsive during change 
>> activations
>>
>> Hi
>> There are actually a number of events that can cause the system to 
>> become unresponsive until the task completes…the easiest ones to talk 
>> about being change activation and host service scanning.  For the 
>> full scan, yes just of a single host and it depends on the type of 
>> host, connection speed to the host, number of services, etc…but it 
>> will hang the system until the scan completes and returns the list of services to the GUI.
>>
>> The activation delay is from the time I click activate changes till 
>> the progress bar completes filling…there is no delay of the progress 
>> bar appearing on screen, it shows up instantly but hangs the system 
>> until it is done activation…approx. 150 seconds.
>>
>> Thanks
>>
>>
>> Jason
>>
>> From: Jim Welch [mailto:jim.welch <at> oit.gatech.edu]
>> Sent: Friday, July 11, 2014 10:28 AM
>> To: Jason Humes
>> Subject: Re: [Check_mk (english)] check_mk unresponsive during change 
>> activations
>>
>> What are you specific symptoms? Is there a long delay between the 
>> time you click the submit button and the time the progress bar appears?
>> (apparently that is when the snapshot is built) During the progress 
>> bar is when (AFAIK) nagios and apache are restarted, that takes up to
>> 75-90 seconds for me. (sometimes less...not sure if it's due to the 
>> load on the system, number of users or what)
>>
>> As for the full scan, is that doing a full scan of just one host? If 
>> so, how long is the delay?
>> ________________________________
>> From: "Jason Humes" <JHumes <at> acs.on.ca<mailto:JHumes <at> acs.on.ca>>
>> To: "Andreas Döhler"
>> <andreas.doehler <at> gmail.com<mailto:andreas.doehler <at> gmail.com>>, "Jim Welch"
>> <jim.welch <at> oit.gatech.edu<mailto:jim.welch <at> oit.gatech.edu>>
>> Cc:
>> checkmk-en <at> lists.mathias-kettner.de<mailto:checkmk-en <at> lists.mathias-k
>> ettner.de>
>> Sent: Friday, July 11, 2014 10:19:37 AM
>> Subject: RE: [Check_mk (english)] check_mk unresponsive during change 
>> activations Sadly the problem still exists for me…wondering if the 
>> issue was that in my older installations I had updated check_mk 
>> inside of OMD manually from a version built from source…perhaps it 
>> was doing something different back then.
>>
>> Thanks anyways ☺
>>
>>
>> Jason
>>
>> From:
>> checkmk-en-bounces <at> lists.mathias-kettner.de<mailto:checkmk-en-bounces
>>  <at> lists.mathias-kettner.de>
>> [mailto:checkmk-en-bounces <at> lists.mathias-kettner.de] On Behalf Of 
>> Andreas Döhler
>> Sent: Thursday, July 10, 2014 12:28 PM
>> To: Jim Welch
>> Cc:
>> checkmk-en <at> lists.mathias-kettner.de<mailto:checkmk-en <at> lists.mathias-k
>> ettner.de>
>> Subject: Re: [Check_mk (english)] check_mk unresponsive during change 
>> activations
>>
>> Nice to hear that it is solved. I was also not thinking about the 
>> snapshot files :)
>>
>> br
>> Andreas
>>
>> 2014-07-10 11:34 GMT+02:00 Jim Welch
>> <jim.welch <at> oit.gatech.edu<mailto:jim.welch <at> oit.gatech.edu>>:
>> Thanks! That was the clue I needed. I took the last snapshot and 
>> unzipped it, then untared files to see what was taking up all the space (and time).
>> The culprit was in the usersettings.tar file. (ldap-debug.log just 
>> like your case) Turns out I'd forgotten I'd turned on ldap debugging 
>> and the ldap-debug.log file had grown to 2.2G! I disabled logging in 
>> the global settings, deleted the existing log file and the snapshot 
>> portion of the process dropped to almost nothing. The rest of the 
>> process is about the same amount of time as running cmk -O manually 
>> so I'm fine with using the GUT, but I'll keep a copy of your script 
>> in case we need it in the future. I didn't know how to remove the 
>> 'pending' changes from the GUI so that is good information.
>> Thanks again,
>> <Jim>
>>
>>
>> ----- Original Message -----
>> From: "Ryan Moore" <ryan.moore <at> cpcc.edu<mailto:ryan.moore <at> cpcc.edu>>
>> To:
>> checkmk-en <at> lists.mathias-kettner.de<mailto:checkmk-en <at> lists.mathias-k
>> ettner.de>
>> Sent: Wednesday, July 9, 2014 6:48:10 PM
>> Subject: Re: [Check_mk (english)] check_mk unresponsive during change 
>> activations
>>
>> That being said, I forgot to mention that using the script won't take 
>> WATO snapshots that can be used for backup/restore, so I still do 
>> occasionally use the GUI button after hours for that purpose. I 
>> haven't dug through the code that creates the snapshot, but I'm sure 
>> it'd be trivial to reproduce. That first delay is the snapshot 
>> getting created, check the size of your ~/etc directory as I believe 
>> that is what is in the snapshot primarily. I had a few very large log 
>> files in there (ldap debug I think) that was causing some very long 
>> delays as well since it was tar/gzip'ing a 700mb file every time.
>>
>>
>> Ryan Moore
>> Infrastructure Systems Analyst
>> Central Piedmont Community College
>> Information Technology Services
>>
>>
>> -------- Original Message  --------
>> Subject: Re: [Check_mk (english)] check_mk unresponsive during change 
>> activations
>> From: Ryan Moore <ryan.moore <at> cpcc.edu<mailto:ryan.moore <at> cpcc.edu>>
>> To:
>> checkmk-en <at> lists.mathias-kettner.de<mailto:checkmk-en <at> lists.mathias-k
>> ettner.de>
>> <checkmk-en <at> lists.mathias-kettner.de<mailto:checkmk-en <at> lists.mathias-
>> kettner.de>>
>> Date: 07/09/2014 06:42 PM
>>
>>> I've run into the same problem, and I believe it due to mod_python 
>>> being single threaded and the rebuild process is tied up during the 
>>> activation process. What I've done is I'm often on the console via 
>>> ssh during the day, and I created a very basic script to do the work:
>>>
>>> ---
>>> #!/bin/bash
>>> #
>>> # Simple rebuild script for Check_MK #
>>>
>>> # Rebuild the config and reload Nagios cmk -O STATUS=$?
>>>
>>> # if rebuild was good clear the pending log for wato (so the 'apply 
>>> changes' button in the GUI goes away) [ $STATUS -eq 0 ] && { echo 
>>> "Rebuild looks good, clearing pending log"; rm -f 
>>> ~/var/check_mk/wato/log/pending.log;} || echo "Uh oh, there was an 
>>> error!"
>>>
>>> --
>>>
>>> This allows me to still make changes via WATO, but activate them on 
>>> the console and not cause any interruptions to users. Perhaps one 
>>> day Multisite will work via WSGI or something else more modern than 
>>> mod_python.
>>>
>>>
>>> Ryan Moore
>>> Infrastructure Systems Analyst
>>> Central Piedmont Community College
>>> Information Technology Services
>>>
>>>
>>> -------- Original Message  --------
>>> Subject: Re: [Check_mk (english)] check_mk unresponsive during 
>>> change activations
>>> From: Jim Welch
>>> <jim.welch <at> oit.gatech.edu<mailto:jim.welch <at> oit.gatech.edu>>
>>> To: Andreas Döhler
>>> <andreas.doehler <at> gmail.com<mailto:andreas.doehler <at> gmail.com>>
>>> Cc:
>>> "checkmk-en <at> lists.mathias-kettner.de<mailto:checkmk-en <at> lists.mathias-kettner.de>"
>>> <checkmk-en <at> lists.mathias-kettner.de<mailto:checkmk-en <at> lists.mathias
>>> -kettner.de>>
>>> Date: 07/09/2014 04:45 PM
>>>
>>>> Thanks for the tip. I've done that (Delay precompiling), but it 
>>>> still takes 2-4 minutes to commit a change. I've verified that the 
>>>> timestamps on the host check files do not change so it doesn't seem 
>>>> to be precompiling the checks. I'm not sure exactly what it's doing 
>>>> in the time between when I hit the button and before the progress 
>>>> bar appears. (1-3 minutes)
>>>>
>>>> -------------------------------------------------------------------
>>>> -----
>>>> *From: *"Andreas Döhler"
>>>> <andreas.doehler <at> gmail.com<mailto:andreas.doehler <at> gmail.com>>
>>>> *To: *"Jason Humes" <JHumes <at> acs.on.ca<mailto:JHumes <at> acs.on.ca>>
>>>> *Cc: *"Jim Welch"
>>>> <jim.welch <at> oit.gatech.edu<mailto:jim.welch <at> oit.gatech.edu>>,
>>>> checkmk-en <at> lists.mathias-kettner.de<mailto:checkmk-en <at> lists.mathias
>>>> -kettner.de>
>>>> *Sent: *Wednesday, July 9, 2014 4:14:35 PM
>>>> *Subject: *Re: [Check_mk (english)] check_mk unresponsive during 
>>>> change activations
>>>>
>>>> Hi,
>>>>
>>>> there is one option you can test inside your OMD setup.
>>>> To speedup the generation of configuration you can take a look at 
>>>> the option "Delay precompiling of host checks." found under "Global 
>>>> configuration settings"
>>>>
>>>> In bigger installations this will reduce the commit time by a big 
>>>> amount of time.
>>>>
>>>> br
>>>> Andreas
>>>>
>>>>
>>>> 2014-07-08 20:12 GMT+02:00 Jason Humes 
>>>> <JHumes <at> acs.on.ca<mailto:JHumes <at> acs.on.ca>
>>>> <mailto:JHumes <at> acs.on.ca<mailto:JHumes <at> acs.on.ca>>>:
>>>>
>>>>      Hi
>>>>      Sorry for the delay in responding...had a small leave from work.
>>>> We
>>>>      have 1000 hosts and about 30000 services and use just internal
>>>>      authentication.  It is such a pain having to wait to do changes at
>>>>      the end of the day so as to not interrupt the use of the 
>>>> system :(
>>>>
>>>>
>>>>      Jason
>>>>
>>>>
>>>>      -----Original Message-----
>>>>      From:
>>>> checkmk-en-bounces <at> lists.mathias-kettner.de<mailto:checkmk-en-bounc
>>>> es <at> lists.mathias-kettner.de>
>>>>
>>>> <mailto:checkmk-en-bounces <at> lists.mathias-kettner.de<mailto:checkmk-
>>>> en-bounces <at> lists.mathias-kettner.de>>
>>>>
>>>> [mailto:checkmk-en-bounces <at> lists.mathias-kettner.de<mailto:checkmk-
>>>> en-bounces <at> lists.mathias-kettner.de>
>>>>
>>>> <mailto:checkmk-en-bounces <at> lists.mathias-kettner.de<mailto:checkmk-
>>>> en-bounces <at> lists.mathias-kettner.de>>]
>>>> On Behalf Of
>>>>      Jim Welch
>>>>      Sent: Thursday, June 19, 2014 4:36 PM
>>>>      To:
>>>> checkmk-en <at> lists.mathias-kettner.de<mailto:checkmk-en <at> lists.mathias
>>>> -kettner.de>
>>>>
>>>> <mailto:checkmk-en <at> lists.mathias-kettner.de<mailto:checkmk-en <at> lists.mathias-kettner.de>>
>>>>      Subject: Re: [Check_mk (english)] check_mk unresponsive during
>>>>      change activations
>>>>
>>>>      Yes, we see that behaviour on omd 1.10 (rhel6). I had to extend the
>>>>      apache request timeouts since the site may be locked up for 2-3
>>>>      minutes while activating changes. How many hosts/services are on
>>>>      your system? (~700/16000) What type of authentication do you use?
>>>>      (we enabled ldap authentication)
>>>>
>>>>
>>>>      ----- Original Message -----
>>>>      From: "Jason Humes" <JHumes <at> acs.on.ca<mailto:JHumes <at> acs.on.ca>
>>>> <mailto:JHumes <at> acs.on.ca<mailto:JHumes <at> acs.on.ca>>>
>>>>      To:
>>>> checkmk-en <at> lists.mathias-kettner.de<mailto:checkmk-en <at> lists.mathias
>>>> -kettner.de>
>>>>
>>>> <mailto:checkmk-en <at> lists.mathias-kettner.de<mailto:checkmk-en <at> lists.mathias-kettner.de>>
>>>>      Sent: Thursday, June 19, 2014 4:25:08 PM
>>>>      Subject: [Check_mk (english)] check_mk unresponsive during change
>>>>      activations
>>>>
>>>>      Hi
>>>>      We've been running check_mk/omd for about two years now, since omd
>>>>      0.54 and currently at omd 1.1.  At some point through the upgrade
>>>>      life the system changed from how it was responding during change
>>>>      activations within WATO/multisite...it used to be that if an admin
>>>>      was activating changes, other users could still browse the 
>>>> multisite
>>>>      view...but currently it seems that the system becomes totally
>>>>      unresponsive during change activations.  Does anyone else 
>>>> experience
>>>>      this?  Is it expected?
>>>>
>>>>      Thanks
>>>>
>>>>
>>>>      Jason
>>>>
>>>>      _______________________________________________
>>>>      checkmk-en mailing list
>>>>
>>>> checkmk-en <at> lists.mathias-kettner.de<mailto:checkmk-en <at> lists.mathias
>>>> -kettner.de>
>>>>
>>>> <mailto:checkmk-en <at> lists.mathias-kettner.de<mailto:checkmk-en <at> lists.mathias-kettner.de>>
>>>>      http://lists.mathias-kettner.de/mailman/listinfo/checkmk-en

>>>>      _______________________________________________
>>>>      checkmk-en mailing list
>>>>
>>>> checkmk-en <at> lists.mathias-kettner.de<mailto:checkmk-en <at> lists.mathias
>>>> -kettner.de>
>>>>
>>>> <mailto:checkmk-en <at> lists.mathias-kettner.de<mailto:checkmk-en <at> lists.mathias-kettner.de>>
>>>>      http://lists.mathias-kettner.de/mailman/listinfo/checkmk-en

>>>>      _______________________________________________
>>>>      checkmk-en mailing list
>>>>
>>>> checkmk-en <at> lists.mathias-kettner.de<mailto:checkmk-en <at> lists.mathias
>>>> -kettner.de>
>>>>
>>>> <mailto:checkmk-en <at> lists.mathias-kettner.de<mailto:checkmk-en <at> lists.mathias-kettner.de>>
>>>>      http://lists.mathias-kettner.de/mailman/listinfo/checkmk-en

>>>>
>>>>
>>>>
>> _______________________________________________
>> checkmk-en mailing list
>> checkmk-en <at> lists.mathias-kettner.de<mailto:checkmk-en <at> lists.mathias-k
>> ettner.de>
>> http://lists.mathias-kettner.de/mailman/listinfo/checkmk-en

>> _______________________________________________
>> checkmk-en mailing list
>> checkmk-en <at> lists.mathias-kettner.de<mailto:checkmk-en <at> lists.mathias-k
>> ettner.de>
>> http://lists.mathias-kettner.de/mailman/listinfo/checkmk-en

>>
>>
>>
>> _______________________________________________
>> checkmk-en mailing list
>> checkmk-en <at> lists.mathias-kettner.de<mailto:checkmk-en <at> lists.mathias-k
>> ettner.de>
>> http://lists.mathias-kettner.de/mailman/listinfo/checkmk-en

>>
>>
> _______________________________________________
> checkmk-en mailing list
> checkmk-en <at> lists.mathias-kettner.de
> http://lists.mathias-kettner.de/mailman/listinfo/checkmk-en

_______________________________________________
checkmk-en mailing list
checkmk-en <at> lists.mathias-kettner.de
http://lists.mathias-kettner.de/mailman/listinfo/checkmk-en


_______________________________________________
checkmk-en mailing list
checkmk-en <at> lists.mathias-kettner.de
http://lists.mathias-kettner.de/mailman/listinfo/checkmk-en
Andreas Döhler | 16 Jul 18:02 2014
Picon

Re: [Check_mk (english)] R: Restore from backup

For your last question there is one short answer.
Create a new test site and go to  the etc/check_mk/multisite.d/wato/ there you find a file hosttags.mk with all the standard tags.

Then all the tagging should work as expected.

br
Andreas


2014-07-16 17:23 GMT+02:00 Andrea Corazzari <ac-ou70kdGnqRc57InD3i3y51zrSV/HdtiB@public.gmane.org>:

Thank you all for the quick answer!

 

 

1.       Was this restore done with restoring a snapshot from the other WATO?

2.       If not you must control all user rights on the restored files and directories

 

1)      Yes, snapshot made form old WATO/Machine and restored on new installatio/machine

2)      Perhaps did you mean if so instead of if not? Anyway I’ll check everithing. Site name and user are the same, maibi UID issue?

 

Last (by now ;) ) question: In configuration -> new folder or new host the “builtin” (if I remember well) host tag Operating System (Windows, Linux, Network Device (SNMP) and Ping Only) is disappered.

I tried to recreate it manually and seemed OK buT all new host I create are treated querying TCP/6556.

 

Maybe there is some file that sould be copied manually. (I tried also to perform backup&restore via CLI as described in http://mathias-kettner.com/checkmk_backup.html)

 

Thank you all

 

All the best

Andrea

 

Da: Andreas Döhler [mailto:andreas.doehler-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org]
Inviato: mercoledì 16 luglio 2014 17:10
A: Andrea Corazzari
Cc: checkmk-en-qhrM8SXbD5JpaB0eVFyvwnWFp+d4uDoM@public.gmane.org
Oggetto: Re: [Check_mk (english)] R: Restore from backup

 

Was this restore done with restoring a snapshot from the other WATO?

If not you must control all user rights on the restored files and directories.

 

br

Andreas

 

2014-07-16 16:48 GMT+02:00 Andrea Corazzari <ac <at> piazzasandomenico.it>:

Some more details:
changes are effectively activated, and in conifg page Host and folder the Activate Button turn back to blue, but both in other wato pages and in pending.log all these changes are seen as pending.
As I said before changes are activated, but what could I do to have normal behaviour?
We are in a restore after a crash and data are taken from an old OVA and imported on a fresh installation, in few words we are in a hurry.

TIA
Regards
Andrea

-----Messaggio originale-----
Da: checkmk-en-bounces-qhrM8SXbD5JpaB0eVFyvwnWFp+d4uDoM@public.gmane.org [mailto:checkmk-en-bounces-qhrM8SXbD5JpaB0eVFyvwnWFp+d4uDoM@public.gmane.org] Per conto di Andrea Corazzari
Inviato: mercoledì 16 luglio 2014 14:20
A: checkmk-en-qhrM8SXbD5JpaB0eVFyvwnWFp+d4uDoM@public.gmane.org
Oggetto: [Check_mk (english)] Restore from backup


Hi all,
I’ve just restored a backup taken from my old installation to a new and fresh one.

OS (CentOS  6.4) and OMD (1.10) version are the same, but if I try to create a new host the default tag “Operating System” dos not appear, manually defined tags are present, moreover if I make some changes and activate them changes seem to be activate but the button Activate changes remains orange.

Are there some things in config files that should be copied by hand? Which are log files to check first in these cases?

Every hint will be very valued.

Thanks in advance

Regards
Andrea



_______________________________________________
checkmk-en mailing list
checkmk-en-qhrM8SXbD5JpaB0eVFyvwnWFp+d4uDoM@public.gmane.org
http://lists.mathias-kettner.de/mailman/listinfo/checkmk-en



_______________________________________________
checkmk-en mailing list
checkmk-en-qhrM8SXbD5JpaB0eVFyvwnWFp+d4uDoM@public.gmane.org
http://lists.mathias-kettner.de/mailman/listinfo/checkmk-en

 


_______________________________________________
checkmk-en mailing list
checkmk-en@...
http://lists.mathias-kettner.de/mailman/listinfo/checkmk-en
Andrea Corazzari | 16 Jul 14:19 2014
Picon

[Check_mk (english)] Restore from backup

Hi all,
I’ve just restored a backup taken from my old installation to a new and fresh one.

OS (CentOS  6.4) and OMD (1.10) version are the same, but if I try to create a new host the default tag
“Operating System” dos not appear, manually defined tags are present, moreover if I make some
changes and activate them changes seem to be activate but the button Activate changes remains orange.

Are there some things in config files that should be copied by hand? Which are log files to check first in
these cases?

Every hint will be very valued.

Thanks in advance

Regards
Andrea

_______________________________________________
checkmk-en mailing list
checkmk-en <at> lists.mathias-kettner.de
http://lists.mathias-kettner.de/mailman/listinfo/checkmk-en
Pawel Grzesik | 16 Jul 11:56 2014
Picon

[Check_mk (english)] Problem with percentage value in notification.

Hi All,

I have found some strange thing at my notification.

This is what I get from one of my server:
WARN - 80.0% used (700.62 of 876.0 GB), (levels at 70.0/80.0%), trend: -20.06GB / 24 hours

And then from the same server:
CRIT - 80.0% used (700.89 of 876.0 GB), (levels at 70.0/80.0%), trend: -19.88GB / 24 hours

It looks like something is wrong here.

First value should be I guess:
irb(main):002:0> 70062.0 / 876.00
=> 79.97945205479452

And the second:
irb(main):004:0> 70089.0 / 876.00
=> 80.01027397260275

It’s because it’s a integer with some rounded value? 

Thanks,
Pawel
Henri Wahl | 15 Jul 21:18 2014
Picon

[Check_mk (english)] Bulk host edit does not work in 1.2.4p5

Hi list,
am I doing it wrong or is this a newly introduced bug? If I want to edit
multiple hosts at once I get an error message claiming:

"Please select some hosts before doing bulk operations on hosts."

With 1.2.4p3 this worked. Can anyone confirm this behaviour?

Best regards
Henri

-- 
Henri Wahl

IT Department
Leibniz-Institut für Festkoerper- u.
Werkstoffforschung Dresden

tel: (03 51) 46 59 - 797
email: h.wahl@...
http://www.ifw-dresden.de

Nagios status monitor Nagstamon:
https://nagstamon.ifw-dresden.de

DHCPv6 server dhcpy6d:
https://dhcpy6d.ifw-dresden.de

IFW Dresden e.V., Helmholtzstraße 20, D-01069 Dresden
VR Dresden Nr. 1369
Vorstand: Prof. Dr. Manfred Hennecke, Dr. h.c. Dipl.-Finw. Rolf Pfrengle

_______________________________________________
checkmk-en mailing list
checkmk-en@...
http://lists.mathias-kettner.de/mailman/listinfo/checkmk-en
Dhawal Doshy | 15 Jul 17:12 2014
Picon

[Check_mk (english)] Cisco - Ignore port up link down in inventory

Hello List, we have some customers who insist on leaving all ports
"administratively up", but there is nothing attached to these ports.
As a result, we see a lot of down interfaces in our list of services.

Is there a way to not inventory these ports in the first place? If
yes, would appreciate being pointed in the right direction.

- dhawal
Gozzini Davide | 15 Jul 16:41 2014
Picon

[Check_mk (english)] no tx buffer credits

Hi All,

 

i’m using check_mk 1.2.2p2 and I'm getting a lot of WARNINGs / CRITICALs about ratio of "no tx buffer credits" on ports on Brocade switches.

 

I have found these configuration files :

 

find -iname *fcpo*

./opt/omd/versions/1.00/share/check_mk/checks/brocade_fcport

./opt/omd/versions/1.00/share/check_mk/checks/mcdata_fcport

./opt/omd/versions/1.00/share/check_mk/pnp-templates/check_mk-brocade_fcport.php

./opt/omd/versions/1.00/share/check_mk/pnp-templates/check_mk-mcdata_fcport.php

./opt/omd/versions/1.00/share/check_mk/checkman/brocade_fcport

./opt/omd/versions/1.00/share/check_mk/checkman/mcdata_fcport

 

But even changing settings inside these files and rebooting check_mk server, nothing changes.

Maybe these files must be somewhere else or maybe i'm doing something wrong, is there someone that can help me about this issue please ?

 

Regards

 

-dg

Recordati Industria Chimica e farmaceutica S.p.A. Sede legale: via M. Civitali 1- 20148 Milano, Italia Capitale sociale: Euro 26.140.644,5 i.v. Reg. Imp. Milano n. 00748210150 DISCLAIMER: This e-mail and any file transmitted with it may contain material that is confidential, proprietary or legally privileged and is for the sole use of the intended recipient. If you are not the intended recipient of this e-mail, please do not read this e-mail and notify us immediately by reply e-mail or by telephone and then delete this message and any file attached from your system. You should not copy or use it for any purpose, disclose the contents of the same to any other person or forward it without express permission. Considering the means of transmission, we do not undertake any liability with respect to the secrecy and confidentiality of the information contained in this e-mail and in its attachments. Il presente messaggio di posta elettronica e ogni eventuale documento a quest'ultimo allegato potrebbe avere carattere riservato ed essere tutelato dal segreto professionale ed e' ad esclusivo utilizzo del destinatario indicato in indirizzo. Qualora non foste il destinatario del presente messaggio Vi preghiamo di volerci avvertire immediatamente tramite posta elettronica o telefonicamente e di cancellare il presente messaggio e ogni documento ad esso allegato dal Vostro sistema. E' vietata la duplicazione o l'utilizzo per qualunque fine del presente messaggio e di ogni documento ad esso allegato cosi' come la relativa divulgazione, distribuzione o inoltro a terzi senza l'espressa autorizzazione del mittente. Il mittente, in ragione del mezzo di trasmissione utilizzato, non assume alcuna responsabilita' in merito alla segretezza e riservatezza delle informazioni contenute nel presente messaggio e nei relativi allegati.
_______________________________________________
checkmk-en mailing list
checkmk-en@...
http://lists.mathias-kettner.de/mailman/listinfo/checkmk-en
Troels Arvin | 15 Jul 16:14 2014
Picon

[Check_mk (english)] Dell KVM switch check

Hello,

At http://troels.arvin.dk/code/check_mk/dell_kvmswitch/ I've put the code 
for checking the operating state of a Dell KVM switch (via SNMP).

I propose the code for inclusion into check_mk.

--

-- 
Regards,
Troels Arvin <troels@...>
http://troels.arvin.dk/

Gmane