Tony Adams | 21 Jul 19:30 2014
Picon

[Check_mk (english)] Custom Active HTTP check assignment

Hello all,

I'm migrating from vanilla Nagios to Check_MK (subscription OMD) I'm struggling to find the most efficient and scalable method of assigning custom Active HTTP checks to hosts.  I would be grateful for any insight you might be able to share.  Here are my constraints and objectives:

I have many dozens of web applications to monitor across various single instance nodes and load balanced farms (in which case I need to test the nodes and the VIP).  Many hosts and VIPs are not dedicated to a single web app and instead host multiple virtual servers.

I need to check for a web application's return string, but I can't enforce a standard page to call or string to expect.  It could be "index.cfm" with "alive" on one farm and "menu.maf" and "Edwards" on another farm, etc.   At this point I've decided to create a separate active HTTP check for each web app, i.e. "HTTP-BLACKSMITH", "HTTP-E1", "HTTP-VLTRADER", etc.   This is how the services were named and defined in Nagios, and the assignment of services to hosts was via hostgroup.

In OMD I'm experimenting with a single on/off tag group for each service.  After a creating my first dozen of these I'm seeing that it may not scale well to dozens in the GUI.    I'd prefer to have a single tag group with a tag for each possible web app, but that would prevent me from assigning multiple tags to a single host.  I'm beginning to wonder if auxiliary tags would help, but I think that might get a bit messy too.   

Does anyone have a similar environment and a solution they are happy with?  Thank you very much in advance.

Tony


_______________________________________________
checkmk-en mailing list
checkmk-en@...
http://lists.mathias-kettner.de/mailman/listinfo/checkmk-en
Raphael Thoma | 21 Jul 11:17 2014
Picon

[Check_mk (english)] Do not show WARN states

Hey guys

We want, that check_mk only displays checks in CRIT state for certain hosts (e.g. with a separate tag).
Currently the checks in the WARN-state pop up in check_mk as well. Is it possible to ignore the WARN state
for certain hosts?

For the notifications this can be done easily - however I couldn't figure out a way to do the same for the view
in check_mk.

Cheers
Raphi
Attachment (smime.p7s): application/pkcs7-signature, 5451 bytes
_______________________________________________
checkmk-en mailing list
checkmk-en@...
http://lists.mathias-kettner.de/mailman/listinfo/checkmk-en
Niklas M | 21 Jul 03:00 2014
Picon

[Check_mk (english)] system upgrade does not acknowledge already acknowledged alerts

Hello,

I was wondering if anyone else noticed that:

1. when upgrading say from: 1.2.4p2 -> 1.2.4p5
* You need to downgrade to 1.2.4 and then up to 1.2.4p5. Is this intentional?
2. when the upgrade is done the already acknowledged alerts is removed and you have to reacknowledge them.

Just want to know if this happens to someone else?

Regards,
-- 
Niklas M

_______________________________________________
checkmk-en mailing list
checkmk-en@...
http://lists.mathias-kettner.de/mailman/listinfo/checkmk-en
Luis Oliveira | 19 Jul 22:28 2014
Picon

[Check_mk (english)] Configure Slave to Send to Master (OMD installation)

But the problem is that Nagios servers are not on the same network (the master 
has a public IP) while the slave is behind a firewall and security rules can 
not change (no public IP) 

How I can say that the Master should run the checks in Slave? (and show the 
Status of Hosts and Services in Master) 

I appreciate your help very ... but I am Rookie in this world of Nagios  :)

If you can give me a example i appreciate a lot !!! (pleaseeeeeeee...this is 
very important) :]
Luis Oliveira | 19 Jul 00:38 2014
Picon

[Check_mk (english)] Configure Slave to Send to Master (OMD installation)

Already the few days I've been trying to set up a system of Master <= Slaves 
(Slaves of the information is centralized in the Master) but so far without 
success. 

The setup I have currently done (through WATO) is: 

In the "main" site (master)  (IP 62.28.102.29)

Site 
ID..........................................................................
..........................		remote1
Alias.......................................................................
.............................		Site_Local
Connection..................................................................
..................................	Connect to local site
URL 
prefix......................................................................
..............................	http://localhost/remote1/

Replication 
method......................................................................
..............................	No replication with this site

A slave of sites (I'm still only configure one) have the following setup

Site 
ID..........................................................................
..........................		prod
Alias.......................................................................
.............................		produção
Connection..................................................................
..................................	Connect to local site
URL 
prefix......................................................................
..............................	http://localhost/prod/
Replication 
method......................................................................
..............................	No replication with this site

Site 
ID..........................................................................
..........................		remote1
Alias.......................................................................
.............................		Site_Central_Dados
Connection..................................................................
..................................	Connect via TCP	HOST 62.28.102.29 
PORT 6557    (this IP is a Public IP Address)
URL 
prefix......................................................................
..............................	http://localhost/prod/

Replication 
method......................................................................
..............................	Slave
Multisite-URL of remote 
site........................................................................
................	http://62.28.102.29/remote1/check_mk/

When i try to add a host in remote1 (and select Monitored on site remote1) i 
get the message "Cannot get data from TCP Port 192.168.11.53:6556... Whyyyy 
? (when i select the host to be monitored in prod i can see the services and 
host states without problem..)

And now my main question :)

What i need to do to send the data os hosts (services and host) from slave 
nagios to master nagios...

Can anyone help me ?

LOliveira

_______________________________________________
checkmk-en mailing list
checkmk-en <at> lists.mathias-kettner.de
http://lists.mathias-kettner.de/mailman/listinfo/checkmk-en
Bernd Stroessenreuther | 18 Jul 11:27 2014
Picon

Re: [Check_mk (english)] Check_MK Werk 0180: sap: It is now possible to add multiple sap instances to the sap.cfg file

Hallo Bastian,

hast Du den Change für Roland Steinmeier (Klinikum Nürnberg) gemacht
oder für einen anderen Kunden?

Bernd

On 16.07.2014 11:57, Bastian Kuhn wrote:
> ID:          0180
> Title:       sap: It is now possible to add multiple sap instances to the sap.cfg file
> Component:   Checks & Agents
> Level:       1
> Class:       New Feature
> Version:     1.2.5i5
> 
> The cfg format has changed from dict to a list of dicts. See the sap.cfg for a example. The old config format
is still working.
> 
> 
> _______________________________________________________________
> Check_MK Werks Mailinglist - http://mathias-kettner.de/check_mk
> http://lists.mathias-kettner.de/mailman/listinfo/checkmk-werks-lvl1
> 
Bruno Henrique Barbosa | 17 Jul 21:27 2014
Picon

[Check_mk (english)] Host notifications issue

Hi everyone,

I'm having some trouble with host notifications by mail. Environment is CentOS 6.5, Nagios 4.0.7 with Check_MK version 1.2.5i3.

I'm testing sending notifications, so I setup one host with WATO and created a ruleset to receive this host notifications. I've supressed service notifications for this it so I could only check for host notifications (which don't flap), but even though I'm forcing the host_subject parameter on my ruleset, I still receive the message like this:

Subject: Check_MK: MYHOST/$ OK -> $    July 17 2014 16:11     
From:        Nagios  
To:    bruno-barbosa-Fyimt0/DmrQ39yzSjRtAkw@public.gmane.org
Host    MYHOST (myhost)
Service    $
Event    OK → $
Address    154.0.0.11
Date / Time    Thu Jul 17 16:11:37 BRT 2014
Plugin Output    $
Additional Output    $
Performance Data    $

In order to test, I've set on my user's notification ruleset, under Notification Method, the field 'Subject for Host Notifications' as this: $HOSTNAME$ is $HOSTSHORTSTATE$

Input for debug (/var/lib/check_mk/notify/notify.log):

2014-07-17 16:11:38 ----------------------------------------------------------------------
2014-07-17 16:11:38 Got raw notification context with 48 variables
2014-07-17 16:11:38 Raw notification context:
                    CONTACTEMAIL=
                    CONTACTNAME=check-mk-notify
                    CONTACTPAGER=
                    DATE=07-17-2014
                    HOSTADDRESS=154.0.0.11
                    HOSTALIAS=myhost
                    HOSTATTEMPT=1
                    HOSTCHECKCOMMAND=check-mk-host-ping
                    HOSTDOWNTIME=0
                    HOSTNAME=MYHOST
                    HOSTNOTIFICATIONNUMBER=1
                    HOSTOUTPUT=CRITICAL - 154.0.0.11: rta nan, lost 100%
                    HOSTPERFDATA=rta=0,000ms;200,000;500,000;0; pl=100%;40;80;; rtmax=0,000ms;;;; rtmin=0,000ms;;;;
                    HOSTPROBLEMID=30
                    HOSTSTATE=DOWN
                    HOSTSTATEID=1
                    HOSTTAGS=lan prod ping wato /wato/cameras/
                    HOST_EC_CONTACT=$
                    HOST_SL=$
                    LASTHOSTSTATE=DOWN
                    LASTHOSTSTATECHANGE=1405624297
                    LASTHOSTSTATEID=1
                    LASTHOSTUP=1405624278
                    LASTSERVICEOK=$
                    LASTSERVICESTATE=$
                    LASTSERVICESTATECHANGE=$
                    LASTSERVICESTATEID=$
                    LONGDATETIME=Thu Jul 17 16:11:37 BRT 2014
                    LONGHOSTOUTPUT=
                    LONGSERVICEOUTPUT=$
                    NOTIFICATIONAUTHOR=
                    NOTIFICATIONAUTHORALIAS=
                    NOTIFICATIONAUTHORNAME=
                    NOTIFICATIONCOMMENT=
                    NOTIFICATIONTYPE=PROBLEM
                    SERVICEATTEMPT=$
                    SERVICECHECKCOMMAND=$
                    SERVICEDESC=$
                    SERVICENOTIFICATIONNUMBER=$
                    SERVICEOUTPUT=$
                    SERVICEPERFDATA=$
                    SERVICEPROBLEMID=$
                    SERVICESTATE=$
                    SERVICESTATEID=$
                    SERVICE_EC_CONTACT=$
                    SERVICE_SL=$
                    SHORTDATETIME=07-17-2014 16:11:37
                    SVC_SL=$
2014-07-17 16:11:38 Computed variables:
                    CONTACTS=
                    HOSTSHORTSTATE=DOWN
                    HOSTURL=/check_mk/index.py?start_url=view.py%3Fview_name%3Dhoststatus%26host%3DMYHOST
                    LASTHOSTSHORTSTATE=DOWN
                    LASTHOSTSTATECHANGE_REL=0d 00:00:01
                    LASTHOSTUP_REL=0d 00:00:20
                    LASTSERVICEOK_REL=16268d 19:11:38
                    LASTSERVICESHORTSTATE=$
                    LASTSERVICESTATECHANGE_REL=16268d 19:11:38
                    LOGDIR=/var/lib/check_mk/notify
                    MAIL_COMMAND=mail -s '$SUBJECT$' '$CONTACTEMAIL$'
                    MONITORING_HOST=nagios
                    PREVIOUSHOSTHARDSHORTSTATE=UP
                    PREVIOUSHOSTHARDSTATE=UP
                    PREVIOUSSERVICEHARDSHORTSTATE=OK
                    PREVIOUSSERVICEHARDSTATE=OK
                    SERVICESHORTSTATE=$
                    SERVICEURL=/check_mk/index.py?start_url=view.py%3Fview_name%3Dservice%26host%3DMYHOST%26service%3D%24
                    WHAT=SERVICE
2014-07-17 16:11:38 Preparing rule based notifications
2014-07-17 16:11:38 Found 1 user specific rules
2014-07-17 16:11:38 Global rule 'Notify all contacts of a host/service via HTML email'...
2014-07-17 16:11:38  -> does not match: This rule is disabled
2014-07-17 16:11:38 User pr208223's rule 'teste'...
2014-07-17 16:11:38  -> matches!
2014-07-17 16:11:38    - adding notification of pr208223 via mail
2014-07-17 16:11:38 Executing 1 notifications:
2014-07-17 16:11:38   * notifying pr208223 via mail, parameters: host_subject, bulk: no
2014-07-17 16:11:38      executing /usr/share/check_mk/notifications/mail

_____________________

So I suppose the message should come instead with the subject: "MYHOST is DOWN" (or "MYHOST is UP") whenever the state changes. Still, it appears like Check_MK: MYHOST/$ OK -> $    {date} as above.

Service notifications work fine for me, but I'd like to receive host notifications normally (without those $), because most of my devices are going to be monitored like: "is it alive/down? send mail".

Hope I was clear on my doubt and will be glad if someone can help. Sorry for the bad English and wall of text :P
Thanks!
_______________________________________________
checkmk-en mailing list
checkmk-en@...
http://lists.mathias-kettner.de/mailman/listinfo/checkmk-en
Mike Hanby | 17 Jul 19:42 2014
Picon

[Check_mk (english)] Git Error in WATO

Howdy,

Versions:
  * check_mk 1.5.2i4p2
  * omd 1.10

Lately most times when I make changes via WATO to hosts, I get the 
following error (I have git enabled in WATO):

"Error executing GIT command cd '/omd/sites/mysite/etc/check_mk' && git 
add *.d/wato 2>&1: The following paths are ignored by one of your 
.gitignore files: mkeventd.d/wato Use -f if you really want to add them. 
fatal: no files added"

Here are the .gitignore files under ~/etc/check_mk. Why is it trying to 
add mkeventd.d/wato?:

:~/etc/check_mk$ find . -name .gitignore -print -exec cat {} \;
./conf.d/.gitignore
*
!wato
!wato/*

./.gitignore
*
!*.d
!.gitignore

./multisite.d/.gitignore
*
!wato
!wato/*

I can manually run the git add command with -f, but I suspect that WATO 
shouldn't be trying to add files under mkeventd.d/wato to the git repo.

I have another server running the same versions of CheckMK and OMD with 
the same configurations and I've never gotten a git error.

Any thoughts what might be triggering this or where I can look to debug?

Thanks,

Mike
Sketch | 17 Jul 19:38 2014

[Check_mk (english)] NagVis multisite auth with LDAP

I spent a little time getting this working, mostly due to some of it not 
being very well documented, and because I confused authmodule and 
authorisationmodule.  Someone on IRC suggesed I post it for reference. I 
get the impression it is basically supposed to work out of the box in OMD, 
but for those not running OMD (like me), here it is.

This is with NagVis 1.8b4.  I haven't tried previous versions.

In CMK, the only thing required is to enabled "wato_write_nagvis_auth = 
True" in /etc/check_mk/multisite.mk

In nagvis.ini.php I have:

[global]
logonmodule="LogonMultisite"
logon_multisite_serials="/etc/nagios/auth.serials"
logon_multisite_secret="/etc/nagios/auth.secret"
logon_multisite_createuser=1
logon_multisite_createrole="Guests"

authorisationmodule="CoreAuthorisationModMultisite"
authorisation_multisite_file="/var/lib/check_mk/wato/auth/auth.php"
authorisation_group_backends="live_1"

That's it.  If you are using htpasswd authentication instead of LDAP, you 
probably have to add 'logon_multisite_htpasswd="/path/to/htpasswd"', as 
well, but I haven't tested that.

--
Sketch
Jason Humes | 16 Jul 19:54 2014
Picon

Re: [Check_mk (english)] check_mk unresponsive during change activations

All servers were using their own certificates.  I think part of the issue was how we redirect people who hit
the root URL and we force them into the multisite...so if they hit www.monitoringdemo.com they get
redirected to monitoringdemo.com/omdsitename/check_mk

Thanks


Jason 

-----Original Message-----
From: Jim Welch [mailto:jim.welch <at> oit.gatech.edu] 
Sent: Wednesday, July 16, 2014 8:14 AM
To: Jason Humes
Subject: Re: [Check_mk (english)] check_mk unresponsive during change activations

How are your SSL certs configured? Do all servers use their own cert or are they using a shared cert?

----------------------------

----- Original Message -----
From: Jason Humes <JHumes <at> acs.on.ca>
To: Dhawal Doshy <dhawal.doshy <at> gmail.com>, Andreas Döhler <andreas.doehler <at> gmail.com>
Cc: checkmk-en <at> lists.mathias-kettner.de
Sent: Wed, 16 Jul 2014 08:05:14 -0400 (EDT)
Subject: Re: [Check_mk (english)] check_mk unresponsive during change activations

How did you get it working?  We were able to get everything talking but with HTTPS enabled we had to keep
logging in between sites...I'm not very good with apache so I'm sure it was some failure on my part with
configuring apache.

Jason 

-----Original Message-----
From: Dhawal Doshy [mailto:dhawal.doshy <at> gmail.com]
Sent: Tuesday, July 15, 2014 11:26 AM
To: Andreas Döhler
Cc: Jason Humes; checkmk-en <at> lists.mathias-kettner.de
Subject: Re: [Check_mk (english)] check_mk unresponsive during change activations

Works fine with https here. We have about 25+ sites on HTTPS + 4 read-only masters on HTTP.

On Mon, Jul 14, 2014 at 10:58 PM, Andreas Döhler <andreas.doehler <at> gmail.com> wrote:
> OK all sites with HTTPS i don't tried yet :) I will have a look inside 
> my testing environment.
>
> br
> Andreas
>
> 2014-07-14 17:20 GMT+02:00, Jason Humes <JHumes <at> acs.on.ca>:
>> Hi
>> We tried to do this too but was unable to figure out how to get 
>> multiple sites working with HTTPS and not having to login multiple 
>> times…we were told it was not possible, thus we run everything on one huge server.
>>
>> Thanks
>>
>>
>> Jason
>>
>> From: Andreas Döhler [mailto:andreas.doehler <at> gmail.com]
>> Sent: Sunday, July 13, 2014 4:24 PM
>> To: Jason Humes
>> Cc: checkmk-en <at> lists.mathias-kettner.de
>> Subject: Re: [Check_mk (english)] check_mk unresponsive during change 
>> activations
>>
>> Like Dhawal already mentioned we use in our bigger installations one 
>> server only for web frontend. No checks running on this machine.
>> From there we configure all the checking machines as slaves. This is 
>> working without such problems.
>>
>> We experienced this problem first in installations with many users of 
>> the frontend (multisite + nagvis). Then it was decided to make a 
>> standalone frontend server and nearly all problems are gone.
>>
>> br
>> Andreas
>>
>> 2014-07-11 17:20 GMT+02:00 Jason Humes
>> <JHumes <at> acs.on.ca<mailto:JHumes <at> acs.on.ca>>:
>> We used to be able to get multiple hosts scanning for services in 
>> parallel and then save them all one at a time and activate all the 
>> changes at once…now we have to wait for one host to finish scanning 
>> before we can start the next as there is zero response during scanning, activating, etc.
>>
>> Thanks
>>
>>
>> Jason
>>
>> From: Pinkoski, David
>> [mailto:dPinkoski <at> dnps.com<mailto:dPinkoski <at> dnps.com>]
>> Sent: Friday, July 11, 2014 11:12 AM
>> To: Jason Humes; Jim Welch
>> Cc:
>> checkmk-en <at> lists.mathias-kettner.de<mailto:checkmk-en <at> lists.mathias-k
>> ettner.de>
>>
>> Subject: RE: [Check_mk (english)] check_mk unresponsive during change 
>> activations
>>
>> Could it be “retain_state_information=1 “ 
>> “state_retention_file=/usr/local/nagios/var/retention.dat” ?
>>
>> Perhaps changing WATO restart mode for Nagios from “restart” to “reload”
>> will help.
>>
>>
>> From:
>> checkmk-en-bounces <at> lists.mathias-kettner.de<mailto:checkmk-en-bounces
>>  <at> lists.mathias-kettner.de>
>> [mailto:checkmk-en-bounces <at> lists.mathias-kettner.de] On Behalf Of 
>> Jason Humes
>> Sent: Friday, July 11, 2014 10:55 AM
>> To: Jim Welch
>> Cc:
>> checkmk-en <at> lists.mathias-kettner.de<mailto:checkmk-en <at> lists.mathias-k
>> ettner.de>
>> Subject: Re: [Check_mk (english)] check_mk unresponsive during change 
>> activations
>>
>> Hi
>> There are actually a number of events that can cause the system to 
>> become unresponsive until the task completes…the easiest ones to talk 
>> about being change activation and host service scanning.  For the 
>> full scan, yes just of a single host and it depends on the type of 
>> host, connection speed to the host, number of services, etc…but it 
>> will hang the system until the scan completes and returns the list of services to the GUI.
>>
>> The activation delay is from the time I click activate changes till 
>> the progress bar completes filling…there is no delay of the progress 
>> bar appearing on screen, it shows up instantly but hangs the system 
>> until it is done activation…approx. 150 seconds.
>>
>> Thanks
>>
>>
>> Jason
>>
>> From: Jim Welch [mailto:jim.welch <at> oit.gatech.edu]
>> Sent: Friday, July 11, 2014 10:28 AM
>> To: Jason Humes
>> Subject: Re: [Check_mk (english)] check_mk unresponsive during change 
>> activations
>>
>> What are you specific symptoms? Is there a long delay between the 
>> time you click the submit button and the time the progress bar appears?
>> (apparently that is when the snapshot is built) During the progress 
>> bar is when (AFAIK) nagios and apache are restarted, that takes up to
>> 75-90 seconds for me. (sometimes less...not sure if it's due to the 
>> load on the system, number of users or what)
>>
>> As for the full scan, is that doing a full scan of just one host? If 
>> so, how long is the delay?
>> ________________________________
>> From: "Jason Humes" <JHumes <at> acs.on.ca<mailto:JHumes <at> acs.on.ca>>
>> To: "Andreas Döhler"
>> <andreas.doehler <at> gmail.com<mailto:andreas.doehler <at> gmail.com>>, "Jim Welch"
>> <jim.welch <at> oit.gatech.edu<mailto:jim.welch <at> oit.gatech.edu>>
>> Cc:
>> checkmk-en <at> lists.mathias-kettner.de<mailto:checkmk-en <at> lists.mathias-k
>> ettner.de>
>> Sent: Friday, July 11, 2014 10:19:37 AM
>> Subject: RE: [Check_mk (english)] check_mk unresponsive during change 
>> activations Sadly the problem still exists for me…wondering if the 
>> issue was that in my older installations I had updated check_mk 
>> inside of OMD manually from a version built from source…perhaps it 
>> was doing something different back then.
>>
>> Thanks anyways ☺
>>
>>
>> Jason
>>
>> From:
>> checkmk-en-bounces <at> lists.mathias-kettner.de<mailto:checkmk-en-bounces
>>  <at> lists.mathias-kettner.de>
>> [mailto:checkmk-en-bounces <at> lists.mathias-kettner.de] On Behalf Of 
>> Andreas Döhler
>> Sent: Thursday, July 10, 2014 12:28 PM
>> To: Jim Welch
>> Cc:
>> checkmk-en <at> lists.mathias-kettner.de<mailto:checkmk-en <at> lists.mathias-k
>> ettner.de>
>> Subject: Re: [Check_mk (english)] check_mk unresponsive during change 
>> activations
>>
>> Nice to hear that it is solved. I was also not thinking about the 
>> snapshot files :)
>>
>> br
>> Andreas
>>
>> 2014-07-10 11:34 GMT+02:00 Jim Welch
>> <jim.welch <at> oit.gatech.edu<mailto:jim.welch <at> oit.gatech.edu>>:
>> Thanks! That was the clue I needed. I took the last snapshot and 
>> unzipped it, then untared files to see what was taking up all the space (and time).
>> The culprit was in the usersettings.tar file. (ldap-debug.log just 
>> like your case) Turns out I'd forgotten I'd turned on ldap debugging 
>> and the ldap-debug.log file had grown to 2.2G! I disabled logging in 
>> the global settings, deleted the existing log file and the snapshot 
>> portion of the process dropped to almost nothing. The rest of the 
>> process is about the same amount of time as running cmk -O manually 
>> so I'm fine with using the GUT, but I'll keep a copy of your script 
>> in case we need it in the future. I didn't know how to remove the 
>> 'pending' changes from the GUI so that is good information.
>> Thanks again,
>> <Jim>
>>
>>
>> ----- Original Message -----
>> From: "Ryan Moore" <ryan.moore <at> cpcc.edu<mailto:ryan.moore <at> cpcc.edu>>
>> To:
>> checkmk-en <at> lists.mathias-kettner.de<mailto:checkmk-en <at> lists.mathias-k
>> ettner.de>
>> Sent: Wednesday, July 9, 2014 6:48:10 PM
>> Subject: Re: [Check_mk (english)] check_mk unresponsive during change 
>> activations
>>
>> That being said, I forgot to mention that using the script won't take 
>> WATO snapshots that can be used for backup/restore, so I still do 
>> occasionally use the GUI button after hours for that purpose. I 
>> haven't dug through the code that creates the snapshot, but I'm sure 
>> it'd be trivial to reproduce. That first delay is the snapshot 
>> getting created, check the size of your ~/etc directory as I believe 
>> that is what is in the snapshot primarily. I had a few very large log 
>> files in there (ldap debug I think) that was causing some very long 
>> delays as well since it was tar/gzip'ing a 700mb file every time.
>>
>>
>> Ryan Moore
>> Infrastructure Systems Analyst
>> Central Piedmont Community College
>> Information Technology Services
>>
>>
>> -------- Original Message  --------
>> Subject: Re: [Check_mk (english)] check_mk unresponsive during change 
>> activations
>> From: Ryan Moore <ryan.moore <at> cpcc.edu<mailto:ryan.moore <at> cpcc.edu>>
>> To:
>> checkmk-en <at> lists.mathias-kettner.de<mailto:checkmk-en <at> lists.mathias-k
>> ettner.de>
>> <checkmk-en <at> lists.mathias-kettner.de<mailto:checkmk-en <at> lists.mathias-
>> kettner.de>>
>> Date: 07/09/2014 06:42 PM
>>
>>> I've run into the same problem, and I believe it due to mod_python 
>>> being single threaded and the rebuild process is tied up during the 
>>> activation process. What I've done is I'm often on the console via 
>>> ssh during the day, and I created a very basic script to do the work:
>>>
>>> ---
>>> #!/bin/bash
>>> #
>>> # Simple rebuild script for Check_MK #
>>>
>>> # Rebuild the config and reload Nagios cmk -O STATUS=$?
>>>
>>> # if rebuild was good clear the pending log for wato (so the 'apply 
>>> changes' button in the GUI goes away) [ $STATUS -eq 0 ] && { echo 
>>> "Rebuild looks good, clearing pending log"; rm -f 
>>> ~/var/check_mk/wato/log/pending.log;} || echo "Uh oh, there was an 
>>> error!"
>>>
>>> --
>>>
>>> This allows me to still make changes via WATO, but activate them on 
>>> the console and not cause any interruptions to users. Perhaps one 
>>> day Multisite will work via WSGI or something else more modern than 
>>> mod_python.
>>>
>>>
>>> Ryan Moore
>>> Infrastructure Systems Analyst
>>> Central Piedmont Community College
>>> Information Technology Services
>>>
>>>
>>> -------- Original Message  --------
>>> Subject: Re: [Check_mk (english)] check_mk unresponsive during 
>>> change activations
>>> From: Jim Welch
>>> <jim.welch <at> oit.gatech.edu<mailto:jim.welch <at> oit.gatech.edu>>
>>> To: Andreas Döhler
>>> <andreas.doehler <at> gmail.com<mailto:andreas.doehler <at> gmail.com>>
>>> Cc:
>>> "checkmk-en <at> lists.mathias-kettner.de<mailto:checkmk-en <at> lists.mathias-kettner.de>"
>>> <checkmk-en <at> lists.mathias-kettner.de<mailto:checkmk-en <at> lists.mathias
>>> -kettner.de>>
>>> Date: 07/09/2014 04:45 PM
>>>
>>>> Thanks for the tip. I've done that (Delay precompiling), but it 
>>>> still takes 2-4 minutes to commit a change. I've verified that the 
>>>> timestamps on the host check files do not change so it doesn't seem 
>>>> to be precompiling the checks. I'm not sure exactly what it's doing 
>>>> in the time between when I hit the button and before the progress 
>>>> bar appears. (1-3 minutes)
>>>>
>>>> -------------------------------------------------------------------
>>>> -----
>>>> *From: *"Andreas Döhler"
>>>> <andreas.doehler <at> gmail.com<mailto:andreas.doehler <at> gmail.com>>
>>>> *To: *"Jason Humes" <JHumes <at> acs.on.ca<mailto:JHumes <at> acs.on.ca>>
>>>> *Cc: *"Jim Welch"
>>>> <jim.welch <at> oit.gatech.edu<mailto:jim.welch <at> oit.gatech.edu>>,
>>>> checkmk-en <at> lists.mathias-kettner.de<mailto:checkmk-en <at> lists.mathias
>>>> -kettner.de>
>>>> *Sent: *Wednesday, July 9, 2014 4:14:35 PM
>>>> *Subject: *Re: [Check_mk (english)] check_mk unresponsive during 
>>>> change activations
>>>>
>>>> Hi,
>>>>
>>>> there is one option you can test inside your OMD setup.
>>>> To speedup the generation of configuration you can take a look at 
>>>> the option "Delay precompiling of host checks." found under "Global 
>>>> configuration settings"
>>>>
>>>> In bigger installations this will reduce the commit time by a big 
>>>> amount of time.
>>>>
>>>> br
>>>> Andreas
>>>>
>>>>
>>>> 2014-07-08 20:12 GMT+02:00 Jason Humes 
>>>> <JHumes <at> acs.on.ca<mailto:JHumes <at> acs.on.ca>
>>>> <mailto:JHumes <at> acs.on.ca<mailto:JHumes <at> acs.on.ca>>>:
>>>>
>>>>      Hi
>>>>      Sorry for the delay in responding...had a small leave from work.
>>>> We
>>>>      have 1000 hosts and about 30000 services and use just internal
>>>>      authentication.  It is such a pain having to wait to do changes at
>>>>      the end of the day so as to not interrupt the use of the 
>>>> system :(
>>>>
>>>>
>>>>      Jason
>>>>
>>>>
>>>>      -----Original Message-----
>>>>      From:
>>>> checkmk-en-bounces <at> lists.mathias-kettner.de<mailto:checkmk-en-bounc
>>>> es <at> lists.mathias-kettner.de>
>>>>
>>>> <mailto:checkmk-en-bounces <at> lists.mathias-kettner.de<mailto:checkmk-
>>>> en-bounces <at> lists.mathias-kettner.de>>
>>>>
>>>> [mailto:checkmk-en-bounces <at> lists.mathias-kettner.de<mailto:checkmk-
>>>> en-bounces <at> lists.mathias-kettner.de>
>>>>
>>>> <mailto:checkmk-en-bounces <at> lists.mathias-kettner.de<mailto:checkmk-
>>>> en-bounces <at> lists.mathias-kettner.de>>]
>>>> On Behalf Of
>>>>      Jim Welch
>>>>      Sent: Thursday, June 19, 2014 4:36 PM
>>>>      To:
>>>> checkmk-en <at> lists.mathias-kettner.de<mailto:checkmk-en <at> lists.mathias
>>>> -kettner.de>
>>>>
>>>> <mailto:checkmk-en <at> lists.mathias-kettner.de<mailto:checkmk-en <at> lists.mathias-kettner.de>>
>>>>      Subject: Re: [Check_mk (english)] check_mk unresponsive during
>>>>      change activations
>>>>
>>>>      Yes, we see that behaviour on omd 1.10 (rhel6). I had to extend the
>>>>      apache request timeouts since the site may be locked up for 2-3
>>>>      minutes while activating changes. How many hosts/services are on
>>>>      your system? (~700/16000) What type of authentication do you use?
>>>>      (we enabled ldap authentication)
>>>>
>>>>
>>>>      ----- Original Message -----
>>>>      From: "Jason Humes" <JHumes <at> acs.on.ca<mailto:JHumes <at> acs.on.ca>
>>>> <mailto:JHumes <at> acs.on.ca<mailto:JHumes <at> acs.on.ca>>>
>>>>      To:
>>>> checkmk-en <at> lists.mathias-kettner.de<mailto:checkmk-en <at> lists.mathias
>>>> -kettner.de>
>>>>
>>>> <mailto:checkmk-en <at> lists.mathias-kettner.de<mailto:checkmk-en <at> lists.mathias-kettner.de>>
>>>>      Sent: Thursday, June 19, 2014 4:25:08 PM
>>>>      Subject: [Check_mk (english)] check_mk unresponsive during change
>>>>      activations
>>>>
>>>>      Hi
>>>>      We've been running check_mk/omd for about two years now, since omd
>>>>      0.54 and currently at omd 1.1.  At some point through the upgrade
>>>>      life the system changed from how it was responding during change
>>>>      activations within WATO/multisite...it used to be that if an admin
>>>>      was activating changes, other users could still browse the 
>>>> multisite
>>>>      view...but currently it seems that the system becomes totally
>>>>      unresponsive during change activations.  Does anyone else 
>>>> experience
>>>>      this?  Is it expected?
>>>>
>>>>      Thanks
>>>>
>>>>
>>>>      Jason
>>>>
>>>>      _______________________________________________
>>>>      checkmk-en mailing list
>>>>
>>>> checkmk-en <at> lists.mathias-kettner.de<mailto:checkmk-en <at> lists.mathias
>>>> -kettner.de>
>>>>
>>>> <mailto:checkmk-en <at> lists.mathias-kettner.de<mailto:checkmk-en <at> lists.mathias-kettner.de>>
>>>>      http://lists.mathias-kettner.de/mailman/listinfo/checkmk-en

>>>>      _______________________________________________
>>>>      checkmk-en mailing list
>>>>
>>>> checkmk-en <at> lists.mathias-kettner.de<mailto:checkmk-en <at> lists.mathias
>>>> -kettner.de>
>>>>
>>>> <mailto:checkmk-en <at> lists.mathias-kettner.de<mailto:checkmk-en <at> lists.mathias-kettner.de>>
>>>>      http://lists.mathias-kettner.de/mailman/listinfo/checkmk-en

>>>>      _______________________________________________
>>>>      checkmk-en mailing list
>>>>
>>>> checkmk-en <at> lists.mathias-kettner.de<mailto:checkmk-en <at> lists.mathias
>>>> -kettner.de>
>>>>
>>>> <mailto:checkmk-en <at> lists.mathias-kettner.de<mailto:checkmk-en <at> lists.mathias-kettner.de>>
>>>>      http://lists.mathias-kettner.de/mailman/listinfo/checkmk-en

>>>>
>>>>
>>>>
>> _______________________________________________
>> checkmk-en mailing list
>> checkmk-en <at> lists.mathias-kettner.de<mailto:checkmk-en <at> lists.mathias-k
>> ettner.de>
>> http://lists.mathias-kettner.de/mailman/listinfo/checkmk-en

>> _______________________________________________
>> checkmk-en mailing list
>> checkmk-en <at> lists.mathias-kettner.de<mailto:checkmk-en <at> lists.mathias-k
>> ettner.de>
>> http://lists.mathias-kettner.de/mailman/listinfo/checkmk-en

>>
>>
>>
>> _______________________________________________
>> checkmk-en mailing list
>> checkmk-en <at> lists.mathias-kettner.de<mailto:checkmk-en <at> lists.mathias-k
>> ettner.de>
>> http://lists.mathias-kettner.de/mailman/listinfo/checkmk-en

>>
>>
> _______________________________________________
> checkmk-en mailing list
> checkmk-en <at> lists.mathias-kettner.de
> http://lists.mathias-kettner.de/mailman/listinfo/checkmk-en

_______________________________________________
checkmk-en mailing list
checkmk-en <at> lists.mathias-kettner.de
http://lists.mathias-kettner.de/mailman/listinfo/checkmk-en


_______________________________________________
checkmk-en mailing list
checkmk-en <at> lists.mathias-kettner.de
http://lists.mathias-kettner.de/mailman/listinfo/checkmk-en
Andreas Döhler | 16 Jul 18:02 2014
Picon

Re: [Check_mk (english)] R: Restore from backup

For your last question there is one short answer.
Create a new test site and go to  the etc/check_mk/multisite.d/wato/ there you find a file hosttags.mk with all the standard tags.

Then all the tagging should work as expected.

br
Andreas


2014-07-16 17:23 GMT+02:00 Andrea Corazzari <ac-ou70kdGnqRc57InD3i3y51zrSV/HdtiB@public.gmane.org>:

Thank you all for the quick answer!

 

 

1.       Was this restore done with restoring a snapshot from the other WATO?

2.       If not you must control all user rights on the restored files and directories

 

1)      Yes, snapshot made form old WATO/Machine and restored on new installatio/machine

2)      Perhaps did you mean if so instead of if not? Anyway I’ll check everithing. Site name and user are the same, maibi UID issue?

 

Last (by now ;) ) question: In configuration -> new folder or new host the “builtin” (if I remember well) host tag Operating System (Windows, Linux, Network Device (SNMP) and Ping Only) is disappered.

I tried to recreate it manually and seemed OK buT all new host I create are treated querying TCP/6556.

 

Maybe there is some file that sould be copied manually. (I tried also to perform backup&restore via CLI as described in http://mathias-kettner.com/checkmk_backup.html)

 

Thank you all

 

All the best

Andrea

 

Da: Andreas Döhler [mailto:andreas.doehler-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org]
Inviato: mercoledì 16 luglio 2014 17:10
A: Andrea Corazzari
Cc: checkmk-en-qhrM8SXbD5JpaB0eVFyvwnWFp+d4uDoM@public.gmane.org
Oggetto: Re: [Check_mk (english)] R: Restore from backup

 

Was this restore done with restoring a snapshot from the other WATO?

If not you must control all user rights on the restored files and directories.

 

br

Andreas

 

2014-07-16 16:48 GMT+02:00 Andrea Corazzari <ac <at> piazzasandomenico.it>:

Some more details:
changes are effectively activated, and in conifg page Host and folder the Activate Button turn back to blue, but both in other wato pages and in pending.log all these changes are seen as pending.
As I said before changes are activated, but what could I do to have normal behaviour?
We are in a restore after a crash and data are taken from an old OVA and imported on a fresh installation, in few words we are in a hurry.

TIA
Regards
Andrea

-----Messaggio originale-----
Da: checkmk-en-bounces-qhrM8SXbD5JpaB0eVFyvwnWFp+d4uDoM@public.gmane.org [mailto:checkmk-en-bounces-qhrM8SXbD5JpaB0eVFyvwnWFp+d4uDoM@public.gmane.org] Per conto di Andrea Corazzari
Inviato: mercoledì 16 luglio 2014 14:20
A: checkmk-en-qhrM8SXbD5JpaB0eVFyvwnWFp+d4uDoM@public.gmane.org
Oggetto: [Check_mk (english)] Restore from backup


Hi all,
I’ve just restored a backup taken from my old installation to a new and fresh one.

OS (CentOS  6.4) and OMD (1.10) version are the same, but if I try to create a new host the default tag “Operating System” dos not appear, manually defined tags are present, moreover if I make some changes and activate them changes seem to be activate but the button Activate changes remains orange.

Are there some things in config files that should be copied by hand? Which are log files to check first in these cases?

Every hint will be very valued.

Thanks in advance

Regards
Andrea



_______________________________________________
checkmk-en mailing list
checkmk-en-qhrM8SXbD5JpaB0eVFyvwnWFp+d4uDoM@public.gmane.org
http://lists.mathias-kettner.de/mailman/listinfo/checkmk-en



_______________________________________________
checkmk-en mailing list
checkmk-en-qhrM8SXbD5JpaB0eVFyvwnWFp+d4uDoM@public.gmane.org
http://lists.mathias-kettner.de/mailman/listinfo/checkmk-en

 


_______________________________________________
checkmk-en mailing list
checkmk-en@...
http://lists.mathias-kettner.de/mailman/listinfo/checkmk-en

Gmane