Josh Lupresto | 2 Apr 2012 06:35
Favicon

Smokeping noob question

I've had smokeping up and running no problem for almost a year now, with certain bigloss criteria on all my sites. Then about 2 months ago, out of nowhere, I will intermittently not get the smokeping notifications when a site has gone down via Postfix. I am doing some packet captures with ngrep on port 25. The graphs are always reporting the down sites, just not getting me the notifications. So assuming when a site goes down, I see no attempts to send the message to Postfix going across port 25, what else can I do troubleshooting wise to narrow down the issue. Suggestions?

Thanks,

Josh Lupresto
Systems Engineer
Telecom Recovery
www.telecomrecovery.com
Support 877-777-3525
Direct 877-663-8700
24x7 Outage Support 877-777-3526

_______________________________________________
smokeping-users mailing list
smokeping-users <at> lists.oetiker.ch
https://lists.oetiker.ch/cgi-bin/listinfo/smokeping-users
G.W. Haywood | 2 Apr 2012 11:53
Picon
Favicon

Re: Smokeping noob question

Hi there,

On Sun, 1 Apr 2012, Josh Lupresto wrote:

> I've had smokeping up and running no problem for almost a year now, with
> certain bigloss criteria on all my sites. Then about 2 months ago, out of
> nowhere, I will intermittently not get the smokeping notifications when a
> site has gone down via Postfix. I am doing some packet captures with ngrep
> on port 25. The graphs are always reporting the down sites, just not
> getting me the notifications. So assuming when a site goes down, I see no
> attempts to send the message to Postfix going across port 25, what else can
> I do troubleshooting wise to narrow down the issue. Suggestions?

If this were happening on one of my systems, I would

1. Check the logs
2. Check the configuration files (e.g. dates of last modification.:)
3. Check for any firewall silliness
4. Put some debugging in sub check_alerts in Smokeping.pm

Get back to us if it gets you nowhere.

Smokeping slt | 9 Apr 2012 13:27
Picon
Favicon

Migrate RRD data to seperate server

hi all,

Im experiencing heavy I/O in my smokeping server. Because of  that it is required to migrate my rrd folder to separate server and redirect smokeping data to that server.
My config file attached here.

Here are the steps I followed.
1) Create remote mount point to the smokeping server

 vi /etc/fstab --------------------------->  //<remote-ip>/smokeping   /mnt/smokeping cifs username=xxx,password=xxx 0 0

2) I think my all rrd files stored in /usr/local/smokeping/var/data .Then I copied all rrd data to the newer folder in remote server

3) I created a soft link ---------------->  cd /usr/local/smokeoping/var
                                                         ln -s data /mnt/smokeping

I can successfully access the remote mount point. During that time It did not update the graphs in my web server.(attached graph) .

http://smokeping/smokeping/smokeping.cgi/test-node

 Then I revert back the config to my previous one.


Please provide me a way to migrate all my rrd files in smokeping server to remote one. I cannot understand why my method did not work.

What is the actual file location of my rrd file ? Am I accessing to wrong data. Please help me

thanks




Attachment (config.dist): application/octet-stream, 946 bytes
_______________________________________________
smokeping-users mailing list
smokeping-users <at> lists.oetiker.ch
https://lists.oetiker.ch/cgi-bin/listinfo/smokeping-users
Smokeping slt | 10 Apr 2012 06:35
Picon
Favicon

Re: Migrate RRD data to seperate server

hi,

Thanks for your reply. I attached my config also. Image directory resides in /var/www/html/smokeping-ms/cache. Followings are the content of that directory.
drwxr-xr-x 6 apache apache  4096 Dec 31  2009 .
drwxr-xr-x 3 apache apache  4096 Dec 31  2009 ..
drwxr-xr-x 2 apache apache  4096 Jan 18 09:12 __chartscache
drwxr-xr-x 2 apache apache  4096 Apr 10 09:31 __navcache
drwxr-xr-x 7 apache apache  4096 Dec 31  2009 network
-rw-r--r-- 1 apache apache  3908 Aug  8  2009 rrdtool.png
-rw-r--r-- 1 apache apache  4382 Aug  8  2009 smokeping.png
drwxr-xr-x 3 apache apache 69632 Feb 22 16:22 Test

Is it ok to migrate them to the my remote server ? or do I need to migrate both this directory and rrd directory (/usr/local/smokeping/var/data) ?

Please provide me a way to solve this ? I really appreciate your kind help

Thanks



Date: Mon, 9 Apr 2012 12:55:18 -0400
Subject: Re: [smokeping-users] Migrate RRD data to seperate server
From: rb14060 <at> gmail.com
To: smokeping123 <at> hotmail.com

Don't hold me to it, but I think you're accessing the wrong data. The data is usually stored I where Apache's httpdocs are, in a folder called cache. In my case that's /var/www/html/cache. Note that I do have a custom setup, but if you like, when I get home I can attach the part of my SmokePing config that handles that.
On Apr 9, 2012 7:27 AM, "Smokeping slt" <smokeping123 <at> hotmail.com> wrote:
hi all,

Im experiencing heavy I/O in my smokeping server. Because of  that it is required to migrate my rrd folder to separate server and redirect smokeping data to that server.
My config file attached here.

Here are the steps I followed.
1) Create remote mount point to the smokeping server

 vi /etc/fstab --------------------------->  //<remote-ip>/smokeping   /mnt/smokeping cifs username=xxx,password=xxx 0 0

2) I think my all rrd files stored in /usr/local/smokeping/var/data .Then I copied all rrd data to the newer folder in remote server

3) I created a soft link ---------------->  cd /usr/local/smokeoping/var
                                                         ln -s data /mnt/smokeping

I can successfully access the remote mount point. During that time It did not update the graphs in my web server.(attached graph) .

http://smokeping/smokeping/smokeping.cgi/test-node

 Then I revert back the config to my previous one.


Please provide me a way to migrate all my rrd files in smokeping server to remote one. I cannot understand why my method did not work.

What is the actual file location of my rrd file ? Am I accessing to wrong data. Please help me

thanks





_______________________________________________
smokeping-users mailing list
smokeping-users <at> lists.oetiker.ch
https://lists.oetiker.ch/cgi-bin/listinfo/smokeping-users

Attachment (config.dist): application/octet-stream, 946 bytes
_______________________________________________
smokeping-users mailing list
smokeping-users <at> lists.oetiker.ch
https://lists.oetiker.ch/cgi-bin/listinfo/smokeping-users
Matt Almgren | 10 Apr 2012 06:37
Picon

Re: How to increase RRD history?

Hey guys, finally getting around to poking around with this...

Here's my database section:

*** Database ***

step     = 30
pings    = 10

# consfn mrhb steps total

AVERAGE  0.5   1  1008
AVERAGE  0.5  12  4320
    MIN  0.5  12  4320
    MAX  0.5  12  4320
AVERAGE  0.5 144   720
    MAX  0.5 144   720
    MIN  0.5 144   720


I'm not too interested in seeing more than the default value of detailed information.   What I am interested in is seeing up to 6 months of non-detailed data just to get trending information.   Still a bit confused on the above values.  Care to give me some numbers to punch in to  a) keep the default detailed samples, but b) keep up to 6 months (non-detailed) archival data?

Thanks!

 -- Matt


On Mon, Mar 5, 2012 at 8:57 AM, Gregory Sloop <gregs <at> sloop.net> wrote:
GS> These aren't files - it's more that if you have hundreds or thousands
GS> of devices you're sampling, there's a lot of them.

Sorry Peter, I accidentally replied direct to you, as well as the
list...

Also, a typo above.

These aren't *huge* files - it's more that if you have hundreds or thousands
of devices you're sampling, there's a lot of them.

And I'll clarify about I/O - with that much disk activity, - writing
to thousands of files very often, your disk may not keep up. But you
would have enough space to store everything, if it could...

HTH

-Greg


_______________________________________________
smokeping-users mailing list
smokeping-users <at> lists.oetiker.ch
https://lists.oetiker.ch/cgi-bin/listinfo/smokeping-users

_______________________________________________
smokeping-users mailing list
smokeping-users <at> lists.oetiker.ch
https://lists.oetiker.ch/cgi-bin/listinfo/smokeping-users
Matt Almgren | 10 Apr 2012 06:40
Picon

Re: How to increase RRD history?

More info...

Based on earlier calculations, I come up with

86400 (sec/day) X 180 days / 30 (step value) = 518400, but I'm not sure where to plug in this value.

As for # of targets, right now it's around 50 in each location, so I'm not too worried about space at the moment.

Thanks!

 -- Matt


On Mon, Apr 9, 2012 at 9:37 PM, Matt Almgren <almgren <at> gmail.com> wrote:
Hey guys, finally getting around to poking around with this...

Here's my database section:

*** Database ***

step     = 30
pings    = 10

# consfn mrhb steps total

AVERAGE  0.5   1  1008
AVERAGE  0.5  12  4320
    MIN  0.5  12  4320
    MAX  0.5  12  4320
AVERAGE  0.5 144   720
    MAX  0.5 144   720
    MIN  0.5 144   720


I'm not too interested in seeing more than the default value of detailed information.   What I am interested in is seeing up to 6 months of non-detailed data just to get trending information.   Still a bit confused on the above values.  Care to give me some numbers to punch in to  a) keep the default detailed samples, but b) keep up to 6 months (non-detailed) archival data?

Thanks!

 -- Matt



On Mon, Mar 5, 2012 at 8:57 AM, Gregory Sloop <gregs <at> sloop.net> wrote:
GS> These aren't files - it's more that if you have hundreds or thousands
GS> of devices you're sampling, there's a lot of them.

Sorry Peter, I accidentally replied direct to you, as well as the
list...

Also, a typo above.

These aren't *huge* files - it's more that if you have hundreds or thousands
of devices you're sampling, there's a lot of them.

And I'll clarify about I/O - with that much disk activity, - writing
to thousands of files very often, your disk may not keep up. But you
would have enough space to store everything, if it could...

HTH

-Greg


_______________________________________________
smokeping-users mailing list
smokeping-users <at> lists.oetiker.ch
https://lists.oetiker.ch/cgi-bin/listinfo/smokeping-users


_______________________________________________
smokeping-users mailing list
smokeping-users <at> lists.oetiker.ch
https://lists.oetiker.ch/cgi-bin/listinfo/smokeping-users
Gregory Sloop | 10 Apr 2012 08:20

Re: How to increase RRD history?

I'm using the database section below as my starting point. Lets reproduce it here:
---
*** Database ***
step     = 30
pings    = 10

# consfn mrhb steps total
AVERAGE  0.5   1  1008
AVERAGE  0.5  12  4320
    MIN  0.5  12  4320
    MAX  0.5  12  4320
AVERAGE  0.5 144   720
    MAX  0.5 144   720
    MIN  0.5 144   720

---
So line 1: The "Total" line should be how many full resolution samples you want to keep.
[i.e. 2880 is 24 hours of full res data (2/min * 60mins * 24 hours)] 
1008 would be 504 minutes of data, or just over 8 hours. [1008 samples, divided by 2 (samples per minute) divided by 60 = 8.4 hours]

The next three lines are the second tier data. These will have x number of steps (or average/min/max) compressed to one. So, if you leave the "steps" to 12, it would then be a 6 minute average [30 secs per sample, 12:1 ratio = 1 sample every 6 minutes.] (6 minute data)

To keep six months of six minute data: total col = 43200 [10 samples per hour * 24 hours * 30 days * 6 months = 43200]

The last three are even lower res data. It will compress 144 full res steps into 1. [i.e. 72 minute data. You can keep as much as you'd like here, just keep as many minutes as you want history. 10000 in the total column would be 720000 minutes or 500 days worth.]

(But you don't have to use 144 as the step value - perhaps you want your third tier data to be hour data, choose accordingly.)

HTH

-Greg


More info...

Based on earlier calculations, I come up with 

86400 (sec/day) X 180 days / 30 (step value) = 518400, but I'm not sure where to plug in this value.

As for # of targets, right now it's around 50 in each location, so I'm not too worried about space at the moment.

Thanks!

 -- Matt


On Mon, Apr 9, 2012 at 9:37 PM, Matt Almgren <
almgren <at> gmail.com> wrote:
Hey guys, finally getting around to poking around with this...

Here's my database section:

*** Database ***

step     = 30
pings    = 10

# consfn mrhb steps total

AVERAGE  0.5   1  1008
AVERAGE  0.5  12  4320
    MIN  0.5  12  4320
    MAX  0.5  12  4320
AVERAGE  0.5 144   720
    MAX  0.5 144   720
    MIN  0.5 144   720


I'm not too interested in seeing more than the default value of detailed information.   What I am interested in is seeing up to 6 months of non-detailed data just to get trending information.   Still a bit confused on the above values.  Care to give me some numbers to punch in to  a) keep the default detailed samples, but b) keep up to 6 months (non-detailed) archival data?

Thanks!

 -- Matt



On Mon, Mar 5, 2012 at 8:57 AM, Gregory Sloop <
gregs <at> sloop.net> wrote:
GS> These aren't files - it's more that if you have hundreds or thousands
GS> of devices you're sampling, there's a lot of them.

Sorry Peter, I accidentally replied direct to you, as well as the
list...

Also, a typo above.

These aren't *huge* files - it's more that if you have hundreds or thousands
of devices you're sampling, there's a lot of them.

And I'll clarify about I/O - with that much disk activity, - writing
to thousands of files very often, your disk may not keep up. But you
would have enough space to store everything, if it could...

HTH

-Greg


_______________________________________________
smokeping-users mailing list
smokeping-users <at> lists.oetiker.ch
https://lists.oetiker.ch/cgi-bin/listinfo/smokeping-users


-- 
Gregory Sloop, Principal: Sloop Network & Computer Consulting
Voice: 503.251.0452 x82
EMail: 
gregs <at> sloop.net
http://www.sloop.net
---
_______________________________________________
smokeping-users mailing list
smokeping-users <at> lists.oetiker.ch
https://lists.oetiker.ch/cgi-bin/listinfo/smokeping-users
Gregory Sloop | 10 Apr 2012 08:29

Re: Migrate RRD data to seperate server

See your config file for

datadir  = /var/lib/smokeping/rrd

The datadir is where the RRD files are located. [That line above is my working config on one of my boxes...]

(I just pulled your up and see: datadir  = /usr/local/smokeping/var/data - so your assumption below was right.)

I know nothing about remote mount points, and I'm not sure how that's going to work.
[My guess is the RRD daemon won't be able to successfully read/write to a remote CIFS mounted drive. But really, I dunno.]

Perhaps:
Have you checked with the rrd tools to see if they can open the data-files on the remote mount? 
You might get a useful error message or logs from trying that.
If you can open read/write the files with rrdtool, then SP ought to work right too.

Bigger, more basic question:
However, why not move the whole thing, vs just the data?
If you want to split things, then perhaps reading up on master/slave servers would be a good plan and migrate the config/install that direction.

-Greg



hi,

Thanks for your reply. I attached my config also. Image directory resides in /var/www/html/smokeping-ms/cache. Followings are the content of that directory.
drwxr-xr-x 6 apache apache  4096 Dec 31  2009 .
drwxr-xr-x 3 apache apache  4096 Dec 31  2009 ..
drwxr-xr-x 2 apache apache  4096 Jan 18 09:12 __chartscache
drwxr-xr-x 2 apache apache  4096 Apr 10 09:31 __navcache
drwxr-xr-x 7 apache apache  4096 Dec 31  2009 network
-rw-r--r-- 1 apache apache  3908 Aug  8  2009 rrdtool.png
-rw-r--r-- 1 apache apache  4382 Aug  8  2009 smokeping.png
drwxr-xr-x 3 apache apache 69632 Feb 22 16:22 Test

Is it ok to migrate them to the my remote server ? or do I need to migrate both this directory and rrd directory (/usr/local/smokeping/var/data) ?

Please provide me a way to solve this ? I really appreciate your kind help

Thanks



Date: Mon, 9 Apr 2012 12:55:18 -0400
Subject: Re: [smokeping-users] Migrate RRD data to seperate server
From: rb14060 <at> gmail.com
To: smokeping123 <at> hotmail.com

Don't hold me to it, but I think you're accessing the wrong data. The data is usually stored I where Apache's httpdocs are, in a folder called cache. In my case that's /var/www/html/cache. Note that I do have a custom setup, but if you like, when I get home I can attach the part of my SmokePing config that handles that.
On Apr 9, 2012 7:27 AM, "Smokeping slt" <
smokeping123 <at> hotmail.com> wrote:
hi all, 

Im experiencing heavy I/O in my smokeping server. Because of  that it is required to migrate my rrd folder to separate server and redirect smokeping data to that server.
My config file attached here.

Here are the steps I followed.
1) Create remote mount point to the smokeping server

 vi /etc/fstab --------------------------->  //<remote-ip>/smokeping   /mnt/smokeping cifs username=xxx,password=xxx 0 0

2) I think my all rrd files stored in /usr/local/smokeping/var/data .Then I copied all rrd data to the newer folder in remote server

3) I created a soft link ---------------->  cd /usr/local/smokeoping/var 
                                                         ln -s data /mnt/smokeping

I can successfully access the remote mount point. During that time It did not update the graphs in my web server.(attached graph) .

http://smokeping/smokeping/smokeping.cgi/test-node

 Then I revert back the config to my previous one.


Please provide me a way to migrate all my rrd files in smokeping server to remote one. I cannot understand why my method did not work. 

What is the actual file location of my rrd file ? Am I accessing to wrong data. Please help me

thanks





_______________________________________________
smokeping-users mailing list
smokeping-users <at> lists.oetiker.ch
https://lists.oetiker.ch/cgi-bin/listinfo/smokeping-users


-- 
Gregory Sloop, Principal: Sloop Network & Computer Consulting
Voice: 503.251.0452 x82
EMail: 
gregs <at> sloop.net
http://www.sloop.net
---
_______________________________________________
smokeping-users mailing list
smokeping-users <at> lists.oetiker.ch
https://lists.oetiker.ch/cgi-bin/listinfo/smokeping-users
Matt Almgren | 11 Apr 2012 05:51
Picon

Re: How to increase RRD history?

Great!  Thanks for the explanation.  The math now makes perfect sense.

If I could trouble you for one more thing.   Is there a service script for this release?  I've found a couple, but they don't do it well.

This one starts and stops, but the 'status' doesn't work and the restart/reload do the same thing.

#!/bin/bash # # chkconfig: 2345 80 05 # Description: Smokeping init.d script # Hacked by : How2CentOS - http://www.how2centos.com   # Get function from functions library . /etc/init.d/functions   # Start the service Smokeping start() { echo -n "Starting Smokeping: " /opt/smokeping/bin/smokeping >/dev/null 2>&1 ### Create the lock file ### touch /var/lock/subsys/smokeping success $"Smokeping startup" echo }   # Restart the service Smokeping stop() { echo -n "Stopping Smokeping: " kill -9 `ps ax | grep "/opt/smokeping/bin/smokeping" | grep -v grep | awk '{ print $1 }'` >/dev/null 2>&1 && killall speedy_backend ### Now, delete the lock file ### rm -f /var/lock/subsys/smokeping success $"Smokeping shutdown" echo }   ### main logic ### case "$1" in start) start ;; stop) stop ;; status) status Smokeping ;; restart|reload|condrestart) stop start ;; *) echo $"Usage: $0 {start|stop|restart|reload|status}" exit 1 esac   exit 0

Is there a start/stop script that comes with the smokeping package that works with CentOS 6.2?

Thanks!

-- Matt



On Mon, Apr 9, 2012 at 11:20 PM, Gregory Sloop <gregs <at> sloop.net> wrote:
I'm using the database section below as my starting point. Lets reproduce it here:
---

*** Database ***
step     = 30
pings    = 10

# consfn mrhb steps total
AVERAGE  0.5   1  1008
AVERAGE  0.5  12  4320
    MIN  0.5  12  4320
    MAX  0.5  12  4320
AVERAGE  0.5 144   720
    MAX  0.5 144   720
    MIN  0.5 144   720

---
So line 1: The "Total" line should be how many full resolution samples you want to keep.
[i.e. 2880 is 24 hours of full res data (2/min * 60mins * 24 hours)] 
1008 would be 504 minutes of data, or just over 8 hours. [1008 samples, divided by 2 (samples per minute) divided by 60 = 8.4 hours]

The next three lines are the second tier data. These will have x number of steps (or average/min/max) compressed to one. So, if you leave the "steps" to 12, it would then be a 6 minute average [30 secs per sample, 12:1 ratio = 1 sample every 6 minutes.] (6 minute data)

To keep six months of six minute data: total col = 43200 [10 samples per hour * 24 hours * 30 days * 6 months = 43200]

The last three are even lower res data. It will compress 144 full res steps into 1. [i.e. 72 minute data. You can keep as much as you'd like here, just keep as many minutes as you want history. 10000 in the total column would be 720000 minutes or 500 days worth.]

(But you don't have to use 144 as the step value - perhaps you want your third tier data to be hour data, choose accordingly.)

HTH

-Greg


More info...

Based on earlier calculations, I come up with 

86400 (sec/day) X 180 days / 30 (step value) = 518400, but I'm not sure where to plug in this value.

As for # of targets, right now it's around 50 in each location, so I'm not too worried about space at the moment.

Thanks!

 -- Matt


On Mon, Apr 9, 2012 at 9:37 PM, Matt Almgren <
almgren <at> gmail.com> wrote:
Hey guys, finally getting around to poking around with this...

Here's my database section:

*** Database ***

step     = 30
pings    = 10

# consfn mrhb steps total

AVERAGE  0.5   1  1008
AVERAGE  0.5  12  4320
    MIN  0.5  12  4320
    MAX  0.5  12  4320
AVERAGE  0.5 144   720
    MAX  0.5 144   720
    MIN  0.5 144   720


I'm not too interested in seeing more than the default value of detailed information.   What I am interested in is seeing up to 6 months of non-detailed data just to get trending information.   Still a bit confused on the above values.  Care to give me some numbers to punch in to  a) keep the default detailed samples, but b) keep up to 6 months (non-detailed) archival data?

Thanks!

 -- Matt



On Mon, Mar 5, 2012 at 8:57 AM, Gregory Sloop <
gregs <at> sloop.net> wrote:
GS> These aren't files - it's more that if you have hundreds or thousands
GS> of devices you're sampling, there's a lot of them.

Sorry Peter, I accidentally replied direct to you, as well as the
list...

Also, a typo above.

These aren't *huge* files - it's more that if you have hundreds or thousands
of devices you're sampling, there's a lot of them.

And I'll clarify about I/O - with that much disk activity, - writing
to thousands of files very often, your disk may not keep up. But you
would have enough space to store everything, if it could...

HTH

-Greg


_______________________________________________
smokeping-users mailing list
smokeping-users <at> lists.oetiker.ch
https://lists.oetiker.ch/cgi-bin/listinfo/smokeping-users


-- 
Gregory Sloop, Principal: Sloop Network & Computer Consulting
Voice: 503.251.0452 x82
EMail: 
gregs <at> sloop.net
http://www.sloop.net
---

_______________________________________________
smokeping-users mailing list
smokeping-users <at> lists.oetiker.ch
https://lists.oetiker.ch/cgi-bin/listinfo/smokeping-users
Mon-Loi Perez | 12 Apr 2012 01:38
Picon
Favicon

Epp probe

Hi all,

Do you have an idea if someone has or is creating an epp probe? Details on epp can be found on the link below. 

http://en.m.wikipedia.org/wiki/Extensible_Provisioning_Protocol

Thanks,
Mon
_______________________________________________
smokeping-users mailing list
smokeping-users <at> lists.oetiker.ch
https://lists.oetiker.ch/cgi-bin/listinfo/smokeping-users


Gmane