Re: Consolidate and poller2 bug
LIMA David <
DLIMA@...>
2007-10-07 08:07:43 GMT
>> Here are the results, i repeat that i only appears using poller2.
> Hmm, thats strange in itself, but let's see.
I think you are right, I notice it with poller2 only because I have a huge number of hosts now and the problem
only appears when the interface flaps (UP/DOWN) within the poll cycle, when the interface stay down the
duration is OK.
> OK, here is our problem. Can you run
> SELECT *, (date_stop - date_start) AS duration FROM alarms WHERE triggered = 1 AND interface=3108;
For sure, the duration calculation into mysql is bad
mysql> SELECT *, (date_stop) - (date_start) AS duration FROM alarms WHERE triggered = 1 AND interface=3108;
+------+---------------------+---------------------+-----------+------+--------+---------------+--------------+-----------+-------------+
| id | date_start | date_stop | interface | type | active | referer_start | referer_stop | triggered |
duration |
+------+---------------------+---------------------+-----------+------+--------+---------------+--------------+-----------+-------------+
| 1598 | 2007-10-04 21:57:56 | 2007-10-04 22:06:20 | 3108 | 40 | 12 | 2147 | 2162 | 1 | 4864.000000 |
| 1652 | 2007-10-04 23:01:18 | 2007-10-04 23:09:22 | 3108 | 40 | 12 | 2220 | 2223 | 1 | 804.000000 |
---snip-----
23 rows in set (0.00 sec)
mysql> SELECT *, TIME_TO_SEC(TIMEDIFF((date_stop),(date_start))) AS duration FROM alarms WHERE
triggered = 1 AND interface=3108;
+------+---------------------+---------------------+-----------+------+--------+---------------+--------------+-----------+----------+
| id | date_start | date_stop | interface | type | active | referer_start | referer_stop | triggered |
duration |
+------+---------------------+---------------------+-----------+------+--------+---------------+--------------+-----------+----------+
| 1598 | 2007-10-04 21:57:56 | 2007-10-04 22:06:20 | 3108 | 40 | 12 | 2147 | 2162 | 1 | 504 |
| 1652 | 2007-10-04 23:01:18 | 2007-10-04 23:09:22 | 3108 | 40 | 12 | 2220 | 2223 | 1 | 484 |
---snip---
23 rows in set (0.00 sec)
I attached the patch for the event api.
Other thing, I don't know the impact of having time between 2 poll cycles > 5 minutes (for RRD ?) I tried to
increase the number of poller threads but I notice that with thread > 40 the system is overloaded (IO > 15
Mb/s and cpu increasing to 100% and load average jumping to 60), with poller thread = 39 I have a correct
value for loadaverage, CPU, and IO rate) but my poll cycle is > 5 Minutes ... I plan to add up to 10 000
interface on this system. Migrating the database to another box does not solve the overload problem, I
don't know if I'm facing a limit, a misconfiguration or a BUG somewhere.
Any advice will be greatly appreciated.
_______________
David LIMA
Professional Services
www.scc.com
-----Message d'origine-----
De : jffnms-users-bounces@...
[mailto:jffnms-users-bounces@...] De la part de
Craig Small
Envoyé : samedi 6 octobre 2007 00:07
À : jffnms-users@...
Objet : Re: [jffnms-users] Consolidate and poller2 bug
On Fri, Oct 05, 2007 at 07:42:33PM +0200, LIMA David wrote:
> Here are the results, i repeat that i only appears using poller2.
Hmm, thats strange in itself, but let's see.
> mysql> SELECT (date_stop - date_start) AS duration FROM alarms WHERE triggered = 1 AND interface=3108;
> +-------------+
> | duration |
> +-------------+
> | 4864.000000 |
OK, here is our problem. Can you run
SELECT *, (date_stop - date_start) AS duration FROM alarms WHERE triggered = 1 AND interface=3108;
> | 2147 | 2007-10-04 21:57:56 | 40 | 306 | Reachability Test | unreachable | status | 80% Packet Loss | 0 | 1 | 1 |
> | 2162 | 2007-10-04 22:06:20 | 40 | 306 | Reachability Test | reachable | status | 0% Packet Loss | 0 | 1 | 1 |
So they're the right times as before.
- Craig
--
Craig Small GnuPG:1C1B D893 1418 2AF4 45EE 95CB C76C E5AC 12CA DFA5
http://www.enc.com.au/ csmall at : enc.com.au
http://www.debian.org/ Debian GNU/Linux, software should be Free
-------------------------------------------------------------------------
This SF.net email is sponsored by: Splunk Inc.
Still grepping through log files to find problems? Stop.
Now Search log events and configuration files using AJAX and a browser.
Download your FREE copy of Splunk now >> http://get.splunk.com/
_______________________________________________
jffnms-users mailing list
jffnms-users@...
https://lists.sourceforge.net/lists/listinfo/jffnms-users
______________________________________________________________________
Ce message contient des informations dont le contenu est susceptible d'etre confidentiel.
Il est destine au(x) destinataire(s) indique(s) exclusivement.
A moins que vous ne fassiez partie de la liste des destinataires, ou que vous soyez habilite a recevoir le
mail a leur place, il vous est interdit de le copier, de l'utiliser ou de devoiler son contenu a un tiers.
Si vous avez recu cet email par erreur, merci de prendre contact avec l'emetteur.
Les opinions exprimees dans cet e-mail sont celles de l'emetteur et ne refletent pas necessairement
celles de l'entreprise.
Ce e-mail peut contenir des pieces jointes dont certaines pourraient contenir des virus qui pourraient
endommager votre systeme informatique.
La compagnie a pris toutes dispositions afin de minimiser ce risque et decline toute responsabilite pour
toute perte ou dommage resultant directement ou indirectement de l'utilisation de cet email ou de son contenu.
Il vous appartient d'effectuer vos propres controles anti-virus avant d'ouvrir la ou les pieces jointes.
______________________________________________________________________
-------------------------------------------------------------------------
This SF.net email is sponsored by: Splunk Inc.
Still grepping through log files to find problems? Stop.
Now Search log events and configuration files using AJAX and a browser.
Download your FREE copy of Splunk now >> http://get.splunk.com/
_______________________________________________
jffnms-users mailing list
jffnms-users@...
https://lists.sourceforge.net/lists/listinfo/jffnms-users