Romi Verma | 2 Mar 2009 04:27
Picon

Re: quorum server

Thanks Andrew and Lars,
I would like  to understand it bit more. , is there any distance parameter here? i mean to say suppose i have two nodes in one room and 1 node in adjacent room  , cant i form three nodes openais/pacemaker cluster here.

 i was under impression that as far as multicast ip works , nothing stops us in forming a cluster.

Regrards,


On Sat, Feb 28, 2009 at 12:04 AM, Lars Marowsky-Bree <lmb-l3A5Bk7waGM@public.gmane.org> wrote:
On 2009-02-27T13:38:02, Romi Verma <romi3rdfeb-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org> wrote:

> Thanks for reply Andrew,
> ok, then how openais-pacemaker cluster handles the cluster whose nodes span
> across multiple sites.

This is not currently handled.

But deploying a split-site setup with quorumd never worked either; it is
not integrated with fencing, and it is impossible to build a working
split-site cluster with it. This was, I'm afraid to say, only a 40%
implementation of the solution even in heartbeat/CCM land.


Having proper split-site support is on the roadmap, as soon as the
openAIS integration stabilizes.


Regards,
   Lars

--
Teamlead Kernel, SuSE Labs, Research and Development
SUSE LINUX Products GmbH, GF: Markus Rex, HRB 16746 (AG Nürnberg)
"Experience is the name everyone gives to their mistakes." -- Oscar Wilde


_______________________________________________
Pacemaker mailing list
Pacemaker-BSnDWwoz/2aRShoRxXF5/A@public.gmane.orgorg
http://oss.clusterlabs.org/mailman/listinfo/pacemaker

_______________________________________________
Pacemaker mailing list
Pacemaker@...
http://oss.clusterlabs.org/mailman/listinfo/pacemaker
Priyanka Ranjan | 2 Mar 2009 06:00
Picon

samba resource agent

Hi All,
Do we have samba resource agent available.

Thanks.

_______________________________________________
Pacemaker mailing list
Pacemaker@...
http://oss.clusterlabs.org/mailman/listinfo/pacemaker
Junko IKEDA | 2 Mar 2009 07:28
Picon

pingd.c needs glib header

Hi,

Little patch for pingd.c
Compiler might complain about g_timeout_add_seconds().
There is no " #include <glib.h> "

Best Regards,
Junko Ikeda

NTT DATA INTELLILINK CORPORATION
Attachment (pingd.patch): application/octet-stream, 292 bytes
_______________________________________________
Pacemaker mailing list
Pacemaker@...
http://oss.clusterlabs.org/mailman/listinfo/pacemaker
Andrew Beekhof | 2 Mar 2009 08:13
Picon

Re: pingd.c needs glib header

weird - it compiles here.

applied

On Mon, Mar 2, 2009 at 07:28, Junko IKEDA
<ikedaj@...> wrote:
> Hi,
>
> Little patch for pingd.c
> Compiler might complain about g_timeout_add_seconds().
> There is no " #include <glib.h> "
>
> Best Regards,
> Junko Ikeda
>
> NTT DATA INTELLILINK CORPORATION
>
> _______________________________________________
> Pacemaker mailing list
> Pacemaker@...
> http://oss.clusterlabs.org/mailman/listinfo/pacemaker
>
>
Andrew Beekhof | 2 Mar 2009 08:14
Picon

Re: samba resource agent

We get all our RAs from Heartbeat and it doesn't look like they have one yet.
They're pretty easy to write though

On Mon, Mar 2, 2009 at 06:00, Priyanka Ranjan
<priyanka3rdfeb@...> wrote:
> Hi All,
> Do we have samba resource agent available.
>
> Thanks.
>
> _______________________________________________
> Pacemaker mailing list
> Pacemaker@...
> http://oss.clusterlabs.org/mailman/listinfo/pacemaker
>
>
Andrew Beekhof | 2 Mar 2009 08:17
Picon

Re: quorum server

On Mon, Mar 2, 2009 at 04:27, Romi Verma <romi3rdfeb@...> wrote:
> Thanks Andrew and Lars,
> I would like  to understand it bit more. , is there any distance parameter
> here? i mean to say suppose i have two nodes in one room and 1 node in
> adjacent room  , cant i form three nodes openais/pacemaker cluster here.

of course.
i believe the point is that as the distance increases, so does the
number of switches/telcos involved and therefor the probability of a
comms failure is between the two sites is quite high.
as is the latency...

>  i was under impression that as far as multicast ip works , nothing stops us
> in forming a cluster.
>
> Regrards,
>
>
> On Sat, Feb 28, 2009 at 12:04 AM, Lars Marowsky-Bree <lmb@...> wrote:
>>
>> On 2009-02-27T13:38:02, Romi Verma <romi3rdfeb@...> wrote:
>>
>> > Thanks for reply Andrew,
>> > ok, then how openais-pacemaker cluster handles the cluster whose nodes
>> > span
>> > across multiple sites.
>>
>> This is not currently handled.
>>
>> But deploying a split-site setup with quorumd never worked either; it is
>> not integrated with fencing, and it is impossible to build a working
>> split-site cluster with it. This was, I'm afraid to say, only a 40%
>> implementation of the solution even in heartbeat/CCM land.
>>
>>
>> Having proper split-site support is on the roadmap, as soon as the
>> openAIS integration stabilizes.
>>
>>
>> Regards,
>>    Lars
>>
>> --
>> Teamlead Kernel, SuSE Labs, Research and Development
>> SUSE LINUX Products GmbH, GF: Markus Rex, HRB 16746 (AG Nürnberg)
>> "Experience is the name everyone gives to their mistakes." -- Oscar Wilde
>>
>>
>> _______________________________________________
>> Pacemaker mailing list
>> Pacemaker@...
>> http://oss.clusterlabs.org/mailman/listinfo/pacemaker
>
>
> _______________________________________________
> Pacemaker mailing list
> Pacemaker@...
> http://oss.clusterlabs.org/mailman/listinfo/pacemaker
>
>
Junko IKEDA | 2 Mar 2009 08:32
Picon

Re: handling snmp trap with crm_mon

> > I tried this.
> > # crm_mon -S 192.168.16.28 -VVVVV
> >
> > but there is no log message both of ha-debug, syslog or stderr.
> > Is any installation failures?
> 
> No. You'll get a big error message if you use -S and snmp support
> wasn't compiled in.

I got the logs.
Log says, "ret = snmp_send(session, trap_pdu)" completed successfully.
but SNMP manager couldn't get any traps.

Thanks,
Junko
Mar  2 16:02:12 node-a syslogd 1.4.1: restart (remote reception).
Mar  2 16:02:22 node-a crm_mon: [8334]: info: main: Starting crm_mon
Mar  2 16:02:22 node-a crm_mon: [8334]: WARN: cluster_status: We do not have quorum - fencing and resource
management disabled
Mar  2 16:02:22 node-a crm_mon: [8334]: WARN: unpack_resources: No STONITH resources have been defined
Mar  2 16:02:22 node-a crm_mon: [8334]: info: G_main_add_SignalHandler: Added signal handler for signal 15
Mar  2 16:02:22 node-a crm_mon: [8334]: info: G_main_add_SignalHandler: Added signal handler for signal 2
Mar  2 16:02:22 node-a crm_mon: [8334]: info: G_main_add_TriggerHandler: Added signal manual handler
Mar  2 16:02:37 node-a crm_mon: [8334]: WARN: cluster_status: We do not have quorum - fencing and resource
management disabled
Mar  2 16:02:37 node-a crm_mon: [8334]: WARN: unpack_resources: No STONITH resources have been defined
Mar  2 16:02:37 node-a crm_mon: [8334]: WARN: cluster_status: We do not have quorum - fencing and resource
management disabled
Mar  2 16:02:37 node-a crm_mon: [8334]: WARN: unpack_resources: No STONITH resources have been defined
Mar  2 16:02:44 node-a crm_mon: [8334]: WARN: cluster_status: We do not have quorum - fencing and resource
management disabled
Mar  2 16:02:44 node-a crm_mon: [8334]: WARN: unpack_resources: No STONITH resources have been defined
Mar  2 16:02:44 node-a crm_mon: [8334]: WARN: cluster_status: We do not have quorum - fencing and resource
management disabled
Mar  2 16:02:44 node-a crm_mon: [8334]: WARN: unpack_resources: No STONITH resources have been defined
Mar  2 16:02:45 node-a crm_mon: [8334]: WARN: cluster_status: We do not have quorum - fencing and resource
management disabled
Mar  2 16:02:45 node-a crm_mon: [8334]: WARN: unpack_resources: No STONITH resources have been defined
Mar  2 16:02:45 node-a crm_mon: [8334]: info: determine_online_status: Node node-b is online
Mar  2 16:02:45 node-a crm_mon: [8334]: WARN: cluster_status: We do not have quorum - fencing and resource
management disabled
Mar  2 16:02:45 node-a crm_mon: [8334]: WARN: unpack_resources: No STONITH resources have been defined
Mar  2 16:02:45 node-a crm_mon: [8334]: info: determine_online_status: Node node-b is online
Mar  2 16:02:45 node-a crm_mon: [8334]: info: determine_online_status: Node node-a is online
Mar  2 16:02:46 node-a crm_mon: [8334]: WARN: unpack_resources: No STONITH resources have been defined
Mar  2 16:02:46 node-a crm_mon: [8334]: info: determine_online_status: Node node-b is online
Mar  2 16:02:46 node-a crm_mon: [8334]: info: determine_online_status: Node node-a is online
Mar  2 16:02:54 node-a crm_mon: [8334]: info: cib_process_diff: Diff 0.5.3 -> 0.12.1 not applied to 0.5.1:
current "num_updates" is less than required
Mar  2 16:02:54 node-a crm_mon: [8334]: notice: unpack_config: On loss of CCM Quorum: Ignore
Mar  2 16:02:54 node-a crm_mon: [8334]: info: determine_online_status: Node node-b is online
Mar  2 16:02:54 node-a crm_mon: [8334]: info: determine_online_status: Node node-a is online
Mar  2 16:02:54 node-a crm_mon: [8334]: WARN: cib_process_diff: Diff 0.12.1 -> 0.12.2 not applied to
0.12.2: current "num_updates" is greater than required
Mar  2 16:02:54 node-a crm_mon: [8334]: notice: handle_rsc_op: monitor of dummy on
59295d90-5459-490d-a1e0-d48810cf2fb3 completed: ok
Mar  2 16:02:54 node-a crm_mon: [8334]: notice: unpack_config: On loss of CCM Quorum: Ignore
Mar  2 16:02:54 node-a crm_mon: [8334]: info: determine_online_status: Node node-b is online
Mar  2 16:02:54 node-a crm_mon: [8334]: info: determine_online_status: Node node-a is online
Mar  2 16:02:55 node-a crm_mon: [8334]: notice: handle_rsc_op: monitor of dummy on
b3852a23-c10b-440a-a8e0-263b0185d657 completed: ok
Mar  2 16:02:55 node-a crm_mon: [8334]: notice: unpack_config: On loss of CCM Quorum: Ignore
Mar  2 16:02:55 node-a crm_mon: [8334]: info: determine_online_status: Node node-b is online
Mar  2 16:02:55 node-a crm_mon: [8334]: info: determine_online_status: Node node-a is online
Mar  2 16:02:56 node-a crm_mon: [8334]: notice: handle_rsc_op: start of dummy on
b3852a23-c10b-440a-a8e0-263b0185d657 completed: ok
Mar  2 16:02:56 node-a crm_mon: [8334]: info: send_snmp_trap: ret = 175129956
Mar  2 16:02:56 node-a crm_mon: [8334]: notice: unpack_config: On loss of CCM Quorum: Ignore
Mar  2 16:02:56 node-a crm_mon: [8334]: info: determine_online_status: Node node-b is online
Mar  2 16:02:56 node-a crm_mon: [8334]: info: determine_online_status: Node node-a is online
Mar  2 16:02:58 node-a crm_mon: [8334]: notice: handle_rsc_op: monitor of dummy on
b3852a23-c10b-440a-a8e0-263b0185d657 completed: ok
Mar  2 16:02:58 node-a crm_mon: [8334]: info: send_snmp_trap: ret = 175129957
Mar  2 16:02:58 node-a crm_mon: [8334]: notice: unpack_config: On loss of CCM Quorum: Ignore
Mar  2 16:02:58 node-a crm_mon: [8334]: info: determine_online_status: Node node-b is online
Mar  2 16:02:58 node-a crm_mon: [8334]: info: determine_online_status: Node node-a is online
Mar  2 16:03:27 node-a crm_mon: [8334]: info: main: Exiting crm_mon
Mar  2 16:03:35 node-a cibadmin: [8456]: info: Invoked: cibadmin -Ql -o nodes 
Mar  2 16:03:37 node-a cibadmin: [8564]: info: Invoked: cibadmin -Ql 
Mar  2 16:03:37 node-a ccm_tool: [8567]: info: mem_handle_event: Got an event OC_EV_MS_NEW_MEMBERSHIP
from ccm
Mar  2 16:03:37 node-a ccm_tool: [8567]: info: mem_handle_event: instance=2, nodes=2, new=2, lost=0,
n_idx=0, new_idx=0, old_idx=4
Mar  2 16:03:37 node-a crm_shadow: [8581]: info: Invoked: crm_shadow 
Mar  2 16:03:37 node-a crm_shadow: [8595]: info: Invoked: crm_shadow 
Mar  2 16:03:37 node-a cibadmin: [8596]: info: Invoked: cibadmin -Ql 
Mar  2 16:03:37 node-a crmd: [8746]: info: main: CRM Hg Version:
0de73ec89e0257f89d37fb43d437e6f54eedb3d6 
Attachment (snmp.tar.bz2): application/octet-stream, 31 KiB
_______________________________________________
Pacemaker mailing list
Pacemaker@...
http://oss.clusterlabs.org/mailman/listinfo/pacemaker
Andrew Beekhof | 2 Mar 2009 08:47
Picon

Re: handling snmp trap with crm_mon

I just tried again, and its working here.
How are you checking for traps?

I get the following:

c001n05:~ # echo "disableAuthorization yes"  > /etc/snmp/snmptrapd.conf
c001n05:~ # sudo snmptrapd -f -a -e -Le -d -Drun:shell,snmptrapd:auth
registered debug token run:shell, 1
registered debug token snmptrapd:auth, 1
NET-SNMP version 5.4.2.1

Received 208 bytes from UDP: [10.10.222.159]->[10.10.222.157]:-29306
snmptrapd:auth: authorization turned off
snmptrapd:auth: authorization turned off: not checking
snmptrapd:auth: authorization turned off
2009-03-02 08:40:21 c001n03.suse.de [UDP:
[10.10.222.159]->[10.10.222.157]:-29306]:
DISMAN-EVENT-MIB::sysUpTimeInstance = Timeticks: (1235979384) 143
days, 1:16:33.84	SNMPv2-MIB::snmpTrapOID.0 = OID:
SNMPv2-SMI::enterprises.32723.1	SNMPv2-SMI::enterprises.32723.1.2 =
STRING: "rsc_c001n01"	SNMPv2-SMI::enterprises.32723.1.1 = STRING:
"c001n01"	SNMPv2-SMI::enterprises.32723.1.3 = STRING:
"stop"	SNMPv2-SMI::enterprises.32723.1.4 = STRING:
"ok"	SNMPv2-SMI::enterprises.32723.1.6 = INTEGER:
0	SNMPv2-SMI::enterprises.32723.1.7 = INTEGER:
0	SNMPv2-SMI::enterprises.32723.1.5 = INTEGER: 0
snmptrapd:auth: authorization turned off

Received 205 bytes from UDP: [10.10.222.159]->[10.10.222.157]:-29306
snmptrapd:auth: authorization turned off
snmptrapd:auth: authorization turned off: not checking
snmptrapd:auth: authorization turned off
2009-03-02 08:40:21 c001n03.suse.de [UDP:
[10.10.222.159]->[10.10.222.157]:-29306]:
DISMAN-EVENT-MIB::sysUpTimeInstance = Timeticks: (1235979384) 143
days, 1:16:33.84	SNMPv2-MIB::snmpTrapOID.0 = OID:
SNMPv2-SMI::enterprises.32723.1	SNMPv2-SMI::enterprises.32723.1.2 =
STRING: "migrator"	SNMPv2-SMI::enterprises.32723.1.1 = STRING:
"c001n02"	SNMPv2-SMI::enterprises.32723.1.3 = STRING:
"stop"	SNMPv2-SMI::enterprises.32723.1.4 = STRING:
"ok"	SNMPv2-SMI::enterprises.32723.1.6 = INTEGER:
0	SNMPv2-SMI::enterprises.32723.1.7 = INTEGER:
0	SNMPv2-SMI::enterprises.32723.1.5 = INTEGER: 0
snmptrapd:auth: authorization turned off

etc etc

On Mon, Mar 2, 2009 at 08:32, Junko IKEDA
<ikedaj@...> wrote:
>> > I tried this.
>> > # crm_mon -S 192.168.16.28 -VVVVV
>> >
>> > but there is no log message both of ha-debug, syslog or stderr.
>> > Is any installation failures?
>>
>> No. You'll get a big error message if you use -S and snmp support
>> wasn't compiled in.
>
> I got the logs.
> Log says, "ret = snmp_send(session, trap_pdu)" completed successfully.
> but SNMP manager couldn't get any traps.
>
> Thanks,
> Junko
>
> _______________________________________________
> Pacemaker mailing list
> Pacemaker@...
> http://oss.clusterlabs.org/mailman/listinfo/pacemaker
>
>
Romi Verma | 2 Mar 2009 08:52
Picon

Re: quorum server

ok, so let me summarize what i understood.
we can form a openais/pacemaker cluster whose nodes span across multiple sites. these sites can be as close as just adjacent rooms or they can be across continent  also.  only problem is in case of communication failure , openais-pacemaker does not have any solution yet.  That's why it is recommend to have a local cluster  ( nodes sharing same rack).  

am i right??

Thanks & Regards,

On Mon, Mar 2, 2009 at 12:47 PM, Andrew Beekhof <beekhof-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org> wrote:
On Mon, Mar 2, 2009 at 04:27, Romi Verma <romi3rdfeb-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org> wrote:
> Thanks Andrew and Lars,
> I would like  to understand it bit more. , is there any distance parameter
> here? i mean to say suppose i have two nodes in one room and 1 node in
> adjacent room  , cant i form three nodes openais/pacemaker cluster here.

of course.
i believe the point is that as the distance increases, so does the
number of switches/telcos involved and therefor the probability of a
comms failure is between the two sites is quite high.
as is the latency...

>  i was under impression that as far as multicast ip works , nothing stops us
> in forming a cluster.
>
> Regrards,
>
>
> On Sat, Feb 28, 2009 at 12:04 AM, Lars Marowsky-Bree <lmb-l3A5Bk7waGM@public.gmane.org> wrote:
>>
>> On 2009-02-27T13:38:02, Romi Verma <romi3rdfeb-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org> wrote:
>>
>> > Thanks for reply Andrew,
>> > ok, then how openais-pacemaker cluster handles the cluster whose nodes
>> > span
>> > across multiple sites.
>>
>> This is not currently handled.
>>
>> But deploying a split-site setup with quorumd never worked either; it is
>> not integrated with fencing, and it is impossible to build a working
>> split-site cluster with it. This was, I'm afraid to say, only a 40%
>> implementation of the solution even in heartbeat/CCM land.
>>
>>
>> Having proper split-site support is on the roadmap, as soon as the
>> openAIS integration stabilizes.
>>
>>
>> Regards,
>>    Lars
>>
>> --
>> Teamlead Kernel, SuSE Labs, Research and Development
>> SUSE LINUX Products GmbH, GF: Markus Rex, HRB 16746 (AG Nürnberg)
>> "Experience is the name everyone gives to their mistakes." -- Oscar Wilde
>>
>>
>> _______________________________________________
>> Pacemaker mailing list
>> Pacemaker-DCXgd8VOjfo@public.gmane.orgsterlabs.org
>> http://oss.clusterlabs.org/mailman/listinfo/pacemaker
>
>
> _______________________________________________
> Pacemaker mailing list
> Pacemaker-BSnDWwoz/2YZj6S/xCzO9g@public.gmane.orglabs.org
> http://oss.clusterlabs.org/mailman/listinfo/pacemaker
>
>

_______________________________________________
Pacemaker mailing list
Pacemaker-BSnDWwoz/2aRShoRxXF5/A@public.gmane.orgorg
http://oss.clusterlabs.org/mailman/listinfo/pacemaker

_______________________________________________
Pacemaker mailing list
Pacemaker@...
http://oss.clusterlabs.org/mailman/listinfo/pacemaker
Glory Smith | 2 Mar 2009 08:55
Picon

anyone tried sbd??


Hi,

i am following http://www.linux-ha.org/SBD_Fencing and trying following command.

 /usr/sbin/sbd -d /dev/sbd -4 20 -1 10 create but getting error message  "/usr/sbin/sbd: invalid option -- '4'"

any one knows what's wrong here??

Regards,


_______________________________________________
Pacemaker mailing list
Pacemaker@...
http://oss.clusterlabs.org/mailman/listinfo/pacemaker

Gmane