N, Ravikiran | 1 Aug 10:47 2014
Picon

Application level HA using heartbeat.. ??

Hi,

I was trying to understand how Heartbeat actually works. I found out that Heartbeat provides HA at a system
level rather than application level, meaning the Virtual IP is moved to backup system only on a
system-down-scenario and not when application monitored goes down. I verified this. I had configured
httpd in my haresources file and I manually stopped httpd using  "service httpd stop". Although this stops
httpd service, Heartbeat doesn't recognize this. ..!!
But when I stop heartbeat or bring down my machine, Backup comes up and starts httpd. Is this the correct
scenario.. if so, please let me know why I should provide a script in resources.d/ to start, stop and find
status of the application. Also, how can I achieve application level HA using heartbeat..!
Thanks in advance.. :)

Regards,
Ravikiran N

_______________________________________________
Linux-HA mailing list
Linux-HA <at> lists.linux-ha.org
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

jarek | 1 Aug 09:39 2014
Picon

Virtual address for slave

Hello!

	I'd like to have two virtual adresses: vip-master and vip-slave.
vip-master should be bound to master mode, vip-slave should be bound to
slave node.
	How can I do it ?

Best regards
Jarek

_______________________________________________
Linux-HA mailing list
Linux-HA <at> lists.linux-ha.org
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Ulrich Windl | 30 Jul 16:05 2014
Picon

Antw: Attention: Problematic Update for SLES11 (kernel)

Hello!

An update: The problem is known at SUSE and there is a temporary fix (PTF.876616) for this issue.
Unfortunately the kernel with the defect is newer than the PTF, i.e. the PTF is not included in the latest kernel.

Regards,
Ulrich

>>> Ulrich Windl schrieb am 30.07.2014 um 08:47 in Nachricht <53D894E7.ECA : 161 :
60728>:
> Hi!
> 
> I wanted to notify you that one of the recent updates for SLES11 SP3 may 
> cause trouble when using cLVM: On an updated node, cLVM won't start any more, 
> and the kernel will flood your syslog with messages like:
> 
> Jul 30 08:17:09 h05 kernel: [  563.700629] dlm: Trying to connect to 172.20
> .16.1
> Jul 30 08:17:09 h05 kernel: [  563.700836] dlm: Can't start SCTP association 
> - retrying
> Jul 30 08:17:09 h05 kernel: [  563.700843] dlm: Retry sending 48 bytes to 
> node id 17831084
> Jul 30 08:17:09 h05 kernel: [  563.700852] dlm: Retrying SCTP association 
> init for node 17831084
> 
> The issue will be investigated, but be prepared for trouble if you update 
> just one node in your cluster.
> 
> Regards,
> Ulrich
(Continue reading)

Ulrich Windl | 30 Jul 08:47 2014
Picon

Attention: Problematic Update for SLES11

Hi!

I wanted to notify you that one of the recent updates for SLES11 SP3 may cause trouble when using cLVM: On an
updated node, cLVM won't start any more, and the kernel will flood your syslog with messages like:

Jul 30 08:17:09 h05 kernel: [  563.700629] dlm: Trying to connect to 172.20
.16.1
Jul 30 08:17:09 h05 kernel: [  563.700836] dlm: Can't start SCTP association - retrying
Jul 30 08:17:09 h05 kernel: [  563.700843] dlm: Retry sending 48 bytes to node id 17831084
Jul 30 08:17:09 h05 kernel: [  563.700852] dlm: Retrying SCTP association init for node 17831084

The issue will be investigated, but be prepared for trouble if you update just one node in your cluster.

Regards,
Ulrich

_______________________________________________
Linux-HA mailing list
Linux-HA <at> lists.linux-ha.org
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Ulrich Windl | 28 Jul 12:15 2014
Picon

crm: "INFO: object cli-ban-... cannot be represented in the CLI notation"

Hi!

In SLES11 SP3 I feel that the message in new in crm (I haven't noticed it in the past):
INFO: object cli-ban-grp_c11_db-on-h07 cannot be represented in the CLI notation

The constraint seems to be created by crm migrate itself! That makes it interesting!

crm(live)configure# show cli-ban-grp_c11_db-on-h07
xml <rsc_location id="cli-ban-grp_c11_db-on-h07" rsc="grp_c11_db" role="Started"> \
  <rule id="cli-ban-grp_c11_db-on-h07-rule" score="-INFINITY"> \
    <expression id="cli-ban-grp_c11_db-on-h07-expr" attribute="#uname" operation="eq"
value="h07"/> \
    <date_expression id="cli-ban-grp_c11_db-on-h07-lifetime" operation="lt" end="2014-07-11
18:29:12Z"/> \
  </rule> \
</rsc_location>

crmsh-1.2.6-0.33.1

Regards,
Ulrich

_______________________________________________
Linux-HA mailing list
Linux-HA <at> lists.linux-ha.org
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Dang Zhiqiang | 28 Jul 11:18 2014

How to modify crm configure by command line

hi,
I want to modify op start timeout value through command line, but I search on internet find nothing.
I try crm_resource  comand, but I only modify params and meta.

root <at> host2:~# crm configure show test-ip
primitive test-ip ocf:openindiana:IPaddr \
        params ip="192.168.1.253" nic="igb0" cidr_netmask="24" \
        op start interval="0s" timeout="60s" on-fail="restart" \
        op monitor interval="10s" timeout="60s" on-fail="restart" \
        op stop interval="0s" timeout="60s" on-fail="stop" \
        meta target-role="Stopped"

_______________________________________________
Linux-HA mailing list
Linux-HA <at> lists.linux-ha.org
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Dejan Muhamedagic | 21 Jul 16:49 2014
Picon

glue 1.0.12 released

Hello,

The current glue repository has been tagged as 1.0.12.

It's been a while since the release candidate 1.0.12-rc1. There
were a few minor fixes and additions in the meantime, mostly for
hb_report.

Please upgrade at the earliest possible opportunity.

You can get the 1.0.12 tarball here:

	http://hg.linux-ha.org/glue/archive/glue-1.0.12.tar.bz2

The ChangeLog is available here:

http://hg.linux-ha.org/glue/file/glue-1.0.12/ChangeLog

A set of rpms is also available at the openSUSE Build Service:*)

http://download.opensuse.org/repositories/network:/ha-clustering:/Stable/

The packages at the openSUSE Build Service will not work with
pacemaker versions earlier than v1.1.8 because the LRM bits are
not compiled.

Many thanks to all contributors. Without you this release would
not have been possible.

Enjoy!
(Continue reading)

Nirmal Fernando | 21 Jul 13:48 2014

Errors when starting heartbeat on CentOS

Hi All,

I was trying to configure heart beat on 2 AWS EC2 instances (CentOS) and
currently facing following error [1].

Also the kernel version;

*[root <at> node01 stratos]# rpm -qa |egrep 'heartbeat|kernel-2.6'*
kernel-2.6.32-431.5.1.el6.x86_64
kernel-2.6.32-279.1.1.el6.x86_64
heartbeat-3.0.4-2.el6.x86_64
kernel-2.6.32-431.11.2.el6.x86_64
heartbeat-libs-3.0.4-2.el6.x86_64
kernel-2.6.32-431.17.1.el6.x86_64
kernel-2.6.32-431.20.3.el6.x86_64

Any help is appreciated.

[1]
Jul 21 10:22:25 node01 heartbeat: [3083]: info: **************************
Jul 21 10:22:25 node01 heartbeat: [3083]: info: Configuration validated.
Starting heartbeat 3.0.4
Jul 21 10:22:25 node01 heartbeat: [3084]: info: heartbeat: version 3.0.4
Jul 21 10:22:25 node01 heartbeat: [3084]: info: Heartbeat generation:
1405925294
Jul 21 10:22:25 node01 heartbeat: [3084]: info: glib: ucast: write socket
priority set to IPTOS_LOWDELAY on eth0
Jul 21 10:22:25 node01 heartbeat: [3084]: info: glib: ucast: bound send
socket to device: eth0

(Continue reading)

Charles Taylor | 17 Jul 17:24 2014
Picon

Managed Failovers w/ NFS HA Cluster

I feel like this is something that must have been covered extensively already but I've done a lot of
googling, looked at a lot of cluster configs, but have not found the solution.

I have an HA NFS cluster (corosync+pacemaker).  The relevant rpms are listed below but I'm not sure they are
that important to the question which is this...

When performing managed failovers of the NFS-exported file system resource from one node to the other (crm
resource move), any active NFS clients experience an I/O error when the file system is unexported.  In
other words, you must unexport it to unmount it.  As soon as it is unexported, clients are no longer able to
write to it and experience an I/O error (rather than just blocking).

In a failure scenario this is not a problem becuase the file system is never unexported on the primary
server.  Rather the server just goes down, the secondary takes over the resources and client I/O blocks
until the process is complete and then goes about its business.   We would like this same behavior for a
*managed* failover but have not found a mount or export option/scenario that works.   Is it possible?  What
am I missing?

I realize this is more of an nfs/exportfs question but I would think that those implementing NFS HA clusters
would be familiar with the scenario I'm describing.

Regards,

Charlie Taylor

pacemaker-cluster-libs-1.1.7-6.el6.x86_64
pacemaker-cli-1.1.7-6.el6.x86_64
pacemaker-1.1.7-6.el6.x86_64
pacemaker-libs-1.1.7-6.el6.x86_64
resource-agents-3.9.2-40.el6.x86_64
fence-agents-3.1.5-35.el6.x86_64
(Continue reading)

willi.fehler@t-online.de | 18 Jul 13:05 2014
Picon

DRBD on CentOS7

Hello,

I'm trying to use DRBD on CentOS7. It looks like RedHat hasn't compiled DRBD into the Kernel.
So I downloaded the source rpm from Fedora 19 and created my own rpm.

[root <at> centos7 ~]# rpm -qa | grep drbd
drbd-utils-8.4.3-2.el7.centos.x86_64
drbd-8.4.3-2.el7.centos.x86_64
drbd-udev-8.4.3-2.el7.centos.x86_64

But I cannot load the drbd kernel module:

[root <at> centos7 ~]# modprobe drbd
modprobe: FATAL: Module drbd not found.

Regards - Willi

_______________________________________________
Linux-HA mailing list
Linux-HA <at> lists.linux-ha.org
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Felipe Lima | 15 Jul 14:33 2014
Picon

Ha.cf with IPv6 address

Dear,

I am using the following configuration on my server:

ubuntu 12.04 + heartbeat 1:3.0.5-3ubuntu

I wonder if it is possible to configure an IPv6 address on file ha.cf? 
Well done proper setting there is the following error message:

Starting High-Availability services: ipv6addr [3390]: INFO: Resource is 
stopped
Heartbeat failure [rc = 6]. Failed.

heartbeat [3436]: 2014/07/14_15: 59:32 ERROR: glib: ucast: can not 
resolve hostname
heartbeat [3436]: 2014/07/14_15: 59:32 ERROR: glib: ucast: Interface 
[eth0] does not exist
heartbeat [3436]: 2014/07/14_15: 59:32 ERROR: Heartbeat not started: 
configuration error.
heartbeat [3436]: 2014/07/14_15: 59:32 ERROR: Configuration error, 
heartbeat not started.

Thank you in advance for your help.

Best Regards,
Felipe Lima
_______________________________________________
Linux-HA mailing list
Linux-HA <at> lists.linux-ha.org
http://lists.linux-ha.org/mailman/listinfo/linux-ha
(Continue reading)


Gmane