Pasi Kärkkäinen | 1 Mar 2010 14:53
Picon
Picon
Favicon

Re: Xen on RHEL cluster

On Mon, Mar 01, 2010 at 12:26:15AM +0100, Jakov Sosic wrote:
> On 02/26/2010 06:29 PM, Jakov Sosic wrote:
> > Hi.
> > 
> > I'm using Xen on RHEL cluster, and I have strange problems. I gave raw
> > volumes from storage to Xen virtual machines. With windows, I have a
> > problem that nodes don't see the volume as same one.... for example:
> > 
> > clusternode1# clusvcadm -d vm:winxp
> > clusternode1# dd if=/dev/mapper/winxp of=/node1winxp
> > clusternode2# dd if=/dev/mapper/winxp of=/node2winxp
> > clusternode3# dd if=/dev/mapper/winxp of=/node3winxp
> > 
> > When I download these files and diff them, they all three differ.
> > 
> > Also, sometimes very strange things happen. For example I download some
> > file into winxp, shut it down, then start it on another node, and file
> > is missing?!?!?!?!
> > 
> > Should I use CLVM and not raw volumes from storage? Why is this happening?
> 
> It seems that CLVM is solution (migrated whole cluster to clvm today),
> and that the bug is double caching (both domU and dom0 caching), and
> that bugs like this occur when using raw LUNs:
> https://bugzilla.redhat.com/show_bug.cgi?id=466681
> 

Oh yeah, this bug can definitely affect you if you're accessing the same 
disk images from both domU and from dom0.

(Continue reading)

Lon Hohberger | 1 Mar 2010 19:15
Picon
Favicon

Re: Service not migrating

On Fri, 2010-02-26 at 00:09 +0000, Joseph L. Casale wrote:
> Hi,
> Still testing around and learning the ins and outs of rhcs, I have an apache
> service with a mount etc configured that starts fine, migrates as well. If
> I drop the Ethernet interface on the node with the service active, the remaining
> node never starts the service, but if I gracefully take the running node out by
> rebooting it etc, the service migrates.
> 
> Any ideas what I am missing?

<ip monitor_link="1" address=... />

?

-- Lon

Lon Hohberger | 1 Mar 2010 19:19
Picon
Favicon

Re: Virtual machine fence fail question

On Wed, 2010-02-24 at 10:03 +0800, Bernard Chew wrote:
> > On Wed, Feb 24, 2010 at 4:05 AM, Lon Hohberger <lhh <at> redhat.com> wrote:
> > On Tue, 2010-02-23 at 11:05 +0800, Bernard Chew wrote:
> >> Hi,
> >>
> >> Given I have 2 Red Hat Clusters; 1 cluster consisting of physical
> >> hosts and another consisting of virtual guests which are hosted in the
> >> physical hosts. The physical host cluster uses DRAC fencing while the
> >> virtual guest cluster uses virtual machine fencing.
> >>
> >> If a physical host goes down, I saw that DRAC fencing takes place
> >> successfully but fencing fail for the virtual guests on the physical
> >> host which go down (together). Does the virtual machine fencing fails
> >> because the virtual guests are no longer available? How can I
> >> configure fencing so that both physical hosts and virtual guests are
> >> fenced correctly?
> >
> > Are you using fence_xvm/fence_xvmd or fence_virsh ?
> >
> > -- Lon
> >
> > --
> > Linux-cluster mailing list
> > Linux-cluster <at> redhat.com
> > https://www.redhat.com/mailman/listinfo/linux-cluster
> >
> 
> Hi Lon,
> 
> Thank you for looking into this. I am currently using fence_xvm/fence_xvmd.
(Continue reading)

Joseph L. Casale | 1 Mar 2010 19:34

Re: fence_ifmib problem

>Hi
>
>I didn't have the same error message but I made fence_ifmib to work by 
>installing  dev-python/pysnmp-2.0.9 on the OS.
>You could check which python modules it uses.Maybe this hint could help you.

There's an srpm available that I couldn't get built for that, but I installed
the egg anyway given I am on a test box.

That's not the issue anyway, its complaining about the exit handler, which
exists in the stock rh python provided. Something awry with the script itself.

Would you mind pastebin'ing your script or posting it here?

Thanks!
jlc

Joseph L. Casale | 1 Mar 2010 19:30

Re: Service not migrating

><ip monitor_link="1" address=... />
>
>?

Hey Lon,
It was a fencing error I was missing, its working as expected now!
jlc

Bernard Chew | 2 Mar 2010 03:37
Picon

Re: Virtual machine fence fail question

> On Tue, Mar 2, 2010 at 2:19 AM, Lon Hohberger <lhh <at> redhat.com> wrote:
> On Wed, 2010-02-24 at 10:03 +0800, Bernard Chew wrote:
>> > On Wed, Feb 24, 2010 at 4:05 AM, Lon Hohberger <lhh <at> redhat.com> wrote:
>> > On Tue, 2010-02-23 at 11:05 +0800, Bernard Chew wrote:
>> >> Hi,
>> >>
>> >> Given I have 2 Red Hat Clusters; 1 cluster consisting of physical
>> >> hosts and another consisting of virtual guests which are hosted in the
>> >> physical hosts. The physical host cluster uses DRAC fencing while the
>> >> virtual guest cluster uses virtual machine fencing.
>> >>
>> >> If a physical host goes down, I saw that DRAC fencing takes place
>> >> successfully but fencing fail for the virtual guests on the physical
>> >> host which go down (together). Does the virtual machine fencing fails
>> >> because the virtual guests are no longer available? How can I
>> >> configure fencing so that both physical hosts and virtual guests are
>> >> fenced correctly?
>> >
>> > Are you using fence_xvm/fence_xvmd or fence_virsh ?
>> >
>> > -- Lon
>> >
>> > --
>> > Linux-cluster mailing list
>> > Linux-cluster <at> redhat.com
>> > https://www.redhat.com/mailman/listinfo/linux-cluster
>> >
>>
>> Hi Lon,
>>
(Continue reading)

Digimer | 2 Mar 2010 05:17

cluster.conf arguments sent to fence agents

Hi all,

   I've built a new fence device and am now trying to sort out how and 
what arguments CMAN (?) sends to the fence agent. Is there a document 
explaining this interaction?

   From looking at other fence agents, it seems like CMAN sends either a 
series of single-dashed arguments followed by their values or a list of 
arguments, one argument per new line. What I don't know is how CMAN 
interprets the arguments in the <fence>...</fence> statement to 
arguments sent to the actual fence agent.

Thanks for any help!

Digi

Joseph L. Casale | 2 Mar 2010 05:40

Re: cluster.conf arguments sent to fence agents

>   I've built a new fence device and am now trying to sort out how and 
>what arguments CMAN (?) sends to the fence agent. Is there a document 
>explaining this interaction?
>
>   From looking at other fence agents, it seems like CMAN sends either a 
>series of single-dashed arguments followed by their values or a list of 
>arguments, one argument per new line. What I don't know is how CMAN 
>interprets the arguments in the <fence>...</fence> statement to 
>arguments sent to the actual fence agent.
>
>Thanks for any help!

Funny, I am hammering away on this now trying to write a Perl fence_ifmib
replacement:)

http://sources.redhat.com/cluster/wiki/FenceAgentAPI

I also made a fencing agent that simply takes stdin and write it to a file
to make sure I know what's being passed.

It will push through whatever you put in there!

jlc

Digimer | 2 Mar 2010 06:18

Re: cluster.conf arguments sent to fence agents

On 10-03-01 11:40 PM, Joseph L. Casale wrote:
>>    I've built a new fence device and am now trying to sort out how and
>> what arguments CMAN (?) sends to the fence agent. Is there a document
>> explaining this interaction?
>>
>>    From looking at other fence agents, it seems like CMAN sends either a
>> series of single-dashed arguments followed by their values or a list of
>> arguments, one argument per new line. What I don't know is how CMAN
>> interprets the arguments in the<fence>...</fence>  statement to
>> arguments sent to the actual fence agent.
>>
>> Thanks for any help!
>
> Funny, I am hammering away on this now trying to write a Perl fence_ifmib
> replacement:)
>
> http://sources.redhat.com/cluster/wiki/FenceAgentAPI
>
> I also made a fencing agent that simply takes stdin and write it to a file
> to make sure I know what's being passed.
>
> It will push through whatever you put in there!
>
> jlc

Awesome, thanks!

Queued that up for light reading in the morning. :)

Digi
(Continue reading)

Steven Whitehouse | 2 Mar 2010 10:06
Picon
Favicon

Re: GFS2 and D state HTTPD processes

Hi,

On Fri, 2010-02-26 at 16:52 +0100, Emilio Arjona wrote:
> Hi,
> 
> we are experiencing some problems commented in an old thread:
> 
> http://www.mail-archive.com/linux-cluster <at> redhat.com/msg07091.html
> 
> We have 3 clustered servers under Red Hat 5.4 accessing a GFS2 resource.
> 
> fstab options:
> /dev/vg_cluster/lv_cluster /opt/datacluster gfs2
> defaults,noatime,nodiratime,noquota 0 0
> 
> GFS options:
> plock_rate_limit="0"
> plock_ownership=1
> 
> httpd processes run into D status sometimes and the only solution is
> hard reset the affected server.
> 
> Can anyone give me some hints to diagnose the problem?
> 
> Thanks :)
> 
Can you give me a rough idea of what the actual workload is and how it
is distributed amoung the director(y/ies) ?

This is often down to contention on glocks (one per inode) and maybe
(Continue reading)


Gmane