arnaud.pillac | 6 Feb 11:39 2014
Picon
Picon

[jgroups-users] Cluster not reconnect node

We actually encounter an error in production environnement. We have two nodes
(oppodindex1&2) in our JGroups cluster. Sometimes in load usage, cluster
member doesn't see each other and we have the following error in our logs:

node1:
22:04:19,971 | INFO  | ppodindex1-12124 | IndexReceiver                    |
275 - fr.bull.telco.orange.pod.index-manager - 0.1.4.SNAPSHOT | Cluster does
BLOCK
22:04:19,975 | INFO  | ppodindex1-12124 | IndexReceiver                    |
275 - fr.bull.telco.orange.pod.index-manager - 0.1.4.SNAPSHOT | Cluster new
VIEW = MergeView::[oppodindex2-65413|3] (2) [oppodindex2-65413,
oppodindex1-12124], 2 subgroups: [oppodindex2-65413|2] (1)
[oppodindex2-65413], [oppodindex1-12124|1] (1) [oppodindex1-12124]
22:04:21,975 | INFO  | ppodindex1-12124 | IndexReceiver                    |
275 - fr.bull.telco.orange.pod.index-manager - 0.1.4.SNAPSHOT | Cluster does
UNBLOCK
22:04:21,975 | WARN  | ppodindex1-12124 | GMS                              |
253 - org.jgroups - 3.4.0.Final | oppodindex1-12124: failed to collect all
ACKs (expected=2) for view [oppodindex2-65413|3] after 2000ms, missing 2
ACKs from oppodindex2-65413, oppodindex1-12124
22:07:02,325 | INFO  | ppodindex1-12124 | IndexReceiver                    |
275 - fr.bull.telco.orange.pod.index-manager - 0.1.4.SNAPSHOT | Cluster does
BLOCK
22:07:02,627 | INFO  | ppodindex1-12124 | IndexReceiver                    |
275 - fr.bull.telco.orange.pod.index-manager - 0.1.4.SNAPSHOT | Cluster new
VIEW = [oppodindex1-12124|4] (1) [oppodindex1-12124]
22:07:02,627 | INFO  | ppodindex1-12124 | IndexReceiver                    |
275 - fr.bull.telco.orange.pod.index-manager - 0.1.4.SNAPSHOT | Cluster does
UNBLOCK
22:07:58,088 | INFO  | ppodindex1-12124 | IndexReceiver                    |
(Continue reading)

Marilen Corciovei | 5 Feb 11:47 2014
Picon

[jgroups-users] JGroups communication stops working after a few minutes

Hello everybody,

I am using jgroups as a base for ehcache replication and in certain 
conditions I am facing a very strange condition.

JGroups is configured for udp, for a 2 nodes cluster. JGroups starts, 
connects, works for 3-4 minutes then stops working without any error. I 
have several clusters and this problem might be related to the 
virtualization platform for the clusters (kvm not ok, virtualbox ok) 
however what I cannot understand is why it works perfectly ok at the 
beginning.

I have spent the last days turning on TRACE for jgroups and I can see 
that at some point some sockets are closed:

2014-02-04 21:48:38,907 TRACE 
[INT-1,EH_CACHE,linux1-26104-org.jgroups.protocols.UNICAST3] 
linux1-26104: removed receive connection for linux2-63479
2014-02-04 21:48:38,907 TRACE 
[INT-1,EH_CACHE,linux1-26104-org.jgroups.protocols.UNICAST3] 
linux1-26104: removed receive connection for linux2-63479
2014-02-04 21:48:39,009 DEBUG 
[Timer-5,EH_CACHE,linux1-26104-org.jgroups.protocols.UNICAST3] 
linux1-26104: removing expired connection for linux1-26104 (60035 ms 
old) from send_table
2014-02-04 21:48:39,009 DEBUG 
[Timer-5,EH_CACHE,linux1-26104-org.jgroups.protocols.UNICAST3] 
linux1-26104: removing expired connection for linux1-26104 (60035 ms 
old) from send_table

(Continue reading)

Chris Lecompte | 5 Feb 07:43 2014
Picon

[jgroups-users] Long Delay in Receiving Messages

First let me say that I’m using a rather old version of JGroups (2.11.1), I’ll be upgrading but can’t
in this particular instance.  I’m using the following protocols, UDP, PING, MERGE2, FD_ALL,
VERIFY_SUSPECT, NAKACK and GMS.  I am seeing an issue for a 24 node cluster where messages are experiencing
serious delays but only to/from certain nodes.  For instance I have a ping operation established to test
communication in the cluster.  The protocol using UDP sends a message from a single node to the group using
channel.send(null, message) and then expects a reply message from each node (including itself).  When
experiencing the problem, a node in question does not appear to receive any messages via multicast.  For
instance, if I run the ping operation on a node that is not experiencing the problem I receive a 23 of 24
replies.  If I run the same operation on the node experiencing the issue then I receive 0 of 24 replies
implying that the node could not send/receive any of the messages within the 10 second timeout that the
operation will wait.  After some time ~30 minutes the messages are received and then from that point on the
issue no longer exists (until it crops up again).  Other nodes on the same host do not necessarily exhibit
the problem.  Is there any particular diagnostic information that I could inspect from the Probe command
or otherwise that might indicate if this is a network related issue?  It seems to be but the delay seems
rather huge in this case.  

Chris
------------------------------------------------------------------------------
Managing the Performance of Cloud-Based Applications
Take advantage of what the Cloud has to offer - Avoid Common Pitfalls.
Read the Whitepaper.
http://pubads.g.doubleclick.net/gampad/clk?id=121051231&iu=/4140/ostg.clktrk
Bela Ban | 24 Jan 14:36 2014
Picon

[jgroups-users] JGroups status presentation

FYI,
[1] http://belaban.blogspot.ch/2014/01/jgroups-status-and-outlook.html

--

-- 
Bela Ban, JGroups lead (http://www.jgroups.org)

------------------------------------------------------------------------------
CenturyLink Cloud: The Leader in Enterprise Cloud Services.
Learn Why More Businesses Are Choosing CenturyLink Cloud For
Critical Workloads, Development Environments & Everything In Between.
Get a Quote or Start a Free Trial Today. 
http://pubads.g.doubleclick.net/gampad/clk?id=119420431&iu=/4140/ostg.clktrk
sirlordt | 14 Jan 21:20 2014
Picon

[jgroups-users] Hide warning messages from SO incorrect setting

Hi how hide this messages? any config?

WARNING: JGRP000015: the send buffer of socket DatagramSocket was set to
640KB, but the OS only allocated 131,07KB. This might lead to performance
problems. Please set your max send buffer in the OS correctly (e.g.
net.core.wmem_max on Linux)
ene 14, 2014 3:28:50 PM org.jgroups.logging.JDKLogImpl warn
WARNING: JGRP000015: the receive buffer of socket DatagramSocket was set to
5MB, but the OS only allocated 131,07KB. This might lead to performance
problems. Please set your max receive buffer in the OS correctly (e.g.
net.core.rmem_max on Linux)
ene 14, 2014 3:28:50 PM org.jgroups.logging.JDKLogImpl warn
WARNING: JGRP000015: the send buffer of socket MulticastSocket was set to
640KB, but the OS only allocated 131,07KB. This might lead to performance
problems. Please set your max send buffer in the OS correctly (e.g.
net.core.wmem_max on Linux)
ene 14, 2014 3:28:50 PM org.jgroups.logging.JDKLogImpl warn
WARNING: JGRP000015: the receive buffer of socket MulticastSocket was set to
5MB, but the OS only allocated 131,07KB. This might lead to performance
problems. Please set your max receive buffer in the OS correctly (e.g.
net.core.rmem_max on Linux)

--
View this message in context: http://jgroups.1086181.n5.nabble.com/Hide-warning-messages-from-SO-incorrect-setting-tp10038.html
Sent from the JGroups - General mailing list archive at Nabble.com.

------------------------------------------------------------------------------
CenturyLink Cloud: The Leader in Enterprise Cloud Services.
Learn Why More Businesses Are Choosing CenturyLink Cloud For
Critical Workloads, Development Environments & Everything In Between.
(Continue reading)

Kresimir Simatovic | 13 Jan 17:09 2014
Picon

[jgroups-users] Garbage collection and suspect event

Hello!

I've observed following:

1) Node in cluster has long garbage collection pause (45 sec)
2) It was suspected by neighbor node 
3) Node was excluded by coordinator some 10 sec after GC started 
4) When GC finished node sent heartbeat:  ignoring the SUSPECT message and
sending back a HEARTBEAT_ACK

As a result, node has obsolete view which was installed before it was
suspected. It could send messages to other nodes (and receive responses
back) but it was invisible to other members. Coordinator on it and other
nodes was same. After disconnect/connect everything worked fined.

Is there way node could detect such situation and recover from it ?

Protocol stack is distro tcp.xml.

Thanks!

--
View this message in context: http://jgroups.1086181.n5.nabble.com/Garbage-collection-and-suspect-event-tp10037.html
Sent from the JGroups - General mailing list archive at Nabble.com.

------------------------------------------------------------------------------
CenturyLink Cloud: The Leader in Enterprise Cloud Services.
Learn Why More Businesses Are Choosing CenturyLink Cloud For
Critical Workloads, Development Environments & Everything In Between.
Get a Quote or Start a Free Trial Today. 
(Continue reading)

Rajni Sharma | 10 Jan 08:59 2014
Picon

[jgroups-users] Return IPv6 in Java

Hi

I'm trying to print ipv6 address of my system using following java program
but it returns only ipv4 address.
When I do ipconfig it shows both ipv4 and ipv6 address.

    public static void main(String args[]) throws UnknownHostException {  

        System.out.println(System.getProperty("java.home"));  
        System.setProperty("java.net.preferIPv6Addresses", "true");  

System.out.println(System.getProperty("java.net.preferIPv6Addresses"));  
        InetAddress[] addr = InetAddress.getAllByName(hostname);  
        for (InetAddress address : addr) {  
            if (address instanceof Inet6Address) {  
                System.out.println("ipv6 address is " +
address.getHostAddress());  
            }  
            else  
                System.out.println("ipv4 address is " +
address.getHostAddress());  
        }  
    }  

Also on debugging i found though impl refers to Inet6Address but
findNative() method of ClassLoader.class calls init() method of
Inet4Address.class in place of calling init() method of Inet6Address.class
Anyone have any idea of why its doing this?

--
(Continue reading)

Bela Ban | 4 Jan 14:12 2014
Picon

[jgroups-users] JGroups 3.4.2.Final released

FYI, the release notes are below.
Cheers,

         Release Notes - JGroups - Version 3.4.2

<h2>        Bug
</h2>
<ul>
<li>[<a href='https://issues.jboss.org/browse/JGRP-1715'>JGRP-1715</a>] 
-         NullPointerException in MessageDispatcher.handleUpEvent
</li>
<li>[<a href='https://issues.jboss.org/browse/JGRP-1744'>JGRP-1744</a>] 
-         Race condition allows NullPointerException in Executing protocol
</li>
<li>[<a href='https://issues.jboss.org/browse/JGRP-1752'>JGRP-1752</a>] 
-         Concurrent message headers modification causes that message is 
never sent
</li>
<li>[<a href='https://issues.jboss.org/browse/JGRP-1753'>JGRP-1753</a>] 
-         BlockingInputStream: reading beyond the array&#39;s capacity
</li>
<li>[<a href='https://issues.jboss.org/browse/JGRP-1755'>JGRP-1755</a>] 
-         TP: dropping message to wrong destination in a shared transport
</li>
<li>[<a href='https://issues.jboss.org/browse/JGRP-1756'>JGRP-1756</a>] 
-         ConcurrentModificationException in Executing.handleView
</li>
<li>[<a href='https://issues.jboss.org/browse/JGRP-1757'>JGRP-1757</a>] 
-         Logging: exceptions are not logged correctly
</li>
(Continue reading)

Bela Ban | 23 Dec 17:32 2013
Picon

[jgroups-users] Change in overriding of bind_addr etc via system props

FYI,

I've changed the way system properties override protocol properties [1]. 
The use case was a customer trying to create different clusters inside 
of the same JBoss EAP instance.

They assumed that using <UDP mcast_addr="${app1.mcast_addr:235.5.5.5}"/> 
for one app and {UDP mcast_addr="${app2.mcast_addr:236.5.5.5}"/> for the 
other and setting the relevant system properties app1.mcast_addr and 
app2.mcast_addr to *different* values would create 2 separate clusters...

Wrong ! JBoss EAP sets jgroups.udp.mcast_addr, which overwrites whatever 
was set in the XML config or via custom sysprops, so both clusters 
joined the same mcast_addr. Turns out they also used the same mcast_port 
(not set) and cluster name ("ISPN" set by Infinispan), so their nodes 
all joined the same cluster !

[1] https://issues.jboss.org/browse/JGRP-1749

--

-- 
Bela Ban, JGroups lead (http://www.jgroups.org)

------------------------------------------------------------------------------
Rapidly troubleshoot problems before they affect your business. Most IT 
organizations don't have a clear picture of how application performance 
affects their revenue. With AppDynamics, you get 100% visibility into your 
Java,.NET, & PHP application. Start your 15-day FREE TRIAL of AppDynamics Pro!
http://pubads.g.doubleclick.net/gampad/clk?id=84349831&iu=/4140/ostg.clktrk
Tomás Rafael Moreno Poggio | 15 Dec 20:31 2013
Picon

[jgroups-users] Problems to use the demos .java

Hello I have problems running the demo below https://github.com/belaban/JGroups/blob/master/src/org/jgroups/demos/ViewDemo.java I get an error that says the following: 

Exception in thread "main" java.lang.ExceptionInInitializerError 
at org.jgroups.conf.XmlConfigurator.parse (XmlConfigurator.java: 115) 
at org.jgroups.conf.XmlConfigurator.getInstance (XmlConfigurator.java: 64) 
at org.jgroups.conf.XmlConfigurator.getInstance (XmlConfigurator.java: 47) 
at org.jgroups.conf.ConfiguratorFactory.getXmlConfigurator (ConfiguratorFactory.java: 216) 
at org.jgroups.conf.ConfiguratorFactory.getStackConfigurator (ConfiguratorFactory.java: 93) 
. at org.jgroups.JChannel <init> (JChannel.java: 138) 
at Main.ViewDemo.start (ViewDemo.java: 30) 
at Main.ViewDemo.main (ViewDemo.java: 62) 
Caused by: java.lang.NullPointerException 
at java.util.ResourceBundle.getBundle (ResourceBundle.java: 1026) 
. org.jgroups.util.Util at <clinit> (Util.java: 98) 

I viewed the source code and the problem line originates 

            String tmp = Util.getProperty (new String [] {Global.XML_VALIDATION}, null, null, false, null); 

Line 115 XmlConfigurator.java file. 

I'm using eclipse Kepler Service Release 1 Build id: 20130919-0819.

JGroups-3.4.1.Final.jar added as library in eclipse 

Java(TM) SE Runtime Environment (build 1.7.0_45-b18)
Java HotSpot(TM) 64-Bit Server VM (build 24.45-b08, mixed mode)

Linux Mint 14

The copy of the file udp.xml https://github.com/belaban/JGroups/tree/master/conf/udp.xml and place it in the root of the bin folder generated by eclipse. 

I tried with other examples and I get the same error. 

Someone could help me please.
------------------------------------------------------------------------------
Rapidly troubleshoot problems before they affect your business. Most IT 
organizations don't have a clear picture of how application performance 
affects their revenue. With AppDynamics, you get 100% visibility into your 
Java,.NET, & PHP application. Start your 15-day FREE TRIAL of AppDynamics Pro!
http://pubads.g.doubleclick.net/gampad/clk?id=84349831&iu=/4140/ostg.clktrk
_______________________________________________
javagroups-users mailing list
javagroups-users <at> lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/javagroups-users
sarath pillai | 12 Dec 22:11 2013
Picon

[jgroups-users] (no subject)

Hi,

Am trying to get s3ping discovery method working in jboss as 7. I have deployed a sample web app which is clustered. As of now i have a single node. But in the near future i ill be adding more nodes in the cluster..

I have modified the file standalone-ha.xml with the required s3 credentials and bucket details.

 <stack name="s3ping">
                <transport type="TCP" socket-binding="jgroups-tcp" diagnostics-socket-binding="jgroups-diagnostics"/>
                <protocol type="S3_PING">
                    <property name="access_key">
                        XXXXXXXXXXXXXXX
                    </property>
                    <property name="secret_access_key">
                        XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
                    </property>
                    <property name="prefix">
                        MyjbossBucket
                    </property>
                    <property name="timeout">
                        6000
                    </property>
                </protocol>
                <protocol type="MERGE2"/>
                <protocol type="FD_SOCK" socket-binding="jgroups-tcp-fd"/>
                <protocol type="FD"/>
                <protocol type="VERIFY_SUSPECT"/>
                <protocol type="BARRIER"/>
                <protocol type="pbcast.NAKACK"/>
                <protocol type="UNICAST2"/>
                <protocol type="pbcast.STABLE"/>
                <protocol type="pbcast.GMS"/>
                <protocol type="UFC"/>
                <protocol type="MFC"/>
                <protocol type="FRAG2"/>
            </stack>


And i start this jboss instance by using the below command.

./standalone.sh -b 10.1.137.250 -bmanagement=10.1.137.250 -c standalone-ha.xml -Djboss.default.jgroups.stack=s3ping -Djgroups.bind.address=10.1.137.250 -Djboss.node.name=node1

Its getting started successfully, but am unable to see any node information file created inside the s3 bucket. Kindly please guide me through the correct method to get this done...Or am i doing some mistake in the configs...

Regards
------------------------------------------------------------------------------
Rapidly troubleshoot problems before they affect your business. Most IT 
organizations don't have a clear picture of how application performance 
affects their revenue. With AppDynamics, you get 100% visibility into your 
Java,.NET, & PHP application. Start your 15-day FREE TRIAL of AppDynamics Pro!
http://pubads.g.doubleclick.net/gampad/clk?id=84349831&iu=/4140/ostg.clktrk
_______________________________________________
javagroups-users mailing list
javagroups-users <at> lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/javagroups-users

Gmane