Randy Stewart | 1 Jun 2012 01:34

Re: A query on SCTP client’s (running under RHEL4) performance in terms of aggregate throughput rate

Chinmaya:

If you are using a connected socket aka one-2-one model, then you can also use
send() and that should work.

The sctp_sendmsg() is also used by the one-2-many model socket, and there you need a
socket address. Some implementations allow you to place a NULL in the socket address
of a connected socket for sctp_sendmsg(), not sure if linux does. But this is all
implementation specific stuff.

R
On May 31, 2012, at 8:29 AM, Chinmaya Dwibedy wrote:

> 
> 
> Hi.
>  
> We have SCTP client program (C++) running under RHEL4 (using kernel version: 2.6.9-55ELsmp and
lksctp-tools-1.0.8-1). We are using one-to-one socket (also called TCP-style sockets) using connect
() system call to setup an association with a server. Then it calls sctp_sendmsg () library function to
send a message from a socket while using the following advanced features of SCTP i.e., SCTP_UNORDERED for
un-ordered delivery of the message. The sctp_sendmsg () API takes a pointer to "sockaddr" as a parameter
indicating the address of the destination. Since our client application knows well that, it will be
sending the multiple messages to the same configured destination address, it would become more
efficient if we will connect the socket explicitly.  In summary, we can say that a SCTP client can call
connect only if that process uses the SCTP socket to communicate with exactly one peer. I mean, calling
connect () on the SCTP socket once simply records the specified IP address and port number as being the
desired communications partner and send/write call. Anything written to a connected SCTP socket is
automatically sent to the protocol address (e.g., IP address and port) specified by connect.
> After connect() system call,  there will be absolutely no need of again passing socket address structure
(Continue reading)

Randy Stewart | 1 Jun 2012 01:35

Re: When SCTP server is restarted, DATA chunks start to get dropped or lost after sometime at SCTP client’s end.

Chinmaya:

I would use that has a starting point. Don't put a time-to-expire on it and see if
you no longer see this behavior. If you do, then it sounds like a bug in the linux kernel
implementation

R
On May 31, 2012, at 3:20 PM, Chinmaya Dwibedy wrote:

> Hi Chris,
> Thanks for your response.
> Our application consumes the message fast enough. Note that, we are using single Intel Xeon Quad-Core
L5518 processor, 8 logical CPUs (speed of the processor is 2.13GHz) and 12GB RAM. Moreover there are no
other apps running on besides our application. This problem appears after SCTP server gets restarted.
There is absolutely no packets drop if we won’t restart the server.
>  
> Do you recommend us to try with 0 ms timetolive which indicates no timeout should occur on this message and
is treated as reliable?
>  
> Regards,
> Chinmaya
> 
> From: Chris Benson <cbenson <at> adax.com>
> To: Chinmaya Dwibedy <ckdwibedy <at> yahoo.com> 
> Cc: "tsvwg <at> ietf.org" <tsvwg <at> ietf.org> 
> Sent: Friday, June 1, 2012 12:13 AM
> Subject: Re: [tsvwg] When SCTP server is restarted, DATA chunks start to get dropped or lost after
sometime at SCTP client’s end.
> 
> Chinmaya,
(Continue reading)

internet-drafts | 1 Jun 2012 08:34
Picon
Favicon

I-D Action: draft-ietf-tsvwg-port-use-00.txt


A New Internet-Draft is available from the on-line Internet-Drafts directories. This draft is a work item
of the Transport Area Working Group Working Group of the IETF.

	Title           : Recommendations for Transport Port Uses
	Author(s)       : Joe Touch
	Filename        : draft-ietf-tsvwg-port-use-00.txt
	Pages           : 11
	Date            : 2012-05-30

   This document provides recommendations to application and service
   designers on how to use the transport protocol port number space to
   help in its preservation. **NOTE THAT THIS CURRENT VERSION IS
   LARGELY AN OUTLINE OF ISSUES**.

A URL for this Internet-Draft is:
http://www.ietf.org/internet-drafts/draft-ietf-tsvwg-port-use-00.txt

Internet-Drafts are also available by anonymous FTP at:
ftp://ftp.ietf.org/internet-drafts/

This Internet-Draft can be retrieved at:
ftp://ftp.ietf.org/internet-drafts/draft-ietf-tsvwg-port-use-00.txt

The IETF datatracker page for this Internet-Draft is:
https://datatracker.ietf.org/doc/draft-ietf-tsvwg-port-use/

Chinmaya Dwibedy | 1 Jun 2012 16:26
Picon
Favicon

Source IP address of the HEARTBEAT_ACK in multi-homing setup

Hi,

We have a SCTP multi-homing setup with host and peer having two network interfaces with IP addresses belong to two different subnets.

 Host       IP eth5 (192.168.16.125 – primary IP),

  IP eth6 (192.168.17.125 – secondary IP): Multi-Homed endpoint

 Peer-end IP eth5 (192.168.16.129 – primary IP),

 The network interfaces of host and peer are connected with crossover cable.

We have received one issue saying that, Host (SCTP endpoint) should use the source IP (i.e., 192.168.17.125 – secondary IP) on which it received the Heart_beat_req on while sending back the response (Heart_beat_ack)”.

Here goes my analysis. Can anyone please validate and feel free to correct me if I am wrong? Thanks in advance for your response.

As per section 3.3.6 of rfc-2960/rfc-4960, a HEARTBEAT ACK is always sent to the source IP address of the IP datagram containing the HEARTBEAT chunk to which this ACK is responding. It says nothing about whether the source of the response has to be the same as the destination of HEART_BEAT_REQ. The RFC/standard does not mandate to keep the source IP address of the HEARTBEAT_ACK has to be the same as the destination IP address of HEART_BEAT_REQ. I think in this case, it is quite expected behavior in compliance with rfc-2960/rfc-4960. It would be wrong to say that, it is an issue in RHEL4 (kernel version: 2.6.9-55.ELsmp).
Even if there are two paths, SCTP does not have any way to determine whether two paths share links and routers when traversing the network. The paths used by the IP   packets across the network might be different depending on the destination IP address.  If a message fails to reach its destination, SCTP may retransmit the message using a different destination IP address. If somewhere along the path a link or/and router fails then SCTP can detect the failure of the path via a heartbeat message. These actions done by SCTP are independent of the actions performed by the lower layers for failure detection and restoration. Under the assumption that every IP address will have a different, separate path towards the remote endpoint, (this is the responsibility of the routing protocols or of manual static configuration), if the transport to one of the IP addresses (= 1 particular path) fails then the traffic can migrate to the other remaining IP address (= other paths) within the SCTP association.

 

Regards,

ChinmayaD

Michael Tuexen | 1 Jun 2012 16:42
Picon

Re: Source IP address of the HEARTBEAT_ACK in multi-homing setup

On Jun 1, 2012, at 4:26 PM, Chinmaya Dwibedy wrote:

> Hi,
> We have a SCTP multi-homing setup with host and peer having two network interfaces with IP addresses
belong to two different subnets.
>  Host       IP eth5 (192.168.16.125 – primary IP),
>   IP eth6 (192.168.17.125 – secondary IP): Multi-Homed endpoint
>  Peer-end IP eth5 (192.168.16.129 – primary IP),
>  The network interfaces of host and peer are connected with crossover cable.
> We have received one issue saying that, Host (SCTP endpoint) should use the source IP (i.e.,
192.168.17.125 – secondary IP) on which it received the Heart_beat_req on while sending back the
response (Heart_beat_ack)”.
> Here goes my analysis. Can anyone please validate and feel free to correct me if I am wrong? Thanks in
advance for your response.
> As per section 3.3.6 of rfc-2960/rfc-4960, a HEARTBEAT ACK is always sent to the source IP address of the IP
datagram containing the HEARTBEAT chunk to which this ACK is responding. It says nothing about whether
the source of the response has to be the same as the destination of HEART_BEAT_REQ. The RFC/standard does
not mandate to keep the source IP address of the HEARTBEAT_ACK has to be the same as the destination IP
address of HEART_BEAT_REQ. I think in this case, it is quite expected 
> behavior in compliance with rfc-2960/rfc-4960. It would be wrong to say that, it is an issue in RHEL4
(kernel version: 2.6.9-55.ELsmp).
RFC 4960 does require you to use any specific source address. The only constraint is,
that you use one of the addresses known by the peer (which means listed in the INIT or
INIT-ACK or the source address of the INIT or INIT-ACK).

Best regards
Michael
> Even if there are two paths, SCTP does not have any way to determine whether two paths share links and
routers when traversing the network. The paths used by the IP   packets across the network might be
different depending on the destination IP address.  If a message fails to reach its destination, SCTP may
retransmit the message using a different destination IP address. If somewhere along the path a link
or/and router fails then SCTP can detect the failure of the path via a heartbeat message. These actions
done by SCTP are independent of the actions performed by the lower layers for failure detection and
restoration. Under the assumption that every IP address will have a different, separate path towards the
remote endpoint, (this is the responsibility of the routing protocols or of manual static
configuration), if the transport to one of the IP addresses (= 1 particular path) fails then the traffic
can migrate to the other remaining IP address (= other paths) within the SCTP association.
>  
> Regards,
> ChinmayaD

Joe Touch | 5 Jun 2012 02:08
Picon
Favicon

Re: I-D Action: draft-ietf-tsvwg-port-use-00.txt

Hi, all,

I've resubmitted this as a WG doc. The major changes were to include 
full references for previous placeholders and a few minor notes.

It would be useful to get feedback on whether there are any areas that 
should be addressed that are not listed before diving into the two key 
issues I hope we can discuss as part of this doc:

	- whether to deprecate the distinction between system and
	user ports (or to do nothing at this time)

	- whether to make stronger recommendations on the use of
	security on port services (i.e., whether security "should"
	be supported for new services, or to make no specific
	recommendation at this time)

Any gap material notes would be most useful now, though.

Joe

On 5/31/2012 11:34 PM, internet-drafts <at> ietf.org wrote:
>
> A New Internet-Draft is available from the on-line Internet-Drafts directories. This draft is a work
item of the Transport Area Working Group Working Group of the IETF.
>
> 	Title           : Recommendations for Transport Port Uses
> 	Author(s)       : Joe Touch
> 	Filename        : draft-ietf-tsvwg-port-use-00.txt
> 	Pages           : 11
> 	Date            : 2012-05-30
>
>     This document provides recommendations to application and service
>     designers on how to use the transport protocol port number space to
>     help in its preservation. **NOTE THAT THIS CURRENT VERSION IS
>     LARGELY AN OUTLINE OF ISSUES**.
>
>
> A URL for this Internet-Draft is:
> http://www.ietf.org/internet-drafts/draft-ietf-tsvwg-port-use-00.txt
>
> Internet-Drafts are also available by anonymous FTP at:
> ftp://ftp.ietf.org/internet-drafts/
>
> This Internet-Draft can be retrieved at:
> ftp://ftp.ietf.org/internet-drafts/draft-ietf-tsvwg-port-use-00.txt
>
> The IETF datatracker page for this Internet-Draft is:
> https://datatracker.ietf.org/doc/draft-ietf-tsvwg-port-use/
>

Randy Stewart | 5 Jun 2012 16:10

Re: Source IP address of the HEARTBEAT_ACK in multi-homing setup


On Jun 1, 2012, at 10:42 AM, Michael Tuexen wrote:

> On Jun 1, 2012, at 4:26 PM, Chinmaya Dwibedy wrote:
> 
>> Hi,
>> We have a SCTP multi-homing setup with host and peer having two network interfaces with IP addresses
belong to two different subnets.
>> Host       IP eth5 (192.168.16.125 – primary IP),
>>  IP eth6 (192.168.17.125 – secondary IP): Multi-Homed endpoint
>> Peer-end IP eth5 (192.168.16.129 – primary IP),
>> The network interfaces of host and peer are connected with crossover cable.
>> We have received one issue saying that, Host (SCTP endpoint) should use the source IP (i.e.,
192.168.17.125 – secondary IP) on which it received the Heart_beat_req on while sending back the
response (Heart_beat_ack)”.
>> Here goes my analysis. Can anyone please validate and feel free to correct me if I am wrong? Thanks in
advance for your response.
>> As per section 3.3.6 of rfc-2960/rfc-4960, a HEARTBEAT ACK is always sent to the source IP address of the
IP datagram containing the HEARTBEAT chunk to which this ACK is responding. It says nothing about whether
the source of the response has to be the same as the destination of HEART_BEAT_REQ. The RFC/standard does
not mandate to keep the source IP address of the HEARTBEAT_ACK has to be the same as the destination IP
address of HEART_BEAT_REQ. I think in this case, it is quite expected 
>> behavior in compliance with rfc-2960/rfc-4960. It would be wrong to say that, it is an issue in RHEL4
(kernel version: 2.6.9-55.ELsmp).
> RFC 4960 does require you to use any specific source address. The only constraint is,
> that you use one of the addresses known by the peer (which means listed in the INIT or
> INIT-ACK or the source address of the INIT or INIT-ACK).
The HB section also says that you *SHOULD* use the source address the packet was received on.

There is a known bug in the linux implementation that has been since fixed where it would
incorrectly reply from a different address…

SHOULD in general means you really really really should do this unless you have some other
purposeful reason not to. A bug is not a purposeful reason IMO. So yes, you are allowed to
respond with any valid source. But when responding with a HB you SHOULD use the src address. Which
basically means you *should* unless you have a specific reason (for example you are a user
land stack and you cannot control the src address). Not responding due to a bug, is not being
conformant to the RFC IMO…  ;-)

In fact in general, if I remember correctly you SHOULD respond whenever possible with
the source address that sent you the packet. There are times when thats not possible
(A sack for example acking a packet that came in from two different source addresses, or
 a situation where you suspect the reverse path is broke), but in general that SHOULD
also applies here as well. 

R

> 
> Best regards
> Michael
>> Even if there are two paths, SCTP does not have any way to determine whether two paths share links and
routers when traversing the network. The paths used by the IP   packets across the network might be
different depending on the destination IP address.  If a message fails to reach its destination, SCTP may
retransmit the message using a different destination IP address. If somewhere along the path a link
or/and router fails then SCTP can detect the failure of the path via a heartbeat message. These actions
done by SCTP are independent of the actions performed by the lower layers for failure detection and
restoration. Under the assumption that every IP address will have a different, separate path towards the
remote endpoint, (this is the responsibility of the routing protocols or of manual static
configuration), if the transport to one of the IP addresses (= 1 particular path) fails then the traffic
can migrate to the other remaining IP address (= other paths) within the SCTP association.
>> 
>> Regards,
>> ChinmayaD
> 
> 

-----
Randall Stewart
randall <at> lakerest.net

Jose Saldana | 6 Jun 2012 09:20
Picon
Favicon

Re: draft-saldana-tsvwg-tcmtf in the World IPv6 Launch day: 70% bandwidth saving

Hi all,

 

Today we are in the World IPv6 Launch day: http://www.worldipv6launch.org/

 

So I would like to share with the group some results I have deployed with TCMTF, using IPv6. As everyone knows, the header of IPv6 is twice as big as the one of IPv4. This can be seen as a problem for real-time services using small packets, since the overhead is increased. Thus, header compressing mechanisms are expected to become more and more interesting. One of the three layers of TCMTF is header compression, and the savings achieved with IPv6 are bigger than the ones obtained with IPv4. I attach a graph of the bandwidth savings obtained when multiplexing and compressing client-to-server flows of World of Warcraft, the most popular MMORPG in the World. Up to 70 percent of the bandwidth can be saved, while adding a small delay.

 

Best regards,

 

Jose

 

De: tsvwg-bounces <at> ietf.org [mailto:tsvwg-bounces <at> ietf.org] En nombre de Jose Saldana
Enviado el: martes, 29 de mayo de 2012 12:19
Para: tsvwg <at> ietf.org
Asunto: [tsvwg] draft-saldana-tsvwg-tcmtf for World of Warcraft: 60% bandwidth saving

 

Hi all,

 

Our research group has just got accepted a paper for a conference next July. It presents the tests of TCMTF (the “tunnels draft”) for the traffic of World of Warcraft (an online game with 10 million players). This game uses TCP/IP, and client-to-server headers can be compressed from 40 to 8.72 bytes average, so this traffic can get a 60% reduction. The cause of these good results is that the game also generates a high number of TCP ACK packets without payload (56%). For them, the saving is very high: the compression is applied to headers, and these packets are “only header”. Two other MMORPG games are also analyzed, and the savings are 40% and 45%. The saving is lower for server-to-client traffic (8.5%, 22%, 20%).

 

The number of packets per second can also be reduced: if the traffic of 100 players is multiplexed,  pps can be reduced from 900 to 12.

 

We have also used a MOS estimator (based on delay and jitter) in order to grant that subjective quality is not harmed.

 

I attach two (small) PDF graphs showing the savings.

 

Since it has not already been published, I have put the paper in my web page, just in case somebody wants to have a look:

 

http://diec.unizar.es/~jsaldana/personal/sergei_SPECTS_2012_in_proc.pdf

 

 

Best regards,

 

Jose Saldana

Eliot Lear | 6 Jun 2012 11:27
Picon
Favicon

Re: I-D Action: draft-ietf-tsvwg-port-use-00.txt

Joe,

I support the development of this draft, and would be willing to provide you with both content and editorial assistance.  I would ask that (at least) one additional important question be considered by this group and the security area:
  • Is a request for a static port assignment in order to accommodate firewalls still an appropriate reason?

I ask this question because there have been applications for as many as six ports for related or very similar services that could all be dynamically discovered from one.  Some come back and say that they are concerned that firewalls will prevent the service from functioning properly.  What is the IETF's point of view on this?

Eliot

On 6/5/12 2:08 AM, Joe Touch wrote:
Hi, all,

I've resubmitted this as a WG doc. The major changes were to include full references for previous placeholders and a few minor notes.

It would be useful to get feedback on whether there are any areas that should be addressed that are not listed before diving into the two key issues I hope we can discuss as part of this doc:

    - whether to deprecate the distinction between system and
    user ports (or to do nothing at this time)

    - whether to make stronger recommendations on the use of
    security on port services (i.e., whether security "should"
    be supported for new services, or to make no specific
    recommendation at this time)

Any gap material notes would be most useful now, though.

Joe

On 5/31/2012 11:34 PM, internet-drafts <at> ietf.org wrote:

A New Internet-Draft is available from the on-line Internet-Drafts directories. This draft is a work item of the Transport Area Working Group Working Group of the IETF.

    Title           : Recommendations for Transport Port Uses
    Author(s)       : Joe Touch
    Filename        : draft-ietf-tsvwg-port-use-00.txt
    Pages           : 11
    Date            : 2012-05-30

    This document provides recommendations to application and service
    designers on how to use the transport protocol port number space to
    help in its preservation. **NOTE THAT THIS CURRENT VERSION IS
    LARGELY AN OUTLINE OF ISSUES**.


A URL for this Internet-Draft is:
http://www.ietf.org/internet-drafts/draft-ietf-tsvwg-port-use-00.txt

Internet-Drafts are also available by anonymous FTP at:
ftp://ftp.ietf.org/internet-drafts/

This Internet-Draft can be retrieved at:
ftp://ftp.ietf.org/internet-drafts/draft-ietf-tsvwg-port-use-00.txt

The IETF datatracker page for this Internet-Draft is:
https://datatracker.ietf.org/doc/draft-ietf-tsvwg-port-use/



Chinmaya Dwibedy | 7 Jun 2012 20:24
Picon
Favicon

Re: When SCTP server is restarted, DATA chunks start to get dropped or lost after sometime at SCTP client’s end.

Hi Mr. Randy Stewart,

Thanks a lot for your valuable suggestion. We turned off Partial Reliability and set the 0 ms timetolive in sctp_sendmsg () API. Thereafter there was no loss in DATA chunks, upon restarting the peer (i.e., SCTP server).

But when the peer (i.e., SCTP server) gets restarted,

1)     All the SCTP clients receive INIT messages (with same address to the association) from SCTP server. The INIT ACK is being sent with the new InitTag and Verification Tag set to the InitTag received in INIT for each association.  The four-way handshake gets completed.

2)    Though there is no loss but we encountered another issue i.e., for one or two associations, it performs chunk bundling by coalescing of multiple DATA chunks into a single SCTP packet and unnecessary delays are introduced. Unexpectedly it takes more than 20 seconds to put the data on the wire, which was never the case before restart.

Notes:

a)    Our SCTP client application has requested kernel to not to bundle by making SCTP_NODELAY on. Of course the kernel does   not in itself stop all bundling from occurring (i.e., in case of congestion or retransmission.

b)    Peer has sent a_rwnd of 131072 bytes sent in the INIT when it restarts. It indicates that the peer has sufficient buffer space.

c)    Our client program (C++) running under RHEL4 (using kernel version: 2.6.9-55ELsmp and lksctp-tools-1.0.8-1).

What I understand, the excess delay most likely due to a stall caused by congestion or peer flow control. Does it reset ssthresh to the correct value (peer's a_rwnd) if the association has been restarted or it consists of SCTP_DEFAULT_MAXWINDOW (i.e., 65535)?

When restarting association,

a)    Does the Linux kernel reset the transport congestion variables?

b)    Does it reset the cwnd (Congestion window values for destination address)?

What I think, if it will be consisting the old values (i.e.,  cwnd, ssthresh) after occurrence of restart then, it must not be transmitting new data to a peer since it will have wrong impression of old cwnd bytes of data outstanding to peer. Can anyone please validate my understanding and correct me if I am wrong.  Please point me to right direction to fix this issue. Thanking you in advance for your response.

Regards,

Chinmaya



From: Randy Stewart <randall <at> lakerest.net>
To: Chinmaya Dwibedy <ckdwibedy <at> yahoo.com>
Cc: Chris Benson <cbenson <at> adax.com>; "tsvwg <at> ietf.org" <tsvwg <at> ietf.org>
Sent: Friday, June 1, 2012 5:05 AM
Subject: Re: [tsvwg] When SCTP server is restarted, DATA chunks start to get dropped or lost after sometime at SCTP client’s end.

Chinmaya:

I would use that has a starting point. Don't put a time-to-expire on it and see if
you no longer see this behavior. If you do, then it sounds like a bug in the linux kernel
implementation

R
On May 31, 2012, at 3:20 PM, Chinmaya Dwibedy wrote:

> Hi Chris,
> Thanks for your response.
> Our application consumes the message fast enough. Note that, we are using single Intel Xeon Quad-Core L5518 processor, 8 logical CPUs (speed of the processor is 2.13GHz) and 12GB RAM. Moreover there are no other apps running on besides our application. This problem appears after SCTP server gets restarted. There is absolutely no packets drop if we won’t restart the server.

> Do you recommend us to try with 0 ms timetolive which indicates no timeout should occur on this message and is treated as reliable?

> Regards,
> Chinmaya
>
> From: Chris Benson <cbenson <at> adax.com>
> To: Chinmaya Dwibedy <ckdwibedy <at> yahoo.com>
> Cc: "tsvwg <at> ietf.org" <tsvwg <at> ietf.org>
> Sent: Friday, June 1, 2012 12:13 AM
> Subject: Re: [tsvwg] When SCTP server is restarted, DATA chunks start to get dropped or lost after sometime at SCTP client’s end.
>
> Chinmaya,
>
> Please forgive me if this is too basic and obvious a question.
>
> Are you sure that your "application" (SCTP-user) at the client
> side is consuming the incoming data AT LEAST AS FAST as it arrives?
>
> If not, then you will see something like the behaviour you
> describe. With thanks, from Chris
>
> On Thu, 31 May 2012, Chinmaya Dwibedy wrote:
>
> >>  Date: Thu, 31 May 2012 11:35:17 -0700 (PDT)
> >>  From: Chinmaya Dwibedy <ckdwibedy <at> yahoo.com>
> >>  To: "tsvwg <at> ietf.org" <tsvwg <at> ietf.org>
> >>  Subject: [tsvwg] When SCTP server is restarted,
> >>      DATA chunks start to get dropped or lost after sometime at SCTP client
> >>      end.
> >> 
> >>  Hi,
> We have SCTP client program (C++) running under RHEL4 (using kernel version: 2.6.9-55ELsmp and lksctp-tools-1.0.8-1). We are using one-to-one style socket using connect () system call to setup an association with a SCTP server (i.e., SUT of vendor). Then it calls sctp_sendmsg () library function to send a message from a socket while using the following advanced features of SCTP (i.e., SCTP_UNORDERED for un-ordered delivery of the message) and timetolive (TTL) is set to 1000 milliseconds for a given message. We have typical defaults values for these RTO.Initial = 3000, RTO.Min = 1000, RTO.Max = 60000 milliseconds.The PR-SCTP extension is enabled in the kernel (by default).
> We are using lot (4000) SCTP associations on a single system, all are in ESTABLISHED state and data communication goes on fine without any drop/loss of message. But when the peer (i.e., SCTP server) gets restarted,
> a)    All the SCTP clients receive INIT messages (with same address to the association) from SCTP server. The INIT ACK is being sent with the new InitTag and Verification Tag set to the InitTag received in INIT for each association.
> b)    The four-way handshake gets completed. I believe the existing association, including its current state, and the corresponding TCB does not get changed.
> c)    Afterward, all SCTP clients use DATA chunks to exchange information with SCTP server.
> The trouble is that, after sometime (30-45 minutes), DATA chunks start to get dropped or lost and that goes on increasing as the time advances.The data was not transmitted to the peer at least once. Note that, the time after which this problem is being encountered is also not consistent. I mean to say, sometime we noticed after 30 minutes, 45 minutes and an hour. But when we lessen the number of SCTP associations to 2000, such problem does not appear.   
>
>
> a)    Is there any chance that, that data messages time-out within the RTO although both the values are same? As a result, the data will be skipped and no longer transmitted.
> b)    Does it indicate that the message could never be transmitted to the peer (e.g., flow control prevented the message from being sent before its lifetime expired), so the peer never received it? Please note that, we are using 1 Gig Intel NIC.
> The preliminary  analysis says that, timetolive (in sctp_sendmsg() API) can be set on a message independent of Partial Reliability.The timer (i.e., sinfo_timetolive) only runs while the message is in the kernel send buffer and has not yet been put on the wire (for the first time). As soon as the message is put on the wire the first time, the timer is dropped. It is used to indicate that a time based lifetime is being applied to the data.  It is then a number of milliseconds for which the data is attempted to be transmitted. If that many milliseconds elapsed and the data has not been transmitted, As a result, they move the abandoned list and are never retransmitted. Please feel free to correct me if I am wrong.
> Should we try with 0 ms timetolive which indicates no timeout should occur on this message and is treated as reliable? Please suggest and also let me know if you need any additional information.
> Thanking you in advance for your response.
> Regards,
> Chinmaya
>

-----
Randall Stewart
randall <at> lakerest.net







Gmane