[Spread-users] Spread message delays

Yair Amir yairamir at cs.jhu.edu
Wed Mar 28 12:21:27 EDT 2012


Dear Erez,

A new version of Spread that, among other things, solves this bug,
is just days away.

If you want a solution with the current version, any of John's suggestions
should work for you.

With the current version and with the bug, even reliable messages will
experience delay because the delay is not caused at the delivery point, but
rather at the sending point. These messages are delayed before being sent.

Cheers,

	:) Yair.  http://www.cs.jhu.edu/~yairamir


On 3/28/12 10:34 AM, Erez Theodorou wrote:
> Thanks! I am using a 65b machine! I will give your fix a try!
> 
> Also, I was able to recreate the delays on demand, when I introduce packet loss to even a single machine on the network.
> I then changed the message types to RELIABLE_MESS (instead of the SAFE_MESS that I had before). According to the spread docs (section 1.3.2), only the SAFE messages need to be received on all spread daemons before they can be delivered. But now I am still seeing the same delay to all the receives even when messages are sent from one node to another on the same machine.
> 
> Any ideas?
> 
> Thanks!
> --Erez
> 
> -----Original Message-----
> From: John Schultz [mailto:jschultz at spreadconcepts.com] 
> Sent: Wednesday, March 28, 2012 3:11 PM
> To: Erez Theodorou
> Cc: spread-users at lists.spread.org
> Subject: Re: [Spread-users] Spread message delays
> 
> If you can't fix the loss in your network, then you can lower the delay significantly by lowering the Hurry_timeout in membership.c where it is set around lines 169 and 180.
> 
> On a LAN, you can typically safely lower it down to the order of 100's of ms.  On a WAN, depending on how just how wide your WAN is you probably don't want to go below half a second or a second.
> 
> Cheers!
> 
> -----
> John Lane Schultz
> Spread Concepts LLC
> Phn: 301 830 8100
> Cell: 443 838 2200
> 
> On Mar 28, 2012, at 8:15 AM, Erez Theodorou wrote:
> 
> Hi,
>  
> Sometimes messages are only received at the recipient after 1 or 2 seconds.
> I have a cluster of nodes and what I see is that all the nodes on the cluster stop receiving messages for the exact same period. Even nodes that send messages to other nodes (processes) on the same machine.
> Any ideas what may be the possible causes?
>  
> Thanks,
> -Erez
>  
> _______________________________________________
> Spread-users mailing list
> Spread-users at lists.spread.org
> http://lists.spread.org/mailman/listinfo/spread-users
> 
> 
> _______________________________________________
> Spread-users mailing list
> Spread-users at lists.spread.org
> http://lists.spread.org/mailman/listinfo/spread-users
> 



More information about the Spread-users mailing list