[Spread-users] reliable?

John Schultz jschultz at d-fusion.net
Fri Apr 11 15:14:17 EDT 2003


One last note: you can often leverage the power of a SAFE msg onto lower 
service level msgs that you sent earlier.

For CAUSAL and AGREED msgs it is very simple. Say that msgs m and m' are 
both delivered at process p in view v, where m is a CAUSAL or AGREED msg 
and m' is a SAFE msg. If m' is delivered before a transitional signal, 
then m has also become safe in v and you can make all the same 
reliability assertions about m that you can about m'. If m' is delivered 
after a transitional signal at p, then m has become safe w.r.t. at least 
the members of p's transitional membership set of the following view.

Leveraging safety onto lower level msgs is much more difficult, but can 
be done. In order to do this, you begin to implement some of the 
functionality that is contained within the Flush Spread layer (the VS 
library).

Here is an example of an algorithm that would let you leverage safety 
onto FIFO msgs:

After every view event each of your processes sends a CAUSAL (or higher 
service lvl) msg m announcing it has installed its most recent view v. 
Any FIFO msgs that your app sends later are blocked from being delivered 
anywhere in views earlier than v by this announcement msg.

Now, any FIFO msg sent by process q that is delivered in view v AFTER 
q's installation announcement about view v can be "made" safe in the 
exact same way as CAUSAL/AGREED msgs.

You could probably extend this reliability knowledge even more by 
tracking VS sets and marking FIFO msgs with the ids of the views in 
which they were sent -- but that gets complex really fast (i.e. Flush 
layer).

Please note that if you used this algorithm you would DRASTICALLY 
increase the cost of joins and leaves as every membership event would 
cause n multicasts. So, you really need to have a good reason to 
implement such a thing. However, a simple optimization to this algorithm 
would be not to "ack" a view event until the application tries to send a 
msg.

-- 
John Schultz
Co-Founder, Lead Engineer
D-Fusion, Inc. (http://www.d-fusion.net)
Phn: 443-838-2200 Fax: 707-885-1055

Cédric Coulon wrote:

> Hello,
> 
> If a node falls during a reliable message is sent (FIFO_MESS for example),
> are the other nodes received the message after all?
> 
> Thanks.
> 
> ----
> Cédric Coulon                                          Tel:   02 51 12 59 63
> IRIN, 2, rue de la Houssiniere, BP 92208    Fax: 02 51 12 58 12
> 44322 NANTES cedex 3
> FRANCE                     mailto:Cedric.Coulon at irin.univ-nantes.fr
> 






More information about the Spread-users mailing list