[Spread-users] Was there every any resolution for "token too long for packet!"??

Rick Cobb rick_cobb at ieee.org
Thu Jun 18 02:50:26 EDT 2009


I'm referring to an old report of a problem on 3.17.3:
http://lists.spread.org/pipermail/spread-users/2005-September/002708.html


We're getting these messages on a very big ring using 4.0.1 (tip revision of
about four months ago) on Linux: we have 14 segments configured, adding up
to about 65 daemons at the moment.  The spread daemons in the two largest
segments (10 and 13 nodes respectively) are failing continuously.


Unlike the original poster, we have very few holes. (zero, in fact). On the
other hand, we're crashing with the same message -- and that poster only had
about 20 daemons.  I haven't found any answer for that person's problem.


In the absence of any other evidence, it looks like our ring is just too
big.  OTOH, if we *can* run with this big a ring, it would be a big
convenience for us.  Can anyone say with some level of certainty that this
is caused by our scale?  Or is there another way to cause the problem?


Thanks! Address-sanitized log attached --

-- ReC



[Thu 18 Jun 2009 06:03:02] =========== Form Token ==========

[Thu 18 Jun 2009 06:03:02] FORM 1 Token, sent by 10.101.36.120. Seq: 3366

[Thu 18 Jun 2009 06:03:02] Configuration hash: 265724240 (local hash
265724240)[Thu 18 Jun 2009 06:03:02] ProcID: 10.101.36.120         ARU: -1,
ARU LastID: 0.0.0.6

[Thu 18 Jun 2009 06:03:02] FlowControl: 0       RTR Len: 1448[Thu 18 Jun
2009 06:03:02] Form Token members list -- Active (48) Pending (10)

        0: 10.101.0.110         1: 10.101.0.114         2: 10.101.0.113
   3: 10.101.0.111         4: 10.101.0.112         5: 10.101.4.110

        6: 10.101.4.114         7: 10.101.4.112         8: 10.101.4.113

        9: 10.101.4.111         10: 10.101.8.110        11: 10.101.8.113

        12: 10.101.8.111        13: 10.101.8.114        14: 10.101.8.112
   15: 10.101.12.110       16: 10.101.12.113       17: 10.101.12.111

        18: 10.101.12.114       19: 10.101.12.112       20: 10.101.16.110

        21: 10.101.16.111       22: 10.101.16.114       23: 10.101.16.113

        24: 10.101.16.112       25: 10.101.20.110       26: 10.101.20.112

        27: 10.101.20.114       28: 10.101.20.113       29: 10.101.20.111

        30: 10.101.24.110       31: 10.101.24.112       32: 10.101.24.111

        33: 10.101.24.114       34: 10.101.24.113       35: 10.101.28.110

        36: 10.101.28.115       37: 10.101.28.114       38: 10.101.28.113

        39: 10.101.28.111       40: 10.101.28.112       41: 10.101.32.110

        42: 10.101.32.112       43: 10.101.32.111       44: 10.101.32.113

        45: 10.101.32.114       46: 10.101.36.122       47: 10.101.36.120


Pending Members:

        48: 10.101.36.118       49: 10.101.36.119       50: 10.101.36.113

        51: 10.101.36.112       52: 10.101.36.111       53: 10.101.36.117

        54: 10.101.36.114       55: 10.101.36.115       56: 10.101.36.116

        57: 10.101.36.121

[Thu 18 Jun 2009 06:03:02] Form Token reps list -- Count (12) index (10)

        0: 10.101.0.110 (T 1 SegInd 2)  1: 10.101.4.110 (T 1 SegInd 3)  2:
10.101.8.110 (

T 1 SegInd 4)

        3: 10.101.12.110 (T 1 SegInd 5)         4: 10.101.16.110 (T 1 SegInd
6)         5

: 10.101.20.110 (T 1 SegInd 7)

        6: 10.101.24.110 (T 1 SegInd 8)         7: 10.101.28.110 (T 1 SegInd
9)         8: 10.101.32.110 (T 1 SegInd 10)

        9: 10.101.36.122 (T 1 SegInd 11)        10: 10.101.40.118 (T 1
SegInd 12)       11: 10.101.44.110 (T 1 SegInd 13)


[Thu 18 Jun 2009 06:03:02] Form Token RING list -- Count (19)

[Thu 18 Jun 2009 06:03:02] Ring 0: MembID 10.101.12.110 - 1245302486
 TransTime 0

[Thu 18 Jun 2009 06:03:02]      ARU: 5  HighSeq: 5      NumHoles: 0

[Thu 18 Jun 2009 06:03:02]      NumCommit: 15   NumTrans: 15

[Thu 18 Jun 2009 06:03:02]      Message Holes:

[Thu 18 Jun 2009 06:03:02]      Trans List:     0: 10.101.12.110        1:
10.101.12.113        2: 10.101.12.111

        3: 10.101.12.114        4: 10.101.12.112        5: 10.101.16.110

        6: 10.101.16.111        7: 10.101.16.114        8: 10.101.16.113

        9: 10.101.16.112        10: 10.101.20.110       11: 10.101.20.112

        12: 10.101.20.114       13: 10.101.20.113       14: 10.101.20.111


[Thu 18 Jun 2009 06:03:02]      Commit List:

[Thu 18 Jun 2009 06:03:02] Ring 1: MembID 10.101.24.110 - 4294967295
 TransTime 0

[Thu 18 Jun 2009 06:03:02]      ARU: 0  HighSeq: 0      NumHoles: 0

[Thu 18 Jun 2009 06:03:02]      NumCommit: 1    NumTrans: 1

[Thu 18 Jun 2009 06:03:02]      Message Holes:

[Thu 18 Jun 2009 06:03:02]      Trans List:     0: 10.101.24.110

[Thu 18 Jun 2009 06:03:02]      Commit List:

[Thu 18 Jun 2009 06:03:02] Ring 2: MembID 10.101.24.112 - 4294967295
 TransTime 0

[Thu 18 Jun 2009 06:03:02]      ARU: 0  HighSeq: 0      NumHoles: 0

[Thu 18 Jun 2009 06:03:02]      NumCommit: 1    NumTrans: 1

[Thu 18 Jun 2009 06:03:02]      Message Holes:

[Thu 18 Jun 2009 06:03:02]      Trans List:     0: 10.101.24.112

[Thu 18 Jun 2009 06:03:02]      Commit List:

[Thu 18 Jun 2009 06:03:02] Ring 3: MembID 10.101.24.111 - 4294967295
 TransTime 0

[Thu 18 Jun 2009 06:03:02]      ARU: 0  HighSeq: 0      NumHoles: 0

[Thu 18 Jun 2009 06:03:02]      NumCommit: 1    NumTrans: 1

[Thu 18 Jun 2009 06:03:02]      Message Holes:

[Thu 18 Jun 2009 06:03:02]      Trans List:     0: 10.101.24.111

[Thu 18 Jun 2009 06:03:02]      Commit List:

[Thu 18 Jun 2009 06:03:02] Ring 4: MembID 10.101.24.114 - 4294967295
 TransTime 0

[Thu 18 Jun 2009 06:03:02]      ARU: 0  HighSeq: 0      NumHoles: 0

[Thu 18 Jun 2009 06:03:02]      NumCommit: 1    NumTrans: 1

[Thu 18 Jun 2009 06:03:02]      Message Holes:

[Thu 18 Jun 2009 06:03:02]      Trans List:     0: 10.101.24.114

[Thu 18 Jun 2009 06:03:02]      Commit List:

[Thu 18 Jun 2009 06:03:02] Ring 5: MembID 10.101.24.113 - 4294967295
 TransTime 0

[Thu 18 Jun 2009 06:03:02]      ARU: 0  HighSeq: 0      NumHoles: 0

[Thu 18 Jun 2009 06:03:02]      NumCommit: 1    NumTrans: 1

[Thu 18 Jun 2009 06:03:02]      Message Holes:

[Thu 18 Jun 2009 06:03:02]      Trans List:     0: 10.101.24.113

[Thu 18 Jun 2009 06:03:02]      Commit List:

[Thu 18 Jun 2009 06:03:02] Ring 6: MembID 10.101.28.110 - 4294967295
 TransTime 0

[Thu 18 Jun 2009 06:03:02]      ARU: 0  HighSeq: 0      NumHoles: 0

[Thu 18 Jun 2009 06:03:02]      NumCommit: 1    NumTrans: 1

[Thu 18 Jun 2009 06:03:02]      Message Holes:

[Thu 18 Jun 2009 06:03:02]      Trans List:     0: 10.101.28.110

[Thu 18 Jun 2009 06:03:02]      Commit List:

[Thu 18 Jun 2009 06:03:02] Ring 7: MembID 10.101.28.115 - 4294967295
 TransTime 0

[Thu 18 Jun 2009 06:03:02]      ARU: 0  HighSeq: 0      NumHoles: 0

[Thu 18 Jun 2009 06:03:02]      NumCommit: 1    NumTrans: 1

[Thu 18 Jun 2009 06:03:02]      Message Holes:

[Thu 18 Jun 2009 06:03:02]      Trans List:     0: 10.101.28.115

[Thu 18 Jun 2009 06:03:02]      Commit List:

[Thu 18 Jun 2009 06:03:02] Ring 8: MembID 10.101.28.114 - 4294967295
 TransTime 0

[Thu 18 Jun 2009 06:03:02]      ARU: 0  HighSeq: 0      NumHoles: 0

[Thu 18 Jun 2009 06:03:02]      NumCommit: 1    NumTrans: 1

[Thu 18 Jun 2009 06:03:02]      Message Holes:

[Thu 18 Jun 2009 06:03:02]      Trans List:     0: 10.101.28.114

[Thu 18 Jun 2009 06:03:02]      Commit List:

[Thu 18 Jun 2009 06:03:02] Ring 9: MembID 10.101.28.113 - 4294967295
 TransTime 0

[Thu 18 Jun 2009 06:03:02]      ARU: 0  HighSeq: 0      NumHoles: 0

[Thu 18 Jun 2009 06:03:02]      NumCommit: 1    NumTrans: 1

[Thu 18 Jun 2009 06:03:02]      Message Holes:

[Thu 18 Jun 2009 06:03:02]      Trans List:     0: 10.101.28.113

Thu 18 Jun 2009 06:03:02]      Commit List:

[Thu 18 Jun 2009 06:03:02] Ring 10: MembID 10.101.28.111 - 4294967295
TransTime 0

[Thu 18 Jun 2009 06:03:02]      ARU: 0  HighSeq: 0      NumHoles: 0

[Thu 18 Jun 2009 06:03:02]      NumCommit: 1    NumTrans: 1

[Thu 18 Jun 2009 06:03:02]      Message Holes:

[Thu 18 Jun 2009 06:03:02]      Trans List:     0: 10.101.28.111

[Thu 18 Jun 2009 06:03:02]      Commit List:

[Thu 18 Jun 2009 06:03:02] Ring 11: MembID 10.101.28.112 - 4294967295
TransTime 0

[Thu 18 Jun 2009 06:03:02]      ARU: 0  HighSeq: 0      NumHoles: 0

[Thu 18 Jun 2009 06:03:02]      NumCommit: 1    NumTrans: 1

[Thu 18 Jun 2009 06:03:02]      Message Holes:

[Thu 18 Jun 2009 06:03:02]      Trans List:     0: 10.101.28.112

[Thu 18 Jun 2009 06:03:02]      Commit List:

[Thu 18 Jun 2009 06:03:02] Ring 12: MembID 10.101.32.110 - 4294967295
TransTime 0

[Thu 18 Jun 2009 06:03:02]      ARU: 0  HighSeq: 0      NumHoles: 0

[Thu 18 Jun 2009 06:03:02]      NumCommit: 1    NumTrans: 1

[Thu 18 Jun 2009 06:03:02]      Message Holes:

[Thu 18 Jun 2009 06:03:02]      Trans List:     0: 10.101.32.110

[Thu 18 Jun 2009 06:03:02]      Commit List:

[Thu 18 Jun 2009 06:03:02] Ring 13: MembID 10.101.32.112 - 4294967295
TransTime 0

[Thu 18 Jun 2009 06:03:02]      ARU: 0  HighSeq: 0      NumHoles: 0

[Thu 18 Jun 2009 06:03:02]      NumCommit: 1    NumTrans: 1

[Thu 18 Jun 2009 06:03:02]      Message Holes:

[Thu 18 Jun 2009 06:03:02]      Trans List:     0: 10.101.32.112

[Thu 18 Jun 2009 06:03:02]      Commit List:

[Thu 18 Jun 2009 06:03:02] Ring 14: MembID 10.101.32.111 - 4294967295
TransTime 0

[Thu 18 Jun 2009 06:03:02]      ARU: 0  HighSeq: 0      NumHoles: 0

[Thu 18 Jun 2009 06:03:02]      NumCommit: 1    NumTrans: 1

[Thu 18 Jun 2009 06:03:02]      Message Holes:

[Thu 18 Jun 2009 06:03:02]      Trans List:     0: 10.101.32.111

[Thu 18 Jun 2009 06:03:02]      Commit List:

[Thu 18 Jun 2009 06:03:02] Ring 15: MembID 10.101.32.113 - 4294967295
TransTime 0

[Thu 18 Jun 2009 06:03:02]      ARU: 0  HighSeq: 0      NumHoles: 0

[Thu 18 Jun 2009 06:03:02]      NumCommit: 1    NumTrans: 1

[Thu 18 Jun 2009 06:03:02]      Message Holes:

[Thu 18 Jun 2009 06:03:02]      Trans List:     0: 10.101.32.113

[Thu 18 Jun 2009 06:03:02]      Commit List:

[Thu 18 Jun 2009 06:03:02] Ring 16: MembID 10.101.0.110 - 1245290616
 TransTime 0

[Thu 18 Jun 2009 06:03:02]      ARU: 24 HighSeq: 33     NumHoles: 0

[Thu 18 Jun 2009 06:03:02]      NumCommit: 16   NumTrans: 16

[Thu 18 Jun 2009 06:03:02]      Message Holes:

[Thu 18 Jun 2009 06:03:02]      Trans List:     0: 10.101.0.110         1:
10.101.0.114         2: 10.101.0.113

        3: 10.101.0.111         4: 10.101.0.112         5: 10.101.4.110

        6: 10.101.4.114         7: 10.101.4.112         8: 10.101.4.113

        9: 10.101.4.111         10: 10.101.8.110        11: 10.101.8.113

        12: 10.101.8.111        13: 10.101.8.114        14: 10.101.8.112

        15: 10.101.32.114

[Thu 18 Jun 2009 06:03:02]      Commit List:

[Thu 18 Jun 2009 06:03:02] Ring 17: MembID 10.101.36.122 - 4294967295
TransTime 0

[Thu 18 Jun 2009 06:03:02]      ARU: 0  HighSeq: 0      NumHoles: 0

[Thu 18 Jun 2009 06:03:02]      NumCommit: 1    NumTrans: 1

[Thu 18 Jun 2009 06:03:02]      Message Holes:

[Thu 18 Jun 2009 06:03:02]      Trans List:     0: 10.101.36.122

[Thu 18 Jun 2009 06:03:02]      Commit List:

[Thu 18 Jun 2009 06:03:02] Ring 18: MembID 10.101.36.120 - 4294967295
TransTime 0

[Thu 18 Jun 2009 06:03:02]      ARU: 0  HighSeq: 0      NumHoles: 0

[Thu 18 Jun 2009 06:03:02]      NumCommit: 1    NumTrans: 1

[Thu 18 Jun 2009 06:03:02]      Message Holes:

[Thu 18 Jun 2009 06:03:02]      Trans List:     0: 10.101.36.120

[Thu 18 Jun 2009 06:03:02]      Commit List:

[Thu 18 Jun 2009 06:03:02]
====================================================

[Thu 18 Jun 2009 06:03:02] Net_ucast_token: Token too long for packet!

Exit caused by Alarm(EXIT)
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://lists.spread.org/pipermail/spread-users/attachments/20090617/d6effe0a/attachment.html 


More information about the Spread-users mailing list