[Spread-users] Was there every any resolution for "token too long for packet!"??
Rick Cobb
rick_cobb at ieee.org
Thu Jun 18 02:50:26 EDT 2009
I'm referring to an old report of a problem on 3.17.3:
http://lists.spread.org/pipermail/spread-users/2005-September/002708.html
We're getting these messages on a very big ring using 4.0.1 (tip revision of
about four months ago) on Linux: we have 14 segments configured, adding up
to about 65 daemons at the moment. The spread daemons in the two largest
segments (10 and 13 nodes respectively) are failing continuously.
Unlike the original poster, we have very few holes. (zero, in fact). On the
other hand, we're crashing with the same message -- and that poster only had
about 20 daemons. I haven't found any answer for that person's problem.
In the absence of any other evidence, it looks like our ring is just too
big. OTOH, if we *can* run with this big a ring, it would be a big
convenience for us. Can anyone say with some level of certainty that this
is caused by our scale? Or is there another way to cause the problem?
Thanks! Address-sanitized log attached --
-- ReC
[Thu 18 Jun 2009 06:03:02] =========== Form Token ==========
[Thu 18 Jun 2009 06:03:02] FORM 1 Token, sent by 10.101.36.120. Seq: 3366
[Thu 18 Jun 2009 06:03:02] Configuration hash: 265724240 (local hash
265724240)[Thu 18 Jun 2009 06:03:02] ProcID: 10.101.36.120 ARU: -1,
ARU LastID: 0.0.0.6
[Thu 18 Jun 2009 06:03:02] FlowControl: 0 RTR Len: 1448[Thu 18 Jun
2009 06:03:02] Form Token members list -- Active (48) Pending (10)
0: 10.101.0.110 1: 10.101.0.114 2: 10.101.0.113
3: 10.101.0.111 4: 10.101.0.112 5: 10.101.4.110
6: 10.101.4.114 7: 10.101.4.112 8: 10.101.4.113
9: 10.101.4.111 10: 10.101.8.110 11: 10.101.8.113
12: 10.101.8.111 13: 10.101.8.114 14: 10.101.8.112
15: 10.101.12.110 16: 10.101.12.113 17: 10.101.12.111
18: 10.101.12.114 19: 10.101.12.112 20: 10.101.16.110
21: 10.101.16.111 22: 10.101.16.114 23: 10.101.16.113
24: 10.101.16.112 25: 10.101.20.110 26: 10.101.20.112
27: 10.101.20.114 28: 10.101.20.113 29: 10.101.20.111
30: 10.101.24.110 31: 10.101.24.112 32: 10.101.24.111
33: 10.101.24.114 34: 10.101.24.113 35: 10.101.28.110
36: 10.101.28.115 37: 10.101.28.114 38: 10.101.28.113
39: 10.101.28.111 40: 10.101.28.112 41: 10.101.32.110
42: 10.101.32.112 43: 10.101.32.111 44: 10.101.32.113
45: 10.101.32.114 46: 10.101.36.122 47: 10.101.36.120
Pending Members:
48: 10.101.36.118 49: 10.101.36.119 50: 10.101.36.113
51: 10.101.36.112 52: 10.101.36.111 53: 10.101.36.117
54: 10.101.36.114 55: 10.101.36.115 56: 10.101.36.116
57: 10.101.36.121
[Thu 18 Jun 2009 06:03:02] Form Token reps list -- Count (12) index (10)
0: 10.101.0.110 (T 1 SegInd 2) 1: 10.101.4.110 (T 1 SegInd 3) 2:
10.101.8.110 (
T 1 SegInd 4)
3: 10.101.12.110 (T 1 SegInd 5) 4: 10.101.16.110 (T 1 SegInd
6) 5
: 10.101.20.110 (T 1 SegInd 7)
6: 10.101.24.110 (T 1 SegInd 8) 7: 10.101.28.110 (T 1 SegInd
9) 8: 10.101.32.110 (T 1 SegInd 10)
9: 10.101.36.122 (T 1 SegInd 11) 10: 10.101.40.118 (T 1
SegInd 12) 11: 10.101.44.110 (T 1 SegInd 13)
[Thu 18 Jun 2009 06:03:02] Form Token RING list -- Count (19)
[Thu 18 Jun 2009 06:03:02] Ring 0: MembID 10.101.12.110 - 1245302486
TransTime 0
[Thu 18 Jun 2009 06:03:02] ARU: 5 HighSeq: 5 NumHoles: 0
[Thu 18 Jun 2009 06:03:02] NumCommit: 15 NumTrans: 15
[Thu 18 Jun 2009 06:03:02] Message Holes:
[Thu 18 Jun 2009 06:03:02] Trans List: 0: 10.101.12.110 1:
10.101.12.113 2: 10.101.12.111
3: 10.101.12.114 4: 10.101.12.112 5: 10.101.16.110
6: 10.101.16.111 7: 10.101.16.114 8: 10.101.16.113
9: 10.101.16.112 10: 10.101.20.110 11: 10.101.20.112
12: 10.101.20.114 13: 10.101.20.113 14: 10.101.20.111
[Thu 18 Jun 2009 06:03:02] Commit List:
[Thu 18 Jun 2009 06:03:02] Ring 1: MembID 10.101.24.110 - 4294967295
TransTime 0
[Thu 18 Jun 2009 06:03:02] ARU: 0 HighSeq: 0 NumHoles: 0
[Thu 18 Jun 2009 06:03:02] NumCommit: 1 NumTrans: 1
[Thu 18 Jun 2009 06:03:02] Message Holes:
[Thu 18 Jun 2009 06:03:02] Trans List: 0: 10.101.24.110
[Thu 18 Jun 2009 06:03:02] Commit List:
[Thu 18 Jun 2009 06:03:02] Ring 2: MembID 10.101.24.112 - 4294967295
TransTime 0
[Thu 18 Jun 2009 06:03:02] ARU: 0 HighSeq: 0 NumHoles: 0
[Thu 18 Jun 2009 06:03:02] NumCommit: 1 NumTrans: 1
[Thu 18 Jun 2009 06:03:02] Message Holes:
[Thu 18 Jun 2009 06:03:02] Trans List: 0: 10.101.24.112
[Thu 18 Jun 2009 06:03:02] Commit List:
[Thu 18 Jun 2009 06:03:02] Ring 3: MembID 10.101.24.111 - 4294967295
TransTime 0
[Thu 18 Jun 2009 06:03:02] ARU: 0 HighSeq: 0 NumHoles: 0
[Thu 18 Jun 2009 06:03:02] NumCommit: 1 NumTrans: 1
[Thu 18 Jun 2009 06:03:02] Message Holes:
[Thu 18 Jun 2009 06:03:02] Trans List: 0: 10.101.24.111
[Thu 18 Jun 2009 06:03:02] Commit List:
[Thu 18 Jun 2009 06:03:02] Ring 4: MembID 10.101.24.114 - 4294967295
TransTime 0
[Thu 18 Jun 2009 06:03:02] ARU: 0 HighSeq: 0 NumHoles: 0
[Thu 18 Jun 2009 06:03:02] NumCommit: 1 NumTrans: 1
[Thu 18 Jun 2009 06:03:02] Message Holes:
[Thu 18 Jun 2009 06:03:02] Trans List: 0: 10.101.24.114
[Thu 18 Jun 2009 06:03:02] Commit List:
[Thu 18 Jun 2009 06:03:02] Ring 5: MembID 10.101.24.113 - 4294967295
TransTime 0
[Thu 18 Jun 2009 06:03:02] ARU: 0 HighSeq: 0 NumHoles: 0
[Thu 18 Jun 2009 06:03:02] NumCommit: 1 NumTrans: 1
[Thu 18 Jun 2009 06:03:02] Message Holes:
[Thu 18 Jun 2009 06:03:02] Trans List: 0: 10.101.24.113
[Thu 18 Jun 2009 06:03:02] Commit List:
[Thu 18 Jun 2009 06:03:02] Ring 6: MembID 10.101.28.110 - 4294967295
TransTime 0
[Thu 18 Jun 2009 06:03:02] ARU: 0 HighSeq: 0 NumHoles: 0
[Thu 18 Jun 2009 06:03:02] NumCommit: 1 NumTrans: 1
[Thu 18 Jun 2009 06:03:02] Message Holes:
[Thu 18 Jun 2009 06:03:02] Trans List: 0: 10.101.28.110
[Thu 18 Jun 2009 06:03:02] Commit List:
[Thu 18 Jun 2009 06:03:02] Ring 7: MembID 10.101.28.115 - 4294967295
TransTime 0
[Thu 18 Jun 2009 06:03:02] ARU: 0 HighSeq: 0 NumHoles: 0
[Thu 18 Jun 2009 06:03:02] NumCommit: 1 NumTrans: 1
[Thu 18 Jun 2009 06:03:02] Message Holes:
[Thu 18 Jun 2009 06:03:02] Trans List: 0: 10.101.28.115
[Thu 18 Jun 2009 06:03:02] Commit List:
[Thu 18 Jun 2009 06:03:02] Ring 8: MembID 10.101.28.114 - 4294967295
TransTime 0
[Thu 18 Jun 2009 06:03:02] ARU: 0 HighSeq: 0 NumHoles: 0
[Thu 18 Jun 2009 06:03:02] NumCommit: 1 NumTrans: 1
[Thu 18 Jun 2009 06:03:02] Message Holes:
[Thu 18 Jun 2009 06:03:02] Trans List: 0: 10.101.28.114
[Thu 18 Jun 2009 06:03:02] Commit List:
[Thu 18 Jun 2009 06:03:02] Ring 9: MembID 10.101.28.113 - 4294967295
TransTime 0
[Thu 18 Jun 2009 06:03:02] ARU: 0 HighSeq: 0 NumHoles: 0
[Thu 18 Jun 2009 06:03:02] NumCommit: 1 NumTrans: 1
[Thu 18 Jun 2009 06:03:02] Message Holes:
[Thu 18 Jun 2009 06:03:02] Trans List: 0: 10.101.28.113
Thu 18 Jun 2009 06:03:02] Commit List:
[Thu 18 Jun 2009 06:03:02] Ring 10: MembID 10.101.28.111 - 4294967295
TransTime 0
[Thu 18 Jun 2009 06:03:02] ARU: 0 HighSeq: 0 NumHoles: 0
[Thu 18 Jun 2009 06:03:02] NumCommit: 1 NumTrans: 1
[Thu 18 Jun 2009 06:03:02] Message Holes:
[Thu 18 Jun 2009 06:03:02] Trans List: 0: 10.101.28.111
[Thu 18 Jun 2009 06:03:02] Commit List:
[Thu 18 Jun 2009 06:03:02] Ring 11: MembID 10.101.28.112 - 4294967295
TransTime 0
[Thu 18 Jun 2009 06:03:02] ARU: 0 HighSeq: 0 NumHoles: 0
[Thu 18 Jun 2009 06:03:02] NumCommit: 1 NumTrans: 1
[Thu 18 Jun 2009 06:03:02] Message Holes:
[Thu 18 Jun 2009 06:03:02] Trans List: 0: 10.101.28.112
[Thu 18 Jun 2009 06:03:02] Commit List:
[Thu 18 Jun 2009 06:03:02] Ring 12: MembID 10.101.32.110 - 4294967295
TransTime 0
[Thu 18 Jun 2009 06:03:02] ARU: 0 HighSeq: 0 NumHoles: 0
[Thu 18 Jun 2009 06:03:02] NumCommit: 1 NumTrans: 1
[Thu 18 Jun 2009 06:03:02] Message Holes:
[Thu 18 Jun 2009 06:03:02] Trans List: 0: 10.101.32.110
[Thu 18 Jun 2009 06:03:02] Commit List:
[Thu 18 Jun 2009 06:03:02] Ring 13: MembID 10.101.32.112 - 4294967295
TransTime 0
[Thu 18 Jun 2009 06:03:02] ARU: 0 HighSeq: 0 NumHoles: 0
[Thu 18 Jun 2009 06:03:02] NumCommit: 1 NumTrans: 1
[Thu 18 Jun 2009 06:03:02] Message Holes:
[Thu 18 Jun 2009 06:03:02] Trans List: 0: 10.101.32.112
[Thu 18 Jun 2009 06:03:02] Commit List:
[Thu 18 Jun 2009 06:03:02] Ring 14: MembID 10.101.32.111 - 4294967295
TransTime 0
[Thu 18 Jun 2009 06:03:02] ARU: 0 HighSeq: 0 NumHoles: 0
[Thu 18 Jun 2009 06:03:02] NumCommit: 1 NumTrans: 1
[Thu 18 Jun 2009 06:03:02] Message Holes:
[Thu 18 Jun 2009 06:03:02] Trans List: 0: 10.101.32.111
[Thu 18 Jun 2009 06:03:02] Commit List:
[Thu 18 Jun 2009 06:03:02] Ring 15: MembID 10.101.32.113 - 4294967295
TransTime 0
[Thu 18 Jun 2009 06:03:02] ARU: 0 HighSeq: 0 NumHoles: 0
[Thu 18 Jun 2009 06:03:02] NumCommit: 1 NumTrans: 1
[Thu 18 Jun 2009 06:03:02] Message Holes:
[Thu 18 Jun 2009 06:03:02] Trans List: 0: 10.101.32.113
[Thu 18 Jun 2009 06:03:02] Commit List:
[Thu 18 Jun 2009 06:03:02] Ring 16: MembID 10.101.0.110 - 1245290616
TransTime 0
[Thu 18 Jun 2009 06:03:02] ARU: 24 HighSeq: 33 NumHoles: 0
[Thu 18 Jun 2009 06:03:02] NumCommit: 16 NumTrans: 16
[Thu 18 Jun 2009 06:03:02] Message Holes:
[Thu 18 Jun 2009 06:03:02] Trans List: 0: 10.101.0.110 1:
10.101.0.114 2: 10.101.0.113
3: 10.101.0.111 4: 10.101.0.112 5: 10.101.4.110
6: 10.101.4.114 7: 10.101.4.112 8: 10.101.4.113
9: 10.101.4.111 10: 10.101.8.110 11: 10.101.8.113
12: 10.101.8.111 13: 10.101.8.114 14: 10.101.8.112
15: 10.101.32.114
[Thu 18 Jun 2009 06:03:02] Commit List:
[Thu 18 Jun 2009 06:03:02] Ring 17: MembID 10.101.36.122 - 4294967295
TransTime 0
[Thu 18 Jun 2009 06:03:02] ARU: 0 HighSeq: 0 NumHoles: 0
[Thu 18 Jun 2009 06:03:02] NumCommit: 1 NumTrans: 1
[Thu 18 Jun 2009 06:03:02] Message Holes:
[Thu 18 Jun 2009 06:03:02] Trans List: 0: 10.101.36.122
[Thu 18 Jun 2009 06:03:02] Commit List:
[Thu 18 Jun 2009 06:03:02] Ring 18: MembID 10.101.36.120 - 4294967295
TransTime 0
[Thu 18 Jun 2009 06:03:02] ARU: 0 HighSeq: 0 NumHoles: 0
[Thu 18 Jun 2009 06:03:02] NumCommit: 1 NumTrans: 1
[Thu 18 Jun 2009 06:03:02] Message Holes:
[Thu 18 Jun 2009 06:03:02] Trans List: 0: 10.101.36.120
[Thu 18 Jun 2009 06:03:02] Commit List:
[Thu 18 Jun 2009 06:03:02]
====================================================
[Thu 18 Jun 2009 06:03:02] Net_ucast_token: Token too long for packet!
Exit caused by Alarm(EXIT)
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://lists.spread.org/pipermail/spread-users/attachments/20090617/d6effe0a/attachment.html
More information about the Spread-users
mailing list