[Spread-users] Send_new_packets: created packet 203 already exist 2

Matt Garman matthew.garman at gmail.com
Wed Feb 22 13:40:10 EST 2012


Hi,

I asked about this back in May, 2008 [1], but never really came to any
resolution.

As a refresher, we're getting regular spread daemon crashes (it went
away for a while, but has recently become a very regular occurrence,
as in several times/day).  We're using spread version 4.00.00,
self-compiled on CentOS 5.6.

The log leading up to the crash looks like this:

[Wed 22 Feb 2012 12:04:58] Prot_handle_token: BUG WORKAROUND: Too many
rounds in EVS state; swallowing token; state:
[Wed 22 Feb 2012 12:04:58]      Aru:              241
[Wed 22 Feb 2012 12:04:58]      My_aru:           241
[Wed 22 Feb 2012 12:04:58]      Highest_seq:      200
[Wed 22 Feb 2012 12:04:58]      Highest_fifo_seq: 103
[Wed 22 Feb 2012 12:04:58]      Last_discarded:   0
[Wed 22 Feb 2012 12:04:58]      Last_delivered:   241
[Wed 22 Feb 2012 12:04:58]      Last_seq:         3533
[Wed 22 Feb 2012 12:04:58]      Token_rounds:     501
[Wed 22 Feb 2012 12:04:58] Last Token:
[Wed 22 Feb 2012 12:04:58]      type:             0x80040080
[Wed 22 Feb 2012 12:04:58]      transmiter_id:    -1407973572
[Wed 22 Feb 2012 12:04:58]      seq:              0
[Wed 22 Feb 2012 12:04:58]      proc_id:          -1407973572
[Wed 22 Feb 2012 12:04:58]      aru:              241
[Wed 22 Feb 2012 12:04:58]      aru_last_id:      -1407973572
[Wed 22 Feb 2012 12:04:58]      flow_control:     0
[Wed 22 Feb 2012 12:04:58]      rtr_len:          0
[Wed 22 Feb 2012 12:04:58]      conf_hash:        1007608523
Membership id is ( -1407973572, 1329934005)
[Wed 22 Feb 2012 12:04:58] --------------------
[Wed 22 Feb 2012 12:04:58] Configuration at lnxsvr1 is:
[Wed 22 Feb 2012 12:04:58] Num Segments 1
[Wed 22 Feb 2012 12:04:58]      4       172.20.7.63       4803
[Wed 22 Feb 2012 12:04:58]              lnxsvr1                 172.20.7.60
[Wed 22 Feb 2012 12:04:58]              lnxsvr2                 172.20.7.61
[Wed 22 Feb 2012 12:04:58]              lnxsvr6                 172.20.7.62
[Wed 22 Feb 2012 12:04:58]              lnxsvr5                 172.20.7.58
[Wed 22 Feb 2012 12:04:58] ====================
[Wed 22 Feb 2012 12:04:58] Send_new_packets: created packet 203 already exist 2
Exit caused by Alarm(EXIT)

Any thoughts?

Thanks,
Matt


[1] http://lists.spread.org/pipermail/spread-users/2008-May/003824.html



More information about the Spread-users mailing list