[Spread-users] Cluster locked up again

Tom Mornini tom at quios.net
Sun Jan 13 02:52:27 EST 2002


Well, making certain to drain the mailboxes didn't prevent Spread from 
hanging again.

I've already implemented the connect level group_membership change, but 
that code hasn't been pushed yet.

Here's an spmonitor dump. Does anything seem out of order? The huge 
retrans number on obi (the group leader) looks rather suspicious...

Question: Is there any problem with running a spread daemon on a system 
that nobody is connecting to? We run jabba as a hot spare, but nothing 
is actually running there. I notice that it's recv pack number is twice 
as high as the next highest...and that doesn't seem right!

============================
Status at boba V 3.16. 1 (state 1, gstate 1) after 248709 seconds :
Membership  :  5  procs in 1 segments, leader is obi
rounds   : 76806053     tok_hurry :  353432     memb change:       1
sent pack:       2      recv pack :  752333     retrans    :     226
u retrans:       9      s retrans :     217     b retrans  :       0
My_aru   :  768294      Aru       :  766693     Highest seq:  768294
Sessions :       3      Groups    :       3     Window     :      60
Deliver M:  767663      Deliver Pk:  768294     Pers Window:      15
Delta Mes:       0      Delta Pack:       0     Delta sec  :      11
==================================

Monitor>
============================
Status at lando V 3.16. 1 (state 1, gstate 1) after 248741 seconds :
Membership  :  5  procs in 1 segments, leader is obi
rounds   : 76810910     tok_hurry :  353473     memb change:       2
sent pack:  364288      recv pack :  387839     retrans    :     510
u retrans:      12      s retrans :     498     b retrans  :       0
My_aru   :  768294      Aru       :  766693     Highest seq:  768294
Sessions :      84      Groups    :       3     Window     :      60
Deliver M:  767737      Deliver Pk:  768384     Pers Window:      15
Delta Mes:      74      Delta Pack:       0     Delta sec  :      32
==================================

Monitor>
============================
Status at greedo V 3.16. 1 (state 1, gstate 1) after 248736 seconds :
Membership  :  5  procs in 1 segments, leader is obi
rounds   : 76810909     tok_hurry :  353473     memb change:       2
sent pack:  387600      recv pack :  365029     retrans    :       0
u retrans:       0      s retrans :       0     b retrans  :       0
My_aru   :  768294      Aru       :  766693     Highest seq:  768294
Sessions :      84      Groups    :       3     Window     :      60
Deliver M:  767737      Deliver Pk:  768384     Pers Window:      15
Delta Mes:       0      Delta Pack:       0     Delta sec  :      -5
==================================

Monitor>
============================
Status at jabba V 3.16. 1 (state 1, gstate 1) after 248742 seconds :
Membership  :  5  procs in 1 segments, leader is obi
rounds   : 76810909     tok_hurry :  353473     memb change:       2
sent pack:      18      recv pack : 1711493     retrans    :       4
u retrans:       4      s retrans :       0     b retrans  :       0
My_aru   :  768294      Aru       :  766693     Highest seq:  768294
Sessions :       0      Groups    :       3     Window     :      60
Deliver M:  767737      Deliver Pk:  768384     Pers Window:      15
Delta Mes:       0      Delta Pack:       0     Delta sec  :       6
==================================

Monitor>
============================
Status at obi V 3.16. 1 (state 1, gstate 1) after 248710 seconds :
Membership  :  5  procs in 1 segments, leader is obi
rounds   : 76806052     tok_hurry :  354419     memb change:       1
sent pack:       2      recv pack :  752558     retrans    :  206268
u retrans:  206268      s retrans :       0     b retrans  :       0
My_aru   :  768294      Aru       :  766693     Highest seq:  768294
Sessions :       3      Groups    :       3     Window     :      60
Deliver M:  767663      Deliver Pk:  768294     Pers Window:      15
Delta Mes:     -74      Delta Pack:       0     Delta sec  :     -32
==================================

--
-- Tom Mornini
-- eWingz Systems, Inc.
--
-- ICQ: 113526784, AOL: tmornini, Yahoo: tmornini, MSN: tmornini






More information about the Spread-users mailing list