[Spread-users] RE: Spread on Linux question

Crystal, Mayer mayer.crystal at gs.com
Wed Jul 6 20:33:22 EDT 2005


OK, it took a little while to generate (still not sure what is the root
cause yet), but I ran my setup under valgrind and received the following
errors in middle of the execution (the daemons are still running, but I have
a feeling that this is not the desired behavior).  Sorry for the long post,
but I hope more information will be helpful.  Has anyone seen anything like
this?  Is this intended, known and/or is there a patch if this is not
intended?

TIA,
Mayer



Log and Valgrind output:
------------------------
[Wed 06 Jul 2005 10:41:26] Memb_handle_message: handling join message from
-1710970210, State is 4
[Wed 06 Jul 2005 10:41:27] Memb_handle_message: handling join message from
-1710970210, State is 4
[Wed 06 Jul 2005 10:41:27] Send_join: State is 4
[Wed 06 Jul 2005 10:41:28] Send_join: State is 4
[Wed 06 Jul 2005 10:41:28] Memb_handle_message: handling join message from
-1710970210, State is 4
[Wed 06 Jul 2005 10:41:29] Memb_handle_message: handling join message from
-1710970210, State is 4
[Wed 06 Jul 2005 10:41:29] Send_join: State is 4
[Wed 06 Jul 2005 10:41:30] Send_join: State is 4
[Wed 06 Jul 2005 10:41:30] Memb_handle_message: handling join message from
-1710970210, State is 4
[Wed 06 Jul 2005 10:41:31] Memb_handle_message: handling join message from
-1710970210, State is 4
[Wed 06 Jul 2005 10:41:31] Send_join: State is 4
[Wed 06 Jul 2005 10:41:32] Memb_handle_message: handling join message from
-1710970210, State is 4
[Wed 06 Jul 2005 10:41:32] Send_join: State is 4
[Wed 06 Jul 2005 10:41:33] Send_join: State is 4
[Wed 06 Jul 2005 10:41:33] Memb_handle_message: handling join message from
-1710970210, State is 4
[Wed 06 Jul 2005 10:41:34] Memb_handle_token: handling form1 token
[Wed 06 Jul 2005 10:41:34] Handle_form1 in GATHER
[Wed 06 Jul 2005 10:41:34] Memb_handle_token: handling form1 token
[Wed 06 Jul 2005 10:41:34] Handle_form1 in FORM
[Wed 06 Jul 2005 10:41:34] Memb_handle_token: handling form2 token
[Wed 06 Jul 2005 10:41:34] Handle_form2 in FORM
[Wed 06 Jul 2005 10:41:34] Memb_handle_token: handling form2 token
[Wed 06 Jul 2005 10:41:34] Handle_form2 in EVS
[Wed 06 Jul 2005 10:41:34] Memb_transitional
[Wed 06 Jul 2005 10:41:34] Memb_regular
Membership id is ( -1806277334, 1120660895)
[Wed 06 Jul 2005 10:41:34] Configuration at condor-qa-ndmz-1 is:
[Wed 06 Jul 2005 10:41:34] Num Segments 4
[Wed 06 Jul 2005 10:41:34]      1       154.4.170.157     12790
[Wed 06 Jul 2005 10:41:34]              condor-qa-ndmz-1 
==7347==
==7347== Invalid read of size 4
==7347==    at 0x805EB70: sl_remove_all (skiplist.c:543)
==7347==    by 0x805EC35: sl_destruct (skiplist.c:564)
==7347==    by 0x804F0A1: G_handle_reg_memb (groups.c:421)
==7347==    by 0x804E6CD: Sess_deliver_reg_memb (session.c:1908)
==7347==    by 0x804B5E9: Discard_packets (protocol.c:1148)
==7347==    by 0x804A6DB: Prot_handle_token (protocol.c:639)
==7347==    by 0x80537AC: E_handle_events (events.c:673)
==7347==    by 0x80497B0: main (spread.c:193)
==7347==  Address 0x1C954020 is 24 bytes inside a block of size 40 free'd
==7347==    at 0x1B903A5D: free (vg_replace_malloc.c:152)
==7347==    by 0x805EC3E: sl_destruct (skiplist.c:565)
==7347==    by 0x804F1B5: G_handle_reg_memb (groups.c:344)
==7347==    by 0x804E6CD: Sess_deliver_reg_memb (session.c:1908)
==7347==    by 0x804B5E9: Discard_packets (protocol.c:1148)
==7347==    by 0x804A6DB: Prot_handle_token (protocol.c:639)
==7347==    by 0x80537AC: E_handle_events (events.c:673)
==7347==    by 0x80497B0: main (spread.c:193)
==7347==
==7347== Invalid write of size 4
==7347==    at 0x805EBAE: sl_remove_all (skiplist.c:554)
==7347==    by 0x805EC35: sl_destruct (skiplist.c:564)
==7347==    by 0x804F0A1: G_handle_reg_memb (groups.c:421)
==7347==    by 0x804E6CD: Sess_deliver_reg_memb (session.c:1908)
==7347==    by 0x804B5E9: Discard_packets (protocol.c:1148)
==7347==    by 0x804A6DB: Prot_handle_token (protocol.c:639)
==7347==    by 0x80537AC: E_handle_events (events.c:673)
==7347==    by 0x80497B0: main (spread.c:193)
==7347==  Address 0x1C954020 is 24 bytes inside a block of size 40 free'd
==7347==    at 0x1B903A5D: free (vg_replace_malloc.c:152)
==7347==    by 0x805EC3E: sl_destruct (skiplist.c:565)
==7347==    by 0x804F1B5: G_handle_reg_memb (groups.c:344)
==7347==    by 0x804E6CD: Sess_deliver_reg_memb (session.c:1908)
==7347==    by 0x804B5E9: Discard_packets (protocol.c:1148)
==7347==    by 0x804A6DB: Prot_handle_token (protocol.c:639)
==7347==    by 0x80537AC: E_handle_events (events.c:673)
==7347==    by 0x80497B0: main (spread.c:193)
==7347==
==7347== Invalid write of size 4
==7347==    at 0x805EBB5: sl_remove_all (skiplist.c:554)
==7347==    by 0x805EC35: sl_destruct (skiplist.c:564)
==7347==    by 0x804F0A1: G_handle_reg_memb (groups.c:421)
==7347==    by 0x804E6CD: Sess_deliver_reg_memb (session.c:1908)
==7347==    by 0x804B5E9: Discard_packets (protocol.c:1148)
==7347==    by 0x804A6DB: Prot_handle_token (protocol.c:639)
==7347==    by 0x80537AC: E_handle_events (events.c:673)
==7347==    by 0x80497B0: main (spread.c:193)
==7347==  Address 0x1C95401C is 20 bytes inside a block of size 40 free'd
==7347==    at 0x1B903A5D: free (vg_replace_malloc.c:152)
==7347==    by 0x805EC3E: sl_destruct (skiplist.c:565)
==7347==    by 0x804F1B5: G_handle_reg_memb (groups.c:344)
==7347==    by 0x804E6CD: Sess_deliver_reg_memb (session.c:1908)
==7347==    by 0x804B5E9: Discard_packets (protocol.c:1148)
==7347==    by 0x804A6DB: Prot_handle_token (protocol.c:639)
==7347==    by 0x80537AC: E_handle_events (events.c:673)
==7347==    by 0x80497B0: main (spread.c:193)
==7347==
==7347== Invalid write of size 4
==7347==    at 0x805EBBC: sl_remove_all (skiplist.c:555)
==7347==    by 0x805EC35: sl_destruct (skiplist.c:564)
==7347==    by 0x804F0A1: G_handle_reg_memb (groups.c:421)
==7347==    by 0x804E6CD: Sess_deliver_reg_memb (session.c:1908)
==7347==    by 0x804B5E9: Discard_packets (protocol.c:1148)
==7347==    by 0x804A6DB: Prot_handle_token (protocol.c:639)
==7347==    by 0x80537AC: E_handle_events (events.c:673)
==7347==    by 0x80497B0: main (spread.c:193)
==7347==  Address 0x1C954010 is 8 bytes inside a block of size 40 free'd
==7347==    at 0x1B903A5D: free (vg_replace_malloc.c:152)
==7347==    by 0x805EC3E: sl_destruct (skiplist.c:565)
==7347==    by 0x804F1B5: G_handle_reg_memb (groups.c:344)
==7347==    by 0x804E6CD: Sess_deliver_reg_memb (session.c:1908)
==7347==    by 0x804B5E9: Discard_packets (protocol.c:1148)
==7347==    by 0x804A6DB: Prot_handle_token (protocol.c:639)
==7347==    by 0x80537AC: E_handle_events (events.c:673)
==7347==    by 0x80497B0: main (spread.c:193)
==7347==
==7347== Invalid write of size 4
==7347==    at 0x805EBC3: sl_remove_all (skiplist.c:556)
==7347==    by 0x805EC35: sl_destruct (skiplist.c:564)
==7347==    by 0x804F0A1: G_handle_reg_memb (groups.c:421)
==7347==    by 0x804E6CD: Sess_deliver_reg_memb (session.c:1908)
==7347==    by 0x804B5E9: Discard_packets (protocol.c:1148)
==7347==    by 0x804A6DB: Prot_handle_token (protocol.c:639)
==7347==    by 0x80537AC: E_handle_events (events.c:673)
==7347==    by 0x80497B0: main (spread.c:193)
==7347==  Address 0x1C954018 is 16 bytes inside a block of size 40 free'd
==7347==    at 0x1B903A5D: free (vg_replace_malloc.c:152)
==7347==    by 0x805EC3E: sl_destruct (skiplist.c:565)
==7347==    by 0x804F1B5: G_handle_reg_memb (groups.c:344)
==7347==    by 0x804E6CD: Sess_deliver_reg_memb (session.c:1908)
==7347==    by 0x804B5E9: Discard_packets (protocol.c:1148)
==7347==    by 0x804A6DB: Prot_handle_token (protocol.c:639)
==7347==    by 0x80537AC: E_handle_events (events.c:673)
==7347==    by 0x80497B0: main (spread.c:193)
==7347==
==7347== Invalid free() / delete / delete[]
==7347==    at 0x1B903A5D: free (vg_replace_malloc.c:152)
==7347==    by 0x805EC3E: sl_destruct (skiplist.c:565)
==7347==    by 0x804F0A1: G_handle_reg_memb (groups.c:421)
==7347==    by 0x804E6CD: Sess_deliver_reg_memb (session.c:1908)
==7347==    by 0x804B5E9: Discard_packets (protocol.c:1148)
==7347==    by 0x804A6DB: Prot_handle_token (protocol.c:639)
==7347==    by 0x80537AC: E_handle_events (events.c:673)
==7347==    by 0x80497B0: main (spread.c:193)
==7347==  Address 0x1C954008 is 0 bytes inside a block of size 40 free'd
==7347==    at 0x1B903A5D: free (vg_replace_malloc.c:152)
==7347==    by 0x805EC3E: sl_destruct (skiplist.c:565)
==7347==    by 0x804F1B5: G_handle_reg_memb (groups.c:344)
==7347==    by 0x804E6CD: Sess_deliver_reg_memb (session.c:1908)
==7347==    by 0x804B5E9: Discard_packets (protocol.c:1148)
==7347==    by 0x804A6DB: Prot_handle_token (protocol.c:639)
==7347==    by 0x80537AC: E_handle_events (events.c:673)
==7347==    by 0x80497B0: main (spread.c:193)
==7347==
==7347== Invalid read of size 4
==7347==    at 0x805E857: sl_concat (skiplist.c:391)
==7347==    by 0x805132A: G_compute_and_notify (groups.c:1614)
==7347==    by 0x8050D7C: G_handle_groups (groups.c:1466)
==7347==    by 0x804E541: Sess_deliver_message (session.c:1862)
==7347==    by 0x804B2A1: Deliver_agreed_packets (protocol.c:1010)
==7347==    by 0x804A21F: Prot_handle_bcast (protocol.c:396)
==7347==    by 0x80537AC: E_handle_events (events.c:673)
==7347==    by 0x80497B0: main (spread.c:193)
==7347==  Address 0x1C268E40 is 8 bytes inside a block of size 32 free'd
==7347==    at 0x1B903A5D: free (vg_replace_malloc.c:152)
==7347==    by 0x805EB9B: sl_remove_all (skiplist.c:549)
==7347==    by 0x8050F56: G_compute_and_notify (groups.c:1582)
==7347==    by 0x8050D7C: G_handle_groups (groups.c:1466)
==7347==    by 0x804E541: Sess_deliver_message (session.c:1862)
==7347==    by 0x804B2A1: Deliver_agreed_packets (protocol.c:1010)
==7347==    by 0x804A21F: Prot_handle_bcast (protocol.c:396)
==7347==    by 0x80537AC: E_handle_events (events.c:673)
==7347==    by 0x80497B0: main (spread.c:193)

.... (goes on to list the remaining segments and the groups)





More information about the Spread-users mailing list