[Spread-users] Need debugging advice

David Avraamides David.Avraamides at SevernRiverCapital.com
Fri Oct 29 10:08:51 EDT 2004


I'm trying to diagnose a problem that just came up on our spread-based
messaging layer. For months we have had applications running fine in
production and yesterday I noticed some problems. It seems I can only
see messaging traffic when the client and server are both running on the
same box. This was never a problem before and I can't think of anything
that changed (no new software, no new config file, etc.). I've written
my own application-level "sniff" tool, but its not helpful since its not
seeing any cross-machine traffic. I was wondering if there are any
spread-level sniffing/debugging tools that could help me understand what
might be wrong.

Thanks,
-Dave

--

The relevant part of the config file I use is:

Spread_Segment  10.10.1.255:4803 {
        ct-srvwebin-01
        ct-srvmon-01
        ct-srvapp-06
        ct-devbuild-01
}

Spread_Segment  10.10.2.255:4803 {
        ct-dev-01
        ct-dev-02
        ct-dev-04
}

And here is the log when I start up a daemon:

ip_init: using file: spread.access_ip
Conf_init: using file: spread.conf
Successfully configured Segment 0 [10.10.1.255:4803] with 4 procs:
              ct-srvwebin-01: 10.10.1.28
                ct-srvmon-01: 10.10.1.37
                ct-srvapp-06: 10.10.1.117
              ct-devbuild-01: 10.10.1.110
Successfully configured Segment 1 [10.10.2.255:4803] with 3 procs:
                   ct-dev-01: 10.10.2.20
                   ct-dev-02: 10.10.2.41
                   ct-dev-04: 10.10.2.50
Finished configuration file.
Conf_init: My name: ct-dev-01, id: 10.10.2.20, port: 4803
Membership id is ( 168427804, 1099058653)
--------------------
Configuration at ct-dev-01 is:
Num Segments 2
        4       10.10.1.255       4803
                ct-srvwebin-01          10.10.1.28
                ct-srvmon-01            10.10.1.37
                ct-srvapp-06            10.10.1.117
                ct-devbuild-01          10.10.1.110
        1       10.10.2.255       4803
                ct-dev-01               10.10.2.20
====================
++++++++++++++++++++++
Num of groups: 3
[1] group data with 4 members:
        [1] #r5694-216#ct-devbuild-01
        [2] #r7467-132#ct-srvwebin-01
        [3] #r8958-1920#ct-srvmon-01
        [4] #r9140-144#ct-srvwebin-01
----------------------
[2] group mail with 3 members:
        [1] #r5694-216#ct-devbuild-01
        [2] #r8958-1920#ct-srvmon-01
        [3] #r9140-144#ct-srvwebin-01
----------------------
[3] group xbtest with 3 members:
        [1] #r5694-216#ct-devbuild-01
        [2] #r8958-1920#ct-srvmon-01
        [3] #r9140-144#ct-srvwebin-01
----------------------


-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://lists.spread.org/pipermail/spread-users/attachments/20041029/af968ab2/attachment.html 


More information about the Spread-users mailing list