[Spread-users] Spread communication between Linux and Windows

Wim Pessemier wim.pessemier at ster.kuleuven.be
Fri Mar 26 05:21:25 EDT 2010


Thank you very much for pointing us in the right direction! When adding that 
line, it said 

----------
Sess_read: Message has type field 0x80000080
Sess_validate_read_header: Message has illegal type field 0x80000080
Sess_kill: killing session user ( mailbox 10 )
----------

So I guessed it had to do with the fact that we recompiled spread on both 
machines with 
MAX_GROUP_NAME          54
MAX_PRIVATE_NAME        32

I did some tests now with the precompiled releases from the spread website and 
everything works perfectly now. Clearly there must be some compatibility 
problem between both our compiled versions, so it won't be a problem anymore 
to figure it out.

We use spread in a distributed control system for an astronomical telescope, 
and I keep on being very happy about that! Since we installed it a half year 
ago spread has proven to be very reliable and scalable.

Many thanks,

Wim





On Thursday 25 March 2010 07:48:01 pm Jonathan Stanton wrote:
> If you add the line:
> 
> DebugFlags = {SESSION PRINT EXIT}
> 
> to your spread.conf file then the daemon should print out some information
>  about each client connection that is attempted. This might show what the
>  error is if the server receives anything from the client, but I'm
>  suspecious that no data is getting through as the error your output shows
>  "recv_nointr_timeout: Timed out" means that the client tried to connect
>  and read the first data from the server and it never got anything.
> 
> I would also try using the linux client and connecting to the local linux
>  daemon and see if that works. The -8 error is a general one for any time
>  the server closed the tcp connection without informing the client about
>  what was wrong. Since your connection doesn't even get established I think
>  it's some sort of connectivity issue.
> 
> I'd also make sure you don't have a firewall blocking connections. Since by
>  default the spread client uses localhost (which usually isn't blocked) I'd
>  expect it to work, but more aggressive or host-based firewalls may still
>  cause problems.
> 
> Cheers,
> 
> Jonathan
> 
> On Thu, Mar 25, 2010 at 05:24:17PM +0100, Wim Pessemier wrote:
> > I just realized that the order of the hosts in the spread.conf was not
> > the same for the Windows and the Linux computers.
> >
> > Now the Windows and Linux spread daemons "see" each other (sptmonitor
> > shows info of both of them) but the problem is that I cannot connect
> > to them.
> >
> > -----------
> > ~> /software/spread/bin/spuser
> > Spread library version is 4.0.0
> > recv_nointr_timeout: Timed out
> > SP_error: (-8) Connection closed by spread
> >
> > Bye
> > -----------
> >
> > Anything I should try?
> >
> > Thank you,
> >
> > Wim
> >
> > On Thursday 25 March 2010 05:05:16 pm Wim Pessemier wrote:
> > > Hi,
> > >
> > > I'm trying to run spread on a mixed network of Linux and Windows XP
> > > computers, but so far I cannot get it to work. When I start a spread
> > > server on a Windows computer (pollux) and then I start another one on
> > >
> > > the Linux computer (ankaa), it says:
> > > | Version 4.00.00 Built 29/November/2006
> > >
> > > \======================================================================
> > >==== =/ Conf_load_conf_file: using file: spread.conf
> > > Successfully configured Segment 0 [10.33.178.255:4803] with 2 procs:
> > >                        pollux: 10.33.178.208
> > >                         ankaa: 10.33.178.81
> > > Setting SO_REUSEADDR to always on -- make sure Spread daemon host is
> > >  secured! Finished configuration file.
> > > Hash value for this configuration is: 3547312074
> > > Conf_load_conf_file: My name: ankaa, id: 10.33.178.81, port: 4803
> > > Membership id is ( 169980497, 1269531216)
> > > --------------------
> > > Configuration at ankaa is:
> > > Num Segments 1
> > >          1       10.33.178.255     4803
> > >                  ankaa                   10.33.178.81
> > > ====================
> > >
> > > So in the end only 'ankaa' (Linux) is found. If I connect with "spuser
> > > -s 4803 at pollux -r", then I get SP_error -8 (Connection closed by
> > > spread) as soon as a message is sent. When I do the same thing in the
> > > reverse order (first start Linux, then Windows) the same errors occur.
> > >
> > > Both ankaa and pollux are present in the hosts files on Windows and
> > > Linux. In both cases v4.00.00 Built 29/November/2006 is used.
> > >
> > > Is there anything special I need to do to get this to work?
> > > I would be very grateful for all help!
> > >
> > > Kind regards,
> > >
> > > Wim
> > >
> > >
> > >
> > >
> > > _______________________________________________
> > > Spread-users mailing list
> > > Spread-users at lists.spread.org
> > > http://lists.spread.org/mailman/listinfo/spread-users
> >
> > On Thursday 25 March 2010 05:05:16 pm Wim Pessemier wrote:
> > > Hi,
> > >
> > > I'm trying to run spread on a mixed network of Linux and Windows XP
> > > computers, but so far I cannot get it to work. When I start a spread
> > > server on a Windows computer (pollux) and then I start another one on
> > >
> > > the Linux computer (ankaa), it says:
> > > | Version 4.00.00 Built 29/November/2006
> > >
> > > \======================================================================
> > >==== =/ Conf_load_conf_file: using file: spread.conf
> > > Successfully configured Segment 0 [10.33.178.255:4803] with 2 procs:
> > >                        pollux: 10.33.178.208
> > >                         ankaa: 10.33.178.81
> > > Setting SO_REUSEADDR to always on -- make sure Spread daemon host is
> > >  secured! Finished configuration file.
> > > Hash value for this configuration is: 3547312074
> > > Conf_load_conf_file: My name: ankaa, id: 10.33.178.81, port: 4803
> > > Membership id is ( 169980497, 1269531216)
> > > --------------------
> > > Configuration at ankaa is:
> > > Num Segments 1
> > >          1       10.33.178.255     4803
> > >                  ankaa                   10.33.178.81
> > > ====================
> > >
> > > So in the end only 'ankaa' (Linux) is found. If I connect with "spuser
> > > -s 4803 at pollux -r", then I get SP_error -8 (Connection closed by
> > > spread) as soon as a message is sent. When I do the same thing in the
> > > reverse order (first start Linux, then Windows) the same errors occur.
> > >
> > > Both ankaa and pollux are present in the hosts files on Windows and
> > > Linux. In both cases v4.00.00 Built 29/November/2006 is used.
> > >
> > > Is there anything special I need to do to get this to work?
> > > I would be very grateful for all help!
> > >
> > > Kind regards,
> > >
> > > Wim
> > >
> > >
> > >
> > >
> > > _______________________________________________
> > > Spread-users mailing list
> > > Spread-users at lists.spread.org
> > > http://lists.spread.org/mailman/listinfo/spread-users
> >
> > _______________________________________________
> > Spread-users mailing list
> > Spread-users at lists.spread.org
> > http://lists.spread.org/mailman/listinfo/spread-users
> 




More information about the Spread-users mailing list