[Spread-users] Spread Crash
Rodrick Brown
rodrick.brown at gmail.com
Sat Oct 4 17:32:03 EDT 2008
On Sat, Oct 4, 2008 at 12:00 PM, Adrian Revill <adrian.revill at shazamteam.com
> wrote:
> Hi,
>
> We are running spread 4.00.00 on a single segment with redhat el5, and have
> been experiencing a problem where one server crashes (OS), and all the
> spread daemons on all the other servers die.
>
> We have enabled logging on spread but do not see any reason logged,
> probably because spread uses a buffered logger and the buffer is not being
> flushed.
> We are also not getting any core dumps.
>
> Has anyone else seen this problem? or have any idea how to fault find?
>
> Also is there a newer version of spread?
>
> Our configuration is.
>
> Spread_Segment 172.20.255.255
> {
> purple9 172.20.0.134
> purple10 172.20.0.135
> purple11 172.20.0.136
> purple12 172.20.0.137
> purple13 172.20.0.140
> purple14 172.20.0.144
> purple15 172.20.0.141
> purple16 172.20.0.143
> app11 172.20.0.130
> app12 172.20.0.131
> app13 172.20.0.142
> wombat14 172.20.0.145
> app15 172.20.0.132
> app16 172.20.0.133
> webportal11 172.20.0.138
> webportal12 172.20.0.139
> webportal13 172.20.0.157
> app20 172.20.0.170
> app21 172.20.0.171
> purple20 172.20.0.172
> purple21 172.20.0.173
> app30 172.20.0.160
> app31 172.20.0.161
> purple30 172.20.0.162
> purple31 172.20.0.163
> pws30 172.20.0.164
> pws31 172.20.0.165
> }
>
> DaemonUser = nobody
> DaemonGroup = nobody
> RuntimeDir = /usr/spread
>
> # EXIT PRINT DEBUG DATA_LINK NETWORK PROTOCOL SESSION
> # CONFIGURATION MEMBERSHIP FLOW_CONTROL STATUS EVENTS
> # GROUPS MEMORY SKIPLIST ALL NONE
>
> #DebugFlags = { ALL !DATA_LINK !MEMORY !DEBUG !EVENTS }
> DebugFlags = { ALL !DATA_LINK !MEMORY !DEBUG !EVENTS }
>
>
Can you paste the last 10 or so lines in the log before the crash? I've
seen weird memory behaviors with spread when you have members in too many
groups and sending out too many messages all at once where spread will
consume all memory on the host and crash with an malloc error.
>
>
> ______________________________________________________________________
> This email has been scanned by the MessageLabs Email Security System.
> For more information please visit http://www.messagelabs.com/email______________________________________________________________________
>
> _______________________________________________
> Spread-users mailing list
> Spread-users at lists.spread.org
> http://lists.spread.org/mailman/listinfo/spread-users
>
--
[ Rodrick R. Brown ]
http://www.rodrickbrown.com http://www.linkedin.com/in/rodrickbrown
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://lists.spread.org/pipermail/spread-users/attachments/20081004/0fbc373f/attachment.html
More information about the Spread-users
mailing list