Forum OpenACS Q&A: Server crashes!

Collapse
Posted by Nima Mazloumi on
Hi all,

I have no idea but for some reason our server is crashing all the time. All I can see in the log is this:

[20/Dec/2005:16:58:05][12038.1891892144][-sched:21-] Notice: dbdrv: opening database 'postgres:134.155.100.72::unima2'
[20/Dec/2005:16:58:05][12038.1891892144][-sched:21-] Notice: Opening unima2 on 134.155.100.72, port
[20/Dec/2005:16:58:05][12038.1891892144][-sched:21-] Fatal: received fatal signal 11

Any idea what that means? I restart it and it works. But some minutes later or hours or days it crashs again.

Greetings,
Nima

Collapse
2: Re: Server crashes! (response to 1)
Posted by Vinod Kurup on
Hi Nima,

From the logs, I'd wonder whether the 'signal 11' is coming from AOLserver or from Postgres. There was a similar report here earlier which was solved by changing libraries:

https://openacs.org/forums/message-view?message_id=318659

If not, I'd look in the PG logs and see if it's reporting anything. There are a bunch of google hits for "fatal signal 11 postgresql", but I didn't investigate them much further.

Collapse
3: Re: Server crashes! (response to 1)
Posted by Patrick Giagnocavo on
Are you 100% sure that the RAM in that system is perfect?

Signal 11 can often mean RAM errors... they appear to be random because if one DIMM is bad but the others are good, it is only when memory is written to the bad part of the DIMM that the problem occurs.