We've been running nsd8x AOLserver 3.2+ad12+ hollyjerry.org patch
since late March... Jerry helped us set up multiple AOLserver
instances and it has been only the past 3 weeks that we moved
a site with moderate volume to this server (redhat 7.0).
We also upgraded to Postgres 7.1.2
Last week, AOLserver stopped serving pages and the log ended with
this entry:
...[-conn910-] Notice: dbinit:
sql(localhost::tgndata): ' select user_id, token, secure_token,
last_ip, last_hit from sec_sessions where session_id = 882531 '
nsthread(13272) error: ns_realloc: could not allocate 1455184
bytes
I just restarted all AOL servers (main and 2 virtual) and
everything started working.... I didn't look any further to solve
the WHY question.
Yesterday, while running the TOP command I noticed that the
busy site server had consumed 70% of memory. I restarted them
and it was back down to 3% but the busy site
aolserver continues to consume memory... Here it is about
24 hours later and now up to 31%. Here is the most recent top:
7:59pm up 125 days, 19:22, 1 user, load average: 0.24, 0.08, 0.02
114 processes: 112 sleeping, 1 running, 1 zombie, 0 stopped
CPU states: 0.3% user, 3.2% system, 0.0% nice, 96.3% idle
Mem: 516140K av, 513048K used, 3092K free, 62796K shrd,
140192K buff
Swap: 265032K av, 4320K used, 260712K free
135572K cached
PID USER PRI NI SIZE RSS SHARE STAT %CPU %MEM TIME COMMAND
1724 root 13 0 1064 1064 816 R 3.4 0.2 0:04 top
1491 nsamain 0 0 3872 3872 1536 S 0.1 0.7 0:00 nsamain
20984 nsamain 0 0 3872 3872 1536 S 0.0 0.7 0:00 nsamain
20988 nsamain 0 0 3872 3872 1536 S 0.0 0.7 0:00 nsamain
20989 nsamain 0 0 3872 3872 1536 S 0.0 0.7 0:00 nsamain
20990 nsamain 0 0 3872 3872 1536 S 0.0 0.7 0:00 nsamain
20991 nsamain 0 0 3872 3872 1536 S 0.0 0.7 0:00 nsamain
20992 nsamain 0 0 3872 3872 1536 S 0.0 0.7 0:05 nsamain
20997 nsatgn 0 0 159M 159M 1888 S 0.0 31.5 0:01 nsd8x
21001 nsatgn 6 0 159M 159M 1888 S 0.0 31.5 0:00 nsd8x
21002 nsatgn 0 0 159M 159M 1888 S 0.0 31.5 0:00 nsd8x
21003 nsatgn 0 0 159M 159M 1888 S 0.0 31.5 0:00 nsd8x
21004 nsatgn 0 0 159M 159M 1888 S 0.0 31.5 0:00 nsd8x
21005 nsatgn 0 0 159M 159M 1888 S 0.0 31.5 0:00 nsd8x
21006 nsatgn 0 0 159M 159M 1888 S 0.0 31.5 0:03 nsd8x
21013 nsatgn 0 0 159M 159M 1888 S 0.0 31.5 0:00 nsd8x
21021 nsaerc 0 0 24892 24M 1856 S 0.0 4.8 0:01 nsaerc
21025 nsaerc 6 0 24892 24M 1856 S 0.0 4.8 0:00 nsaerc
21026 nsaerc 0 0 24892 24M 1856 S 0.0 4.8 0:00 nsaerc
21027 nsaerc 0 0 24892 24M 1856 S 0.0 4.8 0:00 nsaerc
21028 nsaerc 0 0 24892 24M 1856 S 0.0 4.8 0:00 nsaerc
21030 nsaerc 0 0 24892 24M 1856 S 0.0 4.8 0:00 nsaerc
I bet you guessed, nsatgn is the busy site! I'm assuming that
the failure error above was because memory was exausted....
So, could there be something in *my* Openacs code that is causing
the memory consumption and if so, what is it or how do I find it?
THANK YOU.
-Bob