Forum OpenACS Q&A: Multiple openacs in a box problem!

Hi all! I am getting problems with two openacs installations in the same box with the same database server. I have RedHat6.2 and PG7.0.2, for the first server everything is going well, but then I did another database, openacs and start the aolserver in other port (the .tcl file is ok). I can't connect (the most of the times), the problems seems to be that a handle can't be allocated, I don't know why? here's part of my log file with the second openacs:

[31/Jan/2001:13:25:48][17438.4101][-conn0-] Notice: Opening 2open on
localhost
[31/Jan/2001:13:25:48][17438.4101][-conn0-] Notice:
Ns_PgOpenDb(postgres):  Openned connection to localhost::2open.
[31/Jan/2001:13:25:48][17438.4101][-conn0-] Error: dbinit:
error(localhost::2open,pqReadData() -- backend closed the channel
unexpectedly.
        This probably means the backend terminated abnormally
        before or while processing the request.
): 'update users
set last_visit = sysdate()
where user_id = 1'
[31/Jan/2001:13:25:48][17438.4101][-conn0-] Error:
ad_maintain_last_visits filter got an error:  Database operation "dml"
failed
[31/Jan/2001:13:25:48][17438.4101][-conn0-] Notice: dbdrv: opening
database 'postgres:localhost::2open'
[31/Jan/2001:13:25:48][17438.4101][-conn0-] Notice: Opening 2open on
localhost
[31/Jan/2001:13:25:48][17438.4101][-conn0-] Notice:
Ns_PgOpenDb(postgres):  Openned connection to localhost::2open.
[31/Jan/2001:13:25:48][17438.4101][-conn0-] Notice: Querying '
        select user_id, token, secure_token,
               last_ip, last_hit from sec_sessions
        where session_id = 2318;'
[31/Jan/2001:13:25:48][17438.4101][-conn0-] Notice: dbinit:
sql(localhost::2open): '
        select user_id, token, secure_token,
               last_ip, last_hit from sec_sessions
        where session_id = 2318
    '
[31/Jan/2001:13:25:48][17438.4101][-conn0-] Notice: Querying 'select
ad_group_member_p(1, system_administrator_group_id()) from dual;'
NOTICE:  Message from PostgreSQL backend:
        The Postmaster has informed me that some other backend died
abnormally and possibly corrupted shared memory.
        I have rolled back the current transaction and am going to
terminate your database system connection and exit.
        Please reconnect to the database system and repeat your query.
[31/Jan/2001:13:25:48][17438.4101][-conn0-] Notice: Ns_PgExec: Trying
to reopen database connection
[31/Jan/2001:13:25:48][17438.4101][-conn0-] Notice: Opening 2open on
localhost
[31/Jan/2001:13:25:48][17438.4101][-conn0-] Notice:
Ns_PgOpenDb(postgres):  Openned connection to localhost::2open.
[31/Jan/2001:13:25:48][17438.4101][-conn0-] Notice: Ns_PgExec:
Retrying query
[31/Jan/2001:13:25:48][17438.4101][-conn0-] Notice: dbinit:
sql(localhost::2open): 'select ad_group_member_p(1,
system_administrator_group_id()) from dual'
[31/Jan/2001:13:25:48][17438.4101][-conn0-] Error: dbinit: db handle
limit exceeded: thread already owns 1 handle from pool 'main'
[31/Jan/2001:13:25:48][17438.4101][-conn0-] Error: could not allocate
1 handle from pool "main"
could not allocate 1 handle from pool "main"
    while executing
"ns_db gethandle"
    invoked from within
"set db [ns_db gethandle]"
    (file "/web/2open/www/admin/users/action-choose.tcl" line 46)
    invoked from within
"source $script"
    (procedure "ns_sourceproc" line 6)
    invoked from within
"ns_sourceproc cns0 {}"
[31/Jan/2001:13:25:49][17438.2051][-sched-] Notice: Running scheduled
proc flag_interrupted_spams...
[31/Jan/2001:13:25:49][17438.2051][-sched-] Notice: dbdrv: opening
database 'postgres:localhost::2open'
[31/Jan/2001:13:25:49][17438.2051][-sched-] Notice: Opening 2open on
localhost
[31/Jan/2001:13:25:49][17438.4101][-conn0-] Notice: Querying '
                update sec_sessions
                set last_hit = 980965548
                where session_id = 2318;'
[31/Jan/2001:13:25:49][17438.2051][-sched-] Notice:
Ns_PgOpenDb(postgres):  Openned connection to localhost::2open.
[31/Jan/2001:13:25:49][17438.2051][-sched-] Notice: Checking for spam
jobs which were left in the 'sending' state
[31/Jan/2001:13:25:49][17438.2051][-sched-] Notice: Done running
scheduled proc flag_interrupted_spams.
[31/Jan/2001:13:25:49][17438.4101][-conn0-] Notice: dbinit:
sql(localhost::2open): '
                update sec_sessions
                set last_hit = 980965548
                where session_id = 2318
            '
[31/Jan/2001:13:25:51][17438.4101][-conn0-] Notice: Ns_PgExec: Trying
to reopen database connection
[31/Jan/2001:13:25:51][17438.4101][-conn0-] Notice: Opening 2open on
localhost
[31/Jan/2001:13:25:51][17438.4101][-conn0-] Error:
Ns_PgOpenDb(postgres):  Could not connect to localhost::2open:  The
Data Base System is in recovery mode

[31/Jan/2001:13:25:51][17438.4101][-conn0-] Error: dbinit:
error(localhost::2open,pqReadData() -- backend closed the channel
unexpectedly.
        This probably means the backend terminated abnormally
        before or while processing the request.
): 'update users
set last_visit = sysdate()
where user_id = 1'
[31/Jan/2001:13:25:51][17438.4101][-conn0-] Error:
ad_maintain_last_visits filter got an error:  Database operation "dml"
failed
[31/Jan/2001:13:25:51][17438.4101][-conn0-] Notice: Querying '
        select user_id, token, secure_token,
               last_ip, last_hit from sec_sessions
        where session_id = 2318;'
NOTICE:  Message from PostgreSQL backend:
        The Postmaster has informed me that some other backend died
abnormally and possibly corrupted shared memory.
        I have rolled back the current transaction and am going to
terminate your database system connection and exit.
        Please reconnect to the database system and repeat your query.

Thank you for your help!
Collapse
Posted by Don Baccus on
Hmmm...I do this all the time with no problem...

You're only running one postmaster, I hope ????  If you're running
multiple postmasters you're really asking for trouble.

What happens if you run psql, is everything OK then?

Don, yes I'm just running one postmaster, when I access the database with psql dbname there's no problem, so I don't know what to do.
I guess is a shared memory problem...

I had problems before when I tried to do this:
su -l postgres -c '/usr/local/pgsql/bin/postmaster -B 1000 -o "-S 2000" -S -D /usr/local/pgsql/data'
then I've reinstalled all and everything seems to work ok, I don't know whats going on...

any help?

I'm also using aolserver3.2+ad10