Hi,
I post this here if anyone happens to get this problem someday it might help
someone
We have a aprticular BQ with a lot of database usage, suddeny a users
repeated a query more thatn 10 times which clogged up 8 processors into
running queries.
This caused high cpu usage and all services went slow...
As this happened mail users started to get queued on sending emails and
timing out ... When dovecot count reached 100 (I have this setting at 100,
your might be 50 or lower)
Dfix did what it is programmed to do stoped dovecot , killing all dovecot
processes , and run dbrecover.
Inmediately after this (as there are no dovecot processes
anymore it will start dovecot again)
However in our case the problem was not mailrelated (mysql) the server kept
running dbrecover in an endless loop . Causing even more cpu overhead. (and
more customer disappointment )
So what we did was to kill obvioously mysql runaway queries.
But it was not enough anymore, because dfix kept running dbrecover over and
over.
Luckily, this command as root, helped
mv /usr/local/sbin/dfix.sh /usr/local/sbin/dfix.sh.tmp
Restarted sendmail and dovecot once more.
Waited until users were able to sendmails.(5.-10 minutes in my case - )
Gave dfix a new threshold of 200. and renamed back to dfix.sh
Hope it helps someone
Regards
Rodrigo O
Xnet
/sbin/service dovecot stop
# killall -9 dovecot
# killall -9 dovecot-auth
# /etc/init.d/dbrecover start
-----Original Message-----
From: Colin Jack [mailto:colin (at mark) mainline.co.uk]
Sent: SáÃado, 11 de Julio de 2009 04:55
To: coba-e (at mark) bluequartz.org
Subject: [coba-e:15811] FTP Server problems
One of my BQ servers has suddenly developed a problem with the FTP server
stopping and starting every few minutes.
This server is one of my 'old faithful' BQ servers and has been rock solid
for years. I am the only admin on it and I haven't done anything on there
for ages. There have been no recent yum updates.
I have looked through the logs but cannot find what the problem is - just
lots of these in messages:
Jul 11 10:15:03 server1 cced(smd)[29921]: client 0:[0:29902]: SET 18 . FTP
lastChange = 1247303703 currentState = G currentMessage =
"[[base-ftp.amStatusOK]]"
Jul 11 10:30:13 server1 cced(smd)[3704]: client 0:[0:3666]: SET 18 . FTP
lastChange = 1247304613 currentState = R currentMessage =
"[[base-ftp.amNotResponding]]"
Jul 11 10:30:19 server1 proftpd[3932]: server1.mainline.co.uk
(127.0.0.1[127.0.0.1]) - FTP session opened.
Jul 11 10:30:19 server1 proftpd[3932]: server1.mainline.co.uk
(127.0.0.1[127.0.0.1]) - FTP session closed.
Jul 11 10:45:13 server1 cced(smd)[8114]: client 0:[0:8059]: SET 18 . FTP
lastChange = 1247305513 currentState = R currentMessage =
"[[base-ftp.amNotResponding]]"
Jul 11 10:45:19 server1 proftpd[9308]: server1.mainline.co.uk
(127.0.0.1[127.0.0.1]) - FTP session opened.
Jul 11 10:45:19 server1 proftpd[9308]: server1.mainline.co.uk
(127.0.0.1[127.0.0.1]) - FTP session closed.
Jul 11 11:00:04 server1 proftpd[14127]: server1.mainline.co.uk
(127.0.0.1[127.0.0.1]) - FTP session opened.
Jul 11 11:00:04 server1 proftpd[14127]: server1.mainline.co.uk
(127.0.0.1[127.0.0.1]) - FTP session closed.
Jul 11 11:00:04 server1 cced(smd)[13915]: client 0:[0:13900]: SET 18 . FTP
lastChange = 1247306404 currentState = G currentMessage =
"[[base-ftp.amStatusOK]]"
No log in /var/log/proftpd
[root@server1 proftpd]# proftpd -v
ProFTPD Version 1.3.2
Anybody able to help me track down the problem.
Thanks
Colin