One more attempt: stuck processes

Sebastian Hagedorn Hagedorn at uni-koeln.de
Fri Nov 16 11:20:00 EST 2007


--On 16. November 2007 16:52:27 +0100 Gabor Gombas <gombasg at sztaki.hu> 
wrote:

> On Fri, Nov 16, 2007 at 12:36:49PM +0100, Sebastian Hagedorn wrote:
>
>> He suggested that the trace is unreliable. Perhaps a bug in RHEL 3's
>> version of OpenSSL messes up the stack. That would also explain why
>> nobody  else seems to have this problem.
>
> FYI I also know a system that has problems with hung Cyrus processes.
> AFAIR they have problems with pop3s only, but that may be because there
> are more POP3 than IMAP users, I don't know. The system in question runs
> 2.3.8 on Debian Etch currently.

That's a 2.6 kernel, right?

> I intend to help diagnose that system but had no time so far; they're
> now running a script that does a POP3 connection every couple of minutes
> and if that takes too long it restarts Cyrus.

Hm, we don't suffer any actual slowdown, it's just that the number of 
processes increases over time.

> There is nothing interesting in the logs:
>
> Oct 15 02:39:31 host cyrus/master[6102]: about to exec
> /usr/local/cyrus/sbin/pop3d Oct 15 02:39:31 host cyrus/pop3s[6102]:
> executed
> Oct 15 02:39:31 host cyrus/pop3s[6102]: accepted connection

That's what I'm seeing. Could you get a stack trace?

> OTOH there are a lot of messages like the following:
>
> Oct 16 14:13:10 host cyrus/master[26136]: about to exec
> /usr/local/cyrus/sbin/pop3d Oct 16 14:13:10 host cyrus/pop3s[26136]:
> executed
> Oct 16 14:13:10 host cyrus/pop3s[26136]: accepted connection
> Oct 16 14:13:10 host cyrus/pop3s[26136]: pop3s failed:XXXXXXXXXXXX
> [XX.XXX.XX.XXX] Oct 16 14:13:10 host cyrus/pop3s[26136]: Fatal error:
> tls_start_servertls() failed Oct 16 14:13:10 host cyrus/master[15923]:
> process 26136 exited, status 75 Oct 16 14:13:10 host cyrus/master[15923]:
> service pop3s pid 26136 in BUSY state: terminated abnormally
>
> Any idea what's causing that?

I have many of those as well. I suppose that could be any number of things. 
Faulty protocol or dropped connections.
-- 
     .:.Sebastian Hagedorn - RZKR-R1 (Gebäude 52), Zimmer 18.:.
Zentrum für angewandte Informatik - Universitätsweiter Service RRZK
.:.Universität zu Köln / Cologne University - ✆ +49-221-478-5587.:.
                   .:.:.:.Skype: shagedorn.:.:.:.
-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/pgp-signature
Size: 186 bytes
Desc: not available
Url : http://lists.andrew.cmu.edu/pipermail/info-cyrus/attachments/20071116/5955aa9a/attachment.bin 


More information about the Info-cyrus mailing list