Problems with Cyrus IMAP 2.2.5 on Solaris 9 on production server

Igor Brezac igor at ipass.net
Wed Jun 2 09:14:08 EDT 2004


On Wed, 2 Jun 2004, Shawn Sivy wrote:

>
> Igor Brezac wrote:
>
> >On Tue, 1 Jun 2004, Shawn Sivy wrote:
> >
> >
> >
> >>I'm having all kinds of problems with Cyrus IMAP 2.2.5 on Solaris 9.
> >>System I/O errors, imap processing dying, IOERRORs.
> >>
> >>Does anyone have suggestions on what could be the cause?  Has anyone
> >>gotten version 2.2.5 working on Solaris (SPARC) 9?
> >>
> >>  -Shawn
> >>
> >>May 30 17:52:57 cyrus lmtpunix[14954]: [ID 240394 local6.error] IOERROR:
> >>opening quota file /var/imap/quota/m/user.macey2: Too many open files
> >>May 30 17:52:57 cyrus lmtpunix[14954]: [ID 335833 local6.error] DBERROR:
> >>error fetching user.macey2: cyrusdb error
> >>May 30 17:52:57 cyrus lmtpunix[14954]: [ID 860734 local6.debug]
> >>verify_user(user.macey2) failed: System I/O error
> >>May 30 17:53:20 cyrus lmtpunix[14954]: [ID 240394 local6.error] IOERROR:
> >>opening quota file /var/imap/quota/s/user.sdhugg: Too many open files
> >>May 30 17:53:20 cyrus lmtpunix[14954]: [ID 335833 local6.error] DBERROR:
> >>error fetching user.sdhugg: cyrusdb error
> >>May 30 17:53:20 cyrus lmtpunix[14954]: [ID 860734 local6.debug]
> >>verify_user(user.sdhugg) failed: System I/O error
> >>May 30 17:53:40 cyrus lmtpunix[14954]: [ID 240394 local6.error] IOERROR:
> >>opening quota file /var/imap/quota/s/user.samuel2: Too many open files
> >>May 30 17:53:40 cyrus lmtpunix[14954]: [ID 335833 local6.error] DBERROR:
> >>error fetching user.samuel2: cyrusdb error
> >>May 30 17:53:40 cyrus lmtpunix[14954]: [ID 860734 local6.debug]
> >>verify_user(user.samuel2) failed: System I/O error
> >>May 30 17:54:05 cyrus lmtpunix[14954]: [ID 240394 local6.error] IOERROR:
> >>opening quota file /var/imap/quota/b/user.balaisi2: Too many open files
> >>May 30 17:54:05 cyrus lmtpunix[14954]: [ID 335833 local6.error] DBERROR:
> >>error fetching user.balaisi2: cyrusdb error
> >>May 30 17:54:05 cyrus lmtpunix[14954]: [ID 860734 local6.debug]
> >>verify_user(user.balaisi2) failed: System I/O error
> >>
> >>Jun  1 08:42:19 cyrus master[21185]: [ID 970914 local6.error] process
> >>21886 exited, signaled to death by 11
> >>Jun  1 08:43:26 cyrus master[21185]: [ID 970914 local6.error] process
> >>20660 exited, signaled to death by 11
> >>Jun  1 08:43:43 cyrus master[21185]: [ID 970914 local6.error] process
> >>20133 exited, signaled to death by 11
> >>Jun  1 08:47:02 cyrus master[21185]: [ID 970914 local6.error] process
> >>23236 exited, signaled to death by 11
> >>Jun  1 08:47:20 cyrus master[21185]: [ID 970914 local6.error] process
> >>23972 exited, signaled to death by 11
> >>Jun  1 08:47:58 cyrus master[21185]: [ID 970914 local6.error] process
> >>23751 exited, signaled to death by 11
> >>Jun  1 08:48:05 cyrus master[21185]: [ID 970914 local6.error] process
> >>21258 exited, signaled to death by 11
> >>Jun  1 08:49:53 cyrus master[21185]: [ID 970914 local6.error] process
> >>19939 exited, signaled to death by 11
> >>Jun  1 08:51:27 cyrus master[21185]: [ID 970914 local6.error] process
> >>24807 exited, signaled to death by 11
> >>Jun  1 08:51:37 cyrus master[21185]: [ID 970914 local6.error] process
> >>23457 exited, signaled to death by 11
> >>
> >>
> >>
> >
> >This looks like a berkeley db problem, although it could be a file
> >descriptor leak somewhere. Have you applied sleepycat 4.2.52 patches
> >(there are two of them, although the first is more important)?  Does
> >checkpointing of the cyrus databases complete successfully (look for
> >ctl_cyrusdb in the syslog)  I start master from 'configdirectory',
> >otherwise berkeley checkpointing does not work (neither does duplicate
> >db expiration)
> >
> >
> >
> I have both patches installed for db 4.2.52.  Below are the messages
> from the log regarding ctl_cyrusdb.  Looks like it completed fine.  I
> took your suggestion of starting master from /var/imap.
>
> Jun  2 08:47:05 cyrus master[17927]: [ID 392559 local6.debug] about to
> exec /local/cyrus/bin/ctl_cyrusdb
> Jun  2 08:47:06 cyrus ctl_cyrusdb[17927]: [ID 702911 local6.notice]
> recovering cyrus databases
> Jun  2 08:47:09 cyrus ctl_cyrusdb[17927]: [ID 275131 local6.notice]
> skiplist: recovered /var/imap/mailboxes.db (85526 records, 6516904
> bytes) in 3 seconds
> Jun  2 08:47:13 cyrus ctl_cyrusdb[17927]: [ID 127214 local6.notice] done
> recovering cyrus databases
> Jun  2 08:47:13 cyrus master[17935]: [ID 392559 local6.debug] about to
> exec /local/cyrus/bin/ctl_cyrusdb
> Jun  2 08:47:13 cyrus ctl_cyrusdb[17935]: [ID 702911 local6.notice]
> checkpointing cyrus databases
> Jun  2 08:47:13 cyrus ctl_cyrusdb[17935]: [ID 578205 local6.debug]
> archiving database file: /var/imap/mailboxes.db
> Jun  2 08:47:13 cyrus ctl_cyrusdb[17935]: [ID 448116 local6.debug]
> archiving log file: /var/imap/db/log.0000000008
> Jun  2 08:47:13 cyrus ctl_cyrusdb[17935]: [ID 578205 local6.debug]
> archiving database file: /var/imap/annotations.db
> Jun  2 08:47:13 cyrus ctl_cyrusdb[17935]: [ID 448116 local6.debug]
> archiving log file: /var/imap/db/log.0000000008
> Jun  2 08:47:13 cyrus ctl_cyrusdb[17935]: [ID 127214 local6.notice] done
> checkpointing cyrus databases
>
>

Have things improved since you restarted master?

You can use pfiles and pmap (and lsof) to check for open files and memory
usage.  Try to use pfile against a running imapd process and see if a
number of open files increases.

What does ulimit -a say?

-- 
Igor
---
Cyrus Home Page: http://asg.web.cmu.edu/cyrus
Cyrus Wiki/FAQ: http://cyruswiki.andrew.cmu.edu
List Archives/Info: http://asg.web.cmu.edu/cyrus/mailing-list.html




More information about the Info-cyrus mailing list