Restart from....? (DRP)

Albert Shih Albert.Shih at obspm.fr
Mon Jun 18 04:48:16 EDT 2018


Le 18/06/2018 à 10:22:03+0200, Niels Dettenbach via Info-cyrus a écrit
> Am Montag, 18. Juni 2018, 09:46:02 CEST schrieb Albert Shih:
> > What do you think ? What's your DRP ?
> I shoot snapshots from the underlying FS of the spool partition(s) and the
> main DB files (skiplist) - incl. (incremental) filesystem dumps of them.

How you do that ?

Because at the beginning my plan was to do both (replication and snapshot).

The problem is currently I'm encounter big issue with the snapshot. I don't
know if this is the right place because I don't know if it's related to
Cyrus, so that's why I didn't talk about at the
first time. But I got a server (Dell PowerEdge, 192Go, 28 mechanicals disk,
2 ssd, 2 SAS (for the OS)).

The system is FreeBSD 11 running on the 2 SAS disk on UFS

The cyrus imap run inside a jail on the 2 ssd ( on zfs pool)

The mailbox and xapian index are on two zfs dataset on a zpool with 28
mechanicals disk.

Everything seem working fine, until I try to send the dataset on other
server. I just cannot send a zfs snapshot from this server to another. If
the dataset are small that's OK, but with the mailbox (~4To) the zfs
command just hang after 10-40 minutes during 1-10 minutes, come back work
during 1 or 2 hours and hang again etc.

> in a desaster scenario it usually works well to reinstantiate the last
> snapshot and start the server(s) with a forced full reconstruct run. But this
> only offers "low resolution" recovery (mails / mods since last snapshot are
> gone then).
>
> Beside this we run daily FS backups (incl. cyrus DB dumps) which allows us to

How you do that ? Because cyrus got a lot of DB....

> reinstall from zero (i.e. autmated by ansible or similiar) on system and FS

Yes we using puppet, reinstalling the system and configuration are easy.
The hard part are the data.

> level.
>
> I'm a bit new to the new included backup mechs and repo features in cyrus 3
> and interested in experiences with setups, allowing a efficient "lossless"
> recovery too.

I'm a bit new with cyrus so... ;-) All I can say is the replication seem to
works well. I got

  master --> first slave (same room) --> second slave (distant datacenter).

I'll will try today to see if it's easy or not to restart with a slave by
cloning it.

Best regards.

--
Albert SHIH
DIO bâtiment 15
Observatoire de Paris
xmpp: jas at obspm.fr
Heure local/Local time:
Mon Jun 18 10:36:19 CEST 2018


More information about the Info-cyrus mailing list