What's happening in future branch - March 25

Ken Murchison murch at andrew.cmu.edu
Thu Mar 25 07:33:40 EDT 2010


Sounds good!


Bron Gondwana wrote:
> Gosh, it's getting awfully close to my self imposed deadline of April
> isn't it - though I think I'll be pulling an Ubuntu and saying "I
> meant the end of April, honest".  Sysadmin stuff got in the way for
> rather longer than I intended.
> 
> Due to making changes all over the place, I currently have a giant change
> entry on the future branch with all the changes on it.  I'll try to factor
> them out into individual line items later, keeping it reviewable.  I'm not
> going to try to make every intermediate stage work - let alone even
> compile - because it's too much makework for no benefit.
> 
> That said, here's where things are! (long, but hopefully interesting to
> at least some other people... if nothing else a handy memory jogger for
> me if I get side-tracked at all)
> 
> Rewritten (greatly simplified) API for getting file names:
> ==========================================================
> 
> char *fname = mailbox_meta_fname(&mailbox, META_INDEX);
> char *newfname = mailbox_meta_newfname(&mailbox, META_INDEX);
> 
> (uses a separate static buffer so you can just do this and then
> your rename logic!)
> 
> char *fname = mailbox_message_fname(&mailbox, uid);
> 
> So that simplifies HEAPS of code and hides all the "is it meta"
> behind one interface.
> 
> Mboxlist functions return a struct
> ==================================
> 
> struct mboxlist_entry mbentry;
> r = mboxlist_lookup(name, &mbentry, &tid);
> 
> mbentry.mbtype, mbentry.partition, mbentry.acl
> 
> Simplifies parameter passing to functions considerably
> 
> Locking
> =======
> 
> Here's the big one!  I'll write a separate section on this in more
> detail, but here are the basic contracts:
> 
> New file: cyrus.lock - empty file.
> 
> exclusive lock on cyrus.lock:
>   * free reign with all files.  Delete what you like, rewrite
>     what you like
>   * no other locks required
> 
> shared lock on cyrus.lock: 
>   * MUST NOT replace cyrus.index or cyrus.cache files (change inode)
>   * MUST NOT change any files at all without cyrus.index lock
>   * MAY rewrite cyrus.squat.
> 
> exclusive lock on cyrus.index:
>   * MUST have a shared lock on cyrus.lock
>   * MAY rewrite flags/modseq on any index record
>   * MAY append new index records
>   * MAY append to cyrus.cache
>   * MUST NOT rewrite older parts of cyrus.cache
>   * MAY rewrite cyrus.header (write .NEW, rename)
>   * MAY rewrite user.seen file
> 
> shared lock on cyrus.index:
>   * optional - gives consistent reads.  Otherwise modseqs higher than
>     last read header may be found, seen may be out of sync.  I have a
>     simple concurrency test case that can see unintended data by being
>     clever.
> 
> All this infrastructure is in place in the future branch now, though
> untested so far.  APIs are:
> 
> mailbox_open_shared(name, auth_state, &mailbox);
> mailbox_open_exclusive(name, auth_state, &mailbox);
> 
> mailbox_lock_index(&mailbox);
> mailbox_unlock_index(&mailbox);
> 
> mailbox_close(&mailbox);
> 
> CRCs and automatic bookkeeping
> ==============================
> 
> (remember the discussion on recno replacing msgno because expunged records
>  remain in cyrus.index rather than cyrus.expunge, so need to be skipped?
>  It's necessary for the above, because cyr_expire will get an exclusive lock
>  on the mailbox for the duration of its index rewrite!)
> 
> Move "deleted / flagged / answered" meta count management into mailbox.c:
> 
> mailbox_rewrite_index_record(&mailbox, recno, &record);
>  * reads the OLD record again, updates counts, updates CRC32 values including
>    XOR_CRC which is the XOR of all record CRCs.
>  * updates "exists" if it's an expunge.
>  * updates modseq value.  Calls:
> 
> mailbox_index_dirty(&mailbox)
>  * anything doing a write to ANYTHING does this.  It sets a flag,
>    increments the highestmodseq and grabs a timestamp.  All changes
>    use these values.
> 
> mailbox_header_write(&mailbox);
>  * rewrites the header file.  Also updates the mailbox->header_file_crc
>    value which will be saved when we call:
> 
> mailbox_write_index_header(&mailbox);
>  * MUST be called after making any change to ensure all checksums and
>    XOR_CRC are correctlly stored - along with the bumped highestmodseq
>    and correct record counts.
> 
> Oh, we missed:
> 
> mailbox_append_index_records(&mailbox, &records, num);
> 
> Stores "num" records to the index file.  It's a pointer to an array of
> "struct index_record" structs.
> 
> I'm still reworking everything to use these rather than doing the
> bookkeeping by hand!
> 
> index.c - use a mailbox object
> ==============================
> 
> All the globals go away!  There's just a global mailbox struct, which
> will have a lock_fd in shared mode, meaning that all the offsets into
> the cyrus.index and cyrus.cache mmaps are ALWAYS VALID.
> 
> Along with compulsary CONDSTORE - vast swathes of code complexity are
> being removed :)  You don't need to track times when you have modseq
> to determine if flags should be sent again.  You don't have to keep
> statting files, just re-read the header and then scan for updates.
> 
> It WILL be necessary to keep a global array mapping from msgno to recno
> so non-UID commands can be efficient.  This can be rewritten whenever
> highestmodseq is detected to have changed :)
> 
> I'm just starting on this - hence there are lots of global references to
> remove.
> 
> Also - no more UID(msgno) and CACHE_OFFSET(msgno) and all that jazz.  We
> read a record with mailbox_read_index_record() and then use it.  In native
> form, in an easily accessible struct.  Joy.  No more 
> htonl((bit32 *)base_offset+SOME_CONSTANT) nonsense everywhere!  Contain it,
> wrap it, ensure accesses to it go via consistency checking, checksum
> updating and header counter maintaining paths!
> 
> 
> So - that's the status of the future branch!  It also has most of the
> interesting patches from the fastmail branch cherry-picked forwards to 
> sit on top of CVS trunk.  
> 
> Feel free to poke around, make suggestions, send patches, etc.  Some of
> the less popular (i.e. I don't use them) utilities could probably do with
> some TLC or maybe merging into uber-tools if they don't do much.  While I
> haven't drifted too far incompatible just yet, it's getting that way!
> 
> Speaking of which: cyrus.expunge.  It's pretty much all stripped now.  I'll
> be writing something simple in to mailbox_upgrade_index which detects its
> presence and does a sorted merge of the two files into the new cyrus.index,
> complete with system_flags & FLAG_EXPUNGED.  There's also FLAG_UNLINKED
> which will allow us to have immediate expunge of the actual on-disk-file
> (with the associated inabilithy to unexpunge of course!) if sites want that.
> 
> 
> Thanks for reading!
> 
> Bron.
> 

-- 
Kenneth Murchison
Systems Programmer
Carnegie Mellon University


More information about the Cyrus-devel mailing list