Xapian searches of the body of an email

Robert Stepanek rsto at fastmailteam.com
Mon Jan 7 10:42:48 EST 2019


Hi,

Sebastian is right:

On Mon, Jan 7, 2019, at 3:57 PM, Sebastian Hagedorn wrote:
> 
> squatter is nowadays a bit of a misnomer, because it uses
> whatever index> you have configured. In cyrus 2.4, squatter would always create a
> SQUAT> index. When you run squatter with Xapian, it will build the
> index, but for> the index to actually work, you also need the conversationsdb. 

conversations.db is indeed a misnomer now. The database was only used to
keep track of mail threads (hence the name), but its role expanded. One
of the indexes it stores is the SHA1 hashes of every message, and
separate hashes for each of that message MIME parts. Such a hash is
named the GUID, and for each GUID we store a list of all
mailbox:UID[bodypart] pairs where this content occurs in.
For search, we keep track of the indexed messages by GUID, so we can
avoid reindexing duplicate mails. To return a search result, we can now
map that GUID back to its mailbox:message pairs. That's why we need
conversations.db for search.
I can't help with upgrading from 2.4, unfortunately, but if you re-index
your mailboxes once in conversations.db, you should be all set.
Cheers,
Robert
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.andrew.cmu.edu/pipermail/info-cyrus/attachments/20190107/11f9d4e2/attachment.html>


More information about the Info-cyrus mailing list