De-duping attachments

Bron Gondwana brong at fastmail.fm
Wed Sep 15 18:24:04 EDT 2010


On Wed, Sep 15, 2010 at 05:24:11PM +0100, Gavin McCullagh wrote:
> Hi,
> 
> On Wed, 15 Sep 2010, Nik Conwell wrote:
> 
> > Isn't the easy hack for dedup just looking at the above md5 files and 
> > then doing appropriate hard links?  This could be done by a nightly 
> > trawl of the spool space.  A bigger win would be to separate the headers 
> > from the messages but that's a lot more work.
> 
> For what it's worth, I believe the fsdup tool which is part of fslint will
> do this for you.
> 
> 	http://www.pixelbeat.org/fslint/

Or this lovely little toy.  It uses the fact that in current versions of
Cyrus the "GUID" field is actually the sha1 of the underlying file.

Bron ( warning: may contain FastMail specific assuptions )
-------------- next part --------------
A non-text attachment was scrubbed...
Name: fix_duplicate_guids.pl
Type: text/x-perl
Size: 1945 bytes
Desc: not available
Url : http://lists.andrew.cmu.edu/pipermail/info-cyrus/attachments/20100916/bedd1aa3/attachment.bin 


More information about the Info-cyrus mailing list