Page 1 of 1

Find Duplicate Media

Posted: 28 Oct 2012 22:38
by johnmorrisoniom
Hi Jane,
This plugin is superb, and I run it once a week (ish) after I have done a batch of census entries (mainly).
I have a large file (32061 individuals), so the plugin takes longer to run each time.
Would it be possible to set a last run date, and then only check files added/updated since that date. Much the same as Mike's find duplicated individuals does.
The thinking behind the suggestion, is that once all files have been checked, then only the newly added files need checking against what is already there. Thus reducing the rune time required.

ID:6557

Find Duplicate Media

Posted: 29 Oct 2012 19:37
by Jane
Thanks for the compliment, I'll put the request on my To-Do list, which is a little full at the moment, but I will see what needs doing, I think it's just a case of caching the MD5 codes, but it might miss duplicates if files were edited with out updating the records in FH.

Find Duplicate Media

Posted: 02 Feb 2013 12:47
by johnmorrisoniom
Jane,
Any progress with an update for this plug in.?
I now have over 5,500 multimedia items in my file, and the plugin takes about 10 minutes to run.

Find Duplicate Media

Posted: 03 Feb 2013 11:08
by Jane
Sorry John, I'll try and allocate a couple of hours to it.

Find Duplicate Media

Posted: 08 Feb 2013 18:41
by Jane
John, I have added a Cache, it still processes all the records as I need to build a list of the media id's but it caches the md5 for the records.

When starting it asks if you want to use the cache if one exists, so you can do a full scan at any time should you need to, by saying No.

https://www.dropbox.com/s/wiebe40dbl8c6 ... dia.fh_lua

Find Duplicate Media

Posted: 19 Feb 2013 12:11
by johnmorrisoniom
Hi Jane,
I have tried the new version a couple of times now. It definitely runs a bit quicker, more-so on my W7 machine.

Does the cache update each time, or do you have to do a run without the cache for it to update?