* Find Duplicate Media

For users to report plugin bugs and request plugin enhancements; and for authors to test new/new versions of plugins, and to discuss plugin development (in the Programming Technicalities sub-forum). If you want advice on choosing or using a plugin, please ask in General Usage or an appropriate sub-forum.
Post Reply
User avatar
johnmorrisoniom
Megastar
Posts: 882
Joined: 18 Dec 2008 07:40
Family Historian: V7
Location: Isle of Man

Find Duplicate Media

Post by johnmorrisoniom » 28 Oct 2012 22:38

Hi Jane,
This plugin is superb, and I run it once a week (ish) after I have done a batch of census entries (mainly).
I have a large file (32061 individuals), so the plugin takes longer to run each time.
Would it be possible to set a last run date, and then only check files added/updated since that date. Much the same as Mike's find duplicated individuals does.
The thinking behind the suggestion, is that once all files have been checked, then only the newly added files need checking against what is already there. Thus reducing the rune time required.

ID:6557

User avatar
Jane
Site Admin
Posts: 8441
Joined: 01 Nov 2002 15:00
Family Historian: V7
Location: Somerset, England
Contact:

Find Duplicate Media

Post by Jane » 29 Oct 2012 19:37

Thanks for the compliment, I'll put the request on my To-Do list, which is a little full at the moment, but I will see what needs doing, I think it's just a case of caching the MD5 codes, but it might miss duplicates if files were edited with out updating the records in FH.
Jane
My Family History : My Photography "Knowledge is knowing that a tomato is a fruit. Wisdom is not putting it in a fruit salad."

User avatar
johnmorrisoniom
Megastar
Posts: 882
Joined: 18 Dec 2008 07:40
Family Historian: V7
Location: Isle of Man

Find Duplicate Media

Post by johnmorrisoniom » 02 Feb 2013 12:47

Jane,
Any progress with an update for this plug in.?
I now have over 5,500 multimedia items in my file, and the plugin takes about 10 minutes to run.

User avatar
Jane
Site Admin
Posts: 8441
Joined: 01 Nov 2002 15:00
Family Historian: V7
Location: Somerset, England
Contact:

Find Duplicate Media

Post by Jane » 03 Feb 2013 11:08

Sorry John, I'll try and allocate a couple of hours to it.
Jane
My Family History : My Photography "Knowledge is knowing that a tomato is a fruit. Wisdom is not putting it in a fruit salad."

User avatar
Jane
Site Admin
Posts: 8441
Joined: 01 Nov 2002 15:00
Family Historian: V7
Location: Somerset, England
Contact:

Find Duplicate Media

Post by Jane » 08 Feb 2013 18:41

John, I have added a Cache, it still processes all the records as I need to build a list of the media id's but it caches the md5 for the records.

When starting it asks if you want to use the cache if one exists, so you can do a full scan at any time should you need to, by saying No.

https://www.dropbox.com/s/wiebe40dbl8c6 ... dia.fh_lua
Jane
My Family History : My Photography "Knowledge is knowing that a tomato is a fruit. Wisdom is not putting it in a fruit salad."

User avatar
johnmorrisoniom
Megastar
Posts: 882
Joined: 18 Dec 2008 07:40
Family Historian: V7
Location: Isle of Man

Find Duplicate Media

Post by johnmorrisoniom » 19 Feb 2013 12:11

Hi Jane,
I have tried the new version a couple of times now. It definitely runs a bit quicker, more-so on my W7 machine.

Does the cache update each time, or do you have to do a run without the cache for it to update?

Post Reply