Better grouping algorithm
Reported by Virgil Dupras | August 27th, 2009 @ 10:04 AM | in se2.8.0 me5.7.0 pe1.8.0 (closed)
A soon as dupeGuru is looking for duplicates that are not exactly the same, there's the issue of discarded matches coming up. For some discarded matches, it's impossible not to discard them because one side of the match is already part of a group that the other side of the match can't be in.
But after a quick glance at the grouping code, it seems possible that a match is discarded on the basis that one side is an unconfirmed part of a group. If that file is never confirmed, it means that some discarded matches could be used to safely make new groups without conflicting with any other group.
Comments and changes to this ticket
-
Virgil Dupras August 30th, 2009 @ 07:02 PM
- Milestone set to se2.8.0 me5.7.0 pe1.8.0
-
Virgil Dupras September 5th, 2009 @ 04:38 PM
- State changed from new to open
- Assigned user set to Virgil Dupras
-
Virgil Dupras September 5th, 2009 @ 04:58 PM
- State changed from open to fixed
Please Sign in or create a free account to add a new ticket.
With your very own profile, you can contribute to projects, track your activity, watch tickets, receive and update tickets through your email and much more.
Create your profile
Help contribute to this project by taking a few moments to create your personal profile. Create your profile ยป
People watching this ticket
Referenced by
- 51 Better grouping algorithm (from [116]) [#51 state:fixed] Improved the grouping algo...