Deduping an mp3 collection
I made a couple of minor modifications to a script outlined at ASPN by Bill Bumgarner to make a great deduping tool for my mp3 collection.
It checks the first couple of thousand bytes of a file for matches, and if it finds a match then checks the whole file. So unlike some other tools, it actually finds byte-for-byte matches.
iTunes, for instance, just finds files with similar song names, which returned a lot of false positives when I tried it out.
This script deletes files, so run some tests using small files in a temp folder first. Use it at your own risk!
Comments(1)
An interesting way to find dupes more efficiently, but In my case, I have a big false negative problem due to differing ID3 tags, so I have few qualms about wasting CPU and I/O time.