Deduping an mp3 collection

I made a couple of minor modifications to a script outlined at ASPN by Bill Bumgarner to make a great deduping tool for my mp3 collection.

It checks the first couple of thousand bytes of a file for matches, and if it finds a match then checks the whole file. So unlike some other tools, it actually finds byte-for-byte matches.

iTunes, for instance, just finds files with similar song names, which returned a lot of false positives when I tried it out.

This script deletes files, so run some tests using small files in a temp folder first. Use it at your own risk!

1 Comment so far

  1. Stephan Sokolow on May 20th, 2007

    An interesting way to find dupes more efficiently, but In my case, I have a big false negative problem due to differing ID3 tags, so I have few qualms about wasting CPU and I/O time.

Leave a reply