[NBLUG/talk] Finding duplicate files
ME
dugan at passwall.com
Mon Jul 7 03:00:05 PDT 2003
Ross Thomas said:
> Lincoln Peters wrote:
>> Is there an easy way to do a recursive search for duplicate files,
>> preferably from the shell?
>
> Put this into a file, make it executable and run it with the directory
> you wish to check. It defaults to the current directory. This will
> only really work on Linux. Other *nix OS's don't support the -print0
> and -0 args so you would need to insert a 'sed' command or make the
> xargs more restrictive.
>
> This one only finds real files. Symlinks are ignored.
>
> Also handles embedded blanks and tabs. Misbehaves when new-lines are
> embedded in a file name (sort & uniq aren't that sophisticated).
>
> ------------- Cut Here ----------------
> #!/bin/sh
>
> if [ $# -ge 2 ]
> then
> echo "Usage: `basename \"$0\"` [<dir_name>]"
> exit 1
> fi
>
> find "${1:-.}" -type f -print0 | xargs -0 md5sum | sort | uniq -D -w 32
> ------------- Cut Here ----------------
>
> HTH.
>
> Ross.
Nice! :-)
-ME
More information about the talk
mailing list