[NBLUG/talk] Finding duplicate files

ME dugan at passwall.com
Mon Jul 7 03:00:05 PDT 2003


Ross Thomas said:
> Lincoln Peters wrote:
>> Is there an easy way to do a recursive search for duplicate files,
>> preferably from the shell?
>
> Put this into a file, make it executable and run it with the directory
> you wish to check.  It defaults to the current directory.  This will
> only really work on Linux.  Other *nix OS's don't support the -print0
> and -0 args so you would need to insert a 'sed' command or make the
> xargs more restrictive.
>
> This one only finds real files.  Symlinks are ignored.
>
> Also handles embedded blanks and tabs.  Misbehaves when new-lines are
> embedded in a file name (sort & uniq aren't that sophisticated).
>
> ------------- Cut Here ----------------
> #!/bin/sh
>
> if [ $# -ge 2 ]
> then
>     echo "Usage: `basename \"$0\"` [<dir_name>]"
>     exit 1
> fi
>
> find "${1:-.}" -type f -print0 | xargs -0 md5sum | sort | uniq -D -w 32
> ------------- Cut Here ----------------
>
> HTH.
>
> Ross.

Nice! :-)

-ME




More information about the talk mailing list