find-dupes.awk: edits for Linux?

Hi,

Any chance you could help me adapt your `find-dupes.awk` script to work on a Linux system?  Based on your notes, I was able to figure out the following changes:

- Instead of `ls -lTR`, use `ls -l --full-time -R | grep -v ^d`
- Use `md5_exec = "md5sum"`
- Change `$9` to `$8`: `file = substr($0,match($0, $8)+length($8)+1,length($0))`
- Change `$2` to `$1` since we are using `md5sum`: `hash = $1`

I couldn't figure out the rest, starting with the line `sizes[$5]`, as I don't know `awk`.  Would appreciate it as I'm trying to find dupes using the `md5sum` from the stackexchange thread that you referenced, and it's still running after 1 day on 1.3TB worth of data.

Thanks in advance.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

find-dupes.awk: edits for Linux? #4

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

find-dupes.awk: edits for Linux? #4

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions