Skip to content

Massive backup speed increase with optional bypass of 'existing' files in backup #5094

@jumper444

Description

@jumper444

As I watch backups using Borg it seems pretty apparent that the backup is laboriously going file by file to look for changes in data. I have about a 45G backup source with 45,000 files and it will take 15minutes or longer to run when I often only have 200 or so new/changed files over each backup (100MB or so).

Obviously rigorously reading ALL files for changes will never miss anything (and for a critical backup program that would be very important), but I would really like to see the following optional switch under CREATE command:

--quickcheck
(random idea for what to call switch)

When that switch is added to CREATE borg will NOT process the data of every file being backed up. Instead it will read the file attributes only (ls/dir, etc) and then skip processing any file when:

  1. the archive already contains a file with the same name and same path, AND
  2. the file size is the same, AND
  3. the file date is the same

The inclusion of this switch for my backups would drop the backup time down 90% or more to maybe a minute or two. Yes, there are instances where a file is same name, location, size and date where somehow that file has changed, but they are extremely rare (unless malicious action is going on). And, besides, that's why it would be an optional switch if a person didn't want to take that risk.

Thoughts?

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions