Skip to content

tar export / import tuning ideas #6608

@ThomasWaldmann

Description

@ThomasWaldmann

idea from DiVOC borg session: speed-optimize tar pipe

Problem: for big repos, the tar-pipe could take rather long:

  • tar-export would read content chunks again and again via relatively slow connection from a remote repo
    • maybe use borg.remote.cache_if_remote and a big local persistent cache to avoid repeated remote transfer of same chunks.
  • the tar-pipe should be between 2 borg processes on same machine, lots of data flowing here!
  • tar-import would do lots of chunking and hashing, but would be faster for content of 2nd+ archives due to dedup
    • could additionally use a files cache like create does (for already "seen" and unmodified tarstream items)

Note: an optimisation sending chunkid lists over the tar pipe requires the id hash algorithm+secret and the chunker secret to be identical in both repos.

Metadata

Metadata

Labels

No labels
No labels

Type

No type

Projects

No projects

Milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions