In a project I am working on, I want data in an S3 bucket to be available to pull locally, while maintaining the folder structure and the data as it is in the bucket. This is because I need to perform AWS operations on that data as well. But since I also need that data for local operations (and any changes on them), dvc control would be beneficial.
Essentially, S3 being the GitHub for the data, while dvc being the git for the data.
I apologize if this feature already exists, and if so, could someone please direct me on how to achieve this ?
Thanks.
In a project I am working on, I want data in an S3 bucket to be available to pull locally, while maintaining the folder structure and the data as it is in the bucket. This is because I need to perform AWS operations on that data as well. But since I also need that data for local operations (and any changes on them), dvc control would be beneficial.
Essentially, S3 being the GitHub for the data, while dvc being the git for the data.
I apologize if this feature already exists, and if so, could someone please direct me on how to achieve this ?
Thanks.