Skip to content

Conversation

@HafeezOJ
Copy link
Collaborator

@HafeezOJ HafeezOJ commented Jul 14, 2025

Description

This PR renames bags from the format [article_id]_[version]_[first_depositor_full_name]_[metadata_hash] to azu_[article_id]-[version]-[first_author_lastname]-[metadata_hash]_bagXofY_[YYYYMMDD]. The effects of the new name format, such as metadata hash extraction, item id , version, name and bag_count extraction, have been addressed with enhanced decomposition of a bag name into its constituent components. The PR also adds more tags to the bag info and provides more metadata in the Metadata class for easy extraction and assignment to the bag info tags.

See #117

Documentation Update

  • I have updated README.md and other relevant documentation

Implementation Notes

Changes were made to

  • Article.py: Where the article bags are named with respect to the new format. bag_date and bag_count have been included in the bag name of an article.
  • Collection.py: Where the collection bags are named with respect to the new format. bag_date and bag_count have been included in the bag name of a collection.
  • metadata.py: Where the bag info tags are set. More tags (bag_count, last_name, bag_date) have been added to available tags in the Metadata class
  • bag.py: Where bags are created using the workflow.
  • Utils.py: Where all utility functions are housed. Special functions to extract different components of a bag name have been added.
  • default.example.toml: Bag-Count tag has been added to bag info, and a placeholder for last name from the Metadata class.
  • .env.sample.ini: bag_name_prefix field was added to the sample configuration file.
  • READMEs: ReBACH and Bagger READMEs were updated.

HafeezOJ added 29 commits July 8, 2025 10:09
…-{first author lastname}-{hash}_bag_YYYYMMDD
@HafeezOJ HafeezOJ self-assigned this Jul 14, 2025
@HafeezOJ HafeezOJ linked an issue Jul 14, 2025 that may be closed by this pull request
5 tasks
@HafeezOJ HafeezOJ requested a review from zoidy July 14, 2025 21:33
zoidy
zoidy previously approved these changes Jul 16, 2025
@zoidy zoidy merged commit e384c84 into main Jul 16, 2025
1 check passed
@zoidy zoidy deleted the 117-make-bag-names-conform-to-format-to-format-of-bag-names-in-ual-preservation-workflow branch July 16, 2025 19:53
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Make bag names conform to format of bag names in UAL preservation workflow

3 participants