Skip to content

Comments

reExportAll Improvements#12081

Open
qqmyers wants to merge 4 commits intoIQSS:developfrom
GlobalDataverseCommunityConsortium:DANS-2139-reExportAll
Open

reExportAll Improvements#12081
qqmyers wants to merge 4 commits intoIQSS:developfrom
GlobalDataverseCommunityConsortium:DANS-2139-reExportAll

Conversation

@qqmyers
Copy link
Member

@qqmyers qqmyers commented Jan 14, 2026

What this PR does / why we need it: This PR makes several improvements to the admin/metadata/reExportAll API call:

  • Fatal run time problems, such as a problem with the database connection (seen at DANS) get logged to the export log before terminating, making it easier to see when there has been a problem
  • a new optional ?olderThan=<YYYY-MM-DD> parameter has been introduced to allow incremental reExportAll, i.e. allowing reExportAll to be done in smaller chunks, or to allow reExporting datasets last updated before some exporter change occurred, or to restart after a failure (use olderThan=)
  • Explicit marking of the overall exportAllDatasets method as not supporting a transaction to avoid pulling all datasets into one transaction (the exportAllFormatsInNewTransaction(dataset) adds datasets to individual transactions as they are processed)
  • Unused fallback code that would write exportLogger info to the main server.log file has been removed

Which issue(s) this PR closes:

  • Closes #

Special notes for your reviewer: As most of the changes are to error handling or, with the olderThan date param, would require testing over multiple days, I don't see any new tests to add. The existing tests should assure regression testing.

Suggestions on how to test this: Rely on the DatasetsIT tests for regression testing, test performance on large databases (hopefully not having a single overall transaction will noticeably improve memory use/performance). The new olderThan param can be tested on an existing db with a range of lastExport dates. W.r.t. error handling, the only thing I can think to try (if desired) would be to kill the database during a reExportAll run and verify that the export log has logging of that problem. (Would have to restart the db and payara after that I assume.)

Does this PR introduce a user interface change? If mockups are available, please link/include them here:

Is there a release notes update needed for this change?:

Additional documentation:

@qqmyers qqmyers added GDCC: DANS related to GDCC work for DANS Size: 3 A percentage of a sprint. 2.1 hours. labels Jan 14, 2026
@qqmyers qqmyers added this to the 6.10 milestone Jan 14, 2026
@cmbz cmbz moved this to Ready for Review ⏩ in IQSS Dataverse Project Feb 11, 2026
@cmbz cmbz added the FY26 Sprint 17 FY26 Sprint 17 (2026-02-11 - 2026-02-25) label Feb 11, 2026
@stevenwinship stevenwinship self-assigned this Feb 17, 2026
@stevenwinship stevenwinship moved this from Ready for Review ⏩ to In Review 🔎 in IQSS Dataverse Project Feb 17, 2026
@qqmyers qqmyers force-pushed the DANS-2139-reExportAll branch from 5b44ef8 to 4162bae Compare February 17, 2026 15:51
@github-project-automation github-project-automation bot moved this from In Review 🔎 to Ready for QA ⏩ in IQSS Dataverse Project Feb 17, 2026
@stevenwinship stevenwinship removed their assignment Feb 17, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

FY26 Sprint 17 FY26 Sprint 17 (2026-02-11 - 2026-02-25) GDCC: DANS related to GDCC work for DANS Size: 3 A percentage of a sprint. 2.1 hours.

Projects

Status: Ready for QA ⏩

Development

Successfully merging this pull request may close these issues.

3 participants