Skip to content

docs: clarify v2.2 nested drop rollback risk#5999

Merged
Xuanwo merged 2 commits intomainfrom
xuanwo/docs-v22-nested-drop-risk
Feb 24, 2026
Merged

docs: clarify v2.2 nested drop rollback risk#5999
Xuanwo merged 2 commits intomainfrom
xuanwo/docs-v22-nested-drop-risk

Conversation

@Xuanwo
Copy link
Copy Markdown
Collaborator

@Xuanwo Xuanwo commented Feb 24, 2026

  • add a warning that drop_columns is metadata-only but data can become unrecoverable after compact_files + cleanup_old_versions
  • add operational guidance for rollback windows (tag/snapshot, delayed cleanup, validation before aggressive cleanup)

Parts of this PR were drafted with assistance from Codex (with gpt-5.3-codex) and fully reviewed and edited by me. I take full responsibility for all changes.

@github-actions github-actions Bot added the documentation Improvements or additions to documentation label Feb 24, 2026
@github-actions
Copy link
Copy Markdown
Contributor

Code Review

Summary: Documentation-only change adding clarification about v2.2 nested sub-column drops and operational guidance for rollback windows.

Verification

I verified the v2.2 claim against the test code in rust/lance/src/dataset/schema_evolution.rs:1400-1429. The tests confirm:

  • Pre-v2.2 versions (Legacy, V2_0, V2_1): dropping a nested sub-column from list(struct) removes the entire column
  • v2.2: dropping a nested sub-column preserves the parent structure with remaining fields

Review Result

No P0/P1 issues found. The documentation is factually accurate and the warning about data becoming unrecoverable after compact_files() + cleanup_old_versions() is appropriate operational guidance.

LGTM ✓

Copy link
Copy Markdown
Member

@westonpace westonpace left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

One minor nit, good otherwise

Comment thread docs/src/guide/data_evolution.md Outdated
Comment on lines +161 to +163
Starting with Lance file format `2.2`, nested sub-column removal is supported for
nested types (for example `people.item.city` on `list<struct<...>>`), instead of
being limited to `struct` only.
Copy link
Copy Markdown
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Minor nit: could you move this paragraph above the previous one? Right now this breaks up the structure a little...

  • To actually cleanup use cleanup_old_versions
  • Did you know, in newer versions, you can now delete nested subfields
  • Warning! If you cleanup, you might delete actual data and make rollback impossible

Copy link
Copy Markdown
Collaborator Author

@Xuanwo Xuanwo Feb 24, 2026

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Fixed.

@Xuanwo Xuanwo merged commit 693f394 into main Feb 24, 2026
6 checks passed
@Xuanwo Xuanwo deleted the xuanwo/docs-v22-nested-drop-risk branch February 24, 2026 17:11
wjones127 pushed a commit to wjones127/lance that referenced this pull request Feb 25, 2026
- add a warning that `drop_columns` is metadata-only but data can become
unrecoverable after `compact_files` + `cleanup_old_versions`
- add operational guidance for rollback windows (tag/snapshot, delayed
cleanup, validation before aggressive cleanup)

---

**Parts of this PR were drafted with assistance from Codex (with
`gpt-5.3-codex`) and fully reviewed and edited by me. I take full
responsibility for all changes.**
wjones127 pushed a commit to wjones127/lance that referenced this pull request Feb 25, 2026
- add a warning that `drop_columns` is metadata-only but data can become
unrecoverable after `compact_files` + `cleanup_old_versions`
- add operational guidance for rollback windows (tag/snapshot, delayed
cleanup, validation before aggressive cleanup)

---

**Parts of this PR were drafted with assistance from Codex (with
`gpt-5.3-codex`) and fully reviewed and edited by me. I take full
responsibility for all changes.**
wjones127 pushed a commit that referenced this pull request Feb 26, 2026
- add a warning that `drop_columns` is metadata-only but data can become
unrecoverable after `compact_files` + `cleanup_old_versions`
- add operational guidance for rollback windows (tag/snapshot, delayed
cleanup, validation before aggressive cleanup)

---

**Parts of this PR were drafted with assistance from Codex (with
`gpt-5.3-codex`) and fully reviewed and edited by me. I take full
responsibility for all changes.**
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

documentation Improvements or additions to documentation

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants