Skip to content

refactor: remove not used storage class and blob dataset#5131

Merged
Xuanwo merged 5 commits intomainfrom
remove-not-used-storage-class
Nov 4, 2025
Merged

refactor: remove not used storage class and blob dataset#5131
Xuanwo merged 5 commits intomainfrom
remove-not-used-storage-class

Conversation

@Xuanwo
Copy link
Copy Markdown
Collaborator

@Xuanwo Xuanwo commented Nov 4, 2025

This PR removes unused storage class and blob dataset to make our logic cleaner and easier to follow. That’s especially important as we work on support for blob v2 now.

This PR includes breaking changes at the API level, but since we never used a blob dataset before, it won’t break users’ existing datasets.


This PR was primarily authored with Codex using GPT-5-Codex and then hand-reviewed by me. I AM responsible for every change made in this PR. I aimed to keep it aligned with our goals, though I may have missed minor issues. Please flag anything that feels off, I'll fix it quickly.

Copy link
Copy Markdown

@chatgpt-codex-connector chatgpt-codex-connector Bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

💡 Codex Review

Here are some automated review suggestions for this pull request.

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

  • Open a pull request for review
  • Mark a draft as ready
  • Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

Comment thread rust/lance-table/src/format/manifest.rs
Copy link
Copy Markdown
Member

@westonpace westonpace left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks for cleaning out this technical debt. I think we could maybe even get rid of more stuff. I'm not sure we have to worry about removing field metadata keys or migrating old use to the file version of blob data.

Looks like CI is not quite passing but once it is green I think this is good to go.

Comment thread protos/file.proto
Comment thread rust/lance-core/src/datatypes/field.rs Outdated
Comment thread rust/lance-core/src/datatypes/field.rs Outdated
Comment thread rust/lance-core/src/datatypes/field.rs Outdated
Comment thread rust/lance-core/src/datatypes/schema.rs Outdated
Comment thread rust/lance-file/src/datatypes.rs Outdated
Signed-off-by: Xuanwo <github@xuanwo.io>
Signed-off-by: Xuanwo <github@xuanwo.io>
Signed-off-by: Xuanwo <github@xuanwo.io>
Signed-off-by: Xuanwo <github@xuanwo.io>
Signed-off-by: Xuanwo <github@xuanwo.io>
@Xuanwo Xuanwo force-pushed the remove-not-used-storage-class branch from 5caccda to 9817357 Compare November 4, 2025 17:21
@codecov-commenter
Copy link
Copy Markdown

Codecov Report

❌ Patch coverage is 97.82609% with 2 lines in your changes missing coverage. Please review.
✅ Project coverage is 81.85%. Comparing base (b229e47) to head (9817357).
⚠️ Report is 5 commits behind head on main.

Files with missing lines Patch % Lines
rust/lance/src/dataset/write/insert.rs 90.90% 0 Missing and 1 partial ⚠️
rust/lance/src/io/commit.rs 50.00% 0 Missing and 1 partial ⚠️
Additional details and impacted files
@@            Coverage Diff             @@
##             main    #5131      +/-   ##
==========================================
+ Coverage   81.73%   81.85%   +0.12%     
==========================================
  Files         341      341              
  Lines      140755   140506     -249     
  Branches   140755   140506     -249     
==========================================
- Hits       115044   115017      -27     
+ Misses      21894    21683     -211     
+ Partials     3817     3806      -11     
Flag Coverage Δ
unittests 81.85% <97.82%> (+0.12%) ⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:
  • ❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

@Xuanwo Xuanwo merged commit 79d2a35 into main Nov 4, 2025
30 of 31 checks passed
@Xuanwo Xuanwo deleted the remove-not-used-storage-class branch November 4, 2025 18:34
jackye1995 pushed a commit to jackye1995/lance that referenced this pull request Jan 21, 2026
…t#5131)

This PR removes unused storage class and blob dataset to make our logic
cleaner and easier to follow. That’s especially important as we work on
support for blob v2 now.

This PR includes breaking changes at the API level, but since we never
used a blob dataset before, it won’t break users’ existing datasets.

---

**This PR was primarily authored with Codex using GPT-5-Codex and then
hand-reviewed by me. I AM responsible for every change made in this PR.
I aimed to keep it aligned with our goals, though I may have missed
minor issues. Please flag anything that feels off, I'll fix it
quickly.**

---------

Signed-off-by: Xuanwo <github@xuanwo.io>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants