refactor: rename RowIdSelection to RowAddrSelection#5263
refactor: rename RowIdSelection to RowAddrSelection#5263yanghua merged 2 commits intolance-format:mainfrom
Conversation
|
Try reopen PR to fix CI |
f10a1ce to
cb9e613
Compare
| /// fragment k is selected. If there is a pair (k, Partial(v)) then the | ||
| /// fragment k has the selected rows in v. | ||
| inner: BTreeMap<u32, RowIdSelection>, | ||
| inner: BTreeMap<u32, RowAddrSelection>, |
There was a problem hiding this comment.
Why is this appropriate? The docs above say that it can be row ids. Do stable row ids not use this structure at all?
lance/rust/lance-core/src/utils/mask.rs
Lines 365 to 368 in cb9e613
There was a problem hiding this comment.
Will change related comments when doing RowIdTreeMap -> RowAddrMap in #5266 .
| /// fragment k is selected. If there is a pair (k, Partial(v)) then the | ||
| /// fragment k has the selected rows in v. | ||
| inner: BTreeMap<u32, RowIdSelection>, | ||
| inner: BTreeMap<u32, RowAddrSelection>, |
There was a problem hiding this comment.
I think for this one we should use the term row ID? Because this is more used to accommodate stable row ID, and for row address technically directly use RoaringTreemap?
There was a problem hiding this comment.
It seems no matter RowIdSelection nor RowAddrSelection can cover the two semantics of row ID and row address. There is a full plan in my mind(sorry, did not write it down before):
It would be better to split the responsibility of the struct for distinguishing row ID(stable) and row address.
-
for row address:
RowIdTreeMap->RowAddrMap(BTreeMap<u32, RowAddrSelection>)RowIdSelection→RowAddrSelection(Full | Partial(RoaringBitmap))SearchResult(scalar.rs) →RowAddrSearchResult(Exact(RowAddrMap) | AtMost(RowAddrMap))RowIdMask→RowAddrMask
-
for (stable) row id:
- add rowid::
RowIdSet(RoaringTreemap,64-bit set) - add rowid::
RowIdMask - add
RowIdResolver(row_id -> row_addr)
- add rowid::
-
for both:
- add
RowSetOpsforRowIdSetandRowAddrMap
- add
WDYT? cc @wjones127
There was a problem hiding this comment.
That's okay. Could you put this design in an issue?
Remember that you're collaborating with other developers and they need to be able to follow what you are doing.
There was a problem hiding this comment.
Thanks for the suggestion. Have filed a ticket: #5326
d99959e to
41df6f4
Compare
41df6f4 to
e2dcf67
Compare
Codecov Report❌ Patch coverage is
📢 Thoughts on this report? Let us know! |
jackye1995
left a comment
There was a problem hiding this comment.
overall looks good to me, 2 minor comments
As a step to distinguish the row ID and the row address, we should rename
RowIdSelectiontoRowAddrSelection. To show its effect more clearly.