Index merging without garbage

### Current state

Currently the data in several partial (or just one - for transformations) indexes is transformed during merged in the following way:

0) Iterator < TimeAndDims + Object[] metrics (entry in `IncrementalIndex`) >
--> sorting dimension value indexed, aka unsortedToSorted
1) Iterator < Rowboat (Object[] dims, Object[] metrics) >
--> optionally, reordering dims
2) Iterator < Rowboat (Object[] dims, Object[] metrics) > 
   // here array elements are the same objects as at the previous step, but `Object[]` arrays are new, if reordering or dims and/or metrics is actually required <br>
--> another one reindexing, based on merged dictionary
3) Iterator < Rowboat (Object[] dims, Object[] metrics) >
--> final merge.

Here, `Object[]` elements are either `int[]` (DimensionSelector), `Long`, `Double` or `Float` (numeric ColumnValueSelectors, correspondingly).

So in the process of merge, each entry generates 2-3 extra `Rowboat` objects, 4-7 new `Object[]` arrays, and N (the number of string dimensions) * 2 new `int[]` arrays, and new boxed primitive objects, if merging is done with `QueryableIndex` as a source.

### Garbage-free approach

`Rowboat` contains an array of *ColumnValueSelector* objects, representing the stream of dimensions, and another array of ColumnValueSelector objects, representing a stream of metrics, both "under cursor". When `QueryableIndex`is used as source for merging, the existing `Cursor` and `ColumnValueSelectorFactory` infrastructure is reused with minimal modifications.

0->1 and 2-3 conversions, as described above, implemented as ColumnValueSelector transformations, without creating new arrays, boxed primitives, etc. 1->2 transformation is essentially a no-op: create a Rowboat object with array of ColumnValueSelectors, ordered differently.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Index merging without garbage #4622

Current state

Garbage-free approach

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Index merging without garbage #4622

Description

Current state

Garbage-free approach

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions