Usecases: Efficiently process large columns of low cardinality Strings
- BatchIterator should accept both DictionaryBatch and RecordBatch
- Type Coercion optimizer rule should inject expression for converting dictionary value types to index types (for equality expressions, and IN(values, ...)
- Physical expression would lookup index for dictionary values referenced in the query so that at runtime, only indices are being compared per batch
Reporter: Andy Grove / @andygrove
Related issues:
Note: This issue was originally created as ARROW-8464. Please see the migration documentation for further details.