Currently we cache the repetition index for small data types (mini block layout) and pay the 2-IOP cost for large data types (full zip). If the data is in remote storage and users have enough RAM then it may make sense to cache the repetition index in all scenarios.
Currently we cache the repetition index for small data types (mini block layout) and pay the 2-IOP cost for large data types (full zip). If the data is in remote storage and users have enough RAM then it may make sense to cache the repetition index in all scenarios.