Feature Request / Improvement
Currently (iceberg 0.14.1) iceberg supports bucketing on only one data column. i.e. bucket(N, col).
Bucketing is a very important feature in Iceberg. Bucketing helps in filtering and narrowing down the files required to answer the query. As we know, querying on primary key is the major pattern. Hence bucketing on primary key speeds up the primary key based queries. But in most cases the primary key is made of multiple columns and hence supporting bucketing on multiple columns gives major enhancements to the primary key based queries.
Query engine
Spark
Feature Request / Improvement
Currently (iceberg 0.14.1) iceberg supports bucketing on only one data column. i.e. bucket(N, col).
Bucketing is a very important feature in Iceberg. Bucketing helps in filtering and narrowing down the files required to answer the query. As we know, querying on primary key is the major pattern. Hence bucketing on primary key speeds up the primary key based queries. But in most cases the primary key is made of multiple columns and hence supporting bucketing on multiple columns gives major enhancements to the primary key based queries.
Query engine
Spark