Skip to content

Question: Intergration Plan/Solution for Lance with Iceberg #13438

@littleDrew

Description

@littleDrew

Query engine

Spark engine

Question

hi @pvary, it seems that you are working intergrate lance into iceberg, can you show the detail implementation plan/solution.

Here i want to have a discuss or know previous analysis about select which way to intergrate with iceberg:

1st way: use lance file format to intergrate with iceberg, thus like following implementation by @westonpace
2nd way: use lance table format to intergrate with iceberg
Here i thought, considering about performance issue, it seems the 2st way(use lance table format) will be better for good performance

Thus want to have a dissuss as for implementation way. if use 1st way, can we achive good performance, or have we evalute the pros and cons of both implementation way.

public enum FileFormat {
PUFFIN("puffin", false),
ORC("orc", true),
PARQUET("parquet", true),
AVRO("avro", true),
LANCE("lance", true) // add lance as an file format of iceberg instead of table format
}
westonpace@84bf5c5

Hope we have a disscus for this together, as i also have noticed lance have a good performance in proccessing/reading multi-dimentional data. @electrum @martint @massdosage @sullis

Metadata

Metadata

Assignees

No one assigned

    Labels

    questionFurther information is requestedstale

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions