-
Notifications
You must be signed in to change notification settings - Fork 1.3k
Closed
Labels
enhancementNew feature or requestNew feature or request
Description
Search before asking
- I searched in the issues and found nothing similar.
Motivation
Currently, the computation engine frequently reads the schema file and the size of the data files from filesystem when reading the Paimon table.This lead to additional access to the file system. The purpose of this issue is to reduce access to the file system by caching the schema ,paimon table and so on.
Solution
- Hive:
- Cache the table into Split to avoid loading Paimon table when create reader [Hive]Pass Table into Split to Avoid Loading from FileSystem when createRecordReader #2927
- Cache TableSchema into Configuration to avoid loading read scheme file in PaimonSerDe [Hive] Cache TableSchema into Configuration to avoid loading read scheme file in PaimonSerDe #2946
- Core:
- Obtain the size of the data file from the metadata. [core] orc/parquet reader obtain the fileSize from metadata #2918
- cache the schema for SchemaManager. [Core]add table schema cache for SchemaManager #2939
Anything else?
No response
Are you willing to submit a PR?
- I'm willing to submit a PR!
Zouxxyy
Metadata
Metadata
Assignees
Labels
enhancementNew feature or requestNew feature or request