Skip to content

Conversation

@wg1026688210
Copy link
Contributor

@wg1026688210 wg1026688210 commented Feb 28, 2024

Purpose

In order to reduce the access of FileSystem, the format reader of orc/parquet use the fileSize from metadata

Linked issue: related #2978

Tests

KeyValueFileReadWriteTest#testReaderUseFileSizeFromMetadata

API and Format

Documentation

@wg1026688210 wg1026688210 changed the title [core] orc/parquet use the fileSize which metadata record to reduce the access of fileSystem [core] orc/parquet use the fileSize which metadata record to reduce the access of FileSystem Feb 28, 2024
@wg1026688210 wg1026688210 changed the title [core] orc/parquet use the fileSize which metadata record to reduce the access of FileSystem [core] orc/parquet use the fileSize which metadata record Mar 4, 2024
@wg1026688210 wg1026688210 changed the title [core] orc/parquet use the fileSize which metadata record [core] orc/parquet reader obtain the fileSize from metadata Mar 13, 2024
/** A factory to create {@link RecordReader} for file. */
public interface FormatReaderFactory extends Serializable {

RecordReader<InternalRow> createReader(FileIO fileIO, Path file) throws IOException;
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Can you introduce a Context for FormatReaderFactory?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

done

@wg1026688210 wg1026688210 marked this pull request as ready for review March 19, 2024 03:48
Copy link
Contributor

@JingsongLi JingsongLi left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks good to me! Thanks @wg1026688210

@JingsongLi JingsongLi merged commit 7175bd8 into apache:master Mar 19, 2024
zhu3pang pushed a commit to zhu3pang/incubator-paimon that referenced this pull request Mar 29, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants