-
Notifications
You must be signed in to change notification settings - Fork 408
doc: Update Iceberg cache documentation for clarity and detail #3277
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: master
Are you sure you want to change the base?
Conversation
…ng strategies and best practices
- Add new section 'Iceberg Metadata Cache Enhancements (Since 4.0.3)' - Update Manifest Cache default to disabled (false) - Clarify that Manifest Cache does not affect data visibility - Restore original simple sections for Iceberg Table Information and Snapshot - Update disable cache instructions to distinguish 4.0.3+ from earlier versions Applied same changes as master branch to version-4.x documentation.
|
|
||
| 该缓存由 `IcebergMetadataCache` 维护,每个 Iceberg Catalog 都有自己独立的实例,包含 `tableCache` 和 `viewCache` 两个缓存。 | ||
|
|
||
| 缓存的表对象(`IcebergTableCacheValue`)中还包含 Snapshot 信息,该信息按需懒加载(主要用于 MTMV 场景)。 |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
| 缓存的表对象(`IcebergTableCacheValue`)中还包含 Snapshot 信息,该信息按需懒加载(主要用于 MTMV 场景)。 | |
| 缓存的表对象(`IcebergTableCacheValue`)中还包含 Snapshot 信息,该信息按需懒加载(主要用于多表物化视图场景)。 |
| - **Partition(分区)**:分区信息使用缓存的表对象的元数据(分区规范、快照)加载。缓存越旧,分区信息越滞后。 | ||
|
|
||
| :::tip | ||
| 要实时看到最新的 Schema、Snapshot 和 Partition 信息,需要禁用表缓存,设置 `iceberg.table.meta.cache.ttl-second=0`。Schema 缓存不影响使用的版本——它只是为了性能缓存已解析的结果。 |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Schema 缓存不影响使用的版本是啥意思?
|
|
||
| - **最短刷新时间** | ||
|
|
||
| 由 FE 配置项 `external_cache_refresh_time_minutes` 控制,单位为分钟。默认为 10 分钟。这是异步刷新,不会阻塞当前操作。 |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
这个参数如果是0,和上面的参数有什么关系?
| 该缓存存储的是**已解析的** Iceberg Manifest 文件内容——具体是从 Manifest 文件中提取的 `DataFile` 和 `DeleteFile` 对象(而不是原始文件字节): | ||
|
|
||
| - `DataFile` 对象:文件元数据,包括路径、分区值、统计信息等 | ||
| - `DeleteFile` 对象:Equality Delete 的删除元数据 |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
posistion delete有么?
| CREATE CATALOG iceberg_catalog PROPERTIES ( | ||
| 'type' = 'iceberg', | ||
| ... | ||
| 'iceberg.manifest.cache.enable' = 'true', -- 启用 Doris Manifest Cache |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
| 'iceberg.manifest.cache.enable' = 'true', -- 启用 Doris Manifest Cache | |
| 'iceberg.manifest.cache.enable' = 'true', -- 启用 Iceberg Manifest Cache |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
这个名字和原生的太像了,最好换一个
relate prs:
Versions
Languages
Docs Checklist