Skip to content

Conversation

@suxiaogang223
Copy link
Contributor

@suxiaogang223 suxiaogang223 commented Jan 12, 2026

suxiaogang223 and others added 5 commits January 12, 2026 16:27
- Add new section 'Iceberg Metadata Cache Enhancements (Since 4.0.3)'
- Update Manifest Cache default to disabled (false)
- Clarify that Manifest Cache does not affect data visibility
- Restore original simple sections for Iceberg Table Information and Snapshot
- Update disable cache instructions to distinguish 4.0.3+ from earlier versions

Applied same changes as master branch to version-4.x documentation.

该缓存由 `IcebergMetadataCache` 维护,每个 Iceberg Catalog 都有自己独立的实例,包含 `tableCache``viewCache` 两个缓存。

缓存的表对象(`IcebergTableCacheValue`)中还包含 Snapshot 信息,该信息按需懒加载(主要用于 MTMV 场景)。
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
缓存的表对象(`IcebergTableCacheValue`)中还包含 Snapshot 信息,该信息按需懒加载(主要用于 MTMV 场景)。
缓存的表对象(`IcebergTableCacheValue`)中还包含 Snapshot 信息,该信息按需懒加载(主要用于多表物化视图场景)。

- **Partition(分区)**:分区信息使用缓存的表对象的元数据(分区规范、快照)加载。缓存越旧,分区信息越滞后。

:::tip
要实时看到最新的 Schema、Snapshot 和 Partition 信息,需要禁用表缓存,设置 `iceberg.table.meta.cache.ttl-second=0`。Schema 缓存不影响使用的版本——它只是为了性能缓存已解析的结果。
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Schema 缓存不影响使用的版本是啥意思?


- **最短刷新时间**

由 FE 配置项 `external_cache_refresh_time_minutes` 控制,单位为分钟。默认为 10 分钟。这是异步刷新,不会阻塞当前操作。
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

这个参数如果是0,和上面的参数有什么关系?

该缓存存储的是**已解析的** Iceberg Manifest 文件内容——具体是从 Manifest 文件中提取的 `DataFile``DeleteFile` 对象(而不是原始文件字节):

- `DataFile` 对象:文件元数据,包括路径、分区值、统计信息等
- `DeleteFile` 对象:Equality Delete 的删除元数据
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

posistion delete有么?

CREATE CATALOG iceberg_catalog PROPERTIES (
'type' = 'iceberg',
...
'iceberg.manifest.cache.enable' = 'true', -- 启用 Doris Manifest Cache
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
'iceberg.manifest.cache.enable' = 'true', -- 启用 Doris Manifest Cache
'iceberg.manifest.cache.enable' = 'true', -- 启用 Iceberg Manifest Cache

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

这个名字和原生的太像了,最好换一个

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants