From 19c07e948560a924f8063fa038f22d89e4477596 Mon Sep 17 00:00:00 2001 From: TieweiFang Date: Mon, 18 Nov 2024 18:06:53 +0800 Subject: [PATCH 1/2] fix 1 --- docs/data-operate/export/export-overview.md | 6 ++++++ .../current/data-operate/export/export-overview.md | 6 ++++++ 2 files changed, 12 insertions(+) diff --git a/docs/data-operate/export/export-overview.md b/docs/data-operate/export/export-overview.md index 29a45cf5c19df..2734f7401650c 100644 --- a/docs/data-operate/export/export-overview.md +++ b/docs/data-operate/export/export-overview.md @@ -107,6 +107,9 @@ The following table shows the mapping between Doris data types and Parquet, ORC |map|map| |array|array| |json| Not supported| + |bitmap | binary | binary + |quantile_state | binary | binary + |hll | binary | binary 2. When Doris exports to Parquet file format, it first converts Doris in-memory data to Arrow in-memory data format, then writes out to Parquet file format. The mapping relationship between Doris data types and Arrow data types is: @@ -130,3 +133,6 @@ The following table shows the mapping between Doris data types and Parquet, ORC | map | map | | array | list | |json| utf8 | + |bitmap | binary | binary + |quantile_state | binary | binary + |hll | binary | binary diff --git a/i18n/zh-CN/docusaurus-plugin-content-docs/current/data-operate/export/export-overview.md b/i18n/zh-CN/docusaurus-plugin-content-docs/current/data-operate/export/export-overview.md index 4a5441a640708..a9cdf05c2f436 100644 --- a/i18n/zh-CN/docusaurus-plugin-content-docs/current/data-operate/export/export-overview.md +++ b/i18n/zh-CN/docusaurus-plugin-content-docs/current/data-operate/export/export-overview.md @@ -107,6 +107,9 @@ Parquet、ORC 文件格式拥有自己的数据类型。Apache Doris 的导出 |map|map| |array|array| |json|不支持| + |bitmap | binary | binary + |quantile_state | binary | binary + |hll | binary | binary 2. Apache Doris 导出到 Parquet 文件格式时,会先将 Apache Doris 内存数据转换为 Arrow 内存数据格式,然后由 Arrow 写出到 Parquet 文件格式。Apache Doris 数据类型到 Arrow 数据类的映射关系为: @@ -131,3 +134,6 @@ Parquet、ORC 文件格式拥有自己的数据类型。Apache Doris 的导出 |map|map| |array|list| |json|utf8| + |bitmap | binary | binary + |quantile_state | binary | binary + |hll | binary | binary From f52e442b106c1bf0af1d6cc7dcb1cee2655e29d9 Mon Sep 17 00:00:00 2001 From: TieweiFang Date: Tue, 19 Nov 2024 20:20:37 +0800 Subject: [PATCH 2/2] fix 2 --- docs/data-operate/export/export-overview.md | 12 +++++++----- .../current/data-operate/export/export-overview.md | 4 +++- 2 files changed, 10 insertions(+), 6 deletions(-) diff --git a/docs/data-operate/export/export-overview.md b/docs/data-operate/export/export-overview.md index 2734f7401650c..8745298930706 100644 --- a/docs/data-operate/export/export-overview.md +++ b/docs/data-operate/export/export-overview.md @@ -106,7 +106,8 @@ The following table shows the mapping between Doris data types and Parquet, ORC |struct|struct| |map|map| |array|array| - |json| Not supported| + |json | string| + |variant | string| |bitmap | binary | binary |quantile_state | binary | binary |hll | binary | binary @@ -132,7 +133,8 @@ The following table shows the mapping between Doris data types and Parquet, ORC | struct | struct | | map | map | | array | list | - |json| utf8 | - |bitmap | binary | binary - |quantile_state | binary | binary - |hll | binary | binary + | json | utf8 | + | variant | utf8| + | bitmap | binary | binary + | quantile_state | binary | binary + | hll | binary | binary diff --git a/i18n/zh-CN/docusaurus-plugin-content-docs/current/data-operate/export/export-overview.md b/i18n/zh-CN/docusaurus-plugin-content-docs/current/data-operate/export/export-overview.md index a9cdf05c2f436..419db931a3f81 100644 --- a/i18n/zh-CN/docusaurus-plugin-content-docs/current/data-operate/export/export-overview.md +++ b/i18n/zh-CN/docusaurus-plugin-content-docs/current/data-operate/export/export-overview.md @@ -106,7 +106,8 @@ Parquet、ORC 文件格式拥有自己的数据类型。Apache Doris 的导出 |struct|struct| |map|map| |array|array| - |json|不支持| + |json|string| + |variant | string| |bitmap | binary | binary |quantile_state | binary | binary |hll | binary | binary @@ -134,6 +135,7 @@ Parquet、ORC 文件格式拥有自己的数据类型。Apache Doris 的导出 |map|map| |array|list| |json|utf8| + |variant | utf8 | |bitmap | binary | binary |quantile_state | binary | binary |hll | binary | binary