From ae90d054d2bbf9eb57feae4d2baaab39e98d570e Mon Sep 17 00:00:00 2001 From: Shingo Kitagawa Date: Tue, 2 May 2023 17:34:38 +0900 Subject: [PATCH 1/4] update documentation about multiValueHandling --- docs/ingestion/ingestion-spec.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/docs/ingestion/ingestion-spec.md b/docs/ingestion/ingestion-spec.md index 06583e79f20c..dde994859afe 100644 --- a/docs/ingestion/ingestion-spec.md +++ b/docs/ingestion/ingestion-spec.md @@ -226,7 +226,7 @@ Dimension objects can have the following components: | type | Either `string`, `long`, `float`, `double`, or `json`. | `string` | | name | The name of the dimension. This will be used as the field name to read from input records, as well as the column name stored in generated segments.

Note that you can use a [`transformSpec`](#transformspec) if you want to rename columns during ingestion time. | none (required) | | createBitmapIndex | For `string` typed dimensions, whether or not bitmap indexes should be created for the column in generated segments. Creating a bitmap index requires more storage, but speeds up certain kinds of filtering (especially equality and prefix filtering). Only supported for `string` typed dimensions. | `true` | -| multiValueHandling | Specify the type of handling for [multi-value fields](../querying/multi-value-dimensions.md). Possible values are `sorted_array`, `sorted_set`, and `array`. `sorted_array` and `sorted_set` order the array upon ingestion. `sorted_set` removes duplicates. `array` ingests data as-is | `sorted_array` | +| multiValueHandling | Specify the type of handling for [multi-value fields](../querying/multi-value-dimensions.md). Possible values are `sorted_array`, `sorted_set`, and `array`. `sorted_array` and `sorted_set` order the array upon ingestion. For `string` typed dimensions, `sorted_set` removes duplicates. For `string` typed dimensions, `array` ingests data as-is. | `sorted_array` | #### Inclusions and exclusions From a87e4dca8842f9310c8a294cfa8ee1705758b304 Mon Sep 17 00:00:00 2001 From: Shingo Kitagawa Date: Wed, 3 May 2023 09:17:29 +0900 Subject: [PATCH 2/4] Update docs/ingestion/ingestion-spec.md Co-authored-by: Victoria Lim --- docs/ingestion/ingestion-spec.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/docs/ingestion/ingestion-spec.md b/docs/ingestion/ingestion-spec.md index dde994859afe..615d73d2072e 100644 --- a/docs/ingestion/ingestion-spec.md +++ b/docs/ingestion/ingestion-spec.md @@ -226,7 +226,7 @@ Dimension objects can have the following components: | type | Either `string`, `long`, `float`, `double`, or `json`. | `string` | | name | The name of the dimension. This will be used as the field name to read from input records, as well as the column name stored in generated segments.

Note that you can use a [`transformSpec`](#transformspec) if you want to rename columns during ingestion time. | none (required) | | createBitmapIndex | For `string` typed dimensions, whether or not bitmap indexes should be created for the column in generated segments. Creating a bitmap index requires more storage, but speeds up certain kinds of filtering (especially equality and prefix filtering). Only supported for `string` typed dimensions. | `true` | -| multiValueHandling | Specify the type of handling for [multi-value fields](../querying/multi-value-dimensions.md). Possible values are `sorted_array`, `sorted_set`, and `array`. `sorted_array` and `sorted_set` order the array upon ingestion. For `string` typed dimensions, `sorted_set` removes duplicates. For `string` typed dimensions, `array` ingests data as-is. | `sorted_array` | +| multiValueHandling | Specify the type of handling for [multi-value fields](../querying/multi-value-dimensions.md). Possible values are `sorted_array`, `sorted_set`, and `array`. Both `sorted_array` and `sorted_set` order the array upon ingestion. For string typed dimensions, `sorted_set` removes duplicates. For string typed dimensions, `array` ingests data as is. | `sorted_array` | #### Inclusions and exclusions From ab350ff859ddc43d5439c82a52e3f060a318e761 Mon Sep 17 00:00:00 2001 From: Shingo Kitagawa Date: Wed, 3 May 2023 21:58:04 +0900 Subject: [PATCH 3/4] Update docs/ingestion/ingestion-spec.md Co-authored-by: Gian Merlino --- docs/ingestion/ingestion-spec.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/docs/ingestion/ingestion-spec.md b/docs/ingestion/ingestion-spec.md index 615d73d2072e..1fb79ca8e9fb 100644 --- a/docs/ingestion/ingestion-spec.md +++ b/docs/ingestion/ingestion-spec.md @@ -226,7 +226,7 @@ Dimension objects can have the following components: | type | Either `string`, `long`, `float`, `double`, or `json`. | `string` | | name | The name of the dimension. This will be used as the field name to read from input records, as well as the column name stored in generated segments.

Note that you can use a [`transformSpec`](#transformspec) if you want to rename columns during ingestion time. | none (required) | | createBitmapIndex | For `string` typed dimensions, whether or not bitmap indexes should be created for the column in generated segments. Creating a bitmap index requires more storage, but speeds up certain kinds of filtering (especially equality and prefix filtering). Only supported for `string` typed dimensions. | `true` | -| multiValueHandling | Specify the type of handling for [multi-value fields](../querying/multi-value-dimensions.md). Possible values are `sorted_array`, `sorted_set`, and `array`. Both `sorted_array` and `sorted_set` order the array upon ingestion. For string typed dimensions, `sorted_set` removes duplicates. For string typed dimensions, `array` ingests data as is. | `sorted_array` | +| multiValueHandling | For `string` typed dimensions, specifies the type of handling for [multi-value fields](../querying/multi-value-dimensions.md). Possible values are `array` (ingest string arrays as-is), `sorted_array` (sort string arrays during ingestion), and `sorted_set` (sort and deduplicate string arrays during ingestion). This parameter is ignored for types other than `string`. | `sorted_array` | #### Inclusions and exclusions From 184403b0b6021afee72ce9ba41a79755cd6b8ca8 Mon Sep 17 00:00:00 2001 From: Shingo Kitagawa Date: Mon, 8 May 2023 21:55:23 +0900 Subject: [PATCH 4/4] fix spelling --- docs/ingestion/ingestion-spec.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/docs/ingestion/ingestion-spec.md b/docs/ingestion/ingestion-spec.md index 1fb79ca8e9fb..d2e9a52a80a3 100644 --- a/docs/ingestion/ingestion-spec.md +++ b/docs/ingestion/ingestion-spec.md @@ -226,7 +226,7 @@ Dimension objects can have the following components: | type | Either `string`, `long`, `float`, `double`, or `json`. | `string` | | name | The name of the dimension. This will be used as the field name to read from input records, as well as the column name stored in generated segments.

Note that you can use a [`transformSpec`](#transformspec) if you want to rename columns during ingestion time. | none (required) | | createBitmapIndex | For `string` typed dimensions, whether or not bitmap indexes should be created for the column in generated segments. Creating a bitmap index requires more storage, but speeds up certain kinds of filtering (especially equality and prefix filtering). Only supported for `string` typed dimensions. | `true` | -| multiValueHandling | For `string` typed dimensions, specifies the type of handling for [multi-value fields](../querying/multi-value-dimensions.md). Possible values are `array` (ingest string arrays as-is), `sorted_array` (sort string arrays during ingestion), and `sorted_set` (sort and deduplicate string arrays during ingestion). This parameter is ignored for types other than `string`. | `sorted_array` | +| multiValueHandling | For `string` typed dimensions, specifies the type of handling for [multi-value fields](../querying/multi-value-dimensions.md). Possible values are `array` (ingest string arrays as-is), `sorted_array` (sort string arrays during ingestion), and `sorted_set` (sort and de-duplicate string arrays during ingestion). This parameter is ignored for types other than `string`. | `sorted_array` | #### Inclusions and exclusions