Skip to content

Conversation

@hubgeter
Copy link
Contributor

@hubgeter hubgeter commented Oct 9, 2024

bp #40225 , #40888 ,#41386

Proposed changes

Among them, #40225 is the new api of mc,
#40888 is used to fix the bug when reading null between the new and old apis,
#41386 is used for compatibility between the new and old versions

@doris-robot
Copy link

Thank you for your contribution to Apache Doris.
Don't know what should be done next? See How to process your PR

Since 2024-03-18, the Document has been moved to doris-website.
See Doris Document.

@hubgeter hubgeter changed the title Pick 30 mc storage api [Enhancement](MaxCompute)Refactoring maxCompute catalog using Storage API.(#40225 , #40888 ,#41386 ) Oct 9, 2024
@hubgeter hubgeter marked this pull request as draft October 9, 2024 10:43
… API. (apache#40225)

Refactoring maxCompute catalog using Storage API.

Storage API :
https://help.aliyun.com/zh/maxcompute/user-guide/open-storage-sample-java-sdk?spm=a2c4g.11186623.0.i0

```
The following are required:

CREATE CATALOG mc PROPERTIES (
"type" = "max_compute",
"mc.default.project" = "xxx",
"mc.access_key" = "xxx",
"mc.secret_key" = "xxxx",
"mc.endpoint" = "xxxx"
);

Optional parameters:
Configuration Item      Default Value
"mc.quota"              =    "pay-as-you-go"

"mc.split_strategy" =   "byte_size" Split according to file size
"mc.split_byte_size" = "268435456" You can set the file size of each split

"mc.split_strategy" = "row_count" Split according to the number of rows of data
"mc.split_row_count" = "1048576" You can set how many lines to read for each split
```
… maxcompute. (apache#40888)

before pr apache#40225 
## Proposed changes

Fixed a bug where when reading maxcompute, if there are null values ​​in
a batch, null values ​​will always be read out.
…pute catalogs from previous versions. (apache#41386)

before pr apache#40225

(cherry picked from commit e8a1a16)
@hubgeter hubgeter force-pushed the pick_30_mc_storage_api branch from 410922f to 9fa6aae Compare October 10, 2024 10:41
@hubgeter hubgeter marked this pull request as ready for review October 10, 2024 10:44
@hubgeter
Copy link
Contributor Author

run buildall

@hubgeter hubgeter marked this pull request as draft October 10, 2024 11:01
@hubgeter hubgeter marked this pull request as ready for review October 10, 2024 14:17
@morningman morningman merged commit a1f8383 into apache:branch-3.0 Oct 11, 2024
@gavinchou gavinchou mentioned this pull request Nov 26, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants