Skip to content

Conversation

@zhangstar333
Copy link
Contributor

Proposed changes

this agg used by get distinct rows, the group by key is customer_sk and item_sk. and have a projections: customer_sk.
so the output block after call clear_column_data with num_materialized_slots will be remove item_sk column,
But the block will be push back to data_queue to reuse, those will cause the block column not equal.

| 6:VAGGREGATE (merge finalize)
| | group by: customer_sk[#58], item_sk[#59]
| | cardinality=35,999,628
| | projections: customer_sk[#60]
| | project output tuple id: 7

| 4:VAGGREGATE (update serialize)
| | STREAMING
| | group by: customer_sk[#56], item_sk[#57]
| | cardinality=71,999,256

Issue Number: close #xxx

Further comments

If this is a relatively large or complex change, kick off the discussion at dev@doris.apache.org by explaining why you chose the solution you did and what alternatives you considered, etc...

@zhangstar333
Copy link
Contributor Author

run buildall

@github-actions
Copy link
Contributor

clang-tidy review says "All clean, LGTM! 👍"

@hello-stephen
Copy link
Contributor

(From new machine)TeamCity pipeline, clickbench performance test result:
the sum of best hot time: 46.74 seconds
stream load tsv: 511 seconds loaded 74807831229 Bytes, about 139 MB/s
stream load json: 20 seconds loaded 2358488459 Bytes, about 112 MB/s
stream load orc: 65 seconds loaded 1101869774 Bytes, about 16 MB/s
stream load parquet: 32 seconds loaded 861443392 Bytes, about 25 MB/s
insert into select: 29.2 seconds inserted 10000000 Rows, about 342K ops/s
storage size: 17162073772 Bytes

@github-actions github-actions bot added the approved Indicates a PR has been approved by one committer. label Jul 29, 2023
@github-actions
Copy link
Contributor

PR approved by at least one committer and no changes requested.

@github-actions
Copy link
Contributor

PR approved by anyone and no changes requested.

Copy link
Contributor

@yiguolei yiguolei left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@yiguolei yiguolei added the dev/2.0.0 2.0.0 release label Jul 29, 2023
@yiguolei yiguolei merged commit bc88d34 into apache:master Jul 29, 2023
xiaokang pushed a commit that referenced this pull request Jul 31, 2023
…key size (#22357)

* [imporve](flex) support scientific notation(aEb) parser

* update

* [bug](distinct-agg) fix distinct-agg outblock columns size not equal key size
morningman pushed a commit to morningman/doris that referenced this pull request Aug 2, 2023
…key size (apache#22357)

* [imporve](flex) support scientific notation(aEb) parser

* update

* [bug](distinct-agg) fix distinct-agg outblock columns size not equal key size
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

approved Indicates a PR has been approved by one committer. dev/2.0.0-merged merge_conflict reviewed

Projects

None yet

Development

Successfully merging this pull request may close these issues.

5 participants