[branch-2.1](fix) fix snappy decompressor bug #40862

suxiaogang223 · 2024-09-14T07:31:38Z

Proposed changes

Hadoop snappycodec source :
https://github.com/apache/hadoop/blob/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-nativetask/src/main/native/src/codec/SnappyCodec.cc
Example:
OriginData(The original data will be divided into several large data block.) :
large data block1 | large data block2 | large data block3 | ....
The large data block will be divided into several small data block.
Suppose a large data block is divided into three small blocks:
large data block1: | small block1 | small block2 | small block3 |
CompressData: <A [B1 compress(small block1) ] [B2 compress(small block1) ] [B3 compress(small block1)]>

A : original length of the current block of large data block.
sizeof(A) = 4 bytes.
A = length(small block1) + length(small block2) + length(small block3)
Bx : length of small data block bx.
sizeof(Bx) = 4 bytes.
Bx = length(compress(small blockx))

doris-robot · 2024-09-14T07:31:43Z

Thank you for your contribution to Apache Doris.
Don't know what should be done next? See How to process your PR

Since 2024-03-18, the Document has been moved to doris-website.
See Doris Document.

suxiaogang223 · 2024-09-14T07:36:30Z

run buildall

github-actions · 2024-09-14T07:38:06Z

clang-tidy review says "All clean, LGTM! 👍"

github-actions · 2024-09-14T07:42:15Z

clang-tidy review says "All clean, LGTM! 👍"

doris-robot · 2024-09-14T09:13:02Z

TeamCity be ut coverage result:
Function Coverage: 36.14% (9319/25784)
Line Coverage: 27.70% (76549/276325)
Region Coverage: 26.48% (39267/148312)
Branch Coverage: 23.30% (20024/85944)
Coverage Report: http://coverage.selectdb-in.cc/coverage/530158e2f18d897c8852078547f4c1f70c37840a_530158e2f18d897c8852078547f4c1f70c37840a/report/index.html

morningman · 2024-09-14T15:19:36Z

please add test case

yiguolei

Please modify BE UT to test the modification

suxiaogang223 · 2024-09-18T15:11:26Z

run buildall

github-actions · 2024-09-18T15:16:43Z

clang-tidy review says "All clean, LGTM! 👍"

github-actions · 2024-09-18T15:18:05Z

clang-tidy review says "All clean, LGTM! 👍"

doris-robot · 2024-09-18T15:38:42Z

TeamCity be ut coverage result:
Function Coverage: 36.16% (9328/25794)
Line Coverage: 27.73% (76656/276453)
Region Coverage: 26.51% (39363/148483)
Branch Coverage: 23.31% (20047/86000)
Coverage Report: http://coverage.selectdb-in.cc/coverage/98087effcf0fb41f8bf108022a15e2de3a196839_98087effcf0fb41f8bf108022a15e2de3a196839/report/index.html

morningman

LGTM

github-actions · 2024-09-20T03:56:48Z

PR approved by at least one committer and no changes requested.

github-actions · 2024-09-20T03:56:49Z

PR approved by anyone and no changes requested.

) ## Proposed changes Hadoop snappycodec source : https://github.com/apache/hadoop/blob/trunk/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-nativetask/src/main/native/src/codec/SnappyCodec.cc Example: OriginData(The original data will be divided into several large data block.) : large data block1 | large data block2 | large data block3 | .... The large data block will be divided into several small data block. Suppose a large data block is divided into three small blocks: large data block1: | small block1 | small block2 | small block3 | CompressData: <A [B1 compress(small block1) ] [B2 compress(small block1) ] [B3 compress(small block1)]> A : original length of the current block of large data block. sizeof(A) = 4 bytes. A = length(small block1) + length(small block2) + length(small block3) Bx : length of small data block bx. sizeof(Bx) = 4 bytes. Bx = length(compress(small blockx)) Co-authored-by: Socrates <suxiaogang223@icloud.com>

…n` (#46982) related pr: #40862 Doris `branch-2.1` modified this regression case without modifying the `master` branch. So this pr fixes the regression case `test_local_tvf_compression`

…n` (apache#46982) related pr: apache#40862 Doris `branch-2.1` modified this regression case without modifying the `master` branch. So this pr fixes the regression case `test_local_tvf_compression`

suxiaogang223 changed the title ~~[]fix snappy decompressor bug~~ [branch-2.1](fix) fix snappy decompressor bug Sep 14, 2024

suxiaogang223 force-pushed the fix_decompressor branch from 1dafd77 to 530158e Compare September 14, 2024 07:35

yiguolei reviewed Sep 15, 2024

View reviewed changes

suxiaogang223 added 2 commits September 18, 2024 23:11

fix snappy decompressor bug

49eec77

add regression test

98087ef

suxiaogang223 force-pushed the fix_decompressor branch from e3bf3ab to 98087ef Compare September 18, 2024 15:11

morningman approved these changes Sep 20, 2024

View reviewed changes

github-actions bot added the approved Indicates a PR has been approved by one committer. label Sep 20, 2024

github-actions bot added the reviewed label Sep 20, 2024

morningman merged commit e0fac66 into apache:branch-2.1 Sep 20, 2024

suxiaogang223 deleted the fix_decompressor branch September 26, 2024 17:12

yiguolei mentioned this pull request Nov 6, 2024

Release Note 2.1.7 #43319

Closed

BePPPower mentioned this pull request Jan 14, 2025

[fix](regression-test) fix regression case test_local_tvf_compression #46982

Merged

16 tasks

yiguolei mentioned this pull request Jan 19, 2025

Release Note 2.1.8 #47198

Closed

[branch-2.1](fix) fix snappy decompressor bug #40862

[branch-2.1](fix) fix snappy decompressor bug #40862

Uh oh!

Conversation

suxiaogang223 commented Sep 14, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Proposed changes

Uh oh!

doris-robot commented Sep 14, 2024

Uh oh!

suxiaogang223 commented Sep 14, 2024

Uh oh!

github-actions bot commented Sep 14, 2024

Uh oh!

github-actions bot commented Sep 14, 2024

Uh oh!

doris-robot commented Sep 14, 2024

Uh oh!

morningman commented Sep 14, 2024

Uh oh!

yiguolei left a comment

Choose a reason for hiding this comment

Uh oh!

suxiaogang223 commented Sep 18, 2024

Uh oh!

github-actions bot commented Sep 18, 2024

Uh oh!

github-actions bot commented Sep 18, 2024

Uh oh!

doris-robot commented Sep 18, 2024

Uh oh!

morningman left a comment

Choose a reason for hiding this comment

Uh oh!

github-actions bot commented Sep 20, 2024

Uh oh!

github-actions bot commented Sep 20, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

suxiaogang223 commented Sep 14, 2024 •

edited

Loading