support compress & decompress by stream（block by block） by sisong · Pull Request #384 · ebiggers/libdeflate

sisong · 2024-08-07T01:25:54Z

for decompressor

Added several lower-level APIs with little as possible source code changes, to support decompress block by block;
And updated the gzip cmdline with new APIs, the memory usage when decompress will be greatly reduced, usually <1MB!
Also, because it needs to support any .gz format (not just libdeflate), the decompress buffer will automatically adjust the size to fit the input maximum block size.

Because decompressor reused a small memory, the CPU cache hit ratio will be greatly improved, resulting in an increase in decompress speed;
In my multiple test cases, it was able to get 20%--50% faster.

for compressor

Added several lower-level APIs with minor source code changes, to support compress block by block;
And updated the gzip cmdline by new APIs, the memory usage when compress will be greatly reduced;
The memory requires is related to the block size set and compress level, not related to input file size; when the block size is 2 MB and compress level is 12, the memory usage ~ 13 MB.
These new APIs can be used for multi-threaded parallel compression, and there is almost no loss of compress ratio.

( I used libdeflate for my multiple actual projects and make a lot of feature changes; Hopefully commit some useful changes back to libdeflate. )

…ytes on error; gzip stream decompressor fast fail when input bad data.

housisong added 9 commits August 6, 2024 20:53

add API for decompress block by block (like stream);

64282b6

refactoring internal utility functions from gzip_decompress();

7098e9c

gzip decompress by stream;

2fcb510

recode gzip_compress() & build setting & API exegesis

e3ca05e

optimize HT_MATCHFINDER_BUCKET_SIZE == 2

ec46143

add API for compress block by block (like stream);

ee91aa2

gzip compress by stream;

fdebb9f

deflate decompress func only one exit point, & also return working nb…

1931053

…ytes on error; gzip stream decompressor fast fail when input bad data.

Merge branch 'dec-stream' into enc-stream

0199133

sisong changed the title ~~support decompress by stream（block by block）~~ support compress & decompress by stream（block by block） Aug 8, 2024

housisong added 3 commits August 12, 2024 09:01

decompressor support multi concatenated gzip in one file;

f6f7551

fix read head data for gzip when decompress multi concatenated gzip;

d9cdfa4

fix re read head data for gzip when decompress multi concatenated gzip;

0633cd1

KanjiMonster mentioned this pull request Aug 17, 2025

Use parallel gzip compression via host pigz openwrt/openwrt#19792

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

support compress & decompress by stream（block by block）#384

support compress & decompress by stream（block by block）#384
sisong wants to merge 12 commits into
ebiggers:masterfrom
sisong:dec-stream

sisong commented Aug 7, 2024 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

sisong commented Aug 7, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

for decompressor

for compressor

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

sisong commented Aug 7, 2024 •

edited

Loading