Skip to content

Uncompress streams without having to download to tmp first#4364

Merged
fjy merged 2 commits intoapache:masterfrom
niketh:uncompress-without-download
Jun 9, 2017
Merged

Uncompress streams without having to download to tmp first#4364
fjy merged 2 commits intoapache:masterfrom
niketh:uncompress-without-download

Conversation

@niketh
Copy link
Copy Markdown
Contributor

@niketh niketh commented Jun 5, 2017

Uncompress streams without having to download to tmp first

@@ -0,0 +1,210 @@
<?xml version="1.0" encoding="UTF-8"?>
Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

is this change needed ?

Copy link
Copy Markdown
Contributor Author

@niketh niketh Jun 5, 2017

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Not required, removing it

@b-slim
Copy link
Copy Markdown
Contributor

b-slim commented Jun 5, 2017

@niketh how about the HDFS Puller ? also curious what is the main driver is it perf ? stability ? misc ?

@niketh
Copy link
Copy Markdown
Contributor Author

niketh commented Jun 5, 2017

@b-slim HDFS puller already uses directly uncompressing from stream. This PR is a result of #4302

@b-slim
Copy link
Copy Markdown
Contributor

b-slim commented Jun 5, 2017

ok looks like it is a performance improvement plus making pullers more uniform. 👍

@fjy
Copy link
Copy Markdown
Contributor

fjy commented Jun 9, 2017

👍

@fjy fjy added this to the 0.10.1 milestone Jun 9, 2017
@fjy fjy merged commit 2cd91b6 into apache:master Jun 9, 2017
Copy link
Copy Markdown
Contributor

@drcrallen drcrallen left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

If I recall correctly the download to tmp first allows mmap of the zip file and decompression, and knowing all the components and their size. This does things on heap as they come in. Should be ok for most use cases.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants