Skip to content

Conversation

@lw-lin
Copy link
Contributor

@lw-lin lw-lin commented Jun 4, 2016

What changes were proposed in this pull request?

Currently there are some code duplicates in continuous Parquet writes (as in Structured Streaming) and non-continuous writes; see ParquetFileFormat#prepareWrite() and ParquetFileFormat#ParquetOutputWriterFactory.

This may lead to inconsistent behavior, when we only change one piece of code but not the other.

By extracting the common code out, this patch fixes the inconsistency. As a result, Structured Streaming now also enjoys SPARK-15719.

How was this patch tested?

Just code refactoring without any logic change; this should be covered by existing suites.

@lw-lin lw-lin changed the title [SPARK-15765][SQL][Streaming] Make continuous Parquet writing consistent with non-consistent Parquet writing [SPARK-15765][SQL][Streaming] Make continuous Parquet writing consistent with non-continuous Parquet writing Jun 4, 2016
@SparkQA
Copy link

SparkQA commented Jun 4, 2016

Test build #59983 has finished for PR 13507 at commit 60a2c8e.

  • This patch passes all tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@lw-lin
Copy link
Contributor Author

lw-lin commented Jun 4, 2016

@liancheng @tdas @zsxwing would you mind taking a look? Thanks!

@lw-lin lw-lin changed the title [SPARK-15765][SQL][Streaming] Make continuous Parquet writing consistent with non-continuous Parquet writing [SPARK-15765][SQL][Streaming] Make continuous Parquet writes consistent with non-continuous Parquet writes Jun 5, 2016
@lw-lin
Copy link
Contributor Author

lw-lin commented Oct 21, 2016

I'm closing this in favor of SPARK-17924, thanks!

@lw-lin lw-lin closed this Oct 21, 2016
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants