Skip to content

Conversation

@amitsela
Copy link
Member

@amitsela amitsela commented Apr 14, 2016

Be sure to do all of the following to help us incorporate your contribution
quickly and easily:

  • Make sure the PR title is formatted like:
    [BEAM-<Jira issue #>] Description of pull request
  • Make sure tests pass via mvn clean verify. (Even better, enable
    Travis-CI on your fork and ensure the whole test matrix passes).
  • Replace <Jira issue #> in the title with the actual Jira issue
    number, if there is one.
  • If this contribution is large, please file an Apache
    Individual Contributor License Agreement.

@amitsela
Copy link
Member Author

R: @tomwhite and also anyone from the thread in beam-dev ("question about windowed values") - @kennknowles / @tgroh / @robertwb

SparkPipelineOptions options = SparkPipelineOptionsFactory.create();
options.setRunner(SparkPipelineRunner.class);
Pipeline p = Pipeline.create(options);
PCollection<String> inputWords = p.apply(Create.of(WORDS)).setCoder(StringUtf8Coder
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Not particularly relevant to the content of your change, but the recommended way to set the coder here would be Create.of(WORDS).withCoder(StringUtf8Coder.of()).

Separately, it isn't entirely necessary: it is a bit of a hack, but the methods in the default Create implementation that infer coders based on the values are made public just so that runner overrides can still invoke them, as the DataflowPipelineRunner does.

I hope the new whole-graph analysis will make it so individual runners no longer deal with this, but until then since it is not too hard to do, you might consider doing it for the Spark runner.

@tomwhite
Copy link
Member

+1 from me

@asfgit asfgit closed this in 135cb73 Apr 19, 2016
@amitsela amitsela deleted the BEAM-189 branch April 19, 2016 20:07
swegner pushed a commit to swegner/beam that referenced this pull request Apr 22, 2016
Give root transforms step names
iemejia pushed a commit to iemejia/beam that referenced this pull request Jan 12, 2018
mareksimunek pushed a commit to mareksimunek/beam that referenced this pull request May 9, 2018
 [euphoria-core] remove checkpointing from datasets - unusable feature
damccorm pushed a commit that referenced this pull request Dec 19, 2022
Co-authored-by: Elias Segundo <elias.segundo@luisrazo.local>

Co-authored-by: Elias Segundo Antonio <eliassegundo.segundo@gmail.com>
Co-authored-by: Elias Segundo <elias.segundo@luisrazo.local>
lostluck pushed a commit to lostluck/beam that referenced this pull request Dec 22, 2022
Co-authored-by: Elias Segundo <elias.segundo@luisrazo.local>

Co-authored-by: Elias Segundo Antonio <eliassegundo.segundo@gmail.com>
Co-authored-by: Elias Segundo <elias.segundo@luisrazo.local>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants