Skip to content

Conversation

@ryanthompson591
Copy link
Contributor

This is an example of a regression model using an sklearn pipeline.

This illustrates how to separate examples based on what features are available in the example and then using an sklearn pipeline made for that data.

Thank you for your contribution! Follow this checklist to help us incorporate your contribution quickly and easily:

  • Choose reviewer(s) and mention them in a comment (R: @username).
  • Mention the appropriate issue in your description (for example: addresses #123), if applicable. This will automatically add a link to the pull request in the issue. If you would like the issue to automatically close on merging the pull request, comment fixes #<ISSUE NUMBER> instead.
  • Update CHANGES.md with noteworthy changes.
  • If this contribution is large, please file an Apache Individual Contributor License Agreement.

See the Contributor Guide for more tips on how to make review process smoother.

To check the build health, please visit https://github.com/apache/beam/blob/master/.test-infra/BUILD_STATUS.md

GitHub Actions Tests Status (on master branch)

Build python source distribution and wheels
Python tests
Java tests

See CI.md for more information about GitHub Actions CI.

@asf-ci
Copy link

asf-ci commented Jun 28, 2022

Can one of the admins verify this patch?

4 similar comments
@asf-ci
Copy link

asf-ci commented Jun 28, 2022

Can one of the admins verify this patch?

@asf-ci
Copy link

asf-ci commented Jun 28, 2022

Can one of the admins verify this patch?

@asf-ci
Copy link

asf-ci commented Jun 28, 2022

Can one of the admins verify this patch?

@asf-ci
Copy link

asf-ci commented Jun 28, 2022

Can one of the admins verify this patch?

@ryanthompson591
Copy link
Contributor Author

R: @AnandInguva

@github-actions
Copy link
Contributor

Stopping reviewer notifications for this pull request: review requested by someone other than the bot, ceding control

@codecov
Copy link

codecov bot commented Jun 28, 2022

Codecov Report

Merging #22088 (12c24f1) into master (48513ad) will decrease coverage by 0.14%.
The diff coverage is 34.48%.

@@            Coverage Diff             @@
##           master   #22088      +/-   ##
==========================================
- Coverage   74.18%   74.03%   -0.15%     
==========================================
  Files         706      706              
  Lines       93229    93428     +199     
==========================================
+ Hits        69159    69168       +9     
- Misses      22802    22994     +192     
+ Partials     1268     1266       -2     
Flag Coverage Δ
python 83.55% <34.48%> (+<0.01%) ⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

Impacted Files Coverage Δ
...s/inference/sklearn_japanese_housing_regression.py 34.48% <34.48%> (ø)
.../python/apache_beam/testing/test_stream_service.py 88.09% <0.00%> (-4.77%) ⬇️
sdks/go/pkg/beam/io/filesystem/memfs/memory.go 92.00% <0.00%> (-4.16%) ⬇️
.../apache_beam/runners/interactive/dataproc/types.py 93.10% <0.00%> (-3.45%) ⬇️
...che_beam/runners/interactive/interactive_runner.py 90.06% <0.00%> (-1.33%) ⬇️
...ks/go/pkg/beam/runners/dataflow/dataflowlib/job.go 21.55% <0.00%> (-1.13%) ⬇️
sdks/python/apache_beam/internal/metrics/metric.py 93.00% <0.00%> (-1.00%) ⬇️
sdks/go/pkg/beam/core/runtime/exec/input.go 50.96% <0.00%> (-1.00%) ⬇️
...hon/apache_beam/runners/direct/test_stream_impl.py 93.28% <0.00%> (-0.75%) ⬇️
sdks/python/apache_beam/io/fileio.py 95.98% <0.00%> (-0.28%) ⬇️
... and 34 more

📣 We’re building smart automated test selection to slash your CI/CD build times. Learn more

@ryanthompson591
Copy link
Contributor Author

R: @tvalentyn

ryanthompson591 and others added 8 commits June 30, 2022 13:55
…using_regression.py

Co-authored-by: Anand Inguva <34158215+AnandInguva@users.noreply.github.com>
…using_regression.py

Co-authored-by: Anand Inguva <34158215+AnandInguva@users.noreply.github.com>
…using_regression.py

Co-authored-by: Anand Inguva <34158215+AnandInguva@users.noreply.github.com>
@ryanthompson591
Copy link
Contributor Author

Run Python 3.7 PostCommit

@ryanthompson591
Copy link
Contributor Author

Run Python 3.7 PostCommit

@tvalentyn
Copy link
Contributor

Let me know when this is ready to review/merge.

@ryanthompson591
Copy link
Contributor Author

Run Python 3.9 PostCommit

@ryanthompson591
Copy link
Contributor Author

Run Python 3.9 PostCommit

@ryanthompson591
Copy link
Contributor Author

Run Python 3.7 PostCommit

@ryanthompson591
Copy link
Contributor Author

Run Python 3.9 PostCommit

@ryanthompson591
Copy link
Contributor Author

@tvalentyn This looks ready to merge, the failing test is not the new test I added but rather the issue described here: #22605

@ryanthompson591
Copy link
Contributor Author

Run Python 3.9 PostCommit


# pylint: disable=wrong-import-order, wrong-import-position, ungrouped-imports, unused-import
try:
from apache_beam.io.gcp.gcsfilesystem import GCSFileSystem
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

why is this necessary? I am not seeing any usage of GCSFileSystem

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This might have been preemptive. Unless we are able to save files, I think these tests could fail. However, it doesn't seem relevant to this PR and we should defer fixing it until it becomes an issue.

Copy link
Contributor Author

@ryanthompson591 ryanthompson591 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@tvalentyn PTAL


# pylint: disable=wrong-import-order, wrong-import-position, ungrouped-imports, unused-import
try:
from apache_beam.io.gcp.gcsfilesystem import GCSFileSystem
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This might have been preemptive. Unless we are able to save files, I think these tests could fail. However, it doesn't seem relevant to this PR and we should defer fixing it until it becomes an issue.

@ryanthompson591
Copy link
Contributor Author

Run Python 3.9 PostCommit

1 similar comment
@tvalentyn
Copy link
Contributor

Run Python 3.9 PostCommit

@tvalentyn tvalentyn merged commit 6df4939 into apache:master Aug 17, 2022
MarcoRob pushed a commit to MarcoRob/beam that referenced this pull request Sep 5, 2022
Co-authored-by: Anand Inguva <34158215+AnandInguva@users.noreply.github.com>
Co-authored-by: tvalentyn <tvalentyn@users.noreply.github.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants