[SPARK-25764][ML][EXAMPLES] Update BisectingKMeans example to use ClusteringEvaluator #22786

mgaido91 · 2018-10-21T07:06:02Z

What changes were proposed in this pull request?

Using computeCost for evaluating a model is a very poor approach. We should advice the users to a better approach which is available, ie. using the ClusteringEvaluator to evaluate their models. The PR updates the examples for BisectingKMeans in order to do that.

How was this patch tested?

running examples

… deprecated computeCost method

mgaido91 · 2018-10-21T07:06:37Z

cc @cloud-fan @dongjoon-hyun

SparkQA · 2018-10-21T07:23:37Z

Test build #97681 has finished for PR 22786 at commit 9def7e8.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

dongjoon-hyun

The difference from the reverted one is only the commit message, isn't it? Is there another difference in the code, @mgaido91 ?

And, at this time, this should go to master branch only because this is not a blocker issue. Could you review and merge this please, @mengxr ?

cloud-fan · 2018-10-22T13:39:18Z

also cc @WeichenXu123

mgaido91 · 2018-10-22T14:26:01Z

@dongjoon-hyun , no there aren't differences. I resubmitted in answer to #22763 (comment). Thanks.

mgaido91 · 2018-10-28T09:37:51Z

any comments on this?

mgaido91 · 2018-11-05T09:59:40Z

cc also @holdenk @srowen

cloud-fan · 2018-11-05T10:26:28Z

cc @dbtsai

dbtsai · 2018-11-05T22:41:34Z

LGTM. Merged into master. Thanks!

…steringEvaluator ## What changes were proposed in this pull request? Using `computeCost` for evaluating a model is a very poor approach. We should advice the users to a better approach which is available, ie. using the `ClusteringEvaluator` to evaluate their models. The PR updates the examples for `BisectingKMeans` in order to do that. ## How was this patch tested? running examples Closes apache#22786 from mgaido91/SPARK-25764. Authored-by: Marco Gaido <marcogaido91@gmail.com> Signed-off-by: DB Tsai <d_tsai@apple.com>

[SPARK-25764][ML][EXAMPLES] Update BisectingKMeans example not to use…

9def7e8

… deprecated computeCost method

dongjoon-hyun reviewed Oct 21, 2018

View reviewed changes

asfgit closed this in 0b59170 Nov 5, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SPARK-25764][ML][EXAMPLES] Update BisectingKMeans example to use ClusteringEvaluator #22786

[SPARK-25764][ML][EXAMPLES] Update BisectingKMeans example to use ClusteringEvaluator #22786

Uh oh!

mgaido91 commented Oct 21, 2018

Uh oh!

mgaido91 commented Oct 21, 2018

Uh oh!

SparkQA commented Oct 21, 2018

Uh oh!

dongjoon-hyun left a comment •

edited

Loading

Uh oh!

cloud-fan commented Oct 22, 2018

Uh oh!

mgaido91 commented Oct 22, 2018

Uh oh!

mgaido91 commented Oct 28, 2018

Uh oh!

mgaido91 commented Nov 5, 2018

Uh oh!

cloud-fan commented Nov 5, 2018

Uh oh!

dbtsai commented Nov 5, 2018

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

[SPARK-25764][ML][EXAMPLES] Update BisectingKMeans example to use ClusteringEvaluator #22786

[SPARK-25764][ML][EXAMPLES] Update BisectingKMeans example to use ClusteringEvaluator #22786

Uh oh!

Conversation

mgaido91 commented Oct 21, 2018

What changes were proposed in this pull request?

How was this patch tested?

Uh oh!

mgaido91 commented Oct 21, 2018

Uh oh!

SparkQA commented Oct 21, 2018

Uh oh!

dongjoon-hyun left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

cloud-fan commented Oct 22, 2018

Uh oh!

mgaido91 commented Oct 22, 2018

Uh oh!

mgaido91 commented Oct 28, 2018

Uh oh!

mgaido91 commented Nov 5, 2018

Uh oh!

cloud-fan commented Nov 5, 2018

Uh oh!

dbtsai commented Nov 5, 2018

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

dongjoon-hyun left a comment •

edited

Loading