Skip to content

[Core][nightly-test] better way of calculating num features#22158

Merged
scv119 merged 3 commits intoray-project:masterfrom
scv119:better-filter
Feb 7, 2022
Merged

[Core][nightly-test] better way of calculating num features#22158
scv119 merged 3 commits intoray-project:masterfrom
scv119:better-filter

Conversation

@scv119
Copy link
Contributor

@scv119 scv119 commented Feb 7, 2022

Why are these changes needed?

previously, we use num_columns - 2 to calculate the feature size where we assumes both the "label" and arrow internal column __index_level_0__ exists in the schema.

however, the __index_level_0__ might not exist in the dataset's schema, which causes this check fail.
in this pr, we use a better way to filter out arrow internal column and label column.

Related issue number

Checks

@scv119 scv119 added the tests-ok The tagger certifies test failures are unrelated and assumes personal liability. label Feb 7, 2022
@scv119 scv119 merged commit 1381930 into ray-project:master Feb 7, 2022
simonsays1980 pushed a commit to simonsays1980/ray that referenced this pull request Feb 27, 2022
…ect#22158)

* better filter of column length

* address comments

* more
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

tests-ok The tagger certifies test failures are unrelated and assumes personal liability.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants