[RLlib] RecSim Interest evolution environment should use custom video sampler: `IEvVideoSampler` due to only one cluster being used. by gjoliver · Pull Request #22211 · ray-project/ray

gjoliver · 2022-02-08T09:25:58Z

Why are these changes needed?

Interest evolution env should probably use IEV video sampler, instead of the utility model video sampler.
Docs returned from this sampler contains actual features, instead of utility indices.
This may help your slateq runs.

Related issue number

Checks

I've run scripts/format.sh to lint the changes in this PR.
I've included any doc changes needed for https://docs.ray.io/en/master/.
I've made sure the tests are passing. Note that there might be a few flaky tests, see the recent failures at https://flakey-tests.ray.io/
Testing Strategy
- Unit tests
- Release tests
- This PR is not tested :(

Docs returned from this sampler contains actual features, instead of utility indices.

sven1977

Thanks for the fix @gjoliver !

…video_sampler

sven1977 · 2022-02-08T12:14:04Z

Waiting for LINT to pass.

sven1977 · 2022-02-08T12:15:21Z

rllib/examples/env/recsim_recommender_system_envs.py

    return iev.IEvUserModel(
        env_ctx["slate_size"],
-        choice_model_ctor=choice_model.MultinomialProportionalChoiceModel,
+        choice_model_ctor=choice_model.MultinomialLogitChoiceModel,


What's the difference and why did we need to change this? In the original RecSim repo, they use:
choice_model.MultinomialProportionalChoiceModel.

The difference is how to handle negative logits.

MultinomialLogitChoiceModel uses p(x) = exp(x) / Sum_{y in scores} exp(y), while
MultinomialProportionalChoiceModel uses p(x) = (x - min_normalizer) / sum(x - min_normalizer). You need to know the lower bound of your output logits before you can convert everything to be positive.
intuitively, these 2 should work similarly, like the more negative model output is, the less likely it will get clicked on.

gjoliver · 2022-02-08T18:49:01Z

I actually noticed another issue with RecSim.

IEvUserModel is hardcoded to use UtilityModelUserSampler:
https://github.com/google-research/recsim/blob/55e50e4be736d222ffe8c2477ed1981b40f91605/recsim/environments/interest_evolution.py#L491-L492

So I don't know if switching to IEvVideoSampler actually works or not ...

… sampler: `IEvVideoSampler` due to only one cluster being used. (ray-project#22211)

Interest evolution environment should probably use IEvVideoSampler.

34f8845

Docs returned from this sampler contains actual features, instead of utility indices.

gjoliver requested a review from sven1977 February 8, 2022 09:25

gjoliver requested a review from avnishn as a code owner February 8, 2022 09:25

sven1977 self-assigned this Feb 8, 2022

sven1977 approved these changes Feb 8, 2022

View reviewed changes

sven1977 added 2 commits February 8, 2022 13:13

Merge branch 'master' of https://github.com/ray-project/ray into iev_…

f064fb0

…video_sampler

LINT.

a63b435

sven1977 reviewed Feb 8, 2022

View reviewed changes

sven1977 changed the title ~~Interest evolution environment should probably use IEvVideoSampler.~~ [RLlib] RecSim Interest evolution environment should use custom video sampler: IEvVideoSampler. Feb 8, 2022

sven1977 changed the title ~~[RLlib] RecSim Interest evolution environment should use custom video sampler: IEvVideoSampler.~~ [RLlib] RecSim Interest evolution environment should use custom video sampler: IEvVideoSampler due to only one cluster being used. Feb 8, 2022

sven1977 merged commit 3207f53 into ray-project:master Feb 9, 2022

simonsays1980 pushed a commit to simonsays1980/ray that referenced this pull request Feb 27, 2022

[RLlib] RecSim Interest evolution environment should use custom video…

a899657

… sampler: `IEvVideoSampler` due to only one cluster being used. (ray-project#22211)

simonsays1980 pushed a commit to simonsays1980/ray that referenced this pull request Mar 8, 2022

[RLlib] RecSim Interest evolution environment should use custom video…

608f3de

… sampler: `IEvVideoSampler` due to only one cluster being used. (ray-project#22211)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[RLlib] RecSim Interest evolution environment should use custom video sampler: `IEvVideoSampler` due to only one cluster being used.#22211

[RLlib] RecSim Interest evolution environment should use custom video sampler: `IEvVideoSampler` due to only one cluster being used.#22211
sven1977 merged 3 commits intoray-project:masterfrom
gjoliver:iev_video_sampler

gjoliver commented Feb 8, 2022 •

edited by sven1977

Loading

Uh oh!

sven1977 left a comment

Uh oh!

sven1977 commented Feb 8, 2022

Uh oh!

sven1977 Feb 8, 2022

Uh oh!

gjoliver Feb 8, 2022

Uh oh!

gjoliver commented Feb 8, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

gjoliver commented Feb 8, 2022 • edited by sven1977 Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Why are these changes needed?

Related issue number

Checks

Uh oh!

sven1977 left a comment

Choose a reason for hiding this comment

Uh oh!

sven1977 commented Feb 8, 2022

Uh oh!

sven1977 Feb 8, 2022

Choose a reason for hiding this comment

Uh oh!

gjoliver Feb 8, 2022

Choose a reason for hiding this comment

Uh oh!

gjoliver commented Feb 8, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

gjoliver commented Feb 8, 2022 •

edited by sven1977

Loading