Add Thompson Sampling to ReAgent MAB and refactor the UCB classes and methods to unify by alexnikulkov · Pull Request #565 · facebookresearch/ReAgent

alexnikulkov · 2021-10-15T00:13:58Z

Summary:

Add 2 Thompson sampling MAB algorithms: 1 for Bernoulli rewards, 1 for Normal rewards
Refactor UCB code so that Thompson sampling could reuse as much as possible

Differential Revision: D31642370

facebook-github-bot · 2021-10-15T00:14:25Z

This pull request was exported from Phabricator. Differential Revision: D31642370

codecov-commenter · 2021-10-15T00:42:38Z

Codecov Report

Merging #565 (70eea9b) into main (57b58a8) will increase coverage by 0.03%.
The diff coverage is 97.95%.

@@            Coverage Diff             @@
##             main     #565      +/-   ##
==========================================
+ Coverage   86.65%   86.69%   +0.03%     
==========================================
  Files         337      339       +2     
  Lines       20955    21004      +49     
  Branches       44       44              
==========================================
+ Hits        18159    18209      +50     
+ Misses       2770     2769       -1     
  Partials       26       26

Impacted Files	Coverage Δ
reagent/mab/mab_algorithm.py	`93.93% <93.93%> (ø)`
reagent/mab/thompson_sampling.py	`97.87% <97.87%> (ø)`
reagent/mab/ucb.py	`82.05% <100.00%> (-6.75%)`	⬇️
reagent/test/mab/test_mab.py	`100.00% <100.00%> (ø)`

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 57b58a8...70eea9b. Read the comment docs.

… methods to unify (facebookresearch#565) Summary: Pull Request resolved: facebookresearch#565 1. Add 2 Thompson sampling MAB algorithms: 1 for Bernoulli rewards, 1 for Normal rewards 2. Refactor UCB code so that Thompson sampling could reuse as much as possible Differential Revision: D31642370 fbshipit-source-id: f5f9e2227bef9b5caafee9c3894f494be0e9e1a5

facebook-github-bot · 2021-10-15T17:49:45Z

This pull request was exported from Phabricator. Differential Revision: D31642370

facebook-github-bot · 2021-10-19T00:43:22Z

This pull request has been merged in 471defa.

… methods to unify (#565) Summary: Pull Request resolved: #565 1. Add 2 Thompson sampling MAB algorithms: 1 for Bernoulli rewards, 1 for Normal rewards 2. Refactor UCB code so that Thompson sampling could reuse as much as possible Reviewed By: czxttkl Differential Revision: D31642370 fbshipit-source-id: c4447a22ad11e1bb9696cf269ea9f45523d22f28

facebook-github-bot added the fb-exported label Oct 15, 2021

facebook-github-bot added the cla signed label Oct 15, 2021

alexnikulkov force-pushed the export-D31642370 branch from f10936a to 70eea9b Compare October 15, 2021 17:49

facebook-github-bot closed this in 471defa Oct 19, 2021

facebook-github-bot added the Merged label Oct 19, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Thompson Sampling to ReAgent MAB and refactor the UCB classes and methods to unify#565

Add Thompson Sampling to ReAgent MAB and refactor the UCB classes and methods to unify#565
alexnikulkov wants to merge 1 commit intofacebookresearch:mainfrom
alexnikulkov:export-D31642370

alexnikulkov commented Oct 15, 2021

Uh oh!

facebook-github-bot commented Oct 15, 2021

Uh oh!

codecov-commenter commented Oct 15, 2021 •

edited

Loading

Uh oh!

facebook-github-bot commented Oct 15, 2021

Uh oh!

facebook-github-bot commented Oct 19, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

alexnikulkov commented Oct 15, 2021

Uh oh!

facebook-github-bot commented Oct 15, 2021

Uh oh!

codecov-commenter commented Oct 15, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

facebook-github-bot commented Oct 15, 2021

Uh oh!

facebook-github-bot commented Oct 19, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

codecov-commenter commented Oct 15, 2021 •

edited

Loading