Skip to content

CRR with additional hyperparameter beta for KL-divergence (policy constraint) regularization#522

Closed
TengyuX wants to merge 2 commits intofacebookresearch:masterfrom
TengyuX:export-D30183147
Closed

CRR with additional hyperparameter beta for KL-divergence (policy constraint) regularization#522
TengyuX wants to merge 2 commits intofacebookresearch:masterfrom
TengyuX:export-D30183147

Conversation

@TengyuX
Copy link
Contributor

@TengyuX TengyuX commented Aug 9, 2021

Summary:
(1) Add coefficient "beta" (regularization factor of policy constraint regularization) on CRR's objective.
(2) Change default value for "DEFAULT_MAX_UNIQUE_ENUM" from 100 to 10

Reviewed By: czxttkl

Differential Revision: D30183147

czxttkl and others added 2 commits August 8, 2021 18:02
Summary: The feature importance is based on data perturbation

Differential Revision: D30136375

fbshipit-source-id: 3d6db8f2fd0de30cf542653c95e0ca2007649f0d
…straint) regularization

Summary:
(1) Add coefficient "beta" (regularization factor of policy constraint regularization) on CRR's objective.
(2) Change default value for "DEFAULT_MAX_UNIQUE_ENUM" from 100 to 10

Reviewed By: czxttkl

Differential Revision: D30183147

fbshipit-source-id: 97abe7477beb78e17f758938d6f956b42d4d0bbb
@facebook-github-bot
Copy link

This pull request was exported from Phabricator. Differential Revision: D30183147

@codecov-commenter
Copy link

codecov-commenter commented Aug 9, 2021

Codecov Report

Merging #522 (eb91cd4) into master (97e3da3) will decrease coverage by 0.22%.
The diff coverage is 3.84%.

Impacted file tree graph

@@            Coverage Diff             @@
##           master     #522      +/-   ##
==========================================
- Coverage   85.06%   84.84%   -0.23%     
==========================================
  Files         326      328       +2     
  Lines       19195    19246      +51     
  Branches       44       44              
==========================================
+ Hits        16329    16330       +1     
- Misses       2840     2890      +50     
  Partials       26       26              
Impacted Files Coverage Δ
...tion/feature_importance/feature_importance_base.py 0.00% <0.00%> (ø)
...ture_importance/feature_importance_perturbation.py 0.00% <0.00%> (ø)
reagent/preprocessing/identify_types.py 97.05% <100.00%> (ø)
reagent/training/discrete_crr_trainer.py 100.00% <100.00%> (ø)

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 97e3da3...eb91cd4. Read the comment docs.

@facebook-github-bot
Copy link

This pull request has been merged in e3ea217.

xuruiyang pushed a commit that referenced this pull request Sep 20, 2025
…straint) regularization (#522)

Summary:
Pull Request resolved: #522

(1) Add coefficient "beta" (regularization factor of policy constraint regularization) on CRR's objective.
(2) Change default value for "DEFAULT_MAX_UNIQUE_ENUM" from 100 to 10

Reviewed By: czxttkl

Differential Revision: D30183147

fbshipit-source-id: 1c18610678482397bdd669fd064a27d34967881f
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants