Add support for BCE loss for reward decomposition.#573
Closed
j-jiafei wants to merge 1 commit intofacebookresearch:mainfrom
Closed
Add support for BCE loss for reward decomposition.#573j-jiafei wants to merge 1 commit intofacebookresearch:mainfrom
j-jiafei wants to merge 1 commit intofacebookresearch:mainfrom
Conversation
|
This pull request was exported from Phabricator. Differential Revision: D31783145 |
1 similar comment
|
This pull request was exported from Phabricator. Differential Revision: D31783145 |
j-jiafei
pushed a commit
to j-jiafei/ReAgent
that referenced
this pull request
Nov 3, 2021
) Summary: Pull Request resolved: facebookresearch#573 Add BCE loss for reward decomposition. BCE loss decomposition loss is defined between avg(p) and avg(y). Reviewed By: czxttkl Differential Revision: D31783145 fbshipit-source-id: af46052812f517f5679babc9be2a7633e79f64be
bb96fd7 to
fbafe12
Compare
j-jiafei
pushed a commit
to j-jiafei/ReAgent
that referenced
this pull request
Nov 3, 2021
) Summary: Pull Request resolved: facebookresearch#573 Add BCE loss for reward decomposition. BCE loss decomposition loss is defined between avg(p) and avg(y). Reviewed By: czxttkl Differential Revision: D31783145 fbshipit-source-id: 180134dba517ca507ab86693bf165221d08fa21d
fbafe12 to
4b5c467
Compare
|
This pull request was exported from Phabricator. Differential Revision: D31783145 |
j-jiafei
pushed a commit
to j-jiafei/ReAgent
that referenced
this pull request
Nov 3, 2021
) Summary: Pull Request resolved: facebookresearch#573 Add BCE loss for reward decomposition. BCE loss decomposition loss is defined between avg(p) and avg(y). Reviewed By: czxttkl Differential Revision: D31783145 fbshipit-source-id: 83807361904ca198d2b5470fd0868877f8dd5a0f
|
This pull request was exported from Phabricator. Differential Revision: D31783145 |
4b5c467 to
6ddb6da
Compare
j-jiafei
pushed a commit
to j-jiafei/ReAgent
that referenced
this pull request
Nov 3, 2021
) Summary: Pull Request resolved: facebookresearch#573 Add BCE loss for reward decomposition. BCE loss decomposition loss is defined between avg(p) and avg(y). Reviewed By: czxttkl Differential Revision: D31783145 fbshipit-source-id: d58ced069be20f472bf3c64b00efab7ec64e7918
6ddb6da to
650d2e6
Compare
|
This pull request was exported from Phabricator. Differential Revision: D31783145 |
) Summary: Pull Request resolved: facebookresearch#573 Add BCE loss for reward decomposition. BCE loss decomposition loss is defined between avg(p) and avg(y). Reviewed By: czxttkl Differential Revision: D31783145 fbshipit-source-id: 9fe83444d47243e0507b861638644cf0819c4c8d
650d2e6 to
bbbecc0
Compare
|
This pull request was exported from Phabricator. Differential Revision: D31783145 |
|
This pull request has been merged in e2c2674. |
xuruiyang
pushed a commit
that referenced
this pull request
Sep 20, 2025
Summary: Pull Request resolved: #573 Add BCE loss for reward decomposition. BCE loss decomposition loss is defined between avg(p) and avg(y). Reviewed By: czxttkl Differential Revision: D31783145 fbshipit-source-id: fd5c46f406a708ed3f9b9323b70cfdf8252478ec
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Summary:
Add BCE loss for reward decomposition.
BCE loss decomposition loss is defined between avg(p) and avg(y).
Differential Revision: D31783145