Skip to content

Log_props output in TRT-LLM#11009

Closed
oyilmaz-nvidia wants to merge 52 commits intoNVIDIA-NeMo:mainfrom
oyilmaz-nvidia:onur/log-props
Closed

Log_props output in TRT-LLM#11009
oyilmaz-nvidia wants to merge 52 commits intoNVIDIA-NeMo:mainfrom
oyilmaz-nvidia:onur/log-props

Conversation

@oyilmaz-nvidia
Copy link
Collaborator

What does this PR do ?

Adds the functionality to return log_props from TRT-LLM using PyTriton.

oyilmaz-nvidia and others added 30 commits June 13, 2024 10:12
Signed-off-by: Onur Yilmaz <oyilmaz@nvidia.com>
Signed-off-by: Onur Yilmaz <oyilmaz@nvidia.com>
Signed-off-by: Onur Yilmaz <oyilmaz@nvidia.com>
@oyilmaz-nvidia
Copy link
Collaborator Author

@athitten If this PR works for you, I'll do some cleanup and start the CI.

@athitten
Copy link
Collaborator

athitten commented Oct 25, 2024

@athitten If this PR works for you, I'll do some cleanup and start the CI.

@oyilmaz-nvidia sorry I dint get a chance to test it out yet. I will test it tomorrow. But the PR looks good otherwise. Thank you!

Copy link
Collaborator

@athitten athitten left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Tested it locally and it works, although the logProbs of the input tokens are 0.0 and logProbs of output tokens are as expected. It's probably some flag or setting on TRTLLM side to get logProbs of input tokens as well.
Thank you @oyilmaz-nvidia

@oyilmaz-nvidia oyilmaz-nvidia enabled auto-merge (squash) October 28, 2024 15:44
@oyilmaz-nvidia
Copy link
Collaborator Author

Closing this since it'll be included here: #11233

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants

Comments