You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.
@SunMarc it's the other way around, the chat shouldn't pass the request ID (it's not a field in the API) 🤗
I thought this was already solved, a user opened a related PR a few days ago. Perhaps not 🤔
Yeah, it is not part of the create oai api but this is why we are passing it in extra_body + TransformersCompletionCreateParamsStreaming no so that we can pass it in generate_chat_completion just like generation_config no ?
In generate_chat_completion, we try to get the request_id also request_id = req.get("request_id", "req_0")
The ID is something that is created by the server at generation time, mostly for internal server logging purposes. It is returned by the server (API), but not passed through a request.
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
What does this PR do?
This PR fixes transformers serve as it is currently not working due to an unexpected field being passed
request_idreproducer
the second message will trigger an error.