-
Notifications
You must be signed in to change notification settings - Fork 264
Closed
Labels
Description
Overview
To enable automatic speech recognition(ASR) capabilities in Studio, we need to create a Whisper backend class that implements the base Backend class in the appnexus library. The Whisper backend class should be able to make a request to any whisper model hosted anywhere, for example Hugging Face, Open AI, Assembly AI, etc. As a start, it will perform transcription requests for Studio.
Description and outcomes
- Create a new class
Whisperthat implementsBackend:
CLASS Whisper IMPLEMENTS Backend:
METHOD make_request(params):
# make request to DeepLearning models hosted as services(e.g via Hugging Face)
METHOD connect(params):
# Implement the connect method for hosted service if any
CLASS METHOD _create_instance(cls)
# initialize a Whisper Backend instance- The class should connect to whichever backend the whisper model is being hosted(if required) by implementing
connect(). - Implement the
make_request()method that performs transcription requests. It should return the result of a transcription
Accessibility requirements
Not applicable
Acceptance criteria
- The
Whisperbackend is created and it implements theBackendclass. - The class connects to whichever service(Hugging Face, Open AI, etc) the Whisper model is being hosted(if required).
- The
make_requestmethod is implemented correctly and performs transcriptions as expected - Tests are written to validate correctness of the
Whisperbackend logic. - Documentation has been added to the backend class, explaining its purpose, inputs and outputs.