Skip to content

Fast inference#7

Merged
jlamypoirier merged 74 commits intomainfrom
fast_inference
Mar 2, 2023
Merged

Fast inference#7
jlamypoirier merged 74 commits intomainfrom
fast_inference

Conversation

@jlamypoirier
Copy link
Collaborator

No description provided.

bigximik and others added 30 commits August 31, 2022 04:41
* Added onnx config whisper

* added whisper support onnx

* add audio input data

* added whisper support onnx

* fixed the seqlength value

* Updated the whisper onnx ocnfig

* restore files to old version

* removed attention mask from inputs

* Updated get_dummy_input_onnxruntime docstring

* Updated relative imports and token generation

* update docstring
* Add ESMFold code sample

* sorry sylvain

* make fixup

* sorry sylvain again
@jlamypoirier jlamypoirier changed the base branch from main to more_optimizations February 24, 2023 02:01
self.upcast = self.softmax_dtype != self.dtype

do_unscale = attn.scale_attention_softmax_in_fp32 and self.upcast
self.unscale = [i + 1.0 if do_unscale else 1.0 for i in range(self.n_layer)]
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

what does this line exactly do?

Base automatically changed from more_optimizations to main March 2, 2023 19:23
@jlamypoirier jlamypoirier merged commit d4451b4 into main Mar 2, 2023
@jlamypoirier jlamypoirier deleted the fast_inference branch March 2, 2023 19:26
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

6 participants