Skip to content

Remove custom pre attention scaling and use computed value instead.

51f0bd5
Select commit
Loading
Failed to load commit list.
Merged

Add attention and final logit soft-capping, update scaling factor to Gemma2 #8197

Remove custom pre attention scaling and use computed value instead.
51f0bd5
Select commit
Loading
Failed to load commit list.

Workflow runs completed with no jobs