Highlights
- Pro
Pinned Loading
-
arithmetic
arithmetic PublicCode to reproduce "Transformers Can Do Arithmetic with the Right Embeddings", McLeish et al (NeurIPS 2024)
-
gemstone-scaling-laws
gemstone-scaling-laws PublicGemstones: A Model Suite for Multi-Faceted Scaling Laws (NeurIPS 2025)
-
-
-
retrofitting-recurrence
retrofitting-recurrence PublicTeaching Pretrained Language Models to Think Deeper with Retrofitted Recurrence
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.




