recurrent-depth

Star

Here are 3 public repositories matching this topic...

seal-rg / recurrent-pretraining

Star

Pretraining and inference code for a large-scale depth-recurrent language model

reasoning pretraining llms recurrent-depth

Updated Dec 29, 2025
Python

NAME0x0 / AVA

Star

Research and training stack for AVA — a tool-using, memory-aware virtual assistant targeting 4 GB VRAM. Spans custom transformers, verifier-RL, external memory, multi-domain benchmarks, and Gemma 4 inference optimization.

Updated Apr 7, 2026
Python

QBe1n / OpenMythos

Star

Recurrent-depth transformer, fixed. Fork of kyegomez/OpenMythos with scatter-based MoE (2.94x faster), proper ACT halting, DeepSeekMoE load balancing, SDPA kernels, and a working training loop.

pytorch transformer moe claude mla adaptive-computation-time mixture-of-experts deepseek recurrent-depth

Updated Apr 20, 2026
Python

Improve this page

Add a description, image, and links to the recurrent-depth topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the recurrent-depth topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

recurrent-depth

Here are 3 public repositories matching this topic...

seal-rg / recurrent-pretraining

NAME0x0 / AVA

QBe1n / OpenMythos

Improve this page

Add this topic to your repo