Implementation of the model: "(MC-ViT)" from the paper: "Memory Consolidation Enables Long-Context Video Understanding"
-
Updated
Mar 13, 2026 - Python
Implementation of the model: "(MC-ViT)" from the paper: "Memory Consolidation Enables Long-Context Video Understanding"
Multi modal BiTransformer [ Reimplementation ] in Pytorch That Acutally Works !
ARC-AGI Hybrid Architecture A standalone neural-symbolic solver. Features : Perception: Multi-modal Transformers for grid-to-latent mapping. Reasoning: GNNs for entity-relational graph processing. Synthesis: Efficient DSL search guided by neural embeddings and D4 group symmetry.
Add a description, image, and links to the multi-modal-transformers topic page so that developers can more easily learn about it.
To associate your repository with the multi-modal-transformers topic, visit your repo's landing page and select "manage topics."