Skip to content
@swiss-ai

swiss-ai

Popular repositories Loading

  1. mmore mmore Public

    Massive Multimodal Open RAG & Extraction A scalable multimodal pipeline for processing, indexing, and querying multimodal documents Ever needed to take 8000 PDFs, 2000 videos, and 500 spreadsheets …

    Python 180 37

  2. apertus-tech-report apertus-tech-report Public

    Tech Report of the Apertus LLM Suite

    127 4

  3. pretrain-data pretrain-data Public

    Pretraining data reconstruction scripts for Apertus

    Python 111 10

  4. Megatron-LM Megatron-LM Public

    Forked from NVIDIA/Megatron-LM

    Ongoing research training transformer models at scale

    Python 43 19

  5. MoE MoE Public

    some mixture of experts architecture implementations

    Python 25 3

  6. apertus-finetuning-recipes apertus-finetuning-recipes Public

    Python 22 11

Repositories

Showing 10 of 59 repositories
  • swiss-ai/benchmark-image-tokenzier’s past year of commit activity
    Jupyter Notebook 0 2 0 1 Updated Jan 6, 2026
  • mmore Public

    Massive Multimodal Open RAG & Extraction A scalable multimodal pipeline for processing, indexing, and querying multimodal documents Ever needed to take 8000 PDFs, 2000 videos, and 500 spreadsheets and feed them to an LLM as a knowledge base? Well, MMORE is here to help you!

    swiss-ai/mmore’s past year of commit activity
    Python 180 Apache-2.0 37 10 7 Updated Jan 6, 2026
  • verl Public Forked from volcengine/verl

    verl: Volcano Engine Reinforcement Learning for LLMs

    swiss-ai/verl’s past year of commit activity
    Python 0 Apache-2.0 2,981 0 0 Updated Jan 6, 2026
  • sglang Public Forked from sgl-project/sglang

    SGLang is a fast serving framework for large language models and vision language models.

    swiss-ai/sglang’s past year of commit activity
    Python 0 Apache-2.0 3,981 0 0 Updated Jan 5, 2026
  • nanotron_climllama Public

    Minimalistic large language model 3D-parallelism training

    swiss-ai/nanotron_climllama’s past year of commit activity
    Python 1 Apache-2.0 0 0 0 Updated Jan 5, 2026
  • Megatron-LM Public Forked from NVIDIA/Megatron-LM

    Ongoing research training transformer models at scale

    swiss-ai/Megatron-LM’s past year of commit activity
    Python 43 3,537 6 18 Updated Jan 5, 2026
  • fm-service Public Forked from ResearchComputer/fm-proxy

    newer llm service

    swiss-ai/fm-service’s past year of commit activity
    Python 1 3 0 0 Updated Jan 5, 2026
  • swiss-ai/multimodal-data’s past year of commit activity
    Jupyter Notebook 0 0 0 1 Updated Jan 4, 2026
  • swiss-ai/posttraining-data’s past year of commit activity
    Python 4 0 1 0 Updated Dec 28, 2025
  • swiss-ai/model-spinning’s past year of commit activity
    Python 9 2 0 0 Updated Dec 27, 2025

Most used topics

Loading…