Skip to content
@Dao-AILab

Dao AI Lab

We are an AI research group led by Prof. Tri Dao

Popular repositories Loading

  1. flash-attention flash-attention Public

    Fast and memory-efficient exact attention

    Python 21.4k 2.3k

  2. quack quack Public

    A Quirky Assortment of CuTe Kernels

    Python 736 68

  3. causal-conv1d causal-conv1d Public

    Causal depthwise conv1d in CUDA, with a PyTorch interface

    Cuda 693 147

  4. sonic-moe sonic-moe Public

    Accelerating MoE with IO and Tile-aware Optimizations

    Python 508 36

  5. fast-hadamard-transform fast-hadamard-transform Public

    Fast Hadamard transform in CUDA, with a PyTorch interface

    C 268 50

  6. grouped-latent-attention grouped-latent-attention Public

    Python 133 4

Repositories

Showing 8 of 8 repositories
  • sonic-moe Public

    Accelerating MoE with IO and Tile-aware Optimizations

    Dao-AILab/sonic-moe’s past year of commit activity
    Python 508 Apache-2.0 36 7 1 Updated Jan 5, 2026
  • flash-attention Public

    Fast and memory-efficient exact attention

    Dao-AILab/flash-attention’s past year of commit activity
    Python 21,430 BSD-3-Clause 2,262 923 103 Updated Jan 5, 2026
  • quack Public

    A Quirky Assortment of CuTe Kernels

    Dao-AILab/quack’s past year of commit activity
    Python 736 Apache-2.0 67 15 1 Updated Jan 1, 2026
  • causal-conv1d Public

    Causal depthwise conv1d in CUDA, with a PyTorch interface

    Dao-AILab/causal-conv1d’s past year of commit activity
    Cuda 693 BSD-3-Clause 147 32 10 Updated Dec 23, 2025
  • fast-hadamard-transform Public

    Fast Hadamard transform in CUDA, with a PyTorch interface

    Dao-AILab/fast-hadamard-transform’s past year of commit activity
    C 268 BSD-3-Clause 50 8 2 Updated Oct 20, 2025
  • cutlass Public Forked from NVIDIA/cutlass

    CUDA Templates for Linear Algebra Subroutines

    Dao-AILab/cutlass’s past year of commit activity
    C++ 1 1,618 0 0 Updated Jun 8, 2025
  • Dao-AILab/grouped-latent-attention’s past year of commit activity
    Python 133 MIT 4 5 0 Updated May 30, 2025
  • gemm-cublas Public
    Dao-AILab/gemm-cublas’s past year of commit activity
    Python 23 Apache-2.0 1 0 0 Updated May 5, 2025

Most used topics

Loading…