DoradusResearch

Doradus Research DoradusResearch

Popular repositories Loading

MiroThinker-v1.0-30B-FP8 MiroThinker-v1.0-30B-FP8 Public

MiroThinker 30B FP8 Dynamic Quantization

Dockerfile
Hermes-4.3-36B-FP8 Hermes-4.3-36B-FP8 Public

Hermes-4.3-36B FP8 Quantization

Dockerfile
RnJ-1-Instruct-FP8 RnJ-1-Instruct-FP8 Public

RnJ-1-Instruct FP8 Quantization

Python
vllm vllm Public

Forked from vllm-project/vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python
blog blog Public

Forked from huggingface/blog

Public repo for HF blog posts

Jupyter Notebook
flash-attention flash-attention Public

Doradus internal fork: Dao-AILab/flash-attention + open SM_120 PR chain (#2336/#2348/#2349/#2389/#2553/#2439). Build scripts: github.com/DoradusResearch/doradus/tree/develop/infra/flash-attention-f…

Python