[DEPRECATED] Moved to ROCm/rocm-systems repo
-
Updated
Dec 5, 2025 - Python
[DEPRECATED] Moved to ROCm/rocm-systems repo
(Spring 2017) Assignment 2: GPU Executor
A self-hosted low-level functional-style programming language 🌀
A lightweight utility for monitoring and analyzing Triton kernel compilation cache behavior.
Research: Reproducible benchmarks for batch-invariant LLM inference across models & GPUs (A10, A100, H100)
Triton optimizations ran on AMD GPU
Add a description, image, and links to the gpu-kernels topic page so that developers can more easily learn about it.
To associate your repository with the gpu-kernels topic, visit your repo's landing page and select "manage topics."