metal-gpu

Here are 8 public repositories matching this topic...

szibis / MLX-Flash

Run AI models too large for your Mac's memory — at near-full speed. Intelligent expert caching, speculative execution, and 15+ research techniques for MoE inference on Apple Silicon.

python macos rust machine-learning inference moe quantization mlx speculative-execution mixture-of-experts memory-optimization apple-silicon llm metal-gpu ssd-streaming expert-caching

Updated Apr 7, 2026
Python

jvachier / scientific-literature-rag

Star

Production RAG system for scientific literature synthesis with SPECTER2 embeddings, Metal GPU acceleration, multi-LLM support, and automatic BibTeX citations.

embeddings scientific-papers rag llm retrieval-augmented-generation metal-gpu

Updated Nov 29, 2025
Python

RobotFlow-Labs / diffusion-policy-mlx

Star

First native Apple Silicon (MLX) port of Diffusion Policy (RSS 2023 Best Paper). 6 policy variants, 472 tests, Metal GPU verified. Train and run visuomotor diffusion policies on M-series — no CUDA required.

robotics imitation-learning robot-learning mlx apple-silicon diffusion-policy metal-gpu visuomotor-policy pytorch-to-mlx action-diffusion

Updated Mar 15, 2026
Python

akaiHuang / mlx-quantum-sim

Star

GPU-accelerated quantum circuit simulator for Apple Silicon (MLX) with Google Willow, IBM Heron, and QuTech Tuna-9 noise models. Up to 34 qubits on M2 Ultra.

quantum-computing mlx willow quantum-simulation cirq noise-model apple-silicon metal-gpu

Updated Mar 23, 2026
Python

jvachier / Keras-vs-Pytorch-MacOs-CPU-vs-GPU

Star

Comprehensive VAE performance benchmark comparing PyTorch vs TensorFlow on Apple Silicon (M1/M2/M3). Quantifies training speed, memory efficiency, and Metal GPU utilization across Python versions to guide framework selection for ML prototyping and production deployment.

benchmark variational-autoencoder apple-silicon metal-gpu

Updated Nov 19, 2025
Python

RobotFlow-Labs / docker_mlx_cpp

Star

The NVIDIA Container Toolkit for Mac — Give any Docker container full Apple Silicon Metal GPU access. 100+ GPU operations, LLM inference, training, image gen, audio, embeddings. Zero CUDA. Just Metal.

macos docker machine-learning deep-learning inference gpu-acceleration gpu-computing nvidia-docker m4 m2 m3 containerization mlx m1 fine-tuning apple-silicon openai-api llm metal-gpu

Updated Apr 1, 2026
Python

RobotFlow-Labs / LeRobot-mlx

Star

LeRobot-MLX: HuggingFace LeRobot ported to Apple MLX for native Apple Silicon robotics policy training & inference. 10 policies, 739+ tests, Metal GPU accelerated.

machine-learning robotics vla robot-learning mlx apple-silicon policy-learning lerobot metal-gpu pytorch-to-mlx

Updated Mar 15, 2026
Python

RobotFlow-Labs / curobo-mlx

Star

GPU-accelerated robot motion planning on Apple Silicon. Port of NVIDIA cuRobo (CUDA) to MLX — real-time collision-free trajectory generation on M-series Macs.

robotics collision-detection motion-planning inverse-kinematics trajectory-optimization robot-arm mlx apple-silicon curobo metal-gpu

Updated Mar 15, 2026
Python

Improve this page

Add a description, image, and links to the metal-gpu topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the metal-gpu topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

metal-gpu

Here are 8 public repositories matching this topic...

szibis / MLX-Flash

jvachier / scientific-literature-rag

RobotFlow-Labs / diffusion-policy-mlx

akaiHuang / mlx-quantum-sim

jvachier / Keras-vs-Pytorch-MacOs-CPU-vs-GPU

RobotFlow-Labs / docker_mlx_cpp

RobotFlow-Labs / LeRobot-mlx

RobotFlow-Labs / curobo-mlx

Improve this page

Add this topic to your repo