FakeGPU

A CUDA API interception library that simulates GPU devices in non-GPU environments, enabling basic operations for PyTorch and other deep learning frameworks.

Timeline

Implemented Features

CUDA Driver API - Device management, memory allocation, kernel launch
CUDA Runtime API - cudaMalloc/Free, cudaMemcpy, Stream, Event
cuBLAS/cuBLASLt - Matrix operations (GEMM, PyTorch 2.x compatible)
NVML API - GPU information queries
Python API Wrapper - import fakegpu; fakegpu.init() enables FakeGPU from inside Python
PyTorch Support - Basic tensor ops, linear layers, neural networks
GPU Tool Compatibility - Compatible with existing GPU status monitoring tools (nvidia-smi, gpustat, etc.)

Planned Features

Detailed Reporting - More comprehensive documentation and analysis reports
Multi-Node GPU Communication - Simulate cross-node GPU communication (NCCL, etc.)
Enhanced Testing - Optimize test suite with more languages and runtime environments
Preset GPU Info - Add more preset GPU hardware configurations
Multi-Architecture & Data Types - Support different GPU architectures and various data storage/memory types

Quick Start

Build

cmake -S . -B build
cmake --build build

Generated libraries:

build/libcuda.so.1 - CUDA Driver API
build/libcudart.so.12 - CUDA Runtime API
build/libcublas.so.12 - cuBLAS/cuBLASLt API
build/libnvidia-ml.so.1 - NVML API

Test

Comparison test (recommended):

./test/run_comparison.sh

Runs identical tests on both real GPU and FakeGPU to verify correctness.

PyTorch test:

LD_LIBRARY_PATH=./build:$LD_LIBRARY_PATH \
LD_PRELOAD=./build/libcublas.so.12:./build/libcudart.so.12:./build/libcuda.so.1:./build/libnvidia-ml.so.1 \
python3 test/test_comparison.py --mode fake

Usage

import torch

# All PyTorch CUDA operations are intercepted by FakeGPU
device = torch.device('cuda:0')
x = torch.randn(100, 100, device=device)
y = torch.randn(100, 100, device=device)
z = x @ y  # Matrix multiplication

# Simple neural network
model = torch.nn.Linear(100, 50).to(device)
output = model(x)

Runtime requires preloading all libraries:

LD_LIBRARY_PATH=./build:$LD_LIBRARY_PATH \
LD_PRELOAD=./build/libcublas.so.12:./build/libcudart.so.12:./build/libcuda.so.1:./build/libnvidia-ml.so.1 \
python your_script.py

Python wrapper (no need to start Python with LD_PRELOAD):

import fakegpu

# Call early (before importing torch / CUDA-using libraries)
fakegpu.init()

import torch

Shortcut runner:

./fgpu python your_script.py
# Optional: FAKEGPU_BUILD_DIR=/path/to/build ./fgpu python your_script.py

Python runner (installs fakegpu console script):

fakegpu python your_script.py
# or: python -m fakegpu python your_script.py

GPU tools (nvidia-smi)

# FakeGPU-simulated devices via NVML stubs
./fgpu nvidia-smi
# Temperatures may show N/A because the TemperatureV struct is not fully emulated yet.

Test Results

Test	Status	Description
Tensor creation	✓	Basic memory allocation
Element-wise ops	✓	Add, multiply, trigonometric
Matrix multiplication	✓	cuBLAS/cuBLASLt GEMM
Linear layer	✓	PyTorch nn.Linear
Neural network	✓	Multi-layer forward pass
Memory transfer	✓	CPU ↔ GPU data copy

Architecture

FakeGPU
├── src/
│   ├── core/          # Global state and device management
│   ├── cuda/          # CUDA Driver/Runtime API stubs
│   ├── cublas/        # cuBLAS/cuBLASLt API stubs
│   ├── nvml/          # NVML API stubs
│   └── monitor/       # Resource monitoring and reporting
└── test/              # Test scripts

Core Design:

Uses LD_PRELOAD to intercept CUDA API calls
Device memory backed by system RAM (malloc/free)
Matrix operations return random values (no actual computation)
Kernel launches are no-ops (logging only)

GPU Profiles

Default build exposes eight Fake NVIDIA A100-SXM4-80GB devices to mirror common server nodes.
GPU parameters are edited in YAML under profiles/*.yaml; CMake embeds these files at build time so no runtime file lookup is needed. Add or tweak a file, rerun cmake -S . -B build, and the new profiles are compiled in.
Presets cover multiple compute capabilities (Maxwell→Blackwell) and feed the existing helpers (GpuProfile::GTX980/P100/V100/T4/A40/A100/H100/L40S/B100/B200), which now prefer the YAML data and fall back to code defaults if parsing fails.

Limitations

❌ No real GPU computation (kernels are no-ops)
❌ Complex models (Transformers) may require additional APIs
❌ No multi-GPU synchronization
⚠️ For testing and development environments only

Use Cases

✅ Running GPU code tests in CI/CD environments
✅ Debugging deep learning code on machines without GPUs
✅ Validating CUDA API call logic
✅ Prototyping and unit testing

Dependencies

CMake 3.14+
C++17 compiler
Python 3.8+ (for testing)
PyTorch 2.x (optional, for testing)

License

MIT License

Documentation

Test Guide - Detailed testing instructions
cuBLASLt Implementation - cuBLASLt support details

Name		Name	Last commit message	Last commit date
Latest commit History 54 Commits
.github/workflows		.github/workflows
docs		docs
fakegpu		fakegpu
profiles		profiles
research_poc		research_poc
src		src
test		test
verification		verification
.gitignore		.gitignore
CLAUDE.md		CLAUDE.md
CMakeLists.txt		CMakeLists.txt
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
PROJECT_STRUCTURE.md		PROJECT_STRUCTURE.md
QUICK_REFERENCE.md		QUICK_REFERENCE.md
README.md		README.md
TODOs.md		TODOs.md
USAGE_CUDART.md		USAGE_CUDART.md
build_debug.sh		build_debug.sh
build_release.sh		build_release.sh
demo_usage.py		demo_usage.py
design.md		design.md
fgpu		fgpu
find_missing_symbols.sh		find_missing_symbols.sh
pyproject.toml		pyproject.toml
run_nvitop_once.sh		run_nvitop_once.sh
run_test_clean.sh		run_test_clean.sh
setup.cfg		setup.cfg
setup.py		setup.py
show_gpu_info.py		show_gpu_info.py
test_logging.py		test_logging.py
test_summary.py		test_summary.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

FakeGPU

Timeline

Implemented Features

Planned Features

Quick Start

Build

Test

Usage

Test Results

Architecture

GPU Profiles

Limitations

Use Cases

Dependencies

License

Documentation

About

Uh oh!

Releases 1

Packages

Languages

License

FanBB2333/FakeGPU

Folders and files

Latest commit

History

Repository files navigation

FakeGPU

Timeline

Implemented Features

Planned Features

Quick Start

Build

Test

Usage

Test Results

Architecture

GPU Profiles

Limitations

Use Cases

Dependencies

License

Documentation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Languages

Packages