GitHub - off-by-some/TRELLIS-BOX: Half the VRAM, all the 3D model generation. Implements Microsoft's TRELLIS utilizing docker, FP16 optimization and other improvements.

This repository hosts a containerized implementation of Microsoft's TRELLIS image-to-3D generation pipeline. Here we've optimized it with FP16 mixed precision support for efficient GPU usage and automatic GLB export capabilities. It enables users to generate high-quality 3D models from single images or multi-view inputs through a simple web interface, reducing VRAM requirements by ~50% while maintaining generation quality.

Quickstart

To get up and running, you can build the repository to build from source:

# Clone and run with Docker
$ git clone https://github.com/off-by-some/TRELLIS-BOX && \
            cd TRELLIS-BOX && ./trellis.sh run # Add --dev for hot reloading

Or if you prefer, pull and run the pre-built Docker image:

$ docker run --gpus all -it -p 8501:8501 \
                        -v ~/.cache/trellis-box:/root/.cache \
                        -v ~/.cache/rembg:/root/.u2net \
                        -v $(pwd)/outputs:/tmp/Trellis-demo \
         cassidybridges/trellis-box:latest

Then simply open http://localhost:8501 in your browser to access the web interface. See the Docker Configuration Guide for more detailed instructions & configurations.

CLI Reference

Here's the current command reference for trellis.sh:

🚀 TRELLIS Docker Manager

Usage: ./trellis.sh <command> [options]

Commands:
  run      - Start TRELLIS (builds image, checks GPU, etc.)
  dev      - Quick development mode (requires docker-compose)
  restart  - Restart a stopped TRELLIS container
  stop     - Stop the running TRELLIS container
  status   - Show TRELLIS status and system info
  build    - Build the Docker image only
  check    - Check GPU memory availability

Options:
  --dev, -v    Enable development mode with hot reloading
  --diagnostics, -d  Run diagnostics (run command only)

Examples:
  ./trellis.sh run                    # Start TRELLIS with full setup
  ./trellis.sh run --dev              # Start in development mode (hot reloading)
  ./trellis.sh dev                    # Quick dev mode (no GPU checks)
  ./trellis.sh status                 # Check current status
  ./trellis.sh stop                   # Stop TRELLIS
  ./trellis.sh run --diagnostics      # Run GPU diagnostics

Development Mode (with hot reloading):

$ ./trellis.sh run --dev

This enables hot reloading for UI and app changes. Perfect for interface development and debugging Streamlit issues. Note: Core algorithm changes require rebuilding the image.

Use Cases

Single Image to 3D Model

Transform a single 2D image into a detailed 3D model. Perfect for product visualization, character design, or architectural concepts. The pipeline automatically removes backgrounds and generates textured meshes ready for 3D printing or game engines.

Multi-View Enhancement

Upload 2-4 images from different angles to improve generation quality and reduce artifacts. Ideal for complex objects where a single viewpoint isn't sufficient, such as detailed mechanical parts or intricate sculptures.

Batch Processing Workflows

Integrate into automated pipelines for content creation studios. Generate multiple 3D assets from image collections with consistent quality and automatic GLB export for seamless import into downstream tools.

Research and Prototyping

Rapidly prototype 3D concepts from sketches or reference images. The FP16 optimizations make it accessible for researchers working with limited GPU resources, enabling faster iteration cycles.

Installation Details

Prerequisites

Docker: Install Docker Engine with NVIDIA Container Toolkit support
GPU: NVIDIA GPU with at least 8GB VRAM (recommended: 16GB+)
Platform: Linux, macOS, or Windows with Docker Desktop
Storage: ~20GB free space for models and Docker layers
Nvidia GPU: Pull requests welcome for additional GPU support!

Docker Setup

Install NVIDIA Container Toolkit (Linux):

If you're running Arch/Manjaro:

$ sudo pacman -Syu nvidia-container-toolkit

Ubuntu / Debian,

# First add the NVIDIA container toolkit repository
$ distribution=$(. /etc/os-release; echo $ID$VERSION_ID)
$ curl -fsSL https://nvidia.github.io/libnvidia-container/gpgkey | \
   sudo gpg --dearmor -o /usr/share/keyrings/nvidia-container-toolkit-keyring.gpg

# This ensures the packages you install are from NVIDIA’s repository, properly signed. 
$ curl -s -L https://nvidia.github.io/libnvidia-container/$distribution/libnvidia-container.list | \
             sed 's#deb https://#deb [signed-by=/usr/share/keyrings/nvidia-container-toolkit-keyring.gpg] \
             https://#g' | sudo tee /etc/apt/sources.list.d/nvidia-container-toolkit.list

# Update & install the toolkit
$ sudo apt-get update && sudo apt-get install -y nvidia-container-toolkit

Verify GPU access:

# Quick test
docker run --rm --gpus all nvidia/cuda:12.3.2-cudnn9-runtime-ubuntu22.04 nvidia-smi

Custom Build (Optional)

For development or custom modifications, build from source:

./scripts/build.sh  # Build the Docker image
./scripts/run.sh    # Start the container

Remote GPU Setup

If running on a remote GPU machine (like via SSH), ensure proper GPU access:

SSH into your GPU machine:
```
ssh user@your-gpu-server-ip
```
Verify GPU access:
```
./scripts/check_gpu.sh
```
Run TRELLIS:
```
./scripts/run.sh
```
Access the web interface:
- Local: http://localhost:8501
- Remote: http://your-gpu-server-ip:8501 (ensure firewall allows port 8501)

Publishing to Docker Hub

To share your image on Docker Hub:

# Publish with version tag
./scripts/publish.sh v1.0.0

# Or publish as latest
./scripts/publish.sh latest

The script will prompt for your Docker Hub username and handle login if needed.

How to Contribute

Development Setup

Fork and clone the repository
Ensure Docker and NVIDIA Container Toolkit are installed
Build the development image:
```
./trellis.sh build
```
Start the container in development mode:
```
./trellis.sh run
```

Testing Changes

Modify app.py or other source files
Restart the container: ./trellis.sh restart
Test the web interface at http://localhost:8501
Check Docker logs: docker logs trellis-box

Pull Request Process

Create a feature branch from main
Make your changes with clear commit messages
Test thoroughly with different GPU configurations
Submit a pull request with a detailed description
Address any review feedback

Reporting Issues

Use the issue tracker for bugs and feature requests
Include your GPU model, Docker version, and error logs
For performance issues, include nvidia-smi output

Usage

Single Image Generation

Upload one image to generate a 3D model. Background removal is applied automatically with quality preservation.

Multi-View Generation

Upload 2-4 images from different angles. The pipeline cycles through images during sampling for improved conditioning.

Enhanced Features

Image Processing

Optional image refinement using SSD-1B after background removal (runs on CPU to avoid GPU memory conflicts)
Automatic image resizing with multiple preset resolutions (392px to 1022px)
Real-time preview of processed images before generation

Quality Presets

Fast (Low Quality): Quick generation with reduced detail
Balanced: Standard quality settings for most use cases
High Quality (Recommended): Enhanced quality for detailed objects
Maximum Quality (Slow): Highest quality with maximum detail

GLB Export Settings

Mesh quality presets: Low, Medium, High, Premium
Configurable mesh simplification ratios (0.8-0.99)
Adjustable texture sizes (256px to 4096px)
Customizable hole filling resolution and view counts

Configuration

The Docker image is fully configurable via build arguments and environment variables. See the Docker Configuration Guide for more detailed instructions & configurations.

Environment Variables

# Memory optimization
PYTORCH_CUDA_ALLOC_CONF=max_split_size_mb:256

# FP16 precision
TORCH_USE_CUDA_DSA=1

Generation Parameters

Adjust in app.py:

DEFAULT_SS_STEPS: Sparse structure sampling steps (default: 12)
DEFAULT_SLAT_STEPS: Structured latent sampling steps (default: 12)
DEFAULT_SS_GUIDANCE: Sparse structure guidance strength (default: 7.5)
DEFAULT_SLAT_GUIDANCE: SLAT guidance strength (default: 3.0)

Troubleshooting

Memory Issues

Check GPU memory: nvidia-smi
Clear cache: docker exec trellis-box python -c "import torch; torch.cuda.empty_cache()"
Reduce batch size for multi-view inputs

Build Issues

Clean rebuild: ./trellis.sh stop && docker system prune -f && ./trellis.sh build
Check Docker logs: docker logs trellis-box

Model Issues

Clear cache: docker exec trellis-box rm -rf /app/.cache/torch/hub/checkpoints/
Restart: ./trellis.sh restart

Architecture

Pipeline Stages

Image Preprocessing: Background removal, cropping, resizing
Sparse Structure Generation: Flow-based latent generation
Structured Latent Generation: Feature refinement
Mesh/Gaussian/Radiance Field Decoding: 3D representation generation
GLB Export: Automatic texture baking and export

Memory Optimizations

FP16 conversion for transformer models
FP32 preservation for normalization layers
Aggressive CUDA cache clearing
Memory defragmentation techniques

Multi-View Implementation

Cycles through conditioning tensors during sampling
Maintains proper tensor shapes and dtypes
Compatible with CFG and guidance interval sampling

Background

This Docker implementation was created to make Microsoft's TRELLIS 3D generation pipeline more accessible to users who prefer containerized deployments. The original TRELLIS research introduced significant advancements in 3D generation quality and versatility, but required complex environment setup and substantial computational resources.

The key motivations for this Docker implementation include:

Simplified Deployment: Traditional TRELLIS setup requires installing numerous dependencies across different platforms, which can be error-prone. Docker containers provide a consistent, reproducible environment.
Resource Optimization: The original implementation could require significant VRAM. FP16 mixed precision optimizations reduce memory requirements by ~50% while maintaining generation quality.
Production Readiness: Containerization enables easier integration into existing workflows, automated deployment pipelines, and scaling across different hardware configurations.
Developer Experience: The web interface and automatic GLB export make the technology more accessible to non-experts while maintaining the full power of the underlying TRELLIS models.

Acknowledgements

I encountered a lot of issues running TRELLIS for myself, until i found UNES97's trellis-3d-docker project, which provided the initial Dockerized implementation. Special thanks to @UNES97 for the containerization of TRELLIS, making it accessible for anybody within the community.

This project builds upon Microsoft's TRELLIS research, which represents a significant advancement in structured 3D latent representations for scalable generation. We gratefully acknowledge the original researchers and their contributions to the field of 3D generation.

Special thanks to the open-source community for the various dependencies that make this implementation possible, including PyTorch, NVIDIA's CUDA ecosystem, and the broader machine learning tooling landscape.

License

MIT License. Based on Microsoft's TRELLIS research.

Name		Name	Last commit message	Last commit date
Latest commit History 165 Commits
.streamlit		.streamlit
assets/example_image		assets/example_image
docs		docs
extensions/nvdiffrast		extensions/nvdiffrast
scripts		scripts
trellis		trellis
webui		webui
wheels		wheels
.dockerignore		.dockerignore
.gitattributes		.gitattributes
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
app.py		app.py
docker-compose.dev.yml		docker-compose.dev.yml
docker-compose.yml		docker-compose.yml
docker.env.example		docker.env.example
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
trellis.sh		trellis.sh

off-by-some/TRELLIS-BOX

Folders and files

Latest commit

History

Repository files navigation

Quickstart

CLI Reference

Use Cases

Single Image to 3D Model

Multi-View Enhancement

Batch Processing Workflows

Research and Prototyping

Installation Details

Prerequisites

Docker Setup

Custom Build (Optional)

Remote GPU Setup

Publishing to Docker Hub

How to Contribute

Development Setup

Testing Changes

Pull Request Process

Reporting Issues

Usage

Single Image Generation

Multi-View Generation

Enhanced Features

Image Processing

Quality Presets

GLB Export Settings

Configuration

Configuration

Environment Variables

Generation Parameters

Troubleshooting

Memory Issues

Build Issues

Model Issues

Architecture

Pipeline Stages

Memory Optimizations

Multi-View Implementation

Background

Acknowledgements

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages