Bit Harbor

Minimal containers with Hugging Face model weights for Kubernetes init containers.

What is this?

This repository automatically builds tiny containers containing only LLM model weights from Hugging Face. These containers are designed to be used as Kubernetes init containers to speed up ML workloads by pre-downloading models.

Quick Start

Use a pre-built model container as an init container. Since these are scratch-based containers without shell, you need a helper container to copy files:

initContainers:
# First container provides the models (exits immediately)
- name: model-provider
  image: ghcr.io/doublewordai/bit-harbor:gemma-3-4b-it
  volumeMounts:
  - name: model-volume
    mountPath: /data
  # No command needed - container has built-in /pause binary

# Second container copies the models to shared volume
- name: model-copier
  image: busybox:stable
  volumeMounts:
  - name: model-volume
    mountPath: /data
  - name: shared-models
    mountPath: /shared
  command: ['sh', '-c', 'cp -r /data/models/* /shared/']

volumes:
- name: model-volume
  emptyDir: {}
- name: shared-models
  emptyDir: {}

Your main container can then access the models from the shared-models volume.

Pre-Porter Helm Chart

For production deployments, use the Pre-Porter Helm chart to automatically pre-pull bit-harbor images on all your cluster nodes using DaemonSets:

# Install the chart
helm install pre-porter oci://ghcr.io/doublewordai/bit-harbor/pre-porter

# Configure which images to pre-pull
helm upgrade pre-porter oci://ghcr.io/doublewordai/bit-harbor/pre-porter \
  --set-json 'images=[
    {"name":"gemma-3-4b-it","enabled":true,"nodeSelector":{"gpu":"nvidia"}},
    {"name":"llama-3.1-8b-instruct","enabled":true}
  ]'

See ./pre-porter/README.md for detailed usage instructions.

Building Models

Automatic builds:

Push to main → builds missing models
Manual trigger → optionally force rebuild all

Manual builds:

# Build specific model locally
docker buildx build -t ghcr.io/doublewordai/bit-harbor:gemma-3-4b-it \
  --build-arg MODEL_REPO=https://huggingface.co/google/gemma-3-4b-it \
  --build-arg MODEL_NAME=gemma-3-4b-it \
  --build-arg HF_TOKEN=your_token_here .

Available Models

All models are under 30B parameters. See models.json for the complete list:

Gemma 3: 4B, 12B instruction-tuned and 3n variants
Llama 3.1: 8B instruction-tuned
Llama 3.2: 1B, 3B instruction-tuned variants
Qwen 3: 1.7B, 8B, 14B models
Qwen Embeddings: 0.6B and 8B embedding models
Qwen 2.5 VL: 3B, 7B vision-language instruction-tuned models

Adding Models

Edit models.json:

{
  "models": [
    {
      "name": "my-model",
      "repo": "https://huggingface.co/org/model-name"
    }
  ]
}

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 34 Commits
.github/workflows		.github/workflows
pre-porter		pre-porter
.gitignore		.gitignore
.release-please-manifest.json		.release-please-manifest.json
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
download_model.py		download_model.py
models.json		models.json
release-please-config.json		release-please-config.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Bit Harbor

What is this?

Quick Start

Pre-Porter Helm Chart

Building Models

Available Models

Adding Models

License

About

Uh oh!

Releases 7

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Bit Harbor

What is this?

Quick Start

Pre-Porter Helm Chart

Building Models

Available Models

Adding Models

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 7

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages