GitHub - tailucas/snapshot-processor: Image snapshot hub

About The Project

Overview

Note 1: See my write-up on Home Security Automation which provides an architectural overview of how this project works with others in my collection.

Note 2: While I use the word Automation in these projects, there is no integration with sensible frameworks like openHAB or Home Assistant... not yet at least. The goal behind this project was a learning opportunity by employing specific technologies and design patterns. The parts most likely to be useful are integrations with libraries like ZeroMQ, RabbitMQ, and cloud storage, where seamless behavior comes after much trial and error.

Core Functionality

This project is a sophisticated snapshot processor that fetches image snapshots from IP cameras (particularly Foscam) and orchestrates multi-threaded image processing pipelines. It extends base-app with GitHub repository at https://github.com/tailucas/base-app, and takes a git submodule dependency on pylib.

Key Features:

IP Camera Integration: Supports fetching snapshots from Foscam and other IP cameras via HTTP
Message-Driven Architecture: Routes snapshots through RabbitMQ exchange for multi-device coordination
Multiple Object Detection Engines (feature flags):
- AWS Rekognition (cloud-based): Fast, accurate human/object detection with metadata
- YOLOv8 (local): On-device object detection using PyTorch/Ultralytics for privacy/cost
Cloud Storage Integration: Asynchronous uploads to Google Drive with automatic organization
Google Drive Management: Periodic archival of snapshots into year-month-day folder structure, excluding starred images
Local FTP Server: FTP-based snapshot push support with basic authentication for IP cameras
File System Watching: Watchdog-based file system observer to detect new snapshots
Image Caching: LRU cache for snapshot image data to optimize processing
AWS Integration: Rekognition for object detection, CloudWatch metrics posting
Resilient Message Handling: Multi-threaded ZeroMQ relay architecture with error recovery

Platform Variants:

Main branch: Uses AWS Rekognition for cloud-based object detection (fast, accurate, priced per-image)
Jetson Nano branch: Includes PWM fan control and local YOLOv8 detection (for edge devices)

Detection results include human/person confidence scores and detailed metadata for automation decision-making.

Architecture & Design

This 1277-line Python application demonstrates patterns for building asynchronous, multi-threaded image processing pipelines.

Core Components (app/__main__.py):

CameraConfig (line 113): Configuration holder for camera parameters including snapshot URL, authentication, and image format
FileType: Enumeration for file handling types (SNAPSHOT, ARCHIVE, etc.)
Snapshot (line 204): Main snapshot fetching thread extending ZmqRelay. Receives camera trigger events, fetches images from HTTP endpoints, caches locally, and relays to object detection
DeviceEvent (line 342): Message wrapper for device input/output events with metadata (device ID, location, type, timestamp)
CloudStorage (line 402): Abstract base class for cloud storage implementations
GoogleDriveManager (line 408): Base class for Google Drive operations with OAuth token management
GoogleDriveArchiver (line 490): Background thread for organizing Google Drive snapshots into year-month-day folders, skipping starred images
GoogleDriveUploader (line 621): Async uploader for locally cached snapshots to Google Drive with queue management and retry logic
UploadEventHandler (line 765): File system event handler (watchdog) for detecting new local snapshots
ObjectDetector (line 902): Multi-detector abstraction supporting both AWS Rekognition and local YOLOv8 detection
FTPServer (separate file): Implements FTP server for camera image push with user authentication and event logging

Message Flow:

Device event arrives via RabbitMQ → triggers Snapshot to fetch image
Snapshot fetches from camera HTTP endpoint, caches locally, relays to ObjectDetector
ObjectDetector processes (cloud or local) and returns detection results
Results sent back through RabbitMQ to other devices
GoogleDriveUploader asynchronously uploads cache to Google Drive
GoogleDriveArchiver periodically organizes Drive folders

Configuration:

8 configurable camera inputs (INPUT_1 to INPUT_8)
8 configurable device outputs (OUTPUT_1 to OUTPUT_8)
Location-based device labeling
Feature flags for object detection, cloud detection, local detection, storage management

Technology Patterns:

The application demonstrates professional integration patterns:

ZeroMQ: Thread-safe inter-component messaging with relay pattern
RabbitMQ: External device messaging queue
PyDrive: Google Drive OAuth authentication and file management
Watchdog: File system event monitoring for uploads
AWS Rekognition: Cloud-based object/human detection (paid)
YOLOv8 + PyTorch: Local neural network detection (free, privacy-focused)
Pillow: Image manipulation and validation
Cachetools: LRU image caching for efficient memory use
pyftpdlib: FTP server for camera integrations
boto3: AWS SDK for metrics and authentication
Sentry SDK: Production error tracking with threading integration
tailucas-pylib: Shared architectural patterns and utilities

See tailucas-pylib for shared patterns and base-app for the container foundation.

(back to top)

Built With

Technologies that help make this project useful:

Also:

(back to top)

Getting Started

Here is some detail about the intended use of this package.

Prerequisites

Beyond the Python dependencies defined in pyproject.toml, the project requires:

1Password Secrets Automation: Runtime credential and configuration management (paid service with free tier)
Sentry: Error tracking and monitoring (free tier available)
RabbitMQ: Message broker for device communication (self-hosted or managed service)
Google Drive API: Cloud snapshot storage and organization (free tier available)
AWS Rekognition: Cloud object detection (optional, paid per-image)
IP Cameras: Foscam or similar HTTP-based snapshot API (for fetching images)

Optional services:

Jetson Nano: For local YOLOv8 detection with PWM fan control (see branches)
PyTorch: For local YOLOv8 model inference (CPU or GPU)

Required Tools

Install these tools and ensure they're on your environment $PATH:

task: Build orchestration - https://taskfile.dev/installation/#install-script
docker and docker-compose: Container runtime and composition - https://docs.docker.com/engine/install/
uv: Python package manager - https://docs.astral.sh/uv/getting-started/installation/

For local development (optional):

python3: Python 3.12+ runtime

Installation

🛑 Prerequisites - 1Password Secrets Automation Setup

This project relies on 1Password Secrets Automation for configuration and credential management. A 1Password Connect server container must be running in your environment.

Your 1Password Secrets Automation vault must contain an entry called ENV.snapshot-processor with the following configuration variables:

Variable	Purpose	Example
`APP_NAME`	Application identifier for logging	`snapshot-processor`
`AWS_ALT_REGION`	Alternate AWS region (dual-region setup)	`eu-west-1`
`AWS_CONFIG_FILE`	AWS configuration file path	`/home/app/.aws/config`
`AWS_DEFAULT_REGION`	Primary AWS region for Rekognition	`us-east-1`
`CRONITOR_MONITOR_KEY`	Cronitor health check API key	specific to your account
`DEVICE_NAME`	Container hostname	`snapshot-processor`
`FTP_CREATE_DIRS`	Directories to create in FTP root	`snapshots/cam1,snapshots/cam2`
`FTP_PASS`	FTP server password (basic auth)	project specific
`FTP_UPLOAD_DIR`	FTP upload root directory	`uploads/snapshots`
`FTP_USER`	FTP server username (basic auth)	project specific
`GOOGLE_DRIVE_FOLDER`	Google Drive folder name for snapshots	project specific
`HC_PING_URL`	Healthchecks.io URL for health monitoring	specific to your check
`INPUT_1_LOCATION` through `INPUT_8_LOCATION`	Camera location names	device specific
`INPUT_1_TYPE` through `INPUT_8_TYPE`	Camera types	`Camera`
`MINIMUM_HUMAN_CONFIDENCE`	Confidence threshold for human detection	`0.8`
`OBJECT_DETECTION_ENABLED`	Enable object detection feature	`true`
`OBJECT_DETECTION_MODEL`	Detection model type	`yolov8n`
`OP_CONNECT_HOST`	1Password Connect server URL	`http://1password-connect:8080`
`OP_CONNECT_TOKEN`	1Password Connect API token	specific to your server
`OP_VAULT`	1Password vault ID	specific to your vault
`OUTPUT_1_LOCATION` through `OUTPUT_8_LOCATION`	Output device location names	device specific
`OUTPUT_1_TYPE` through `OUTPUT_8_TYPE`	Output device types	`Camera`
`RABBITMQ_DEVICE_TOPIC`	RabbitMQ topic for snapshot messages	`snapshot`
`RABBITMQ_EXCHANGE`	RabbitMQ exchange name	`home_automation`
`RABBITMQ_SERVER_ADDRESS`	RabbitMQ broker IP/hostname	`192.168.1.100`
`RUN_FTP_SERVER`	Enable FTP server feature	`true`
`USER`	Process user in container	`app`

Additional Credentials (stored separately in 1Password):

Google/oath/client_secret: Google OAuth 2.0 client credentials for Drive API
FTP/username: FTP server username
FTP/password: FTP server password
Cronitor/password: Cronitor API key
Sentry/__APP_NAME__/dsn: Sentry DSN

Clone Repository and Initialize Submodules

git clone https://github.com/tailucas/snapshot-processor.git
cd snapshot-processor
git submodule init
git submodule update

Create Data Directory and Set Permissions
```
task datadir
```
Creates /data directory for FTP uploads and Google Drive token storage (UID 999).
Configure Runtime Environment
```
task configure
```
Generates .env from base.env template and 1Password secrets.
Build Docker Image
```
task build
```
Multi-stage Docker build process:
- Extends tailucas/base-app:latest base image
- Installs locale support
- Adds backup scripts
- Configures cron jobs
- Installs uv-managed Python dependencies
- Installs PyTorch, YOLOv8, OpenCV, Ultralytics
- Copies application code
- Configures FTP server (if enabled)
Run Application

Foreground (interactive, logs to console):
```
task run
```
Background (detached, logs to syslog):
```
task rund
```
The application will:
- Initialize RabbitMQ client for device messaging
- Start watchdog file system observer for FTP upload detection
- Authenticate to Google Drive (OAuth flow if token missing)
- Initialize Snapshot fetcher with camera configuration
- Start ObjectDetector (AWS Rekognition or YOLOv8)
- Start GoogleDriveUploader for async uploads
- Start GoogleDriveArchiver for folder organization
- Launch FTP server (if enabled)
- Run main event loop

(back to top)

Application Components

Snapshot Fetching

The Snapshot class (line 204) manages camera snapshot fetching:

Receives trigger events from RabbitMQ for specific cameras
Constructs camera-specific snapshot URLs (Foscam-compatible by default)
Fetches images via HTTP with retry logic and timeout handling
Validates image data (size, format, MD5 checksum)
Caches locally with LRU cache for efficiency
Relays images to ObjectDetector via ZeroMQ

Supports multiple camera models through configurable default_command and default_image_format.

Object Detection

The ObjectDetector class (line 902) provides detection abstraction:

AWS Rekognition Mode (cloud-based):

Sends images to AWS Rekognition DetectLabels API
Identifies objects, scenes, and human presence with confidence scores
Returns structured metadata for automation decisions
Scales automatically with load (pay-per-image pricing)
Primary detection for main branch

YOLOv8 Mode (local, CPU/GPU):

Uses Ultralytics YOLOv8 neural network model
Runs inference on local hardware (CPU or GPU)
Identifies human presence with confidence thresholds
Privacy-focused (no cloud dependency)
Included in optional coding dependency group
Suitable for Jetson Nano and edge deployments

Feature flags control which detector is active.

Google Drive Integration

GoogleDriveUploader (line 621):

Monitors local cache for new snapshots
Asynchronously uploads to configured Google Drive folder
Handles OAuth authentication with token caching
Implements retry logic for failed uploads
Manages upload queue to avoid rate limiting

GoogleDriveArchiver (line 490):

Periodically scans Google Drive folder
Organizes snapshots into year/month/day folder structure
Skips (preserves) starred images
Cleans up old flat-structure files

Google Drive OAuth Setup:

First run initiates OAuth flow via PyDrive
User follows URL to authorize Google Drive access
Token saved to /data/snapshot_processor_creds
Backed up regularly (see cron jobs)

FTP Server

The SnapshotFTPHandler (app/ftp_server.py) provides FTP-based upload:

IP cameras can push snapshots directly via FTP
Basic authentication with configurable username/password
Creates configured upload directories automatically
Logs all FTP events (login, upload, download)
Optional feature (controlled by RUN_FTP_SERVER flag)
Listens on port 21 (exposed in docker-compose.yml)

File System Monitoring

The UploadEventHandler (line 765) watches for local FTP uploads:

Uses watchdog library to monitor /data/ftp directory
Detects file creation, movement, and completion
De-duplicates rapid file system events
Validates file integrity before processing
Triggers snapshot processing pipeline

Build System

Task CLI (Taskfile.yml)

Primary build orchestration:

task build - Build Docker image with all dependencies and application code
task run - Run container in foreground with full log output
task rund - Run container detached (persists after terminal close)
task configure - Generate .env and docker-compose.yml from 1Password secrets
task datadir - Create data directory with proper permissions (UID/GID 999)
task python - Setup Python virtual environment with uv
task push - Push built image to Docker Hub/registry

Dockerfile

Extends tailucas/base-app:latest with:

Locale generation (LANG/LANGUAGE args)
Backup authentication token scripts
Cron job configuration (backup_auth_token, cleanup_snapshots)
Python dependencies (uv-managed via pyproject.toml)
PyTorch (CPU or GPU variants)
OpenCV (headless variant for containers)
YOLOv8 / Ultralytics
Supporting libraries (numpy, PIL, matplotlib, polars, pyyaml)
FTP server configuration
Application code (app/main.py, app/ftp_server.py)
Runs as user app (UID 999)

Dependencies

Python (pyproject.toml, managed via uv):

tailucas-pylib>=0.5.6 - Shared utilities, threading, ZeroMQ, RabbitMQ
pydrive>=1.3.1 - Google Drive API integration
pyftpdlib>=2.0.1 - FTP server implementation
cachetools>=6.2.0 - LRU image caching
watchdog>=6.0.0 - File system event monitoring
awscli>=1.42.35 - AWS CLI for Rekognition integration
pillow>=11.3.0 - Image processing
boto3 (via awscli) - AWS SDK for Rekognition

Optional/Development (coding dependency group):

torch>=2.8.0 - PyTorch ML framework (CPU variant in container)
torchvision>=0.23.0 - Computer vision model hub
ultralytics>=8.3.202 - YOLOv8 model and training
Custom PyTorch index: https://download.pytorch.org/whl/cpu

Configuration

config/app.conf (INI format):

[app]: Shutdown grace period, device name, monitor keys
[creds]: Sentry DSN and Cronitor password paths
[rabbitmq]: Exchange, topic, server address
[snapshots]: FTP root, upload directory, detection flags
[rekognition]: AWS region for Rekognition
[object_detection]: YOLOv8 model selection
[human_detection]: Confidence threshold
[gdrive]: Google Drive folder, token path
[camera]: Snapshot command, image format
[input_location] / [input_type]: 8 camera inputs
[output_location] / [output_type]: 8 device outputs

Docker Compose:

Port 21: FTP server
Syslog logging to Docker host
1Password Connect secret integration
Volume mounts: /data for FTP and tokens, /dev/log for syslog

Features & Capabilities

Image Processing Pipeline

Trigger: Device event arrives via RabbitMQ
Fetch: Snapshot thread connects to IP camera via HTTP
Validate: Checks image size, format, MD5 checksum
Cache: Stores locally in LRU memory cache
Detect: Sends to Rekognition or YOLOv8 detector
Store: GoogleDriveUploader queues for async upload
Publish: Results sent back through RabbitMQ
Archive: GoogleDriveArchiver organizes on Drive

Cron Jobs

backup_auth_token: Periodic Google Drive token backup to data directory
cleanup_snapshots: Cleanup old local snapshot cache

Error Handling

The application handles various failure scenarios:

HTTP connection errors (retries with backoff)
RabbitMQ connection loss (auto-reconnect with exponential backoff)
Google Drive API errors (retry with queue persistence)
Image validation failures (delete invalid, retry later)
AWS Rekognition throttling (request queuing)
File system errors (log and continue)
SSL/TLS errors (comprehensive exception handling)

Multi-Region Support

AWS Rekognition can use dual regions:

Primary region: AWS_DEFAULT_REGION
Alternate region: AWS_ALT_REGION
Useful for failover or regional optimization

Feature Flags

Controlled via environment/config:

FEATURE_FLAG_OBJECT_DETECTION: Master switch for all detection
FEATURE_FLAG_CLOUD_OBJECT_DETECTION: AWS Rekognition
FEATURE_FLAG_LOCAL_OBJECT_DETECTION: YOLOv8
FEATURE_FLAG_CLOUD_STORAGE_MANAGEMENT: Google Drive archival

License

Distributed under the MIT License. See LICENSE for more information.

(back to top)

Acknowledgments

(back to top)

Name		Name	Last commit message	Last commit date
Latest commit History 1,015 Commits
.devcontainer		.devcontainer
app		app
config		config
.gitattributes		.gitattributes
.gitignore		.gitignore
.gitmodules		.gitmodules
.python-version		.python-version
Dockerfile		Dockerfile
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
Taskfile.yml		Taskfile.yml
app_entrypoint.sh		app_entrypoint.sh
backup_auth_token.sh		backup_auth_token.sh
base.env		base.env
connect_to_app.sh		connect_to_app.sh
docker-compose.yml		docker-compose.yml
dot_env_setup.sh		dot_env_setup.sh
pyproject.toml		pyproject.toml
settings.yaml		settings.yaml
uv.lock		uv.lock

Folders and files

Latest commit

History

Repository files navigation

About The Project

Overview

Core Functionality

Architecture & Design

Built With

Getting Started

Prerequisites

Required Tools

Installation

Application Components

Snapshot Fetching

Object Detection

Google Drive Integration

FTP Server

File System Monitoring

Build System

Task CLI (Taskfile.yml)

Dockerfile

Dependencies

Configuration

Features & Capabilities

Image Processing Pipeline

Cron Jobs

Error Handling

Multi-Region Support

Feature Flags

License

Acknowledgments

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages