Document Context

A Go library for converting documents into context-friendly formats suitable for LLM consumption and analysis.

Status: v0.1.0 Release Ready

document-context has completed Phase 2 development and is ready for v0.1.0 release. The library provides production-ready PDF processing with image caching, enhancement filters, and comprehensive documentation.

Documentation

ARCHITECTURE.md: Technical specifications and implementation details
PROJECT.md: Project scope, philosophy, and roadmap
CLAUDE.md: Development principles and conventions

Overview

This library provides format-agnostic interfaces for document processing with extensible format support. It was created as a tooling extension for the go-agents project but can be used standalone for document processing needs.

Current Capabilities:

PDF document processing
Page-level image extraction
Multiple output formats (PNG, JPEG)
Configurable quality and resolution
Persistent filesystem caching for rendered images
Structured logging infrastructure
Base64 data URI encoding for LLM APIs

Prerequisites

Go Version

Go 1.25.4 or later

External Dependencies

ImageMagick (Required):

Used for high-quality PDF page rendering
Must use version 7.0+ with the magick command
Installation varies by platform:

Verify Installation:

magick --version

Installation

go get github.com/JaimeStill/document-context

Usage Examples

Basic PDF to Image Conversion

package main

import (
    "fmt"
    "os"

    "github.com/JaimeStill/document-context/pkg/config"
    "github.com/JaimeStill/document-context/pkg/document"
    "github.com/JaimeStill/document-context/pkg/image"
)

func main() {
    // Create configuration (or use config.DefaultImageConfig() for PNG, 300 DPI)
    cfg := config.ImageConfig{
        Format:  "png",    // "png" or "jpg"
        DPI:     300,      // Resolution
        Quality: 85,       // JPEG quality (1-100, ignored for PNG)
        Options: map[string]any{
            "brightness": 110,  // 0-200, 100=neutral
            "contrast":   10,   // -100 to +100, 0=neutral
        },
    }

    // Transform configuration to renderer
    renderer, err := image.NewImageMagickRenderer(cfg)
    if err != nil {
        fmt.Fprintf(os.Stderr, "Invalid configuration: %v\n", err)
        return
    }

    // Open PDF and extract page
    doc, err := document.OpenPDF("report.pdf")
    if err != nil {
        fmt.Fprintf(os.Stderr, "Failed to open PDF: %v\n", err)
        return
    }
    defer doc.Close()

    page, err := doc.ExtractPage(1)
    if err != nil {
        fmt.Fprintf(os.Stderr, "Failed to extract page: %v\n", err)
        return
    }

    // Convert to image (nil = no caching)
    imageData, err := page.ToImage(renderer, nil)
    if err != nil {
        fmt.Fprintf(os.Stderr, "Failed to convert page: %v\n", err)
        return
    }

    // Save image
    err = os.WriteFile("page-1.png", imageData, 0644)
    if err != nil {
        fmt.Fprintf(os.Stderr, "Failed to write image: %v\n", err)
        return
    }

    fmt.Println("Successfully converted page to image")
}

Processing Multiple Pages:

// Extract all pages
pages, err := doc.ExtractAllPages()
if err != nil {
    return err
}

// Convert each page
for _, page := range pages {
    imageData, err := page.ToImage(renderer, nil)
    // Handle imageData...
}

Using the Filesystem Cache

Enable persistent caching for faster repeated conversions:

// Create cache configuration
cacheCfg := &config.CacheConfig{
    Name: "filesystem",
    Logger: config.LoggerConfig{Level: config.LogLevelInfo},
    Options: map[string]any{
        "directory": "/var/cache/document-context",
    },
}

// Create cache instance
c, err := cache.Create(cacheCfg)
if err != nil {
    return err
}

// Use cache with ToImage()
page, _ := doc.ExtractPage(1)
imageData, err := page.ToImage(renderer, c)  // First call renders and caches
cachedData, err := page.ToImage(renderer, c)  // Second call returns cached data

Cache keys are generated from document path, page number, and all rendering parameters (format, DPI, quality, filters). The same configuration always produces the same cache key.

For detailed cache behavior, troubleshooting, and advanced usage, see GUIDE.md

Data URI Encoding for LLM APIs

Convert pages to base64 data URIs for LLM vision APIs:

import "github.com/JaimeStill/document-context/pkg/encoding"

// After converting page to image (as shown above)
imageData, _ := page.ToImage(renderer, nil)

// Encode as data URI
dataURI, err := encoding.EncodeImageDataURI(imageData, document.PNG)
if err != nil {
    return err
}

// Use dataURI with LLM vision API
// response := llm.Vision("Analyze this document", []string{dataURI})

For integration with go-agents, see the go-agents documentation for vision API usage patterns.

Examples

See examples/document-converter for a comprehensive CLI tool demonstrating all library features.

Configuration

The library uses a configuration-to-renderer transformation pattern where configuration (data) transforms into renderers (behavior):

// Create configuration
cfg := config.ImageConfig{
    Format:  "png",    // "png" or "jpg"
    Quality: 85,       // JPEG quality (1-100), ignored for PNG
    DPI:     300,      // Resolution (72/150/300/600)
    Options: map[string]any{  // ImageMagick filters
        "brightness": 110,     // 0-200, 100=neutral
        "contrast":   10,      // -100 to +100, 0=neutral
        "saturation": 100,     // 0-200, 100=neutral
        "rotation":   0,       // 0-360 degrees
        "background": "white", // Color name for alpha channel
    },
}

// Transform to renderer (validates configuration)
renderer, err := image.NewImageMagickRenderer(cfg)

// Or use defaults: PNG, 300 DPI, no filters
renderer, _ := image.NewImageMagickRenderer(config.DefaultImageConfig())

Format Selection: PNG (lossless, larger) vs JPEG (lossy, smaller). DPI: 72 (screen), 150 (web), 300 (print/default), 600 (professional).

Testing

The library includes comprehensive unit tests. Tests requiring ImageMagick will be skipped if the binary is not available.

Run All Tests

go test ./tests/... -v

Run Tests for Specific Package

# Test document package
go test ./tests/document/... -v

# Test encoding package
go test ./tests/encoding/... -v

Error Handling

All operations return descriptive errors with context:

// Common error scenarios
doc, err := document.OpenPDF("file.pdf")     // File not found, invalid/corrupted PDF
page, err := doc.ExtractPage(999)            // Page out of range
imageData, err := page.ToImage(renderer, c)  // ImageMagick not installed, config invalid
dataURI, err := encoding.EncodeImageDataURI(data, format)  // Empty data, unsupported format

Error messages include operation context and external command output for debugging.

Deployment

Container Deployment - Ensure ImageMagick is available:

FROM golang:1.25-alpine
RUN apk add --no-cache imagemagick
COPY . /app
WORKDIR /app
RUN go build -o service .
CMD ["./service"]

Startup Verification:

if _, err := exec.LookPath("magick"); err != nil {
    log.Fatal("ImageMagick not installed")
}

Limitations

Current Limitations

PDF Only: Only PDF format currently supported
Image Output Only: Cannot extract raw text (planned for future)
Sequential Processing: Pages processed one at a time (parallel processing planned)
No OCR: Cannot extract text from image-based PDFs (OCR support planned)
ImageMagick Required: External binary dependency for PDF rendering

Roadmap

Planned features include additional document formats (Office, HTML, Markdown), alternative outputs (text extraction, structured content), and processing enhancements (parallel processing, streaming). See PROJECT.md for the complete roadmap and current development status.

License

This project is licensed under the MIT License.

Related Projects

go-agents: Go library for building LLM-powered applications

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
.github/workflows		.github/workflows
_context		_context
examples/document-converter		examples/document-converter
pkg		pkg
tests		tests
ARCHITECTURE.md		ARCHITECTURE.md
CHANGELOG.md		CHANGELOG.md
CLAUDE.md		CLAUDE.md
GUIDE.md		GUIDE.md
PROJECT.md		PROJECT.md
README.md		README.md
coverage.out		coverage.out
go.mod		go.mod
go.sum		go.sum

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Document Context

Status: v0.1.0 Release Ready

Documentation

Overview

Prerequisites

Go Version

External Dependencies

Installation

Usage Examples

Basic PDF to Image Conversion

Using the Filesystem Cache

Data URI Encoding for LLM APIs

Examples

Configuration

Testing

Run All Tests

Run Tests for Specific Package

Error Handling

Deployment

Limitations

Current Limitations

Roadmap

License

Related Projects

About

Uh oh!

Releases 2

Packages

Languages

JaimeStill/document-context

Folders and files

Latest commit

History

Repository files navigation

Document Context

Status: v0.1.0 Release Ready

Documentation

Overview

Prerequisites

Go Version

External Dependencies

Installation

Usage Examples

Basic PDF to Image Conversion

Using the Filesystem Cache

Data URI Encoding for LLM APIs

Examples

Configuration

Testing

Run All Tests

Run Tests for Specific Package

Error Handling

Deployment

Limitations

Current Limitations

Roadmap

License

Related Projects

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases 2

Packages 0

Languages

Packages