Skip to content

Alcray/MiniDP

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

MiniDP

A minimal, JSON-recipe-first data processing pipeline inspired by NVIDIA NeMo SDP.

MiniDP provides a lightweight, modality-agnostic spine for building data transformation pipelines. It is designed to be easily authored and edited by both humans and LLMs.

Features

  • Deterministic execution engine
  • JSON recipe format (tool-calling friendly, human editable)
  • Streaming JSONL manifest processing
  • Composable processor API with drop/modify/expand semantics
  • Optional multiprocessing support
  • Zero external dependencies (stdlib only)

Installation

pip install -e .

Quick Start

# Run a pipeline
minidp run examples/demo_recipe.json

# Preview output
minidp preview examples/demo_recipe.json -n 5

# List available processors
minidp list-processors

Documentation

License

MIT

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages