Repository for meeting notes and links to resources of the Dutch HTR Knowledge Exchange meetings
-
Updated
Oct 15, 2025
Repository for meeting notes and links to resources of the Dutch HTR Knowledge Exchange meetings
A high-performance Python library for extracting structured content from PDF documents with layout-aware text extraction. pdf_to_json preserves document structure including headings (H1-H6) and body text, outputting clean JSON format.
The MDF Parser CLI Tool is a command-line application designed to parse .mdf (Molecular Data Format) files and extract molecular topology information. It efficiently processes molecular structures, extracting details like atoms, bonds, angles, and dihedrals. The tool supports structured output in CSV and JSON formats.
Add a description, image, and links to the structure-extraction topic page so that developers can more easily learn about it.
To associate your repository with the structure-extraction topic, visit your repo's landing page and select "manage topics."