AudioMarkGenerator

AudioMarkGenerator is an opinionated desktop Python tool that builds a per-book SQLite search database from an EPUB3 file with SMIL media overlays.

The generated database is designed specifically for use with: Audio Mark (Android Viewer)

Tested With Storyteller EPUBs

AudioMarkGenerator is developed and tested primarily against EPUB3 files with SMIL media overlays generated using the Storyteller application from GitLab.

If you are generating your EPUB + audio alignment using Storyteller, AudioMarkGenerator should work as expected.

Other EPUB3 + SMIL workflows may work, but Storyteller-generated EPUBs are the reference implementation used during development.

⚠️ Important

This repository contains only the database generator.

It does not include:

Any Android application
Any audio player
Any EPUB editor
Any alignment tool

To use the generated database, you must install: Audio Mark (Android Viewer)

Getting Started

Requirements

Python 3.11+
A valid EPUB3 file with SMIL media overlays

1. Clone the Repository

git clone https://github.com/ujwalnk/AudioMarkGenerator.git
cd AudioMarkGenerator

2. Create a Virtual Environment

python3 -m venv ./.virt

Activate it:

macOS / Linux

source ./.virt/bin/activate

Windows

.\.virt\Scripts\activate

3. Install Dependencies

pip install -r requirements.txt

4. Run the Generator

python3 build_index.py path/to/book.epub

Example:

python3 build_index.py ./TheBook.epub

Output

A SQLite database will be created next to the EPUB file:

<BookTitle>.db

If the EPUB contains a <dc:title>, that will be used as the filename. Otherwise, the EPUB filename is used.

You can now import this .db file into Audio Mark.

What It Does

Given an EPUB3 file with SMIL media overlays, AudioMarkGenerator:

Extracts metadata (title, author)
Reads the spine (reading order)
Parses the TOC (toc.ncx or nav.xhtml)
Resolves every SMIL <par> entry
Extracts:
- 📖 Chapter title
- 🎧 Audio file path
- ⏱ Timestamp (HH:MM:SS)
- 📝 Exact inner text of the referenced HTML fragment
Writes everything into a single SQLite .db file

Each generated .db file represents exactly one book.

Requirements

The EPUB must contain:

META-INF/container.xml
A valid content.opf
SMIL media overlays declared in the OPF manifest
HTML/XHTML files with proper fragment IDs referenced by SMIL
A TOC (toc.ncx or nav.xhtml) for chapter title extraction

If these are missing or malformed, the generator will abort with a clear error.

Search Model

Each row in the generated database corresponds to exactly one SMIL <par> entry.

Search in the Android app uses exact substring matching:

SELECT chapter_title, audio_file, timestamp
FROM paragraphs
WHERE text LIKE '%<selected_text>%'

No normalization. No lowercasing. No fuzzy matching.

The extracted HTML text is stored exactly as-is.

Database Format

Each book uses one SQLite file containing:

metadata

CREATE TABLE metadata (
    id INTEGER PRIMARY KEY,
    title TEXT,
    author TEXT
);

paragraphs

CREATE TABLE paragraphs (
    id INTEGER PRIMARY KEY,
    chapter_title TEXT NOT NULL,
    audio_file TEXT NOT NULL,
    timestamp TEXT NOT NULL,
    text TEXT NOT NULL
);

An index on text supports search queries.

Timestamp Handling

The following SMIL clipBegin formats are supported:

12.34s
75s
HH:MM:SS(.fff)
MM:SS(.fff)
Plain seconds

All timestamps are stored as:

HH:MM:SS

Fractional seconds are dropped.

Design Choices

Strict requirement: SMIL-based EPUBs only.
No partial builds.
No silent fallbacks.
No text normalization.
No ranking.
One SQLite database per book.
Built specifically for integration with Audio Mark.

This tool is intentionally opinionated and optimized for self-hosted audiobook workflows.

Typical Workflow

Align audiobook + EPUB using Storyteller.
Export EPUB with SMIL overlays.
Run AudioMarkGenerator on the EPUB.
Import the generated .db into Audio Mark.
Select text → jump to audiobook timestamp.

Platform Support

✅ macOS
✅ Linux
✅ Windows (Python 3.11+)

License

You are free to use, modify, and distribute this software under the GNUGPLv3 license.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
LICENSE		LICENSE
README.md		README.md
build_index.py		build_index.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AudioMarkGenerator

Tested With Storyteller EPUBs

⚠️ Important

Getting Started

Requirements

1. Clone the Repository

2. Create a Virtual Environment

macOS / Linux

Windows

3. Install Dependencies

4. Run the Generator

Output

What It Does

Requirements

Search Model

Database Format

metadata

paragraphs

Timestamp Handling

Design Choices

Typical Workflow

Platform Support

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

AudioMarkGenerator

Tested With Storyteller EPUBs

⚠️ Important

Getting Started

Requirements

1. Clone the Repository

2. Create a Virtual Environment

macOS / Linux

Windows

3. Install Dependencies

4. Run the Generator

Output

What It Does

Requirements

Search Model

Database Format

metadata

paragraphs

Timestamp Handling

Design Choices

Typical Workflow

Platform Support

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages