Fix hardcoded TinyStories data path in train_large/train_large_ane by nabbilkhan · Pull Request #29 · maderix/ANE

nabbilkhan · 2026-03-03T19:20:18Z

Why I worked on this

First, thank you for building this project. ANE is honestly awesome, and I’m actively using it in real workflows.

I’m running these training pipelines as part of my Open Claw agent environment across multiple Apple machines and different launch contexts (manual runs, scripted runs, and restart-driven runs). In that setup, I kept hitting the same issue: the static trainers expected token data at a fixed path.

The run would work in one context, then break in another, and it could also fail after exec() restart because the path context was not explicit. That created unnecessary friction for real usage and for onboarding other people.

What this PR changes

Adds --data PATH to train_large and train_large_ane
Replaces hardcoded token data path with a runtime-configurable path
Preserves --data across exec() restart so resumed training keeps the same dataset
Improves missing-data error text with clear next steps
Updates training/README.md with examples and flag documentation

Why this helps the community

Makes the project easier to run from any working directory
Supports custom dataset locations without source edits
Prevents restart-related path regressions
Reduces first-run setup failures for new contributors
Improves compatibility with scripted/automated setups

Validation

I validated this on Apple Silicon macOS with explicit absolute paths:

train_large --steps 11 --data <abs-path> (forces restart path)
train_large_ane --no-ane-extras --steps 11 --data <abs-path> (same restart validation)
Missing-path negative check confirms clear error guidance

Both restart paths resumed correctly and continued reading token data as expected.

Personal note

I’m very excited about this project and would love to keep contributing to its growth. I’ve contributed multiple times to Open Claw, and I’m using ANE in serious, practical workflows, so I plan to keep sending real-world fixes and useful benchmark data upstream.

maderix · 2026-03-04T12:25:29Z

Thanks — threading --data through execl() restarts was the tricky part and you got it right. Merged!

Add --data path support for static training pipelines

c04168e

maderix merged commit 032f866 into maderix:main Mar 4, 2026

maderix mentioned this pull request Mar 4, 2026

Community contributions: M1-M4 compat, security fixes, docs, benchmarks, and community dashboard #25

Closed

6 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix hardcoded TinyStories data path in train_large/train_large_ane#29

Fix hardcoded TinyStories data path in train_large/train_large_ane#29
maderix merged 1 commit intomaderix:mainfrom
nabbilkhan:contrib/fix-training-data-paths

nabbilkhan commented Mar 3, 2026 •

edited

Loading

Uh oh!

maderix commented Mar 4, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

nabbilkhan commented Mar 3, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Why I worked on this

What this PR changes

Why this helps the community

Validation

Personal note

Uh oh!

maderix commented Mar 4, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

nabbilkhan commented Mar 3, 2026 •

edited

Loading