TerryLM - Tiny Reasoning Model

A ~10M parameter LLM that was designed for reasoning

TerryLM is a compact Transformer project for training and chatting with Terry, a tiny synthetic assistant. This model was disigned for now supports long context reasoning with sequences up to 25K tokens using efficient sliding window attention.

Key Features

Long Context Support: Handle 10K-25K token sequences with sliding window attention
Memory Efficient: Gradient checkpointing and mixed precision training
Reasoning Capabilities: Improved attention mechanism for better reasoning over long contexts
Compact Architecture: 256-dimensional embeddings, 8 layers, 8 attention heads

Data flow

Generate Terry conversations:

python data/generate_terry_dataset.py

This writes:

src/terry_daily_chat_train.jsonl
src/terry_daily_chat_valid.jsonl

Prepare tokenized training data:

python prepare_data.py

This writes:

src/processed/terry_train_tokens.txt
src/processed/terry_valid_tokens.txt
tokenizer/terry_byte/tokenizer_config.json

Train:

python train.py

Configuration

Key parameters in config.py:

@dataclass
class ModelConfig:
    d_model: int = 256
    n_layers: int = 8
    n_heads: int = 8
    max_seq_len: int = 8192  # Maximum sequence length
    sliding_window: int = 2048  # Local attention window
    use_sliding_window: bool = True

Tokenizer

The project uses a local byte-level tokenizer with fixed special token IDs:

0: <|pad|>
1: <|im_start|>
2: <|im_end|>

Inference

python example_usage.py

Name		Name	Last commit message	Last commit date
Latest commit History 46 Commits
.github/ISSUE_TEMPLATE		.github/ISSUE_TEMPLATE
assets		assets
data		data
engine		engine
model		model
skills/llm-project-guidelines		skills/llm-project-guidelines
tests		tests
tokenizer/terry_byte		tokenizer/terry_byte
tools		tools
utils		utils
.gitignore		.gitignore
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
LICENSE		LICENSE
README.md		README.md
config.py		config.py
example_usage.py		example_usage.py
prepare_data.py		prepare_data.py
requirements.txt		requirements.txt
train.py		train.py
train_terry.ipynb		train_terry.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

TerryLM - Tiny Reasoning Model

Key Features

Data flow

Configuration

Tokenizer

Inference

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

TerryLM - Tiny Reasoning Model

Key Features

Data flow

Configuration

Tokenizer

Inference

About

Topics

Resources

License

Code of conduct

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages