Skip to content

[Proposal] support simplestories models #950

@mivanit

Description

@mivanit

Proposal

Support the SimpleStories family of models.

Motivation

The TinyStories models already included in TransformerLens are incredibly useful both as objects of study in their own right, as well as for debugging research code in low-resource environments before investigating larger models. The SimpleStories models are an improvement on the TinyStories models, built on a much more diverse dataset.

Links

models on HF: https://huggingface.co/SimpleStories
training repo: https://github.com/danbraunai/simple_stories_train
dataset paper: https://arxiv.org/pdf/2504.09184

Checklist

  • I have checked that there is no similar issue in the repo (required)

Metadata

Metadata

Assignees

No one assigned

    Labels

    complexity-moderateModerately complicated issues for people who have intermediate experience with the codemodel-requestAny issues related to requesting additional model support

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions