Grassmann Flows

An independent reproduction study of "Attention Is Not What You Need" (arXiv 2512.19428).

Summary

This repository contains a reproduction of Grassmann flow layers for sequence modeling. The original paper claims performance "within 10-15% of size-matched Transformers" on Wikitext-2. Our reproduction shows a 22.6% gap - significantly larger than claimed.

Key Results

Model	Parameters	Test PPL
Grassmann (paper arch)	17.70M	242.94
Transformer	17.67M	198.17

Gap: 22.6% (vs claimed 10-15%)

CUDA Optimization

Custom CUDA kernels provide 2x inference speedup:

Metric	PyTorch	CUDA	Speedup
Full model inference	9.16 ms	4.53 ms	2.0x

Blog Post

Full analysis and discussion: blog.md

Quick Start

# Install dependencies
pip install torch datasets transformers tqdm

# Run reproduction
python train_wikitext2.py --model both --epochs 20

# Build CUDA kernels (optional)
cd src/cuda && python setup.py install

Files

train_wikitext2.py - Training script
src/models/grassmann_v4.py - Paper-exact implementation
src/cuda/ - CUDA kernel implementation
blog.md - Full reproduction report
technical.md - Technical details

Hardware

Experiments run on NVIDIA H100 SXM5 80GB (Voltage Park Cloud).

Citation

@article{arledge2025grassmann,
  title={Grassmann Flows for Sequence Modeling: An Independent Reproduction Study},
  author={Arledge, Elliot},
  year={2025},
  month={December},
  url={https://github.com/Infatoshi/grassmann-flows}
}

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
paper		paper
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
analyze.py		analyze.py
benchmark_cuda.py		benchmark_cuda.py
blog.md		blog.md
configs.py		configs.py
profile_training.py		profile_training.py
pyproject.toml		pyproject.toml
technical.md		technical.md
test_cuda_kernels.py		test_cuda_kernels.py
train.py		train.py
train_wikitext2.py		train_wikitext2.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Grassmann Flows

Summary

Key Results

CUDA Optimization

Blog Post

Quick Start

Files

Hardware

Citation

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Grassmann Flows

Summary

Key Results

CUDA Optimization

Blog Post

Quick Start

Files

Hardware

Citation

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages