GitHub - azeebneuron/GPTFromScratch: Building a GPT model from the ground up, implementing self-attention, multi-head attention, and transformer blocks. Developed a custom autograd engine for neural networks, applying it to binary classification using micrograd.

This repository documents my journey of implementing a GPT model from the ground up.
Every step is explained in Jupyter notebooks with code, notes, and experiments, making it a resource for anyone curious about how GPTs really work under the hood.

Current Progress

micrograd (autograd engine from scratch)
makemore Part 1 (building a character-level language model)

What’s Next

Tokenizer and transformer implementation
Profiling and benchmarking
Custom Triton kernels (e.g., FlashAttention2)
Distributed and memory-efficient training
Scaling experiments
Data preprocessing and filtering from raw sources
Alignment methods: supervised finetuning, reinforcement learning, and DPO

Acknowledgments

Following Andrej Karpathy’s neural net series for inspiration and guidance.
Can’t thank him enough for making this stuff feel fun instead of intimidating.

Name		Name	Last commit message	Last commit date
Latest commit History 27 Commits
Makemore		Makemore
Micrograd		Micrograd
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Current Progress

What’s Next

Acknowledgments

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Current Progress

What’s Next

Acknowledgments

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages