Chess AI

A deep learning chess engine trained on Lichess game data. This project implements both CNN and Transformer architectures for chess move prediction, with a tiny UI system and training pipeline.

Installation

Install Poetry (package manager):

curl -sSL https://install.python-poetry.org | python3 -

Clone the repository:

git clone https://github.com/yourusername/chess-ai.git
cd chess-ai

Install dependencies:

poetry install

Project Structure

chess-ai/
├── chess_ai/
│   ├── models/
│   │   ├── cnn/          # CNN architecture
│   │   └── transformer/  # Transformer architecture
│   ├── data/             # Data loading and processing
│   ├── training/         # Training logic
│   ├── ui/               # User interfaces
│   └── utils/            # Helper functions
├── scripts/              # Command-line tools
└── notebooks/            # Jupyter notebooks

Usage

Training

Train a model using the command-line interface:

poetry run python scripts/train.py \
    --model-type cnn \
    --data-path /path/to/pgn/data \
    --rating-range "1600-2000" \
    --batch-size 64 \
    --epochs 10 \
    --learning-rate 0.001

Available options:

--model-type: Choose between 'cnn' or 'transformer'
--rating-range: ELO rating range for training data
--batch-size: Training batch size
--epochs: Number of training epochs
--learning-rate: Learning rate
--save-dir: Directory to save model checkpoints

Playing Against the Model

poetry run python scripts/play_cli.py < --checkpoint-path= > [ --model-type ] [ --value-checkpoint-path ]

Model Architectures

TODO: Check and complete this section

CNN Model

The CNN architecture uses:

Convolutional layers with batch normalization
Residual connections
Global average pooling
Dense layers for move prediction

Input format:

12-channel 8x8 board representation (6 piece types × 2 colors)
Additional features (castling rights, turn indicator)

Transformer Model

The experimental transformer architecture includes:

Separate embeddings for pieces and positions
Multi-head self-attention
Positional encoding
Encoder-decoder architecture

This architecture is still under development and may not perform as well as the CNN model.

Development

Running Tests

The tests were made really quickly and sloppily, will not work in your environment most likely.

poetry run pytest

Name		Name	Last commit message	Last commit date
Latest commit History 35 Commits
assets/pieces		assets/pieces
checkpoints		checkpoints
chess_ai		chess_ai
notebooks		notebooks
scripts		scripts
tests		tests
.flake8		.flake8
.gitattributes		.gitattributes
.gitignore		.gitignore
README.md		README.md
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Chess AI

Installation

Project Structure

Usage

Training

Playing Against the Model

Model Architectures

CNN Model

Transformer Model

Development

Running Tests

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Chess AI

Installation

Project Structure

Usage

Training

Playing Against the Model

Model Architectures

CNN Model

Transformer Model

Development

Running Tests

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages