SelectivBench: Evaluating Selectivity in Linear Recurrent Models

This repository contains source code and simulation scripts to perform the experiments presented in the paper titled "Dissecting Linear Recurrent Models: How Different Gating Strategies Drive Selectivity and Generalization" (2026), arXiv preprint arXiv:2601.12598.

This work introduces SelectivBench, a benchmark suite that extends SymSeqBench to evaluate small-scale language models on tasks of increasing complexity, targeting NLP-relevant capabilities such as selective information routing and generalization.

Note: The code in symseqbench is based on older versions of SeqBench and SymSeq and will be updated to the latest versions soon.

Installation

Install uv:

pip install uv

# or

pipx install uv

Clone the repository:

git clone https://github.com/symseqbench/selectivbench.git
cd selectivbench

Create and activate a virtual environment:

uv venv
source .venv/bin/activate

Install dependencies:

uv pip install -e ".[dev]"

Running Experiments

Exemplary training command

python train.py \
  --batch_size 64 \
  --warmup_steps 1000 \
  --lr 0.001 \
  --train_gap_min 0.1 \
  --classify \
  --base_set one_hot \
  --save_model True \
  --run_main \
  --hash_save \
  --test_seqlen_max 200 \
  --grad_acc -1 \
  --grammar_scale True \
  --amb 24 \
  --d_state 128 \
  --feat_size 1024 \
  --layers 8 \
  --model transformer \
  --seed 3 \
  --train_gap_max 0.1 \
  --train_noise 0 \
  --train_seqlen_max 200

Note: the first run of the above command generate the sequence data in the folder Data/SeqBench. It takes about half an hour to generate the data. For debugging purposes, you could consider changing the number of generated sequences using the --dataset_size argument.

Retrieve and Use W&B Sweeps

This project uses Weights & Biases for experiment tracking and hyperparameter sweeps.

Hyperparameter sweep configuration files for the four benchmark tasks are located in the wb_configs/ directory:

wb_configs/
├── task1_complexity.yaml
├── task1_model_size.yaml
├── task2.yaml
├── task3.yaml
└── task4.yaml

Each YAML file defines a W&B sweep for a specific task.

Launch a Sweep

First, log in to W&B and execute:

python generate_sweep_id.py wb_configs/taskx.yaml

The output will include a Sweep ID (e.g., username/project/sweep_id). Use this ID in the next step.

Run the Agent

Now execute the agent to run the experiments:

wandb agent username/project/sweep_id

You can run multiple agents in parallel across machines to accelerate experimentation.

Citation

If you use SelectivBench code base or its derived tasks from SymSeqBench, please cite the following paper:

Bouhadjar, Y., Fabre, M., Schmidt, F & Neftci, E (2026). Dissecting Linear Recurrent Models: How Different Gating Strategies Drive Selectivity and Generalization. arXiv:2601.12598.

@misc{Bouhadjar26_SelectivBench,
      title={Dissecting Linear Recurrent Models: How Different Gating Strategies Drive Selectivity and Generalization}, 
      author={Younes Bouhadjar and Maxime Fabre and Felix Schmidt and Emre Neftci},
      year={2026},
      eprint={2601.12598},
      publisher={arXiv},
      doi={10.48550/arXiv.2601.12598},
      url={https://arxiv.org/abs/2601.12598}, 
}

Project Structure

SelectivBench/
├── symseqbench/      # Core library for the SymSeqBench library
├── wb_configs/       # W&B sweep config files for each benchmark task
├── models/           # Models implementation
├── train.py          # Training and evaluation script
└── README.md

License

This project is licensed under the MIT License. See the LICENSE file for details.

symseqbench Directory

The symseqbench/ directory contains code derived from both SeqBench and SymSeq projects, both of which are licensed under the MIT License. The code in this directory is therefore also licensed under the MIT License. See the symseqbench/LICENSE file for details.

Additional Note: The dataset/shd_ssc.py file contains code from the Idiap Research Institute and is licensed under BSD-3-Clause. See the file header for details.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SelectivBench: Evaluating Selectivity in Linear Recurrent Models

Installation

Running Experiments

Exemplary training command

Retrieve and Use W&B Sweeps

Launch a Sweep

Run the Agent

Citation

Project Structure

License

symseqbench Directory

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
models		models
symseqbench		symseqbench
wb_configs		wb_configs
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml
train.py		train.py
utils.py		utils.py

Folders and files

Latest commit

History

Repository files navigation

SelectivBench: Evaluating Selectivity in Linear Recurrent Models

Installation

Running Experiments

Exemplary training command

Retrieve and Use W&B Sweeps

Launch a Sweep

Run the Agent

Citation

Project Structure

License

symseqbench Directory

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages