Skip to content

rodcochran/kan_bert

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

14 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

KANBERT

KANBERT is an advanced model that integrates Kolmogorov-Arnold Networks (KANs) into the BERT framework. This repository provides the codebase to reproduce the experiments and results described in our work.

For a detailed overview of the experimentation process, refer to the kanbert.pdf file.

Overview

This project includes:

  • main.py: A predefined script to execute all the experiments as defined in experiments.py.
  • kan_bert.py: Contains the implementation of the KAN layers integrated into the BERT model using PyTorch.
  • train.py: Defines the training class to set up and run training for a single experiment with default parameters.
  • evaluation.py: Contains the evaluation class to run the same evaluation as in the experiments for a single experiment.
  • experiments.py: Defines experiments for testing KANBERT and BERT-baseline models.

Getting Started

Prerequisites

Ensure you have Python 3.9 installed. You can install the required Python packages using the requirements.txt file:

pip install -r requirements.txt

The models used in the predefined experiments were trained on systems equipped with A100 GPUs, each with up to 40GB of GPU RAM.

License

This project is licensed under the MIT License. See the LICENSE file for details.

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors