H-STAR: LLM-driven Hybrid SQL-Text Adaptive Reasoning on Tables

[NAACL 2025] Official Implementation of H-STAR: LLM-driven Hybrid SQL-Text Adaptive Reasoning on Tables.

Overview

In this paper, we introduce a novel algorithm H-STAR that integrates both symbolic and semantic approaches to perform tabular reasoning tasks. H-STAR decomposes the table reasoning task into two stages: 1) Table Extraction and 2) Adaptive Reasoning.

Dependencies

Activate the environment by running

conda create -n hstar python=3.9
conda activate hstar
pip install -r requirements.txt
pip install records==0.5.3

Change Path

Create a file named <file_name>.pth in the /[PATH to Conda]/envs/hstar/lib/python3.9/site-packages/ directory, and paste the Project root path [PATH to H-STAR ].

Datasets

Benchmark datasets studied in the paper have been provided in the datasets/ directory.

Add key

Apply and get API keys from OpenAI API, save the key in key.txt

For running the Gemini model generate the API key from Vertex AI and store it as a .json file in the directory.

Run

Run the H-STAR pipeline for different Large Language Models (LLMs) using:

For Open AI models:

python run_gpt.py

For Gemini/PaLM models:

run_gemini.py

The outputs for every intermediate step in the pipeline are saved in the results/ directory.

Evaluation

Evaluate the results for TabFact/ WikiTQ using the notebook

evaluate.ipynb

Evaluate FetaQA using command line instuction

python fetaqa_score.py --model_name [MODEL_NAME]

Set model_name to the desired LLM

Citation

If you find our paper or the repository helpful, please cite us with

@article{abhyankar2024h,
  title={H-STAR: LLM-driven Hybrid SQL-Text Adaptive Reasoning on Tables},
  author={Abhyankar, Nikhil and Gupta, Vivek and Roth, Dan and Reddy, Chandan K},
  journal={arXiv preprint arXiv:2407.05952},
  year={2024}
}

Acknowledgement

This implementation is based on Binding Language Models in Symbolic Languages. The work has also benefitted from TabSQLify: Enhancing Reasoning Capabilities of LLMs Through Table Decomposition. Thanks to the author for releasing the code.

Contact Us

For any questions or issues, you are welcome to open an issue in this repo, or contact us at nikhilsa@vt.edu, keviv9@gmail.com.

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
datasets		datasets
generation		generation
nsql		nsql
prompts		prompts
scripts		scripts
utils		utils
.gitignore		.gitignore
README.md		README.md
evaluate.ipynb		evaluate.ipynb
fetaqa_score.py		fetaqa_score.py
illustration.png		illustration.png
requirements.txt		requirements.txt
run_gemini.py		run_gemini.py
run_gpt.py		run_gpt.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

H-STAR: LLM-driven Hybrid SQL-Text Adaptive Reasoning on Tables

Overview

Dependencies

Change Path

Datasets

Add key

Run

Evaluation

Citation

Acknowledgement

Contact Us

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

H-STAR: LLM-driven Hybrid SQL-Text Adaptive Reasoning on Tables

Overview

Dependencies

Change Path

Datasets

Add key

Run

Evaluation

Citation

Acknowledgement

Contact Us

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages