LawBot: Enhancing LLMs with RAG for Legal Precision

Group Project of COMP0087: Statistical Natural Language Processing

Introduction

LawBot is a simple yet effective framework tailored to specialized legal domains, operational without training. Utilizing Chinese legal and regulatory documents as the knowledge base, LawBot enhances the breadth of retrieval through multi-query generation and hybrid search strategies. It increases precision with metadata filtering and confirms the plausibility of knowledge through context-based reranking. Remarkably, all these procedures are conducted via zero-shot prompting, making LawBot broadly applicable even when LLMs are accessible only through a black-box API.

Installation

Follow these steps to set up the LawBot environment on your local machine:

Clone the repository:

git clone https://github.com/yix8/LawBot.git

Navigate to the LawBot directory:
```
cd LawBot
```
Install the required packages:
```
pip install -r requirements.txt
```
Build the general vector store:
```
python framework/embed_laws.py
```

Build the specific vector store:

python framework/finetune_data/embed_query.py

Configuration

Set up the necessary API keys in the .env file located in the framework folder:

OPENAI_API_KEY: Your OpenAI API key.
COHERE_RERANK_KEY: Your Cohere rerank API key.
LANGCHAIN_API_KEY: Your LangChain API key.
LANGCHAIN_PROJECT: Your LangChain project identifier.

Running LawBot

To interact with the model via a web interface:

python framework/App.py

China Law Query Synthetic

We also proposed an open-source QA dataset, the Chinese Legal Question Answering dataset (CLQS) which can be utilized as an instruction dataset.

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
CLQS		CLQS
__pycache__		__pycache__
evaluation_results		evaluation_results
framework		framework
imgs		imgs
models		models
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LawBot: Enhancing LLMs with RAG for Legal Precision

Introduction

Installation

Configuration

Running LawBot

China Law Query Synthetic

About

Uh oh!

Releases

Packages

Uh oh!

Languages

yix8/LawBot

Folders and files

Latest commit

History

Repository files navigation

LawBot: Enhancing LLMs with RAG for Legal Precision

Introduction

Installation

Configuration

Running LawBot

China Law Query Synthetic

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages