SMARTFinRAG: An Interactive Modular Framework for Financial RAG Systems

SMARTFinRAG is a comprehensive, modular live-demo system specifically designed for financial domains that addresses critical challenges in Retrieval-Augmented Generation (RAG) systems. Our framework enables customizable RAG evaluation, real-time component swapping, and document-centric assessment to promote trustworthy, document-grounded financial question answering research.

Deployed Live-Demo Available!!!

https://smartfinrag-a9cawsutx5f4pl4j8wnox7.streamlit.app/

Key Capabilities

Interactive Evaluation Framework

Customizable Parameter Configuration: Adjust both RAG and LLM parameters to dynamically configure the generation process
Modular Component Architecture: Selectively enable/disable components in the RAG pipeline for ablation studies and bottleneck identification
Document-Based Evaluation: Utilize a document-centric evaluation paradigm with LLM-as-a-Judge to generate and assess QA pairs
Comprehensive Metrics Suite: Measure both retrieval quality (hit rate, MRR, precision, recall, NDCG) and response quality (faithfulness, relevancy)

Financial Domain Specialization

Finance-Specific Processing: Tailored for financial document understanding with domain-specific components
Multi-Dataset Support: Unified QA schema covering multiple financial datasets
SEC Filings Support: Compatible with the "Generative AI rewritten SEC filings" dataset (Lehner, 2024)
Timeliness Evaluation: Just-in-time document ingestion to assess model performance on recent financial information

Advanced RAG Components

QueryPreprocessor: Named Entity Recognition and query enhancement capabilities
RetrieverFactory: Configure BM25, vector-based, and hybrid retrieval approaches
Document Processing Pipeline: Support for PDF, TXT, DOCX with chunking optimization
Extensible Evaluation System: Automated metrics calculation and result exportation

Implementation

Setup

Create a virtual environment (recommended):

python -m venv venv
source venv/bin/activate  # On Windows: venv\Scripts\activate

Install dependencies:

pip install -r requirements.txt

Create a .env file in the root directory and add your API keys:

OPENAI_API_KEY=your_openai_api_key_here

HUGGINGFACE_API_KEY=your_hf_api_key_here

OPENROUTER_API_KEY=your_openrouter_api_key_here

Usage

Start the application:

streamlit run app.py

Upload financial documents using the sidebar
Click "Process Documents" to index the uploaded files
Configure RAG components through the interface
Start asking questions in the chat interface
View evaluation metrics and experiment with different configurations

Document Guidelines

Upload relevant financial documents (PDF, TXT, DOCX)
For optimal performance, use well-structured financial documents
The system supports both batch processing and real-time document indexing
Documents are stored securely in the local environment

Architecture

SMARTFinRAG implements a modular architecture with the following key components:

Document Processing: Ingestion, parsing, and chunking of financial documents
Retrieval System: Configurable retrieval mechanisms with multiple strategies
Generation Module: LLM-based response synthesis with context integration
Evaluation Framework: Automatic assessment of retrieval and generation quality
Web Interface: Interactive UI for real-time experimentation and visualization

Security Considerations

Never upload documents containing sensitive personal information
API keys should be kept secure and never shared
Document storage is local to the application

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
.devcontainer		.devcontainer
__pycache__		__pycache__
config		config
pages		pages
src		src
storage		storage
.gitignore		.gitignore
README.md		README.md
app.py		app.py
experiments.ipynb		experiments.ipynb
llm_providers.py		llm_providers.py
requirements.txt		requirements.txt
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SMARTFinRAG: An Interactive Modular Framework for Financial RAG Systems

Deployed Live-Demo Available!!!

Key Capabilities

Interactive Evaluation Framework

Financial Domain Specialization

Advanced RAG Components

Implementation

Setup

Usage

Document Guidelines

Architecture

Security Considerations

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

SMARTFinRAG: An Interactive Modular Framework for Financial RAG Systems

Deployed Live-Demo Available!!!

Key Capabilities

Interactive Evaluation Framework

Financial Domain Specialization

Advanced RAG Components

Implementation

Setup

Usage

Document Guidelines

Architecture

Security Considerations

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages