GenRAG

GenRAG is a terminal tool designed to set up a Retrieval-Augmented Generation Pipeline locally from scratch, without utilizing any high-level frameworks like LangChain or vector databases. It includes features such as recursive text splitting, chunking, and building embeddings. The embeddings are stored in a CSV file,without using any Vector Databases and searching is based on cosine similarity.

Features

Recursive text splitting
Text chunking
Building embeddings
Storing embeddings in a CSV file
Searching based on cosine similarity
Inference from local LLM/API

Installation

Clone the repository:

(https://github.com/CoDIngDEMon018/PDF-Summarizer)
cd GenRAG

Create and activate a virtual environment:

python3 -m venv env
source env/bin/activate  # On Windows use `env\Scripts\activate`

Install the required dependencies:
```
pip install -r requirements.txt
```

Usage

Add a PDF file to the data folder and change the path in the script as necessary.
Create embeddings by running:
```
python3 create_embeddings.py
```
You should see a CSV file generated in the data folder.
Run the main script:
```
python3 main.py
```
Enter your query when prompted.

LLM Response

You can use both a local LLM or an LLM from an API like Gemini for generating responses.

Local LLM: If you have the capability to run a local LLM, you can use it for generating responses. Cause mine is too slow :(
LLM from API: If your system is not powerful enough for local inference, you can use an API like Gemini. To do this, create a .env file and pass the Gemini API key.

Using Gemini API

Create a .env file in the root directory of the project.
Add your Gemini API key to the .env file:
```
GEMINI_API_KEY=your_api_key_here
```
The system will use the API key for generating responses through the Gemini API.

Credits

Special thanks to the following YouTube channels and research papers for their invaluable resources and insights:

YouTube Channels

Research Papers

Patrick Lewis ., "Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks" arXiv:2005.11401
Vaswani et al., "Attention is All You Need" arXiv:1706.03762
Reimers et al., "Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks" arXiv:1908.10084

License

This project is licensed under the MIT License - see the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
data		data
env		env
llm		llm
services		services
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
create_embeddings.py		create_embeddings.py
main.py		main.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

GenRAG

Features

Installation

Usage

LLM Response

Using Gemini API

Credits

YouTube Channels

Research Papers

License

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

CoDIngDEMon018/PDF-Summarizer

Folders and files

Latest commit

History

Repository files navigation

GenRAG

Features

Installation

Usage

LLM Response

Using Gemini API

Credits

YouTube Channels

Research Papers

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages