Skip to content

tanmay271/RAG-Qdrant-AI

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

High-Fidelity RAG Engine: Qdrant & OpenAI Integration

About This Project

This repository features an end-to-end Retrieval-Augmented Generation (RAG) system built with a focus on clean software architecture and production-readiness.

Rather than relying on abstracted, heavy-weight orchestration frameworks, this project utilizes a custom-built, highly modular Python backend. It securely ingests unstructured PDF documents, processes them using token-aware semantic chunking, and leverages Qdrant Cloud and OpenAI's GPT models to deliver grounded, context-aware AI responses.

🏗️ Architectural Highlights

This application is designed with strict Separation of Concerns (SoC), making the codebase highly scalable, testable, and maintainable. The backend pipeline is decoupled into specialized micro-services:

  • ingestion.py: Handles robust document parsing and extraction using PyPDF2.
  • embeddings.py: Executes token-optimized semantic chunking using tiktoken to maximize OpenAI API efficiency and generate high-dimensional vector embeddings.
  • qdrant_client.py: Manages secure cloud connections and vector indexing within the Qdrant database.
  • retrieval.py & generation.py: Orchestrates similarity search and precise prompt engineering to feed context-rich data to gpt-3.5-turbo / gpt-4, effectively mitigating LLM hallucinations.

⚙️ Tech Stack

  • AI & Embeddings: OpenAI API, tiktoken
  • Vector Database: Qdrant Cloud
  • Backend: Python, numpy, python-dotenv
  • Data Processing: PyPDF2

This project demonstrates my ability to not only implement cutting-edge Generative AI concepts but to engineer them using scalable, enterprise-grade software design patterns.


About

High-performance RAG pipeline engineered to eliminate LLM hallucinations during complex document analysis. Leverages token-aware chunking via tiktoken, custom PyPDF2 data extraction, OpenAI embeddings, and Qdrant vector databases to transform unstructured data into highly accurate, grounded, and context-rich AI insights.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages