I build:
- Retrieval-Augmented Generation (RAG) systems
- AI-powered information retrieval pipelines
- Machine learning prediction models
- AI backend services
AI-powered personalized news retrieval system using LLM-based agent, query expansion, web search, credibility filtering, and summarization.
User Email + Query
β
Fetch User Interests (SQL)
β
AI Agent (Tool Calling)
β
LLM Query Expansion
β
DuckDuckGo Search
β
Credibility Filtering
β
News Summarization
- Python
- Groq LLM API
- DuckDuckGo Search
- SQL
Retrieval-Augmented Generation system for answering medical queries using domain documents.
Medical Documents
β
Text Chunking
β
Embeddings (Sentence Transformers)
β
Vector Database (Chroma)
β
Semantic Retrieval
β
LLM Answer Generation
- PDF ingestion and chunking
- Embeddings and semantic search
- Vector database retrieval
- LLM-based answer generation
- Python
- LangChain
- ChromaDB
- Sentence Transformers
- LLaMA / Mistral
Machine learning model predicting turbine failures using 40+ sensor variables.
Sensor Data
β
Data Cleaning
β
Feature Scaling
β
Neural Network Model
β
Failure Prediction
- Improved minority-class recall 32% β 92%
- Achieved 95% overall accuracy
- Addressed severe class imbalance
- Python
- Scikit-learn
- Neural Networks
- Pandas / NumPy
Computer vision model detecting helmet usage using VGG16 transfer learning.
Image Dataset
β
Data Augmentation
β
Transfer Learning (VGG16)
β
CNN Training
β
Helmet Classification
- 96.8% validation accuracy
- Implemented data augmentation and transfer learning
- Python
- TensorFlow
- CNN
- VGG16
Python, SQL
Scikit-learn
Feature Engineering
Model Evaluation
Classification / Regression
TensorFlow
PyTorch
CNNs
Transfer Learning
RAG Pipelines
LangChain
Embeddings
Prompt Engineering
Query Expansion
FastAPI
Git
Streamlit
I also contribute to open source using another account:
https://github.com/sakeena-7878
