Software Engineer | Data & AI Engineer · Fremont, CA
🎓 MS Applied Data Science (AI concentration) · University of Michigan, May 2026
🎓 B.Tech Electrical Engineering · IIT Hyderabad (Indian Institute of Technology)
Languages & data
Python · SQL · PySpark · Pandas · HTML/CSS/JS
Cloud & infrastructure
AWS (Lambda · S3 · RDS · SQS · API Gateway · Bedrock) · Docker · CI/CD
AI & GenAI
LangChain · RAG · VectorDB · AWS Bedrock · Prompt Engineering
ML
Scikit-Learn · PyTorch · XGBoost · Random Forest · KMeans · PCA · NLP
APIs & backend
Flask · FastAPI · SQLAlchemy · RESTful APIs · Microservices
| Project | What it does | Stack |
|---|---|---|
| MoveSmart | End-to-end city intelligence platform — ingests 5 government datasets (Census, FBI, CDC, NOAA, EPA), engineers features, applies PCA + KMeans clustering and content-based filtering to score 900+ U.S. metros across 6 life themes. RAG pipeline with ChromaDB embeddings and semantic search generates personalized AWS Bedrock LLM summaries. Fully deployed. | Python · PySpark · ChromaDB · RAG · LangChain · AWS Bedrock · Streamlit |
| Airbnb Success Prediction | What makes an Airbnb listing successful? Predicts review activity across 52K California listings using XGBoost, Random Forest, KNN and Linear Regression, enriched with FBI crime, EPA walkability, and Census data. Unsupervised segmentation via PCA + KMeans reveals 4 distinct listing archetypes by property scale, stay strategy, and neighborhood context. | Python · XGBoost · Random Forest · KMeans · PCA · GeoPandas · Scikit-Learn · Pandas |
| US Automotive Trade Analysis | How do GDP and tariffs shape U.S. automotive trade? Analyzes imports, exports and trade balance across 30+ countries from 2008–2022, integrating ITA trade data, World Bank GDP, and WITS tariff datasets. Includes Spearman correlation, OLS regression, and animated visualizations revealing how the 2018 tariff spike and COVID-19 impacted trade flows. | Python · Pandas · Matplotlib · Seaborn · Scipy · OLS Regression |
- Built serverless REST APIs on AWS Lambda + API Gateway, containerized with Docker, deployed via GitLab CI/CD — exposing golden records for insurance workflows at Accenture
- Designed event-driven microservices with AWS SQS processing real-time insurance domain events for transactional downstream systems
- Automated a 4M+ row Excel process into a Python pipeline — cut processing time from half a day to under 10 minutes at MatrixIntelligence
- Built end-to-end ETL pipelines delivering structured datasets to S3 for downstream ML systems, saving 20+ hours/month of manual effort
- Built an HR data agent with LangChain and custom tools enabling natural language queries over employee datasets via prompt engineering and LLM pipelines
- 📍 Fremont, CA · Open to relocation
- 🎓 Graduating April 2026
- 💼 Open to Software Engineer, Data Engineer, and AI Engineer roles
- 📬 divya.andem.1@gmail.com · LinkedIn · GitHub · Portfolio
"Build things that work. Then make them work better."

