This collection of open source projects is created for educational purposes to help students and researchers learn about:
- System Design and Distributed Platforms
- AI Agents and Large Language Models
- Data Engineering and Analytics
- Research-to-Implementation Workflows
- Cloud Orchestration and Scalability
- Modern Software Development Practices
Declarative LLM Programming with DSPy β’ Progressive labs for learning
β
LLM Fine-Tuning Practice β’ PEFT, LoRA, and advanced techniques
β
DuckDB Analytics Practice β’ SQL optimization and analytics
β
Apache Spark Practice β’ Batch and stream processing
β
Apache Iceberg Lakehouse Practice β’ Time travel and schema evolution
β
Apache Beam Practice β’ Unified batch and stream processing
β
Scala Data Analysis Practice β’ Functional programming and big data
β
Automated Research Collection β’ Multi-source paper aggregation and scoring
β
Research to Implementation β’ Agentic framework for converting papers to code
β
Quantitative Finance Research β’ Framework for implementing trading strategies
β
Cloud Orchestration Gateway β’ Scaling research-to-repo pipelines
β
AGI/ASI Educational Resources β’ Curated collection for learning
β
AI-Powered Research Assistant β’ Academic paper discovery and analysis
β
Table Formats Education β’ Comparison of Iceberg, Delta Lake, Hudi
β
Technical Learning Notes β’ AI/ML, System Design, Data Engineering
β
- Start with practice repositories for hands-on learning
- Progress to framework projects for understanding architecture
- Explore research resources for advanced concepts
- Use research tools for literature review automation
- Study implementation frameworks for research-to-code workflows
- Leverage cloud orchestration for scaling research
- Practice with code repositories for skill development
- Study framework architecture for system design
- Explore data engineering tools for infrastructure learning
β Star repositories you find helpful for your learning journey!
All projects created for educational purposes
