- Machine Learning Engineer at American Express
- Alumnus of IIT Delhi
- Passionate about AI, Machine Learning, and Data Science
- Always learning more about ML/AI advancements and automated systems
Pinned Loading
-
-
-
smolagents
smolagents PublicForked from huggingface/smolagents
🤗 smolagents: a barebones library for agents that think in code.
Python
-
-
grpo_vs_gdpo
grpo_vs_gdpo PublicGDPO vs GRPO: Dominance-Based Multi-Objective Optimization in RL for LLMs
Jupyter Notebook
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.



