Popular repositories Loading
-
bird-sql-forked
bird-sql-forked PublicForked from ContextualAI/bird-sql
ContextualAI's text-to-SQL pipeline for BIRD benchmark
Jupyter Notebook
-
daytona-hackday
daytona-hackday PublicBrowser Use for execution + Galileo for RCA + Claude-code to refine
TypeScript 1
-
text-to-sql-reasoning-dbml-2026
text-to-sql-reasoning-dbml-2026 PublicText-to-SQL with reasoning and DBML
Python
-
tau2-bench
tau2-bench PublicForked from sierra-research/tau2-bench
τ²-Bench: Evaluating Conversational Agents in a Dual-Control Environment
Python
-
appworld-leaderboard
appworld-leaderboard PublicForked from StonyBrookNLP/appworld-leaderboard
🌍 Leaderboard Repository for "AppWorld: A Controllable World of Apps and People for Benchmarking Interactive Coding Agent", ACL2024
Python
-
agentbeats-mini-swe-baseline
agentbeats-mini-swe-baseline PublicBaseline mini-swe-agent purple agent for AgentBeats SWE-bench Pro
Python
If the problem persists, check the GitHub status page or contact support.