|
My long-term goal is to develop intelligent machines capable of understanding, generation, reasoning and agentic action on multi-modality content Currently working on Multimodal Foundation Model Based in Abu Dhabi | PhD Candidate |
ML PhD at MBZUAI
Pinned Loading
-
MetaAgentX/OpenCaptchaWorld
MetaAgentX/OpenCaptchaWorld Public[NeurIPS 2025] The first web-based benchmark and platform to evaluate visual reasoning and interaction capabilities of MLLM powered agents through diverse and dynamic CAPTCHA puzzles.
-
MetaAgentX/NextGen-CAPTCHAs
MetaAgentX/NextGen-CAPTCHAs Public[ICML 2026]A defense framework against MLLM-based web GUI agents. This repository provides both the generative CAPTCHA system and tools for evaluating agent resistance.
Python 20
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.




