SaFoLab : Security and Safe Foundation Model Systems
Pinned Loading
Repositories
- armor Public
SaFo-Lab/armor’s past year of commit activity - AdaShield Public
[ECCV 2024] The official code for "AdaShield: Safeguarding Multimodal Large Language Models from Structure-based Attack via Adaptive Shield Prompting."
SaFo-Lab/AdaShield’s past year of commit activity - ReasoningBomb Public
The official implementation of our preprint paper "ReasoningBomb: A Stealthy Denial-of-Service Attack by Inducing Pathologically Long Reasoning in Large Reasoning Models"
SaFo-Lab/ReasoningBomb’s past year of commit activity - DoxBench Public
[ICLR 2026] The official code for "Doxing via the Lens: Revealing Location-related Privacy Leakage on Multi-modal Large Reasoning Models"
SaFo-Lab/DoxBench’s past year of commit activity - AutoDAN-Turbo Public
[ICLR 2025 Spotlight] The official implementation of our ICLR2025 paper "AutoDAN-Turbo: A Lifelong Agent for Strategy Self-Exploration to Jailbreak LLMs".
SaFo-Lab/AutoDAN-Turbo’s past year of commit activity
People
This organization has no public members. You must be a member to see who’s a part of this organization.
Most used topics
Loading…