exploiting and defending neural networks(神经网络攻防专栏)
-
Updated
Mar 2, 2021 - Python
exploiting and defending neural networks(神经网络攻防专栏)
LLM-as-a-Judge security layer for Microsoft Copilot Studio agents
AiShields is an open-source Artificial Intelligence Data Input and Output Sanitizer
Inspired by dynamic taint tracking, PoisonSpot uses fine-grained training provenance tracker that: (1) tags & traces the impact of every single training sample on model updates, (2) probabilistically scores suspect samples based on their lineage of impact on model weights, and (3) separates the clean from the poisonous before retraining a model.
Add a description, image, and links to the aisec topic page so that developers can more easily learn about it.
To associate your repository with the aisec topic, visit your repo's landing page and select "manage topics."