Fix run_all attacks configuration by marcorosa · Pull Request #116 · SAP/STARS

marcorosa · 2025-09-29T14:16:45Z

Define the configuration for the run_all attack
Fix pyrit's tools (they were not showing results after completion)

github-actions · 2025-09-29T14:17:03Z

The recent changes focus on refining the attack handling system, offering enhanced visibility through enabled output printing, adjusting default configurations to optimize performance, and expanding the attack type repertoire. The default attack configurations in default.json have been modified to provide more nuanced control and potential responses within attacks, aiming primarily at improved attack execution strategies.

Walkthrough

Refactor: Changed print_output from False to True in several attack execution cases for increased visibility during operations.
Documentation: Improved logging with detailed attack start messages.
Chore: Revised attack model configurations in default.json, optimizing parameters for various attack types, e.g., adding max_turns and max_backtracks.
New Feature: Introduced new attack types (artprompt, encoding, goodside, etc.) expanding the system capabilities and versatility.

_{Model: gpt-4o | Prompt Tokens: 1427 | Completion Tokens: 176}

github-actions

Here's a friendly code review powered by AI assistance. These insights offer suggestions and observations that may help improve your work, though they're not absolute truths. Please take what serves you best and feel free to disregard anything that doesn't fit your approach. You remain the expert on your project—AI simply provides another perspective to consider as you make your development choices.

Always critique what AI says. Do not let AI replace YOUR I.
_{Model: anthropic--claude-4-sonnet | Prompt Tokens: 2923 | Completion Tokens: 695}

github-actions · 2025-09-29T14:17:14Z

+    {
+      "attack": "encoding",
+      "target-model": "<target>"
+    },
+    {
+      "attack": "goodside",
+      "target-model": "<target>"
+    },
+    {
+      "attack": "latentinjection",
+      "target-model": "<target>"
+    },
+    {
+      "attack": "malwaregen",
+      "target-model": "<target>"
+    },
+    {
+      "attack": "phrasing",
+      "target-model": "<target>"
+    },
+    {
+      "attack": "promptinject",
+      "target-model": "<target>"
+    },
+    {
+      "attack": "suffix",
+      "target-model": "<target>"
    }


Consider adding validation schema or documentation for the configuration file. The current structure mixes different attack types with varying parameter requirements, which could lead to runtime errors. Also, ensure all attack types are properly supported by the codebase before including them in the default configuration.

marcorosa added 2 commits September 29, 2025 16:13

Fix pyrit tools results shown

31f3712

Add attack specification for run_all

039666a

marcorosa requested a review from a team as a code owner September 29, 2025 14:16

github-actions bot reviewed Sep 29, 2025

View reviewed changes

marcorosa merged commit 8dc09ca into develop Sep 29, 2025
6 of 7 checks passed

marcorosa deleted the suite/all branch September 29, 2025 14:26

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix run_all attacks configuration#116

Fix run_all attacks configuration#116
marcorosa merged 2 commits intodevelopfrom
suite/all

marcorosa commented Sep 29, 2025

Uh oh!

github-actions bot commented Sep 29, 2025

Uh oh!

github-actions bot left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

github-actions bot Sep 29, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

marcorosa commented Sep 29, 2025

Uh oh!

github-actions bot commented Sep 29, 2025

Walkthrough

Uh oh!

github-actions bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

github-actions bot Sep 29, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant