Add 2 PyRIT orchestrators ((Crescendo, PAIR)) and re-strucutre PyRIT code. by samailguliyev · Pull Request #93 · SAP/STARS

samailguliyev · 2025-09-03T16:49:53Z

Summary

This PR adds comprehensive PyRIT orchestrator enhancements including new orchestrator types (Crescendo, PAIR), tools, and CLI adjustements.

Changes Made

Added system prompt file for SelfAskTrueFalseScorer
Added agent instruction for orchestrator type input in agent mode
Added clean_json() method to LLMAdapter
Changed from inheritance to wrapper class approach for orchestrator agnostic functionality
Added one runner function per orchestrator
Added one tool per orchestrator
Added 1 CLI command per orchestrator
Added 1 attack specification case per orchestrator
Added tools to agent

Tested by:

Running individual CLI scripts for each PYRIT attack
Running main.py and running vulnerability scan
Running main.py and asking to run pyrit separately and passing inputs

2. Change inheritance based approach (InstrumentedRedTeamingOrchestrator) to wrapper class approach for orchestrator agnostic functionality 3. Minor code adjustments to incorporate 2 new orchestrator types to pyrit.

1. start_pyrit_attack_red_teaming() 2. start_pyrit_attack_crescendo() 3. start_pyrit_attack_pair() Delegate orchestrator agnostic PyRIT logic to start_pyrit_attack()

1. run_pyrit_red_teaming 2. run_pyrit_screscendo 3. run_pyrit_pair

… develop

marcorosa

Some notes while I do the code review, in no particular order:

Missing note (in /backend-agent/data/pyrit) to explain to the agent how to use these attacks (the current one is still the old one, and does not explain to the agent that there are 3 sub-attacks in case the users ask for "pyrit".
do not mix ' and " in your strings. Be consistent (and better use ' for strings in code). For reference, this random guy on the interned explains it very well: link
Chose wisely the names of the attack/attack specification. Indeed, consider the name that will be written in the db, and think in advance of any issue it may cause using a name too long or containing _ or - or with spaces inside. Will users write this correctly?
Please, re-run a python linter because I am not fully convinced it worked correctly

marcorosa · 2025-09-04T09:50:39Z

I confirm the linter action did not work:

Error: This action does not have permission to create annotations on forks. You may want to run it only on `pull_request_target` events with checks permissions set to write. See https://docs.github.com/en/actions/learn-github-actions/workflow-syntax-for-github-actions#permissions for details.

 Flake8 found 82 errors (failure)

So, you have 82 linter violations to fix 😄

marcorosa · 2025-09-04T10:21:36Z

If you fetch the upstream develop branch, I may have fixed the linter setup and it would run automatically

get latest linter changes

samailguliyev · 2025-09-14T21:42:00Z

I confirm the linter action did not work:

Error: This action does not have permission to create annotations on forks. You may want to run it only on `pull_request_target` events with checks permissions set to write. See https://docs.github.com/en/actions/learn-github-actions/workflow-syntax-for-github-actions#permissions for details.

 Flake8 found 82 errors (failure)

So, you have 82 linter violations to fix 😄

@marcorosa I fixed all your comments and all linter violations locally and fetched upstream develop branch, but backend and frontend actions fail to run for some reason.

samailguliyev · 2025-09-15T08:03:06Z

Some notes while I do the code review, in no particular order:

Missing note (in /backend-agent/data/pyrit) to explain to the agent how to use these attacks (the current one is still the old one, and does not explain to the agent that there are 3 sub-attacks in case the users ask for "pyrit".

do not mix ' and " in your strings. Be consistent (and better use ' for strings in code). For reference, this random guy on the interned explains it very well: link

Chose wisely the names of the attack/attack specification. Indeed, consider the name that will be written in the db, and think in advance of any issue it may cause using a name too long or containing _ or - or with spaces inside. Will users write this correctly?

Please, re-run a python linter because I am not fully convinced it worked correctly

done, github action does not work though.

marcorosa

small adjustments still needed, mostly in the note phrasing

github-actions · 2025-09-15T15:00:36Z

This update introduces a significant overhaul of LLM attack functionalities, specifically enhancing the PyRIT attack framework. It refines how attacks are executed by expanding the variety and specificity of attack types available. The changes aim to make the attack process easier to configure and more versatile, thereby improving the user experience for security professionals working with LLM vulnerability assessments.

Walkthrough

New Feature: Introduced new attack types — "redteaming", "crescendo", and "pair" — to the PyRIT framework, along with corresponding tools and orchestrators.
Refactor: Updated naming conventions and reframed orchestrator usage, providing more modular and maintainable code architecture.
Chore: Expanded attack configuration capabilities and adjusted default attack parameters for better personalization.
Documentation: Enhanced instructions and notes to guide users on specifying attack types and parameters efficiently.
Style: Minor styling adjustments for improved readability and consistency in code formatting.

_{Model: gpt-4o | Prompt Tokens: 8804 | Completion Tokens: 191}

github-actions

Here's a collaborative code review enhanced by AI assistance. These insights offer suggestions and observations that may help improve your work, though they're not absolute truths. You remain the expert on your project's needs and goals. Consider these recommendations as supportive guidance while you make the final decisions that align best with your vision and requirements.

Always critique what AI says. Do not let AI replace YOUR I.
_{Model: anthropic--claude-4-sonnet | Prompt Tokens: 14851 | Completion Tokens: 2533}

marcorosa

File true_false_system_prompt.yaml is copied in both backend-agent/data and backend-agent/libs/data. It should appear only in the latter.

samailguliyev · 2025-09-18T15:08:54Z

File true_false_system_prompt.yaml is copied in both backend-agent/data and backend-agent/libs/data. It should appear only in the latter.

done

samailguliyev added 14 commits August 27, 2025 08:27

Add 'orchestrator_type' argument to pyrit CLI command

2571c58

Add orchestrator_type input variable to run_pyrit() method

e5da6fd

Add the system prompt that is used for SelfAskTrueFalseScorer to a file

7793def

Add agent instruction to ask for orchestrator_type input in agent mode

e5f74ab

1. Add clean_json() method to LLMAdapter

9c9e6c5

2. Change inheritance based approach (InstrumentedRedTeamingOrchestrator) to wrapper class approach for orchestrator agnostic functionality 3. Minor code adjustments to incorporate 2 new orchestrator types to pyrit.

Add one runner function per orchestrator, namely:

30a5271

1. start_pyrit_attack_red_teaming() 2. start_pyrit_attack_crescendo() 3. start_pyrit_attack_pair() Delegate orchestrator agnostic PyRIT logic to start_pyrit_attack()

Add a tool per orchestrator, namely:

eb592b9

1. run_pyrit_red_teaming 2. run_pyrit_screscendo 3. run_pyrit_pair

monir fixes

aa615b1

Add 1 CLI command per orchestrator

8a456b9

Add 1 attack specification case per orchestator

033f102

Add tools to agent

f2816df

Merge branch 'develop' of https://github.com/samailguliyev/STARS into…

4a99961

… develop

Change the file to show how to use parameters for orchestrators

00da0d6

Fix inputs to adapt to new input structure

e6ff234

samailguliyev requested a review from a team as a code owner September 3, 2025 16:49

marcorosa requested changes Sep 4, 2025

View reviewed changes

samailguliyev added 12 commits September 4, 2025 14:08

Rename "Args:" to "@params"

2d71151

Remove start_pyrit_attack

af1a79e

Delete unnecessary comment

f5213c3

Merge remote-tracking branch 'origin/develop' into develop

f2af4d9

get latest linter changes

Retain only 1 agent tool for PyRIT

2f12505

Rename attacks to be lowercase and no special characters

66e7d39

Rename CLI commands

9d2579f

Add boilerplate prompt to bypass GPT content filter

83707fe

Keep 1 agent tool for PyRIT attack

ff555af

Add a list of attacksfor PyRIT as in Garak implementation

165a828

Update PyRIT notes , inspired by Garak notes

a17e97d

Show usage of PyRIT attacks

2a76e92

samailguliyev added 3 commits September 12, 2025 20:23

Rename attacks

e15e8ea

Fix flake8 linter errors

142b897

Make the quote usage consistent and fix linter errors

fa2b1aa

Merge branch 'SAP:develop' into develop

1ed4ed6

samailguliyev and others added 2 commits September 15, 2025 13:59

Revert changes in main.py

424dd60

Merge branch 'SAP:develop' into develop

9bb2171

marcorosa requested changes Sep 15, 2025

View reviewed changes

Merge branch 'SAP:develop' into develop

c5d7baf

samailguliyev marked this pull request as draft September 15, 2025 14:59

samailguliyev marked this pull request as ready for review September 15, 2025 15:00

github-actions bot reviewed Sep 15, 2025

View reviewed changes

samailguliyev added 8 commits September 16, 2025 15:47

Fix inconsistent parameter naming. Make all snakecase

d3bdf48

Refactor run_pyrit_attack , improve parameter handling

b80db97

Fix misspelling

6d80389

Minor fix, improve code readability and naming consistency

a1a8955

Move file to proper location

d5da1dc

Improve agent instructions

132cd7a

Remove unexpected parameter.

a112b22

Fix linter warnings

cfb1d65

samailguliyev requested a review from marcorosa September 18, 2025 13:34

marcorosa requested changes Sep 18, 2025

View reviewed changes

Delete file copy from wring directory

88aa35b

marcorosa approved these changes Sep 18, 2025

View reviewed changes

marcorosa merged commit 6b95ec8 into SAP:develop Sep 18, 2025
3 of 5 checks passed

Conversation

samailguliyev commented Sep 3, 2025

Summary

Changes Made

Tested by:

Uh oh!

marcorosa left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

marcorosa commented Sep 4, 2025

Uh oh!

marcorosa commented Sep 4, 2025

Uh oh!

samailguliyev commented Sep 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

samailguliyev commented Sep 15, 2025

Uh oh!

marcorosa left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

github-actions bot commented Sep 15, 2025

Walkthrough

Uh oh!

github-actions bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

marcorosa left a comment

Choose a reason for hiding this comment

Uh oh!

samailguliyev commented Sep 18, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

samailguliyev commented Sep 14, 2025 •

edited

Loading