Detecting Mental Manipulation in Speech via Synthetic Multi-Speaker Dialogue

Accepted at IWSDS 2026

TL;DR

We extend the text-only mental manipulation benchmark into speech by rendering each dialogue as multi-speaker, voice-consistent TTS. This enables 1:1 comparisons between text and audio. Models and humans both struggle more on audio, highlighting modality-specific ambiguity and the subjectivity of mental manipulation.

What's in this repo?

speech_mentalmanip/
├─ README.md                                       # You are here
├─ multi_speaker_TTS_audios_generation_scripts/    # TTS by turn
├─ prediction/                                     # Model Evaluations
├─ composed_audios_dataset/                        # Composed audio dataset
│  ├─ text_conversations_mental_manipulative_composed_audios_batch_01.zip
│  ├─ text_conversations_mental_manipulative_composed_audios_batch_02.zip
│  └─ text_conversations_non_mental_manipulative_composed_audios.zip
└─ human_annotations/                              # Human annotation data and results
   ├─ annotations_audio_compiled.csv              # Compiled audio annotations
   ├─ annotations_text_compiled.csv               # Compiled text annotations
   ├─ audio_files/                                 # Audio files used for annotation
   └─ template/                                    # Annotation templates

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Detecting Mental Manipulation in Speech via Synthetic Multi-Speaker Dialogue

TL;DR

What's in this repo?

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
case_studies		case_studies
composed_audios_dataset		composed_audios_dataset
human_annotations		human_annotations
multi_speaker_TTS_audios_generation_scripts		multi_speaker_TTS_audios_generation_scripts
prediction		prediction
README.md		README.md

Folders and files

Latest commit

History

Repository files navigation

Detecting Mental Manipulation in Speech via Synthetic Multi-Speaker Dialogue

TL;DR

What's in this repo?

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages