Skip to content

runjchen/speech_mentalmanip

Repository files navigation

Detecting Mental Manipulation in Speech via Synthetic Multi-Speaker Dialogue

Accepted at IWSDS 2026

TL;DR

We extend the text-only mental manipulation benchmark into speech by rendering each dialogue as multi-speaker, voice-consistent TTS. This enables 1:1 comparisons between text and audio. Models and humans both struggle more on audio, highlighting modality-specific ambiguity and the subjectivity of mental manipulation.

What's in this repo?

speech_mentalmanip/
├─ README.md                                       # You are here
├─ multi_speaker_TTS_audios_generation_scripts/    # TTS by turn
├─ prediction/                                     # Model Evaluations
├─ composed_audios_dataset/                        # Composed audio dataset
│  ├─ text_conversations_mental_manipulative_composed_audios_batch_01.zip
│  ├─ text_conversations_mental_manipulative_composed_audios_batch_02.zip
│  └─ text_conversations_non_mental_manipulative_composed_audios.zip
└─ human_annotations/                              # Human annotation data and results
   ├─ annotations_audio_compiled.csv              # Compiled audio annotations
   ├─ annotations_text_compiled.csv               # Compiled text annotations
   ├─ audio_files/                                 # Audio files used for annotation
   └─ template/                                    # Annotation templates

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors