WhyFlow: Interrogative Debugger for Taint Analysis

Overview

WhyFlow is an interrogative debugging tool for taint analysis that enables developers to ask why, why-not, and what-if questions about dataflows. This artifact accompanies our ICSE 2026 paper: "WhyFlow: Interrogative Debugger for Sensemaking Taint Analysis".

WhyFlow addresses the challenge of making sense of taint analysis results by providing:

Interrogative Debugging: Ask questions about the existence or absence of specific dataflows
Speculative Analysis: Explore the impact of different third-party library models and configurations
Visual Sensemaking: Graph-based visualization with color-coded annotations for global connectivity reasoning
Interactive Q&A Interface: Template-based queries with contextualized selections for sources, sinks, and APIs

Key Features

Interactive question-answer debugging interface for taint analysis
Support for why, why-not, and what-if queries about dataflows
Integration with CodeQL and Souffle Datalog for static analysis
Visual graph representation of taint flows with color-coded paths
Efficient handling of large-scale analysis results using MongoDB
User study data and statistical analysis scripts included

Repository Structure

WhyFlow/
├── taint_debug_app/          # Main WhyFlow application
│   ├── taint_debug/          # Meteor web application
│   │   ├── client/           # Frontend UI components
│   │   ├── server/           # Backend API and data loading
│   │   └── imports/          # Shared code and collections
│   ├── analysis_files/       # Analysis data and fact files
│   ├── app_souffle_queries/  # Souffle Datalog query files
│   └── souffle_output/       # Generated query outputs
├── Subject_Prog_CodeQL_Taint/# Subject program and CodeQL results
│   ├── src/                  # Source code (Apache Dubbo)
│   ├── codeql-custom-queries-java/ # Custom CodeQL queries
│   └── *.json, *.csv         # CodeQL analysis results
├── statistical_tests/        # User study statistical analysis
│   ├── statistical_tests.py  # Python scripts for analysis
│   └── *.csv                 # User study data and results
├── data/                     # User study materials
│   ├── data/                 # Questionnaire responses
│   ├── extension_queries/    # Additional query examples
│   ├── tutorials/            # Tutorial materials
│   └── *.png, *.ipynb        # Plots and analysis notebooks
└── souffle_output/           # Additional Souffle outputs

Prerequisites

Meteor (v2.13 or higher)
Node.js (v14 or higher)
MongoDB (installed with Meteor)
Souffle (optional, for running custom Datalog queries)
CodeQL (optional, for analyzing new programs)

Installation

1. Install Meteor

# macOS/Linux
curl https://install.meteor.com/ | sh

# Windows
# Download installer from https://www.meteor.com/install

2. Clone the Repository

git clone https://github.com/yourusername/WhyFlow.git
cd WhyFlow

3. Install Dependencies

# Install root-level dependencies
npm install

# Install WhyFlow app dependencies
cd taint_debug_app/taint_debug
meteor npm install
cd ../..

Running WhyFlow

Start the Application

cd taint_debug_app/taint_debug
meteor run

The application will be available at http://localhost:3000

Environment Variables (Optional)

Set these variables for custom configurations:

export PWD=/path/to/WhyFlow
export SOURCE_CODE_ROOT_DIR=/path/to/subject/program

Using WhyFlow

Access the Interface: Open http://localhost:3000 in your browser
Select Query Type: Choose from templated why, why-not, or what-if questions
Contextualize Query: Select specific sources, sinks, and third-party APIs from dropdowns
View Results: Explore results in the graph view with color-coded annotations
Iterate: Refine queries based on initial results for deeper investigation

Reproducing the User Study

Statistical Analysis

The statistical_tests/ directory contains all user study data and analysis scripts:

cd statistical_tests
python3 statistical_tests.py

This will regenerate the statistical test results reported in the paper.

Data Visualization

Generate plots from the user study data:

cd data
jupyter notebook plots.ipynb

Extending WhyFlow

Adding Custom Queries

Place Souffle Datalog query files in taint_debug_app/app_souffle_queries/

Analyzing New Programs

Run CodeQL analysis on your target program
Export results in JSON/CSV format
Place results in Subject_Prog_CodeQL_Taint/
Update paths in the Meteor application configuration

Customizing the UI

Modify the Meteor application in taint_debug_app/taint_debug/:

client/ - Frontend React components
server/ - Backend API methods
imports/ - Shared collections and utilities

Data Availability

This repository includes:

✅ User study questionnaire responses
✅ Statistical analysis scripts and results
✅ Subject program (Apache Dubbo) with CodeQL results
✅ Tutorial materials and task descriptions
✅ NASA-TLX and accuracy data

Citation

If you use WhyFlow in your research, please cite our paper:

@inproceedings{yetistiren2026whyflow,
  title={WhyFlow: Interrogative Debugger for Sensemaking Taint Analysis},
  author={Yetiştiren, Burak and Kang, Hong Jin and Kim, Miryung},
  booktitle={Proceedings of the 48th International Conference on Software Engineering},
  year={2026},
  organization={ACM}
}

License

This project is licensed under the MIT License - see the LICENSE file for details.

Contact

For questions or issues, please:

Open an issue on GitHub
Contact: burak@cs.ucla.edu

Acknowledgments

This work is supported by the National Science Foundation under grant numbers 2426162, 2106838, and 2106404, with additional support from Amazon and Samsung.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
Subject_Prog_CodeQL_Taint		Subject_Prog_CodeQL_Taint
data		data
node_modules		node_modules
souffle_output		souffle_output
statistical_tests		statistical_tests
taint_debug_app		taint_debug_app
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
package-lock.json		package-lock.json
package.json		package.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

WhyFlow: Interrogative Debugger for Taint Analysis

Overview

Key Features

Repository Structure

Prerequisites

Installation

1. Install Meteor

2. Clone the Repository

3. Install Dependencies

Running WhyFlow

Start the Application

Environment Variables (Optional)

Using WhyFlow

Reproducing the User Study

Statistical Analysis

Data Visualization

Extending WhyFlow

Adding Custom Queries

Analyzing New Programs

Customizing the UI

Data Availability

Citation

License

Contact

Acknowledgments

About

Uh oh!

Releases

Packages

Languages

License

UCLA-SEAL/WhyFlow

Folders and files

Latest commit

History

Repository files navigation

WhyFlow: Interrogative Debugger for Taint Analysis

Overview

Key Features

Repository Structure

Prerequisites

Installation

1. Install Meteor

2. Clone the Repository

3. Install Dependencies

Running WhyFlow

Start the Application

Environment Variables (Optional)

Using WhyFlow

Reproducing the User Study

Statistical Analysis

Data Visualization

Extending WhyFlow

Adding Custom Queries

Analyzing New Programs

Customizing the UI

Data Availability

Citation

License

Contact

Acknowledgments

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages