VeriOS

Research code for the paper "VeriOS: Query-Driven Proactive Human-Agent-GUI Interaction for Trustworthy OS Agents".

Paper link: https://arxiv.org/abs/2509.07553

🚀 Quick Start

1. Environment Setup

Clone the repository:

git clone https://github.com/Wuzheng02/VeriOS

Navigate into the project directory:
```
cd VeriOS
```
Download the VeriOS-Bench dataset:

https://huggingface.co/datasets/wuuuuuz/VeriOS-Bench
Download the pre-trained models:

VeriOS-Agent-7B: https://huggingface.co/wuuuuuz/VeriOS-Agent-7B

VeriOS-Agent-32B: https://huggingface.co/wuuuuuz/VeriOS-Agent-32B

2. Evaluation

Evaluate VeriOS-Agent performance:

python test_interaction_loop.py --model_path /path/to/VeriOS-Agent --json_path /path/to/test.json

Evaluate dual-agent system performance:

python dual_agent.py --model_path1 /path/to/scenarioagent --model_path2 /path/to/actionagent --json_path /path/to/test.json

Evaluate other baselines:

python test_loop_{name}.py --model_path /path/to/agent --json_path /path/to/test.json

3. Training

This work is based on full fine-tuning of LLMs using LLaMA-Factory. We gratefully acknowledge the support from the LLaMA-Factory project.

To reproduce the training process of VeriOS-Agent from scratch:

Replace the .yaml files in the LLaMA-Factory repository with those provided in this repository.
Follow the official training tutorials provided in the LLaMA-Factory repository.

📋 Citation

@article{wu2025verios,
  title={VeriOS: Query-Driven Proactive Human-Agent-GUI Interaction for Trustworthy OS Agents},
  author={Zheng Wu and Heyuan Huang and Xingyu Lou and Xiangmou Qu and Pengzhou Cheng and Zongru Wu and Weiwen Liu and Weinan Zhang and Jun Wang and Zhaoxiang Wang and Zhuosheng Zhang},
  journal={arXiv preprint arXiv:2509.07553},
  year={2025}
}

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
README.md		README.md
VeriOS-Agent-32B.yaml		VeriOS-Agent-32B.yaml
VeriOS-Agent-7B.yaml		VeriOS-Agent-7B.yaml
dual_agent.py		dual_agent.py
get_answer.py		get_answer.py
test_interaction_loop.py		test_interaction_loop.py
test_loop_atlas.py		test_loop_atlas.py
test_loop_qwen25.py		test_loop_qwen25.py
test_loop_tars.py		test_loop_tars.py
test_loop_tars15.py		test_loop_tars15.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

VeriOS

🚀 Quick Start

1. Environment Setup

2. Evaluation

3. Training

📋 Citation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

VeriOS

🚀 Quick Start

1. Environment Setup

2. Evaluation

3. Training

📋 Citation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages