Improve model card: Add pipeline tag, library, license, and quick start

#1
by nielsr HF Staff - opened
Files changed (1) hide show
  1. README.md +51 -4
README.md CHANGED
@@ -1,12 +1,58 @@
1
- ### Model Overview
 
 
 
 
 
 
2
 
3
  This model is a **Query-Driven Trustworthy OS Agent** implemented as described in the paper:
 
 
4
 
5
- **VeriOS: Query-Driven Proactive Human-Agent-GUI Interaction for Trustworthy OS Agents**
6
 
7
- It is initialized with weights from the **Qwen2.5-VL-32B** model.
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
8
 
9
- ### Citation
10
 
11
  ```bibtex
12
  @article{wu2025verios,
@@ -15,3 +61,4 @@ It is initialized with weights from the **Qwen2.5-VL-32B** model.
15
  journal={arXiv preprint arXiv:2509.07553},
16
  year={2025}
17
  }
 
 
1
+ ---
2
+ license: cc-by-nc-4.0
3
+ library_name: transformers
4
+ pipeline_tag: image-text-to-text
5
+ ---
6
+
7
+ # VeriOS: Query-Driven Proactive Human-Agent-GUI Interaction for Trustworthy OS Agents
8
 
9
  This model is a **Query-Driven Trustworthy OS Agent** implemented as described in the paper:
10
+ [VeriOS: Query-Driven Proactive Human-Agent-GUI Interaction for Trustworthy OS Agents](https://arxiv.org/abs/2509.07553).
11
+ It is initialized with weights from the **Qwen2.5-VL-32B** model.
12
 
13
+ ## Quick Start
14
 
15
+ ### 1. Environment Setup
16
+ 1. Clone the repository:
17
+ ```bash
18
+ git clone https://github.com/Wuzheng02/VeriOS
19
+ ```
20
+ 2. Navigate into the project directory:
21
+ ```bash
22
+ cd VeriOS
23
+ ```
24
+ 3. Download the VeriOS-Bench dataset:
25
+
26
+ [https://huggingface.co/datasets/wuuuuuz/VeriOS-Bench](https://huggingface.co/datasets/wuuuuuz/VeriOS-Bench)
27
+
28
+ 4. Download the pre-trained models:
29
+
30
+ VeriOS-Agent-7B: [https://huggingface.co/wuuuuuz/VeriOS-Agent-7B](https://huggingface.co/wuuuuuz/VeriOS-Agent-7B)
31
+
32
+ VeriOS-Agent-32B: [https://huggingface.co/wuuuuuz/VeriOS-Agent-32B](https://huggingface.co/wuuuuuz/VeriOS-Agent-32B)
33
+
34
+ ### 2. Evaluation
35
+ 1. Evaluate VeriOS-Agent performance:
36
+ ```bash
37
+ python test_interaction_loop.py --model_path /path/to/VeriOS-Agent --json_path /path/to/test.json
38
+ ```
39
+ 2. Evaluate dual-agent system performance:
40
+ ```bash
41
+ python dual_agent.py --model_path1 /path/to/scenarioagent --model_path2 /path/to/actionagent --json_path /path/to/test.json
42
+ ```
43
+ 3. Evaluate other baselines:
44
+ ```bash
45
+ python test_loop_{name}.py --model_path /path/to/agent --json_path /path/to/test.json
46
+ ```
47
+
48
+ ### 3. Training
49
+ This work is based on full fine-tuning of LLMs using [LLaMA-Factory](https://github.com/hiyouga/LLaMA-Factory). We gratefully acknowledge the support from the LLaMA-Factory project.
50
+
51
+ To reproduce the training process of VeriOS-Agent from scratch:
52
+ 1. Replace the `.yaml` files in the LLaMA-Factory repository with those provided in this repository.
53
+ 2. Follow the official training tutorials provided in the [LLaMA-Factory repository](https://github.com/hiyouga/LLaMA-Factory).
54
 
55
+ ## Citation
56
 
57
  ```bibtex
58
  @article{wu2025verios,
 
61
  journal={arXiv preprint arXiv:2509.07553},
62
  year={2025}
63
  }
64
+ ```