This repository host the following materials for the ACM CCS 2023 accepted paper: "PolicyChecker: Analyzing the GDPR Completeness of Mobile Apps’ Privacy Policies" .
-
Code of the PolicyChecker framework
-
Download and extract NLP models to
\models. -
Download and extract meta data files to
\meta_data. Note: You need to create two addtional meta files in order to perform controller identity detection:policy-to-app_title_mapping.pickle and policy-to-deverloperID_mapping.pickleExample:('policy_id : developer_id') -
Run the code
# python = 3.8
pip install -r requirements.txt
python main.py
parameters:
-path to privacy policy
-path to output folder
-path to the folder storing metadata files
-path to the folder storing NLP models
-CUDA ID to run on the GPU (-1 to run on the CPU)
Example:
python main.py /path/to/input_folder/policy.txt /path/to/output_folder/ meta_data/ models/ 0
Anhao Xiang, Weiping Pei, and Chuan Yue. 2023. PolicyChecker: Analyzing the GDPR Completeness of Mobile Apps’ Privacy Policies. In Proceedings of the 2023 ACM SIGSAC Conference on Computer and Communications Security (CCS ’23), November 26–30, 2023, Copenhagen, Denmark. ACM, New York, NY, USA, 15 pages. https://doi.org/10.1145/3576915.3623067
This repo uses NER model published by PurPliance . The provided Android app dataset is obtained from the Androzoo collections:
Kevin Allix, Tegawendé F Bissyandé, Jacques Klein, and Yves Le Traon. 2016. Androzoo: Collecting millions of android apps for the research community. In Proceedings of the International Conference on Mining Software Repositories (MSR)
PolicyChecker is licensed under a Creative Commons Attribution International 4.0 License.