This file explains the typical flow of data through the InsightStack toolkit.
-
Raw Data Entry
- Source: XLSForms, field surveys, or CSV files
- Tools:
survey_to_codebook/,data_validation/
-
Data Validation
- Scripts flag:
- Missing values
- Duplicates
- Out-of-range values
- Folder:
data_validation/
- Scripts flag:
-
Variable Labeling
- Apply readable labels from a dictionary
- Folder:
label_variables/ - Compatible across R, Python, and Stata
-
Exploration & Analysis
- Summary stats, regressions, and model testing
- Folder:
replication/ - Can use Stata, R, or Python
-
Documentation
- Convert survey design into Markdown codebooks
- Folder:
survey_to_codebook/
-
Archival & Replication
- Reusable scripts, testable results, and cleaned data
- Folder:
replication/, with output storage
Each folder in InsightStack corresponds to one or more of these steps — allowing any user to jump in, adapt, or expand the workflow for their own context.
