GalilAI: Out-of-Task Distribution Detection using Causal Active Experimentation for Safe Transfer RL
Run causal world experiments on (size, mass) settings.
You may also modify cem_planner_cw.py and cem_planner_cw_baseline.py to
test more pairs.
- run
cem_planner_cw.pyto save OOD detection with causal curiosity results under./pickle/CW/SizeMass - run
cem_planner_cw_baseline.pyto save OOD detection baseline results under./pickle/CW/SizeMass/SA - run
cem_planner_cw_res.pyto show results for 1 and 2 in matrix form, which are shown as heatmaps in the paper
A sample workflow to run all code for a given experiment is shown in run_all_mujoco.sh.
You may modify this scipt to suit your needs.
To see a list of avaliable options for any script, run python [script_name].py --help.
The workflow consists of 5 parts.
- Run GalilAI method
- run the
plan_mujoco.pyscript with desired flags.
- run the
- Collect additional (state, action) pairs for baseline experiments.
- run
plan_mujoco.pyagain, now with--baseline_only Trueand making sure you specify the same output path as before.
- run
- Train PNNs to get mean & covariance vectors for baselines.
- run
train_pnn.py.
- run
- Calculate KL divergence between (unseen, seen) causal pairs.
- run
train_pnn.pyagain with same flags, adding--compare_only true
- run
- Plot results!
- for this, please see
plot.pyand modify the directory names accordingly.
- for this, please see
If you would like to cite our paper,
Sumedh A Sontakke*, Stephen Iota*, Zizhao Hu*, Arash Mehrjou, Laurent Itti and Bernhard Schölkopf. GalilAI: Out-of-Task Distribution Detection using Causal Active Experimentation for Safe Transfer RL. Accepted to The 25th International Conference on Artificial Intelligence and Statistics (AISTATS), 2022. [arXiv].
@article{sontakke2021galilai,
author = {Sumedh A Sontakke and Stephen Iota and Zizhao Hu and Arash Mehrjou and Laurent Itti and Bernhard Schölkopf},
title = {GalilAI: Out-of-Task Distribution Detection using Causal Active Experimentation for Safe Transfer RL},
year = {2021},
eprint = {arXiv:2110.15489},
url = {https://arxiv.org/abs/2110.15489},
note = {Accepted to \textit{The 25th International Conference on Artificial Intelligence and Statistics} (AISTATS), 2022}
}