This is a work in progress and not compatible with openml-python, yet.
If you are interested in testing it, use the following openml-python fork which supports OpenMLActiveClassificationTask and try it out on the openml test server (call openml.config.start_using_configuration_for_example() after importing openml).
- How to differ between extensions? Currently, scikit-learn checks only for the dependencies via
_is_sklearn_flowin the fileopenml.extensions.sklearn.extension.py. - We cannot store the results of the individual active learning cycles. Currently, we use the
user_defined_measuresto store these results (budget, predictions, probabilities over the cycles). As a result, we needed to modify the handling of list via_to_dictin the fileopenml.runs.run.py. Particularly, we replace lists with the value0. Results are not contained in the.xmlfile as description of the run. - The datasplits are again not available for active learning tasks, likely due to a switch of the test openml to another branch.
- Where do we can add outputs/results specific to a certain learning task?
- How are flows compared to each other or matched?
- How to deal with utility scores and subsampling?