In the most recent addition to Bonsai.ML (PR is here: #70), a test was introduced to compare the results of running the Python model against the results of the Bonsai model. While the test works locally, the execution of unit tests in the GitHub workflow is failing with an "Out of disk space" exception on the Linux x64 runner (see here: https://github.com/bonsai-rx/machinelearning/actions/runs/18683815999/job/53271046154#step:9:125). Currently the test downloads the raw data, bootstraps the Python environment, and then runs the Python script to generate the test results. One of the benefits of this approach is that it ensures that both the Python code and Bonsai code are up to date and running correctly. However, the space limitation on the free GitHub runner means it's probably better to just run the Bonsai code and rely on pre-computed data for comparisons.