Hyperparameter optimization

### Is your feature request related to a problem?

Finding appropriate values for hyperparameters by hand is tedious. There should be automation to try different combinations of values.

### Desired solution

1. For all hyperparameters of models of type `T` it should also be possible to pass a `Choice[T]` (see #325). Example:
  ```py
  # Before
  class KNearestNeighbors(Classifier):
    def __init__(self, number_of_neighbors: int) -> None:
      ...
  
  # After
  class KNearestNeighbors(Classifier):
    def __init__(self, number_of_neighbors: int | Choice[int]) -> None:
      ...

  # Usage
  KNearestNeighbors(number_of_neighbors = Choice(1, 10, 100))
  ```
2. Adjust the getters (#260) accordingly.
3. When a user tries to call `fit` on a model that contains `Choice` at any level (can be nested), raise an exception. Also point to the correct method (see 4.).
4. Add new method `fit_by_exhaustive_search` to `Classifier` and subclasses with parameter:
    * `optimization_metric`: The metric to use to find the best model. It should have type `ClassifierMetric`, which is an enum with one value for each classifier metric we have available:
    ```py
    class ClassifierMetric(Enum):
        ACCURACY = "accuracy"
        PRECISION = "precision
        RECALL = "recall"
        F1_SCORE = "f1_score"
    ```
    The parameter should be required.
5. Add new method `fit_by_exhaustive_search` to `Regressor` and subclasses with parameter:
    * `optimization_metric`: The metric to use to find the best model. It should have type `RegressorMetric`, which is an enum with one value for each regressor metric we have available:
    ```py
    class RegressorMetric(Enum):
        MEAN_SQUARED_ERROR = "mean_squared_error"
        MEAN_ABSOLUTE_ERROR = "mean_absolute_error"
    ```
    The parameter should be required.
6. Both of those methods should then collect the `Choice`s inside of the model and its children, and for each possible setting create a model without choices, fit this, and compute the listed metric on it. It should then keep track of the best (fitted) model according to the metric and return it at the end. [`GridSearchCV`](https://scikit-learn.org/stable/modules/generated/sklearn.model_selection.GridSearchCV.html) of `scikit-learn` can be useful for this.


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Hyperparameter optimization #264

Is your feature request related to a problem?

Desired solution

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Hyperparameter optimization #264

Description

Is your feature request related to a problem?

Desired solution

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions