fixes for test #5

drammock · 2024-08-10T22:42:55Z

don't merge yet. will add some comments on monday. Thought it would be more helpful/instructive to open as PR-into-your-PR rather than just pushing to your branch.

drammock · 2024-08-12T13:58:47Z

mne/stats/cluster_level.py

+    obj_type = all_types.pop()
+    is_epo = GetEpochsMixin in obj_type.__mro__
+    is_tfr = BaseTFR in obj_type.__mro__
+    is_arr = np.ndarray in obj_type.__mro__
+    return is_epo, is_tfr, is_arr


here, in the _validate_cluster_df helper function, it turned out to be more helpful to return some boolean variables rather than return the object type itself. It would have been possible to leave this as-is, but then these checks against __mro__ would have needed to happen a few times in a few different places, so I thought it was simpler to just do it once here.

side note: FYI, obj.__mro__ stands for "method resolution order" --- a list of parent classes and the order in which they're checked when looking up an object's attributes or methods. For example: EpochsTFR defines a few methods (average, drop, ...) but doesn't define e.g. crop or get_data. So when you do my_epochs_tfr_obj.get_data(), python checks the __mro__ of the object, and goes through that list in order, looking at each parent class until it finds one that defines a get_data method, then uses the first one it finds. In this case, BaseTFR is next on the MRO list, and it does define get_data, so that's the method definition that gets used.

It turns out that EpochsTFR and EpochsTFRArray don't inherit from BaseEpochs, so isinstance(BaseEpochs) wasn't catching the EpochsTFR cases. But they do inherit from GetEpochsMixin (and so do time-domain Epochs and EpochsArray) so checking against __mro__ is a good way to capture both Epochs and EpochsTFR.

drammock · 2024-08-12T14:10:23Z

mne/stats/cluster_level.py

-    def _extract_data_array(series):
+    outer_func = np.concatenate if is_epo else np.array
+    axes = (-3, -1) if is_tfr else (-2, -1)
+
+    def func_arr(series):
        return np.concatenate(series.values)

-    def _extract_data_mne(series):  # 2D data
-        return np.array(
-            series.map(lambda inst: inst.get_data().swapaxes(-2, -1)).to_list()
+    def func_mne(series):
+        return outer_func(
+            series.map(lambda inst: inst.get_data().swapaxes(*axes)).to_list()
        )

-    def _extract_data_tfr(series):
-        return series.map(lambda inst: inst.get_data().swapaxes(-3, -1)).to_list()
-
-    if _dtype is np.ndarray:
-        func = _extract_data_array
-    elif _dtype is BaseTFR:
-        func = _extract_data_tfr
-    else:
-        func = _extract_data_mne


[REFACTORED] I realized I was going to need to write a fourth extraction function (in addition to _extract_data_array, _extract_data_tfr, and _extract_data_mne) to handle Epochs and EpochsTFRs. Looking at the mne and tfr ones we'd already written, I noticed the TFR one was wrong (needed to be wrapped in an np.array), which made it clear how similar those two were --- only different in the swapaxes arguments --- and the new func for Epochs was only going to differ in the outer wrapper function (np.concatenate instead of np.array).

So now we can have two variables: outer_func and axes and generate the extraction function we need based on whether is_epo and is_tfr.

see my comment above, I am unsure if we properly handle the case of EpochsTFR (swapping epochs) and I think a quick dimension check might do the job.

I think the test confirms that we are in fact doing the right thing:

n_epo, n_chan, n_freq, n_times = 6, 3, 4, 5

since each dimension has a different size, I think that if we'd done something wrong then the test would fail.

drammock · 2024-08-12T14:10:42Z

mne/stats/cluster_level.py

        cmap_evokeds: None | str | tuple = None,
        cmap_topo: None | str | tuple = None,
-        ci: float | bool | callable() | None = None,
+        ci: float | bool | callable | None = None,


unrelated parameter typing fix

drammock · 2024-08-12T14:18:49Z

mne/stats/cluster_level.py

+from ..epochs import BaseEpochs, EvokedArray
+from ..evoked import Evoked


Some parts of the MNE API can be imported multiple places. For user scripts, it's OK to do from mne import Evoked, but within the MNE package, we should always do from mne.evoked import Evoked (or here, from ..evoked import Evoked, which is the equivalent relative import) because it ensures that we avoid any circular import problems. the way to check for this is pytest mne/tests/test_import_nesting.py

drammock · 2024-08-12T14:21:26Z