Add functionality to export to EEGLAB .set #9192

jackz314 · 2021-03-24T22:44:09Z

Reference issue

N/A

What does this implement/fix?

This adds the ability to export Raw or Epochs instances as EEGLAB set files. As well as the ability to import channel location data from separate CSV files or another file (using read_raw).

Additional information

The exported to set function (save_set) was based on mnelab's write_set. I added channel location export and events/epochs data export so that Epochs exports can be read by EEGLAB.

The channel location expansion function (xyz to all, used for save_set) referenced MATLAB's cart2sph, as well as EEGLAB's relevant functions.

It has been roughly tested on a few data sets on both EEGLAB and MNE.

…a from csv files.

jackz314 · 2021-03-24T23:12:02Z

Seems like the tests are failing due to the lack of docstrings. Should I add them?

mmagnuski · 2021-03-24T23:44:25Z

Welcome on your first mne PR, @jackz314!
Thanks for contributing, beinging able to save files in .set format would be a nice addition (at least from my perspective).

Seems like the tests are failing due to the lack of docstrings. Should I add them?

Yes, we follow the numpy docstring format. Take a look around in the codebase - you will see the pattern. The docstrings are required for all public functions.
Adding a functionality like that also requires writing tests that cover > 90% of the added code.
Also - make sure you follow the PEP8 code style.
You will find more guidelines here.

agramfort · 2021-03-25T07:18:20Z

thx @jackz314 for the work

we should discuss how to approach this. So far we have been reluctant to support writing for something different from .fif as it's very likely that the save to something else will not persist to disk all the informations in the mne instance.

Also you add a lot of public functions to mne and we are support careful about this. Should i had started this PR I would have tried to support writing without any new public function: just use .save method and detect the file extension to know what to do.

hoechenberger · 2021-03-25T07:54:45Z

Nice job, @jackz314!
Like @agramfort said, ideally all functions except for save_set() should be private.

Also I'm wondering if instead of adding save_set(), we should consider adding a parameter to save(), allowing to save in EEGLAB format. Or it could even auto-pick "EEGLAB .set" if the output file extension is .set.
But this really needs a bit of discussion (and, most certainly, much more testing…)

jackz314 · 2021-03-25T08:27:13Z

Thanks guys. I'm currently writing tests on this, but my expertise in both EEGLAB and MNE is limited, so I would probably need help from other people to test this more thoroughly - As I'm writing these tests, I'm already discovering multiple bugs in my code. I agree maybe merging save_set with save would be better.

The only two user-facing functions would probably be save_set and import_eeg_chan_location_from_csv. get_eeglab_full_cords and cart_to_eeglab_full_coords doesn't seem to be useful outside of conversion between eeglab and mne, so I can change them to private.

As for information loss, unfortunately, it seems like there will be some unavoidable info loss, so far from my limited testing, things like re-reference, baseline, and high/low pass freq info are lost when converting between the two. For re-referencing and hi/lo pass, the modified data is preserved, but in EEGLAB there's no way of knowing what kind of re-referencing is done, and when converting back to MNE, there's no way of getting the filtering parameters.

Info lost is definitely not ideal, but in a lot of cases it's tolerable (as long as data is intact and as expected), so I think maybe some warning could be given to users who try to export as set files, that way they are aware of the info lost, but can still benefit from the conversion. I'm currently working with a lab where EEGLAB is heavily used, but they also want to use Python as well, since some things like ML is easier on Python, this is why I wrote these functions, and I think other people in similar situations would benefit from this as well.

hoechenberger · 2021-03-25T08:29:56Z

import_eeg_chan_location_from_csv

This one should probably also be integrated into read_custom_montage()

jackz314 · 2021-03-25T08:39:13Z

import_eeg_chan_location_from_csv

This one should probably also be integrated into read_custom_montage()

I just realized that channel location data in MNE is called Montage, I will integrate it into read_custom_montage(), but it should probably be a separate PR.

hoechenberger · 2021-03-25T08:45:33Z

I will integrate it into read_custom_montage(), but it should probably be a separate PR.

This actually sounds like a good idea!!

…ions.

jackz314 · 2021-03-25T09:16:38Z

I also would like to know where I should put the tests for these functions. Currently, I put the tests for Epochs' save set to tests/test_epochs.py, and I plan on putting the tests for Raw's save set to io/tests/test_raw.py, would this be appropriate? Or should I put both of them into io/eeglab/tests/test_eeglab.py since they are related to EEGLAB?

jackz314 · 2021-03-27T02:17:35Z

I noticed that channel location data is actually stored in the format of -y, x, z instead of x, y, z. Is there a particular reason for this? Additionally, is this a standard that applies to all location data or specific to some formats like EEGLAB set files?

…cation to tests. Add raw save_set test.

hoechenberger · 2021-03-27T06:35:07Z

Hello @jackz314,

I briefly talked with @agramfort about this effort and we had the feeling that adding support for writing to new formats could end up as opening Pandora's box: it would be very challenging to ensure that no data is lost (or even to quantify as to which data is lost, specifically) during an I/O roundtrip.

We felt that maybe this functionality would be better suited in an external package, which could very well be maintained under the MNE umbrella (mne-tools GitHub organization), but could be released separately from MNE, which would make a lot of sense as this would allow to push out bugfixes on an independent schedule.

Another idea I had and which is kind of contradictory to what I said above is adding a new method for exports that possibly lead to data loss. This is essentially what GIMP does: "save/save as" saves to their native format; "export" converts the data, possibly losing information. So I would vote for calling a method that exports to non-FIFF files export().

I'd love to hear your and the other developers' thoughts!

jackz314 · 2021-03-27T07:10:09Z

Personally, I think this feature would be helpful even if there is some data loss, like I mentioned above, I think it would be better to provide warnings to users who use this function (and maybe on relevant docs as well) about potential data loss than to not provide the feature at all. As you said, I think using a separate and distinct name like export instead of save or save_set could further make sure that the user knows what they are doing.

Some data will be lost unfortunately due to some features being implemented very differently in EEGLAB and MNE (e.g., reference channel/method, filtering configuration, etc.). However, since the core data and some other fields like events/trials/epochs/annotations (the events "family") and channel info can mostly be exchanged without data loss, exporting will still be helpful even if other fields might not survive the i/o cycle.

I'm leaning more towards your second idea, because I feel like since other functions that read from EEGLAB are already maintained in the main MNE package, any changes to EEGLAB that requires any updates to MNE's I/O should occur naturally together. I also think that maintaining a separate package just for exporting to EEGLAB seems a bit overkill and shouldn't be necessary, especially since reading from EEGLAB is already in MNE. Still, I'm also fine with starting a new package, if the consensus later agrees that this feature should be in a separate package, I'd be happy to port my code over.

hoechenberger · 2021-03-27T09:16:18Z

Pinging @larsoner, @jasmainak, @adam2392, @sappelhoff, and @cbrnr

agramfort · 2021-03-27T15:16:14Z

i hear you Jack but it's a non-trivial maintenance cost and I anticipate a lot of complaints because some data was lost during IO round trip and user "did not read the warning". it also means a lot of CI time for testing as suddenly we need to duplicate many IO tests to guarantee nothing breaks when people save to .set files.

adam2392 · 2021-03-27T16:32:26Z

Don't we have a "pybv" package for brainvision? Perhaps we have:

Pyedf (I think there's pyedflib but writing was non trivial)
Pyeeglab?

Maybe "raw.export" just calls these underlying packages? Unit testing should primarily be in those sub packages to address @agramfort objection?

the only reason I even use eeglab is cuz their ICA is more advanced then mne. Don't use them for anything else :p. Imo these kinds of write/export functionality will be very attractive for cross-platform usage. Could even have things like:

Pypersyst
Pynihonkohden

? This can be useful for mne-bids too for writing other supported BIDS formats of EEG/IEEG.

agramfort · 2021-03-27T18:03:18Z

+1 for a pyeeglab package now regarding the export I am not convinced

hoechenberger · 2021-03-27T19:13:32Z

@adam2392

Maybe "raw.export" just calls these underlying packages? Unit testing should primarily be in those sub packages to address @agramfort objection?

Yeah that sounds like a possibility. Raw.export() would just be a few lines then, passing self to one of those packages and emitting a warning – that's about it.

the only reason I even use eeglab is cuz their ICA is more advanced then mne.

It's off-topic alright, but could you elaborate? Do you mean the interactive exploration of components..?

adam2392 · 2021-03-27T19:24:45Z

They have this nice feature which plugs into a database of features I think that can automatically estimate ICA labels in case you have 100s of subjects and can't feasibly run manual ICA, they actually have a cool feature that will match it against known labels to try to estimate if it's eye blink, heart beat, etc.

Sorry off topic yeah :p.

hoechenberger · 2021-03-27T19:43:50Z

They have this nice feature which plugs into a database of features I think that can automatically estimate ICA labels in case you have 100s of subjects and can't feasibly run manual ICA, they actually have a cool feature that will match it against known labels to try to estimate if it's eye blink, heart beat, etc.

Sorry off topic yeah :p.

Ah, that one. Yes we should add this to MNE too. Actually shouldn't be too difficult, I believe! We already do have a pattern matching infrastructure.

hoechenberger

Two more nitpicks and then I'm happy 😃

mne/utils/check.py

hoechenberger · 2021-04-18T06:25:32Z

mne/utils/check.py

+
+    if fmt not in supported_formats:
+        raise ValueError(f"Format '{fmt}' is not supported. "
+                         f"Supported formats are {supported_formats}.")


I think we only want to print the keys here, separated by commas, i.e. something like

', '.join(supported_formats.keys()

Well someone could attempt to use file extensions to infer automatically, and if the extension is not supported I think printing out the dictionary might be more helpful compared to just the formats.

I can also print the dictionary in a better way (e.g. eeglab (.set), brainvision (.eeg, .vmrk, ...))

Great idea!

mne/utils/check.py

Co-authored-by: Richard Höchenberger <richard.hoechenberger@gmail.com>

hoechenberger

👌👌👌👌👌👌👌❤️

hoechenberger · 2021-04-18T12:34:40Z

@cbrnr Feel free to merge if you're happy!

Fantastic work, and thank you so much for your endless patience, @jackz314!

mne/tests/test_epochs.py

mne/utils/docs.py

agramfort · 2021-04-18T20:17:04Z

mne/io/base.py

+        Supported formats: EEGLAB (set, uses :mod:`eeglabio`)
+        %(export_warning)s
+        %(export_params_base)s
+        fmt : 'auto' | 'eeglab'


Suggested change

fmt : 'auto' | 'eeglab'

agramfort · 2021-04-18T20:17:27Z

mne/epochs.py

+        Supported formats: EEGLAB (set, uses :mod:`eeglabio`)
+        %(export_warning)s
+        %(export_params_base)s
+        fmt : 'auto' | 'eeglab'


Suggested change

fmt : 'auto' | 'eeglab'

cbrnr · 2021-04-19T05:46:35Z

@hoechenberger can you check what needs to be done for @agramfort's latest comments? I think we're not yet ready.

jackz314 · 2021-04-19T05:55:36Z

@hoechenberger can you check what needs to be done for @agramfort's latest comments? I think we're not yet ready.

Yeah I removed the redundant code but I'm not sure about the docstring part.

larsoner

Code looks good, just minor suggestions really!

However, I don't think we should merge until this is solved, i.e., eeglabio makes a PyPi release:

$ pip install --user eeglabio
ERROR: Could not find a version that satisfies the requirement eeglabio
ERROR: No matching distribution found for eeglabio

and then we add eeglabio to ~~github_actions_dependencies.sh in the same place as nitime~~ requirements_testing_extra.txt. I suspect that none of these lines are really being tested by CIs, and codecov agrees:

codecov/patch — 22.32% of diff hit (target 95.00%)

I know that there is already a pip install -i https://test.pypi.org/simple/ eeglabio in the eeglabio instructions, but I think users will rightly assume they should be able to pip install eeglabio, and we should wait until that's available. Also you might consider setting up a conda-forge recipe for it, maybe @hoechenberger can help with that end?

mne/io/base.py

mne/io/utils.py

mne/utils/check.py

larsoner · 2021-04-22T19:37:02Z

@jackz314 now that #9337 is merged, if you rebase or merge with upstream/main you should be able to add eeglabio to requirements_testing_extra.txt once you make a PyPi release, and it should make CIs actually test the new functionality

jackz314 · 2021-04-22T21:25:11Z

@jackz314 now that #9337 is merged, if you rebase or merge with upstream/main you should be able to add eeglabio to requirements_testing_extra.txt once you make a PyPi release, and it should make CIs actually test the new functionality

I moved eeglabio from Test PyPI to PyPI and added it as an requirement. The tests should now run on CIs as well.

agramfort · 2021-04-23T07:21:08Z

you have some remaining comments from @larsoner to address. thx !

…

jackz314 · 2021-04-23T11:26:34Z

you have some remaining comments from @larsoner to address. thx !
…

For some reason, I didn't see the comments before. I just addressed them.

larsoner · 2021-04-23T19:00:35Z

Thanks for this @jackz314 !

jackz314 added 2 commits March 24, 2021 15:38

Add functionality to export to set files. Import channel location dat…

9ed245d

…a from csv files.

Fix formatting.

7397830

Fix formatting. Remove redundant code. Add docstrings.

57007e1

jackz314 mentioned this pull request Mar 25, 2021

Crashes when using read_epochs_eeglab on files with only one trial. #9193

Closed

jackz314 force-pushed the jack branch from 770f319 to 60ac4d4 Compare March 25, 2021 09:06

Fix ch_names bug. Finish basic tests. Remove unnecessary public funct…

4efb2bb

…ions.

jackz314 force-pushed the jack branch from 60ac4d4 to 4efb2bb Compare March 25, 2021 09:08

jackz314 added 2 commits March 26, 2021 19:48

Fix docstrings & formatting. Fix channel location bug. Add channel lo…

8e570c5

…cation to tests. Add raw save_set test.

Add docstrings to tests.

bc47a78

jackz314 mentioned this pull request Mar 27, 2021

Add support for importing channel location from XYZ/CSV files. #9203

Merged

adam2392 mentioned this pull request Mar 27, 2021

Adding automatic ICA labeling capability ported from EEGLab #9206

Closed

Switch export supported formats to dict. Fix docstrings.

a2052cb

hoechenberger reviewed Apr 18, 2021

View reviewed changes

jackz314 added 2 commits April 18, 2021 00:39

Add docs for _infer_check_export_fmt.

142f42f

Pretty print supported formats.

ad6f4f4

hoechenberger reviewed Apr 18, 2021

View reviewed changes

mne/utils/check.py Outdated Show resolved Hide resolved

Expand supported string code.

c47bfad

Co-authored-by: Richard Höchenberger <richard.hoechenberger@gmail.com>

hoechenberger approved these changes Apr 18, 2021

View reviewed changes

agramfort reviewed Apr 18, 2021

View reviewed changes

Remove redundant check in test_export_set.

745813b

cbrnr added this to the 0.23 milestone Apr 19, 2021

cbrnr changed the title ~~Add functionality to export to set files. Import channel location.~~ Add functionality to export to EEGLAB .set Apr 19, 2021

jackz314 requested a review from larsoner April 22, 2021 00:00

larsoner reviewed Apr 22, 2021

View reviewed changes

mne/io/base.py Show resolved Hide resolved

mne/io/base.py Outdated Show resolved Hide resolved

mne/io/utils.py Outdated Show resolved Hide resolved

mne/utils/check.py Show resolved Hide resolved

larsoner mentioned this pull request Apr 22, 2021

MRG, MAINT: Better split of reqs #9337

Merged

jackz314 added 2 commits April 22, 2021 14:19

Merge branch 'main' of https://github.com/mne-tools/mne-python into jack

807dc27

Add eeglabio requirement.

29c5905

jackz314 added 2 commits April 23, 2021 04:19

Refactor export docdict, imports, function names.

f8ac376

Fix als coords fn name.

f7dcc7c

larsoner merged commit c6b22e8 into mne-tools:main Apr 23, 2021

sappelhoff mentioned this pull request Jul 13, 2021

Are we able to support "writing to EDF"? If so, what will it take? mne-tools/mne-bids#840

Closed

adam2392 mentioned this pull request Jul 13, 2021

Enable exporting EDF files #9566

Closed

jackz314 deleted the jack branch August 2, 2022 23:56

Uh oh!

Add functionality to export to EEGLAB .set #9192

Add functionality to export to EEGLAB .set #9192

Uh oh!

Conversation

jackz314 commented Mar 24, 2021

Reference issue

What does this implement/fix?

Additional information

Uh oh!

jackz314 commented Mar 24, 2021

Uh oh!

mmagnuski commented Mar 24, 2021

Uh oh!

agramfort commented Mar 25, 2021

Uh oh!

hoechenberger commented Mar 25, 2021

Uh oh!

jackz314 commented Mar 25, 2021

Uh oh!

hoechenberger commented Mar 25, 2021

Uh oh!

jackz314 commented Mar 25, 2021

Uh oh!

hoechenberger commented Mar 25, 2021

Uh oh!

jackz314 commented Mar 25, 2021

Uh oh!

jackz314 commented Mar 27, 2021

Uh oh!

hoechenberger commented Mar 27, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jackz314 commented Mar 27, 2021

Uh oh!

hoechenberger commented Mar 27, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

agramfort commented Mar 27, 2021 via email

Uh oh!

adam2392 commented Mar 27, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

agramfort commented Mar 27, 2021 via email

Uh oh!

hoechenberger commented Mar 27, 2021

Uh oh!

adam2392 commented Mar 27, 2021

Uh oh!

hoechenberger commented Mar 27, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

hoechenberger left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

hoechenberger Apr 18, 2021

Choose a reason for hiding this comment

Uh oh!

jackz314 Apr 18, 2021

Choose a reason for hiding this comment

Uh oh!

jackz314 Apr 18, 2021

Choose a reason for hiding this comment

Uh oh!

hoechenberger Apr 18, 2021

Choose a reason for hiding this comment

Uh oh!

Uh oh!

hoechenberger left a comment

Choose a reason for hiding this comment

Uh oh!

hoechenberger commented Apr 18, 2021

Uh oh!

Uh oh!

Uh oh!

agramfort Apr 18, 2021

Choose a reason for hiding this comment

Uh oh!

agramfort Apr 18, 2021

hoechenberger commented Mar 27, 2021 •

edited

Loading

hoechenberger commented Mar 27, 2021 •

edited

Loading

adam2392 commented Mar 27, 2021 •

edited

Loading

hoechenberger commented Mar 27, 2021 •

edited

Loading

larsoner left a comment •

edited

Loading