MRG, ENH: Add autobad detection like MF #6940

larsoner · 2019-10-09T19:11:21Z

At the sprint @olafhauk mentioned that not having -autobad support was what he was missing in his Python pipeline. This adds support for it.

larsoner · 2019-10-09T20:47:00Z

https://16211-1301584-gh.circle-artifacts.com/0/dev/auto_tutorials/preprocessing/plot_60_maxwell_filtering_sss.html

drammock · 2019-10-09T21:18:56Z

tutorial LGTM. I didn't really have time for more than a skim of the code changes.

codecov · 2019-10-09T22:26:02Z

Codecov Report

Merging #6940 into master will increase coverage by 0.04%.
The diff coverage is 95.53%.

@@            Coverage Diff             @@
##           master    #6940      +/-   ##
==========================================
+ Coverage   89.92%   89.96%   +0.04%     
==========================================
  Files         453      451       -2     
  Lines       82111    81731     -380     
  Branches    12999    12968      -31     
==========================================
- Hits        73840    73531     -309     
+ Misses       5445     5374      -71     
  Partials     2826     2826

agramfort · 2019-10-10T09:30:13Z

@larsoner did you test on a number of datasets to which point this implementation matches with maxfilter output?

agramfort · 2019-10-10T09:30:26Z

@olafhauk maybe you can give it a try?

larsoner · 2019-10-10T15:17:47Z

I tried it on sample and got the same result. On other files I tested, in general one or two channels can differ but I think it mostly has to do with:

MaxFilter sometimes processing multiple buffers even though it says it processes them one at a time (or somehow treating time differently)
They probably band-pass differently (maybe just a frequency-domain zeroing?)

If I tweak duration and limit a bit I can usually make things match. One or two channel differences for channels near the limit boundary are to be expected I think.

agramfort · 2019-10-12T05:30:55Z

I can test on cam-can data but I would need the bads found by original maxfilter program. @olafhauk @dengemann @SherazKhan do you know where I can find this?

wmvanvliet

Cool PR!

mne/preprocessing/maxwell.py

agramfort · 2019-10-16T08:21:29Z

@wmvanvliet if you have access to original maxfilter code can you test this PR on some of your dataset? thanks

wmvanvliet · 2019-10-16T09:13:49Z

What procedure do you want me to use to check maxwell vs mne output? Here is my first attempt, giving me different bad channels: https://gist.github.com/wmvanvliet/a3944ad425f28e33fba836315f78169d

agramfort · 2019-10-16T09:22:20Z

yes I would like a public quantified evaluation of the differences so we know where we stand

…

wmvanvliet · 2019-10-16T11:21:22Z

Here are some results: https://pastebin.com/KafKgLH3
Output is quite different.

larsoner · 2019-10-16T11:28:13Z

For flats you need to use mne.preprocessing.mark_flat as a first step. Without it the reconstructions will be different so differences are expected. I can add this to the instructions. BTW it's much faster if you do raw.resample(100).filter(0.1, None) rather than just raw.filter(0.1, 50).

larsoner · 2019-10-16T11:34:20Z

@wmvanvliet you might also find that changing the duration and/or limit arguments in MNE will get you closer to what MaxFilter produces. When I ran things it seemed like sometimes it was using a duration=5 type of processing, and the deviation values it found were always larger than the ones I got in MNE, so maybe something like duration=5, limit=5 would get you closer.

wmvanvliet · 2019-10-17T06:20:24Z

With limit=5, MNE was detecting too many bads compared to maxfilter. So I tried duration=5, limit=7. Results are here: https://pastebin.com/KV9Yzgjn. Code for the script is here: https://gist.github.com/wmvanvliet/a3944ad425f28e33fba836315f78169d

agramfort · 2019-10-17T07:42:06Z

can you cook a summary figure of this? fraction of time where they agree? summary of average overlaps between sets?

…

wmvanvliet · 2019-10-17T11:59:55Z

How's this? https://pastebin.com/vcKwRbSf

larsoner · 2019-10-17T12:04:59Z

A 12% absolute match is not so great :(

I'll ping Jukka and see if he has any insight into why things might be different. It's possible that it's the filtering. Are these European / 50 Hz data? The line noise might be messing things up, and if they use a brick wall FFT that stops at 49 Hz, that will be very different from what raw.filter or even raw.resample(100) would do.

If it's easy enough to re-run, you could try raw.resample(98) to see if it helps, as that should kill the 50 Hz line noise and eliminate that possibility.

wmvanvliet · 2019-10-17T12:19:47Z

Powerline is at 50Hz, yes

larsoner · 2019-10-17T13:15:30Z

Also did you make sure the params were the same like origin, cross_talk, fine_calibration, etc.? I doubt it would make a big difference but it might make some difference

wmvanvliet · 2019-10-17T13:50:26Z

See the script: https://gist.github.com/wmvanvliet/a3944ad425f28e33fba836315f78169d

larsoner · 2019-10-17T14:16:00Z

You should probably use origin=(0., 0., 0.04) in MNE (assuming these are subject files and not empty room) -- we use a dig fit by default and they use a fixed pos
Either pass fine_calibration and cross_talk in MNE or (easier) pass -ctc off -cal off in the MF call -- by default MF for a given site will use built-in cross-talk and fine cal
If you really want to reduce differences in the expansions, use regularize=None, bad_condition='ignore' in MNE and -regularize off in MF because our regularization behaves slightly differently
If you do decide to re-run, I'd also do resample(98) just in case some line noise is sticking around right at Nyquist, since I'm not sure how MF actually does its lowpass.

olafhauk · 2019-10-17T17:31:04Z

@olafhauk maybe you can give it a try?

I've just come back from another trip. You want me to test the autobad option in maxfilter?

olafhauk · 2019-10-17T17:41:18Z

I can test on cam-can data but I would need the bads found by original maxfilter program. @olafhauk @dengemann @SherazKhan do you know where I can find this?
Do you already have an answer to this? They have just appointed a new CamCan administrator who may be able to provide this info.

wmvanvliet · 2019-10-18T07:37:05Z

That improved things a little: https://pastebin.com/M1tdsz6f

larsoner · 2019-10-18T13:22:53Z

That improved things a little:

80 hits, 14 misses, 46 false alarms (and 9958 correct rejections) -- getting better at least! I emailed Jukka to see if he has some insight into where other potential differences may arise, but it might take a bit for him to get back to me. I'll mark this WIP in the meantime

Do you already have an answer to this? They have just appointed a new CamCan administrator who may be able to provide this info.

@olafhauk no we do not have this info yet

olafhauk · 2019-10-18T15:53:05Z

That improved things a little:

80 hits, 14 misses, 46 false alarms (and 9958 correct rejections) -- getting better at least! I emailed Jukka to see if he has some insight into where other potential differences may arise, but it might take a bit for him to get back to me. I'll mark this WIP in the meantime

Do you already have an answer to this? They have just appointed a new CamCan administrator who may be able to provide this info.

@olafhauk no we do not have this info yet

If you have already applied for access to CamCan MEG data via the CamCan web-site, you should be able to find the log-files with bad channel information on your temporary CBU account. Otherwise you need to apply for access to these data. I don't work with CamCan data myself at the moment I'm afraid. For queries it's probably best for the person who applied for CamCan access to write to rik.henson@mrc-cbu.cam.ac.uk directly.

larsoner · 2019-10-21T16:06:08Z

@wmvanvliet I don't have any lines that look like Detected 2 flat channels what MaxFilter version are you using? I have Revision: 2.2.15

wmvanvliet · 2019-10-22T07:50:25Z

I have $Revision: 2.2.15 Neuromag maxfilter Dec 11 2012 14:48:36 $

…

On 21 Oct 2019, at 19:06, Eric Larson ***@***.***> wrote: @wmvanvliet I don't have any lines that look like Detected 2 flat channels what MaxFilter version are you using? I have Revision: 2.2.15 — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub, or unsubscribe.

larsoner · 2019-12-16T20:39:26Z

So it turns out that MaxFilter does some stuff that we can and probably should avoid. The reading, filtering, and downsampling is tied to the number of tags and number of samples per tag, which we don't need to do because of our reading abstractions. Moreover, we can just load and filter all data at once to avoid edge artifacts.

It seems like the hits/misses/false alarms are now most strongly tied to how the filtering is done (windowing, steepness, corner frequency) and I think we should just make a sensible choice (probably low-pass below the line freq) and go with that, noting that MF behavior will be different. For now I've pushed something that gets me 53/6/14/7271 for hit/miss/FA/CR on 32 files, with the numbers bouncing around depending mostly on how I change our filtering choices.

@wmvanvliet can you try re-running on your data? You can remove any flat or bad marking steps at the Python end, e.g. you no longer need to mark_flat as we now internally mimic what MF does.

larsoner · 2020-02-05T17:04:19Z

ping @wmvanvliet :

can you try re-running on your data? You can remove any flat or bad marking steps at the Python end, e.g. you no longer need to mark_flat as we now internally mimic what MF does.

See my explanation above, but I think at this point even with some mismatches we are doing about as well as we should try to do in matching what MaxFilter does. See if you agree with my reasoning above.

With this and #7290 I'll finally be able to have everything in Python (no need for pushing files to a MaxFilter workstation)...

larsoner · 2020-03-03T20:19:03Z

ping @wmvanvliet, it would be nice to move forward on this if possible

wmvanvliet · 2020-03-04T06:59:38Z

Updated script: https://gist.github.com/wmvanvliet/a3944ad425f28e33fba836315f78169d
Results: https://pastebin.com/Pf53JKVn

larsoner · 2020-03-04T16:07:39Z

@wmvanvliet I think that's pretty good! It's not perfect, but based on what I wrote above I think this is expected. Are you (sufficiently) convinced?

wmvanvliet · 2020-03-05T10:14:48Z

At the very least, it's a useful addition :) It's ok for me if things are not 100% MF compatible, as other portions such as the head position tracking also vary a little.

agramfort

We also have a

def find_outliers(X, threshold=3.0, max_iter=2):

in a file called preprocessing/bads.py

maybe we could APIs between functions that aim to find bad channels?

find_outliers -> find_bads_zscore
here then find_bads_maxwell ?

thinking out loud...

mne/preprocessing/maxwell.py

larsoner · 2020-03-05T14:49:30Z

We also have a def find_outliers(X, threshold=3.0, max_iter=2): in a file called preprocessing/bads.py

This looks like it's supposed to be a private function -- it's undocumented in python_reference.rst and only used in ica code. I think we should do a deprecation cycle to make it private (very easy by renaming to _find_outliers and adding a find_outliers with deprecation decocorator).

That frees us up from worrying about that function at all. I'm fine with find_bads_maxwell as it does make it so that future find_bads_* methods could be added.

larsoner · 2020-03-05T15:29:47Z

Pushed a commit to rename to find_bad_channels_maxwell

agramfort · 2020-03-05T15:33:17Z

+1 for find_bad_channels_maxwell and to deprecate find_outliers

larsoner · 2020-03-05T17:24:12Z

CI failures are unrelated

agramfort · 2020-03-05T20:47:31Z

awesome @larsoner !

* ENH: Add autobad detection like MF * API: Rename to find_bad_channels_maxwell * DOC: Missed a few omissions [ci skip]

wmvanvliet reviewed Oct 16, 2019

View reviewed changes

mne/preprocessing/maxwell.py Show resolved Hide resolved

mne/preprocessing/maxwell.py Show resolved Hide resolved

larsoner changed the title ~~MRG, ENH: Add autobad detection like MF~~ WIP, ENH: Add autobad detection like MF Oct 18, 2019

larsoner force-pushed the autobad branch from a4d55ed to 1f539ab Compare December 16, 2019 20:39

larsoner added this to the 0.20 milestone Feb 5, 2020

larsoner changed the title ~~WIP, ENH: Add autobad detection like MF~~ MRG, ENH: Add autobad detection like MF Feb 5, 2020

larsoner force-pushed the autobad branch from 574d7da to 7d1df83 Compare February 26, 2020 04:12

agramfort reviewed Mar 5, 2020

View reviewed changes

mne/preprocessing/maxwell.py Outdated Show resolved Hide resolved

larsoner added 2 commits March 5, 2020 09:56

ENH: Add autobad detection like MF

1cb16c9

API: Rename to find_bad_channels_maxwell

af1f974

larsoner force-pushed the autobad branch from 7d1df83 to af1f974 Compare March 5, 2020 15:29

DOC: Missed a few omissions [ci skip]

4d9efc4

agramfort approved these changes Mar 5, 2020

View reviewed changes

agramfort merged commit 76d8495 into mne-tools:master Mar 5, 2020

larsoner deleted the autobad branch March 6, 2020 04:13

AdoNunes pushed a commit to AdoNunes/mne-python that referenced this pull request Apr 6, 2020

MRG, ENH: Add autobad detection like MF (mne-tools#6940)

873ee24

* ENH: Add autobad detection like MF * API: Rename to find_bad_channels_maxwell * DOC: Missed a few omissions [ci skip]

AdoNunes pushed a commit to AdoNunes/mne-python that referenced this pull request Apr 6, 2020

MRG, ENH: Add autobad detection like MF (mne-tools#6940)

55bc3eb

* ENH: Add autobad detection like MF * API: Rename to find_bad_channels_maxwell * DOC: Missed a few omissions [ci skip]

Uh oh!

MRG, ENH: Add autobad detection like MF #6940

MRG, ENH: Add autobad detection like MF #6940

Uh oh!

Conversation

larsoner commented Oct 9, 2019

Uh oh!

larsoner commented Oct 9, 2019

Uh oh!

drammock commented Oct 9, 2019

Uh oh!

codecov bot commented Oct 9, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

agramfort commented Oct 10, 2019

Uh oh!

agramfort commented Oct 10, 2019

Uh oh!

larsoner commented Oct 10, 2019

Uh oh!

agramfort commented Oct 12, 2019

Uh oh!

wmvanvliet left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

agramfort commented Oct 16, 2019

Uh oh!

wmvanvliet commented Oct 16, 2019

Uh oh!

agramfort commented Oct 16, 2019 via email

Uh oh!

wmvanvliet commented Oct 16, 2019

Uh oh!

larsoner commented Oct 16, 2019

Uh oh!

larsoner commented Oct 16, 2019

Uh oh!

wmvanvliet commented Oct 17, 2019

Uh oh!

agramfort commented Oct 17, 2019 via email

Uh oh!

wmvanvliet commented Oct 17, 2019

Uh oh!

larsoner commented Oct 17, 2019

Uh oh!

wmvanvliet commented Oct 17, 2019

Uh oh!

larsoner commented Oct 17, 2019

Uh oh!

wmvanvliet commented Oct 17, 2019

Uh oh!

larsoner commented Oct 17, 2019

Uh oh!

olafhauk commented Oct 17, 2019

Uh oh!

olafhauk commented Oct 17, 2019

Uh oh!

wmvanvliet commented Oct 18, 2019

Uh oh!

larsoner commented Oct 18, 2019

Uh oh!

olafhauk commented Oct 18, 2019

Uh oh!

larsoner commented Oct 21, 2019

Uh oh!

wmvanvliet commented Oct 22, 2019 via email

Uh oh!

larsoner commented Dec 16, 2019

Uh oh!

larsoner commented Feb 5, 2020

Uh oh!

larsoner commented Mar 3, 2020

Uh oh!

wmvanvliet commented Mar 4, 2020

Uh oh!

larsoner commented Mar 4, 2020

Uh oh!

codecov bot commented Oct 9, 2019 •

edited

Loading