WIP: initialize mklfft #2623

jona-sassenhagen · 2015-11-17T21:11:41Z

My attempt at #1916

Please check if the general approach seems okay.
Basically, there is a function in mne.utils, _get_mkl_fft, that takes as arguments a string and n_jobs. If n_jobs isn't CUDA, it tries to import the function denoted by the string from mklfft.fftpack. It will also try to import mkl from mkl-service and set the number of threads to n_jobs.
Otherwise, it returns the corresponding function from scipy.fftpack.

agramfort · 2015-11-17T21:51:47Z

mne/filter.py

don't use a get function. Just allow to do

from mne.utils import fft

and make sure it corresponds to mkl if available.

ok?

So in mne.utils, I'd have something like

def fft(*args, **kwargs): try: from mklfft.fftpack import fft except ImportError: from scipy.fftpack import fft return fft(*args, **kwargs)

... or if not, can you point me to some code that does what you mean?

I tried modeling this after how ica and xdawn handle fast_dot.

just add to the utils.py file

try: from mklfft.fftpack import fft except ImportError: from scipy.fftpack import fft

it should do the trick

… into mklfft * 'mklfft' of https://github.com/jona-sassenhagen/mne-python: Update filter.py

… into mklfft * 'mklfft' of https://github.com/jona-sassenhagen/mne-python: Stupid error

jona-sassenhagen · 2015-11-18T12:26:22Z

Like this? (error is pep)

agramfort · 2015-11-18T12:47:29Z

so far so good.

let us when you have benchmarks.

jona-sassenhagen · 2015-11-18T14:25:22Z

On my i5 iMac, it's actually slower to use mklfft ~~with 4 or 8 threads~~. I can't currently test it on our actually powerful computers because somebody is using them to run MATLAB ...

jona-sassenhagen · 2015-11-18T21:15:33Z

I can't get the TF decomposition functions to use more than 1 thread (maybe because of it being wrapped inside parallel? But see below - ).

For filtering, with 48 threads, representative runs for MKL:
real 0m22.330s
user 9m39.524s
sys 0m18.133s

With MKL (12 threads):
real 0m21.795s
user 3m36.406s
sys 0m9.849s

Without MKL (n_jobs=1):
real 0m22.783s
user 0m19.933s
sys 0m2.680s

Without MKL (n_jobs=12):
real 0m16.730s
user 0m36.374s
sys 0m10.653s

Without MKL (n_jobs=48):
real 0m17.527s
user 0m35.702s
sys 0m20.013s

With MKL and parallel (n_jobs=4-8, MKL threads=4-12):
real 0m17.593s
user 6m39.953s
sys 0m19.461s

So nothing much beyond regular parallel with joblib. I guess it would save memory to run 4 jobs with 10 MKL threads each rather than 40 regular jobs?

jasmainak · 2015-11-20T19:09:32Z

Thanks for taking a stab at this @jona-sassenhagen . Something looks weird with your commit history. There are merge commits. You can fix it by rebase

jona-sassenhagen · 2015-11-20T19:11:23Z

Thanks @jasmainak ... not sure it's worth it though, seeing as I don't see improvements. Is anyone else getting speed improvements? I can imagine it might make a difference for systems with many cores, but little RAM, but I don't have such a machine available.

dengemann · 2015-11-20T19:11:36Z

mne/utils.py

is this necessary? I think we can live with a small exception here.

better just to use # noqa at the end of the offending lines

jasmainak · 2015-11-20T19:12:14Z

Last I remember when I tried it, I wasn't getting any improvements ...

jona-sassenhagen · 2015-11-20T19:16:25Z

In that thread, Alex said it was due to multithreading being required. But I somehow don't see improvements even when really hammering all of our cores with mklfft either. I don't know.

jasmainak · 2015-11-20T19:28:30Z

@jona-sassenhagen : can you clarify how I should read your benchmarks? What is real, user and sys?

jona-sassenhagen · 2015-11-20T19:29:53Z

That's just with standard Unix time - so "real" is the actual time it takes to run the script, and I guess "user" is a measure of CPU time or something.

jasmainak · 2015-11-20T19:33:38Z

Could it be that there aren't many fft operations to make a real difference? Maybe you need to benchmark on something where there are many fft operations ...

jona-sassenhagen · 2015-11-20T19:35:53Z

I'd think filtering raws with method=fft is one of the most fft heavy task we have, or is there a better test?

jasmainak · 2015-11-20T19:40:22Z

did you try TF decomposition on epochs? that was real slow. It might be worthwhile to profile and figure out if at least the fft operations are faster.

jona-sassenhagen · 2015-11-20T19:42:10Z

Which method (multitaper etc) ... would benefit the most you think?

jasmainak · 2015-11-20T19:45:24Z

No idea ... I remember having tried tfr_morlet with use_fft=True and that was pretty slow. Also, for profiling: https://github.com/rkern/line_profiler in case you don't know it already

jasmainak · 2015-11-20T19:46:00Z

By pretty slow, I mean it could take 10 minutes to run on my computer ...

jona-sassenhagen · 2015-11-25T15:04:18Z

@jasmainak that's sadly not disproportionally slow for TF analyses I fear. I've had TF decompositions take days to run for a full experiment and low frequencies (= long windows) on EEGLAB.
If you really want speed, CUDA is the way to go.

jasmainak · 2015-11-25T16:04:26Z

@agramfort thoughts?

agramfort · 2015-11-25T16:15:07Z

can you share a bench script?

jasmainak · 2015-12-11T12:30:52Z

@jona-sassenhagen would you have time for this in the coming days? If not, I can take a look once the EEGLAB .set reader is merged.

jona-sassenhagen · 2015-12-11T12:38:43Z

I'm a bit demotivated due to the lack of results ... feel free to take over.

jona-sassenhagen · 2015-12-24T12:34:51Z

Closing this for now as I just opened two other PRs, @Eric89GXL opened his issue on the topic, @jasmainak indicated he might want to take over, and I'm not making progress.

jona-sassenhagen added 2 commits November 17, 2015 22:08

initialize mklfft

32de475

Update filter.py

bdc4239

agramfort reviewed Nov 17, 2015
View reviewed changes

jona-sassenhagen added 5 commits November 18, 2015 12:01

import from utils

cd0d5a3

Merge branch 'mklfft' of https://github.com/jona-sassenhagen/mne-python…

51a1e2f

… into mklfft * 'mklfft' of https://github.com/jona-sassenhagen/mne-python: Update filter.py

Stupid error

be0921f

fix max- &stockwell

9a44a15

Merge branch 'mklfft' of https://github.com/jona-sassenhagen/mne-python…

3d7de61

… into mklfft * 'mklfft' of https://github.com/jona-sassenhagen/mne-python: Stupid error

pep8

be96fa1

jona-sassenhagen added 2 commits November 18, 2015 15:12

fix mkl ifft 2nd argument

7b350f7

fix mkl ifft 2nd argument

b612028

jona-sassenhagen added 2 commits November 18, 2015 15:56

Update utils.py

9c7da20

Update utils.py

ce4116e

Update cuda.py

adcc17d

dengemann reviewed Nov 20, 2015
View reviewed changes

Update utils.py

e4259c4

agramfort mentioned this pull request Nov 23, 2015

ENH use mkl fft if available #1916

Closed

larsoner mentioned this pull request Dec 18, 2015

ENH: Speed up frequency functions #2722

Closed

jona-sassenhagen closed this Dec 24, 2015

Uh oh!

WIP: initialize mklfft #2623

WIP: initialize mklfft #2623

Uh oh!

Conversation

jona-sassenhagen commented Nov 17, 2015

Uh oh!

agramfort Nov 17, 2015

Choose a reason for hiding this comment

Uh oh!

jona-sassenhagen Nov 17, 2015

Choose a reason for hiding this comment

Uh oh!

agramfort Nov 18, 2015

Choose a reason for hiding this comment

Uh oh!

jona-sassenhagen commented Nov 18, 2015

Uh oh!

agramfort commented Nov 18, 2015

Uh oh!

jona-sassenhagen commented Nov 18, 2015

Uh oh!

jona-sassenhagen commented Nov 18, 2015

Uh oh!

jasmainak commented Nov 20, 2015

Uh oh!

jona-sassenhagen commented Nov 20, 2015

Uh oh!

dengemann Nov 20, 2015

Choose a reason for hiding this comment

Uh oh!

larsoner Nov 20, 2015

Choose a reason for hiding this comment

Uh oh!

jasmainak commented Nov 20, 2015

Uh oh!

jona-sassenhagen commented Nov 20, 2015

Uh oh!

jasmainak commented Nov 20, 2015

Uh oh!

jona-sassenhagen commented Nov 20, 2015

Uh oh!

jasmainak commented Nov 20, 2015

Uh oh!

jona-sassenhagen commented Nov 20, 2015

Uh oh!

jasmainak commented Nov 20, 2015

Uh oh!

jona-sassenhagen commented Nov 20, 2015

Uh oh!

jasmainak commented Nov 20, 2015

Uh oh!

jasmainak commented Nov 20, 2015

Uh oh!

jona-sassenhagen commented Nov 25, 2015

Uh oh!

jasmainak commented Nov 25, 2015

Uh oh!

agramfort commented Nov 25, 2015

Uh oh!

jasmainak commented Dec 11, 2015

Uh oh!

jona-sassenhagen commented Dec 11, 2015

Uh oh!

jona-sassenhagen commented Dec 24, 2015

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants