-
Notifications
You must be signed in to change notification settings - Fork 31
Closed
Labels
Description
Once we have closed #195 and #244. We can look into the performance of these threading improvements together with the previously threaded matrix multiply kernels.
The multiply kernel can be selected with the MULT_KERN option in the Makefile. The best place to start is ompGemm, but worth looking at the other options too.
A good test case is:
-
Use
Si.ionfrom test 002 in the testsuite -
Use
Conquest_inputfrom test 002 in the testsuite, change Grid cutoff to 200 -
Use Coords.dat from the input used in Thread loops over blocks #195
--> This is thematrix_multiplyperformance test in Add input configurations used for profiling #262 -
Think about strategies for reducing omp overhead
Reactions are currently unavailable