276 thread exx phi on grid by connoraird · Pull Request #324 · OrderN/CONQUEST-release

connoraird · 2024-02-09T16:52:33Z

Description

The xyz nested loop as been thread parallised in exx_phi_on_grid

Speedup plot

These plots shows the performance of test test_004_isol_C2H4_4proc_PBE0CRI for 1 mpi process

tkoskela

This is a nice, minimal change that gets a parallel performance boost! I like it.

tkoskela · 2024-02-16T11:38:33Z

src/exx_evalpao.f90

       !print*,
+       xyz_offset = xyz + rst
+       !$omp parallel do collapse(3) schedule(runtime) default(none) & 
+       !$omp    shared(mx,my,mz,px,py,pz,grid_spacing,xyz_offset,pao,spec,phi_on_grid,i_dummy,exx_cartesian,extent) &


Shorter lines please 🥺

tkoskela · 2024-02-16T11:47:42Z

src/exx_evalpao.f90

+                x = nx*grid_spacing + xyz_offset(1)
+                y = ny*grid_spacing + xyz_offset(2)
+                z = nz*grid_spacing + xyz_offset(3)


This is fine. If there's spare time, I'd like to try the following:

precompute x, y, and z into arrays outside of the loop. These are just 1d arrays so the memory footprint should be small, and we would avoid redundant recomputations of y and z. Could use a Array of Structures data format so that structures of x,y,z are aligned in memory (ie, real, dimension(3,N) :: xyz, where N = max(px-mx, py-my, pz-mz), or something like that. Then inside this loop you could just use xyz(1,nx), xyz(2,ny), xyz(3,nz). I'm not sure if this makes much performance difference. As we learned this week, "memory is expensive, flops are free".

I'm not sure this would be worth it. I've just tested the concept with a short program and there seems to be no difference. When accessing xyz in the nested loop, the different values of nx, ny and nz make the memory accessed non-contiguous so we'll be getting lots of cache misses.

It should be possible to arrange the data such that the nx,ny,nz accesses are contiguous. Maybe I got it wrong in my comment. In principle I agree, if it seems like this isn't worth it, let's not spend much time on it.

connoraird requested review from ilectra and tkoskela February 9, 2024 16:57

tkoskela approved these changes Feb 16, 2024

View reviewed changes

tkoskela added the improves: speed Speed-up of code label Feb 16, 2024

connoraird force-pushed the 276-use-blas branch from eff92cd to cee9e0f Compare February 16, 2024 17:11

tkoskela requested a review from lionelalexandre February 19, 2024 10:09

connoraird force-pushed the 276-thread-exx-phi-on-grid branch from ec90a6a to 36848b7 Compare February 19, 2024 15:54

connoraird added 5 commits February 19, 2024 15:56

Thread xyz loops

38ace69

Ending parallel do

70b1539

Adding missing omp variables

ffc3025

Removing unnecessary zeroing of arrays

3dfeabf

Only zeroing if needed

be386f1

connoraird force-pushed the 276-thread-exx-phi-on-grid branch from 36848b7 to be386f1 Compare February 19, 2024 15:57

connoraird changed the base branch from 276-use-blas to 276-combine-nsf-loops February 20, 2024 12:01

connoraird marked this pull request as ready for review February 20, 2024 12:02

connoraird merged commit 442640c into 276-combine-nsf-loops Feb 20, 2024

tkoskela mentioned this pull request Apr 15, 2024

Multi threading exact exchange #276

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

276 thread exx phi on grid#324

276 thread exx phi on grid#324
connoraird merged 5 commits into276-combine-nsf-loopsfrom
276-thread-exx-phi-on-grid

connoraird commented Feb 9, 2024 •

edited

Loading

Uh oh!

tkoskela left a comment

Uh oh!

tkoskela Feb 16, 2024

Uh oh!

tkoskela Feb 16, 2024

Uh oh!

connoraird Feb 19, 2024

Uh oh!

tkoskela Feb 20, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

connoraird commented Feb 9, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Speedup plot

Uh oh!

tkoskela left a comment

Choose a reason for hiding this comment

Uh oh!

tkoskela Feb 16, 2024

Choose a reason for hiding this comment

Uh oh!

tkoskela Feb 16, 2024

Choose a reason for hiding this comment

Uh oh!

connoraird Feb 19, 2024

Choose a reason for hiding this comment

Uh oh!

tkoskela Feb 20, 2024

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

connoraird commented Feb 9, 2024 •

edited

Loading