FIX: Handle a non-linear FrameIterator in HydrogenBondAnalysis #5202

BradyAJohnston · 2026-01-06T14:19:29Z

Fixes #5200

Changes made in this Pull Request:

If the self.start is None then return computed values by using a dictionary lookup rather than index lookup

LLM / AI generated code disclosure

LLMs or other AI-powered tools (beyond simple IDE use cases) were used in this contribution: no

PR Checklist

Issue raised/referenced?
Tests updated/added?
Documentation updated/added?
package/CHANGELOG file updated?
Is your name in package/AUTHORS? (If it is not, add it!)
LLM/AI disclosure was updated.

Developers Certificate of Origin

I certify that I can submit this code contribution as described in the Developer Certificate of Origin, under the MDAnalysis LICENSE.

📚 Documentation preview 📚: https://mdanalysis--5202.org.readthedocs.build/en/5202/

BradyAJohnston · 2026-01-06T14:32:44Z

Oops it seems like my auto-formatter went a bit wild - despite still passing Black. Will clean up.

codecov · 2026-01-06T15:02:28Z

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 92.73%. Comparing base (a029dcd) to head (ae5afa7).
⚠️ Report is 1 commits behind head on develop.

Additional details and impacted files

@@           Coverage Diff            @@
##           develop    #5202   +/-   ##
========================================
  Coverage    92.72%   92.73%           
========================================
  Files          180      180           
  Lines        22473    22476    +3     
  Branches      3189     3190    +1     
========================================
+ Hits         20838    20843    +5     
+ Misses        1177     1176    -1     
+ Partials       458      457    -1

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

orbeckst

Looks good!

Optionally: Look at the performance, perhaps there's a faster way to do the lookup.

package/CHANGELOG

orbeckst · 2026-01-06T21:43:07Z

package/MDAnalysis/analysis/hydrogenbonds/hbond_analysis.py

+            count_lookup = dict(zip(indices, tmp_counts))
+            return np.array([count_lookup.get(i, 0) for i in range(len(self.frames))])


Looking up each frame looks slow. Perhaps there's some numpy magic (take???) ?

The only really faster approach I could figure out would be this:

if self.start is None: counts = np.zeros(len(self.frames), dtype=int) positions = np.searchsorted(self.frames, indices) counts[positions] = tmp_counts return counts

But this assumes the self.frames to be sorted. Would this always be the case, given the FrameIterator could be a non-sorted sequence of frames?

I am not sure if self.frames is sorted, possibly not when using run(frames=[2, 3, 0, 7, 6]). Maybe do a quick test?

Perhaps one could sort frames and rearrange counts in the same way and then un-sort everything again before returning?

fix hbond iterator

84e63d2

BradyAJohnston force-pushed the fix-hbond-iterator branch from 695a668 to 84e63d2 Compare January 6, 2026 14:39

BradyAJohnston added 2 commits January 6, 2026 14:40

fix changelog

a40ec44

improve lookup dict creation

9bee140

orbeckst approved these changes Jan 6, 2026

View reviewed changes

Update package/CHANGELOG

ae5afa7

orbeckst self-assigned this Jan 6, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

FIX: Handle a non-linear FrameIterator in HydrogenBondAnalysis #5202

FIX: Handle a non-linear FrameIterator in HydrogenBondAnalysis #5202

BradyAJohnston commented Jan 6, 2026 •

edited by github-actions bot

Loading

Uh oh!

BradyAJohnston commented Jan 6, 2026

Uh oh!

codecov bot commented Jan 6, 2026 •

edited

Loading

Uh oh!

orbeckst left a comment

Uh oh!

Uh oh!

orbeckst Jan 6, 2026

Uh oh!

BradyAJohnston Jan 7, 2026

Uh oh!

orbeckst Jan 7, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

		count_lookup = dict(zip(indices, tmp_counts))
		return np.array([count_lookup.get(i, 0) for i in range(len(self.frames))])

FIX: Handle a non-linear FrameIterator in HydrogenBondAnalysis #5202

Are you sure you want to change the base?

FIX: Handle a non-linear FrameIterator in HydrogenBondAnalysis #5202

Conversation

BradyAJohnston commented Jan 6, 2026 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

LLM / AI generated code disclosure

PR Checklist

Developers Certificate of Origin

Uh oh!

BradyAJohnston commented Jan 6, 2026

Uh oh!

codecov bot commented Jan 6, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

orbeckst left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

orbeckst Jan 6, 2026

Choose a reason for hiding this comment

Uh oh!

BradyAJohnston Jan 7, 2026

Choose a reason for hiding this comment

Uh oh!

orbeckst Jan 7, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

BradyAJohnston commented Jan 6, 2026 •

edited by github-actions bot

Loading

codecov bot commented Jan 6, 2026 •

edited

Loading