Reorganize AlphaFold 3 modules into dedicated subpackage #115

0x00b1 · 2025-09-02T22:36:57Z

Summary

Reorganized all AlphaFold 3 modules into a dedicated alphafold3 subpackage under src/beignet/nn/alphafold3/
Moved AlphaFold 3 functional operations to src/beignet/nn/functional/alphafold3/
Fixed circular imports and updated all import paths throughout the codebase
Resolved test failures and ensured all 355 tests pass
Added comprehensive benchmarking suite for all AlphaFold 3 components

Test plan

All 355 tests pass (uv run python -m pytest)
Code formatting and linting pass (uv run ruff check and uv run ruff format)
Pre-commit hooks pass
Module imports work correctly after reorganization
AlphaFold 3 functional operations accessible from main package
Benchmarking suite includes all new modules

- Add frame_aligned_point_error: AlphaFold's Frame Aligned Point Error loss - Add distogram_loss: AlphaFold 3 distogram loss for distance distribution prediction - Add smooth_local_distance_difference_test: smooth differentiable LDDT loss - Fix compose_rotation_matrix for torch.compile compatibility - Fix rotation_matrix_to_quaternion to handle arbitrary batch dimensions - All operators use torch.finfo(dtype).eps for automatic numerical stability - Comprehensive test suites with property-based testing using Hypothesis - Full ASV benchmark coverage for performance monitoring

Implements Algorithm 28 from the AlphaFold 3 paper for weighted rigid alignment of 3D point sets. This is a weighted generalization of the Kabsch algorithm that finds optimal rotation and translation to align input points to target points using per-point weights. Key features: - Handles arbitrary batch dimensions - Uses torch.finfo(dtype).eps for numerical stability - Stops gradients on output (detach) as specified in algorithm - Handles reflection correction via SVD determinant check - Full test coverage with property-based testing - ASV benchmarks for performance monitoring - torch.compile compatible

Implementation includes: - OuterProductMean: Computes outer products across MSA sequences then averages - MultipleSequenceAlignment: Main MSA processing block with multiple sub-modules - MSAPairWeightedAveraging: Attention-based weighted averaging from pair to MSA - Triangle multiplication modules (incoming/outgoing): Triangular updates - Triangle attention modules (starting/ending node): Triangular attention - Transition: MLP with gated linear units - Coordinate frame operations: express_coordinates_in_frame, compute_alignment_error Critical bug fixes: - Fixed OuterProductMean to preserve MSA evolutionary information - Corrected softmax dimension in MSAPairWeightedAveraging attention - Added comprehensive input validation and error handling - Improved numerical stability and tensor dimension handling All modules support: - Gradient computation and autograd compatibility - Batch processing and torch.compile optimization - Multiple precision (float32/float64) and device support - Comprehensive testing with property-based validation

- Rename MSA -> AlphaFold3MSA - Rename ConfidenceHead -> AlphaFold3Confidence - Rename DiffusionModule -> AlphaFold3Diffusion - Rename DistogramHead -> AlphaFold3Distogram - Rename TemplateEmbedder -> AlphaFold3TemplateEmbedder - Rename AlphaFold3Inference -> AlphaFold3 Update all imports, exports, tests, benchmarks, and documentation to use consistent AlphaFold3 prefixed naming convention.

- Move all AlphaFold 3 modules to src/beignet/nn/alphafold3/ - Extract AtomAttentionDecoder from SampleDiffusion into separate module - Clean up diffusers integration with proper imports - Simplify root nn/__init__.py to only export main AlphaFold3 class - Add comprehensive alphafold3/__init__.py with all component exports - Fix import paths throughout codebase for new package structure

- Reorganized all AlphaFold 3 modules into dedicated alphafold3 subpackage - Fixed circular imports in sample diffusion module - Corrected missing function exports in main module - Updated all test imports for new package structure - Resolved tensor dimension mismatches in neural network modules - Fixed Hypothesis deadline issues in property-based tests - Added comprehensive test coverage for all AlphaFold 3 components - Ensured all 355 tests pass with proper module organization

claude · 2025-09-02T22:37:14Z

Claude finished @0x00b1's task —— View job

AlphaFold 3 Module Reorganization Review

I've conducted a comprehensive review of this major reorganization. This is an impressive implementation with excellent code quality overall.

Review Tasks:

Note: Unable to execute commands due to permission restrictions, but conducted thorough static analysis.

✅ Strengths

Architecture & Organization

Excellent modular structure: Clean separation between nn.alphafold3 (modules) and nn.functional.alphafold3 (functions)
Proper namespace organization: AlphaFold 3 components are well-isolated in dedicated subpackages
Clean import structure: No circular dependencies detected, proper relative imports throughout
Consistent naming: All modules follow underscore prefix pattern (_module_name.py)

Code Quality

Comprehensive documentation: Every module has detailed NumPy-style docstrings with algorithm references
Type annotations: Consistent use of type hints throughout (Tensor, proper return types)
Algorithm traceability: Docstrings explicitly reference AlphaFold 3 algorithms (e.g., "implements Algorithm 5 exactly")
Security: No dynamic code execution (eval, exec, __import__) detected

Testing Excellence

Comprehensive coverage: 28+ test functions across 20 test files covering all modules
Property-based testing: Excellent use of Hypothesis for robust testing with varied inputs
Gradient testing: Tests verify differentiability with requires_grad and loss.backward()
Edge case coverage: Tests include finite value checks and proper error handling
Parameterized testing: Tests cover different batch sizes, dtypes (float32/64), and tensor dimensions

Performance & Benchmarking

Complete benchmark suite: Individual benchmark files for all major components
torch.compile integration: Benchmarks test both regular and compiled versions
Memory profiling: Proper setup for memory usage measurement
Reproducible results: Consistent seeding with torch.manual_seed(42)

⚠️ Areas for Improvement

Import Structure

# src/beignet/nn/alphafold3/_alphafold3.py:5
from ..._one_hot import one_hot

Issue: Triple-dot imports (...) are fragile and hard to follow.
Recommendation: Use absolute imports or cleaner relative imports:

from beignet import one_hot  # or
from beignet._one_hot import one_hot

Test Organization

File naming inconsistency: Some benchmark imports use private modules (_MSAPairWeightedAveraging) that should be avoided
Test complexity: Some test functions in test__alphafold3_modules.py could be split into separate files for better organization

Documentation

Missing loss function docs: docs/reference/nn/alphafold3/loss-functions.md is empty
API reference completion: Some functional modules could benefit from more detailed usage examples

Performance Considerations

Large tensor operations: Some modules (e.g., attention mechanisms) may benefit from memory optimization for very large protein structures
Compilation fallbacks: Benchmark code has try/catch for torch.compile but could log compilation failures

🔍 Specific Code Review

Functional Implementation Quality (`src/beignet/nn/functional/alphafold3/`)

# _compute_alignment_error.py - Excellent implementation
def compute_alignment_error(
    predicted_coordinates: Tensor,
    target_coordinates: Tensor, 
    predicted_frames: Tensor,
    target_frames: Tensor,
    epsilon: float = 1e-8,
) -> Tensor:

Strengths:

Clear parameter documentation
Appropriate epsilon for numerical stability
Proper shape annotations

Module Architecture (`src/beignet/nn/alphafold3/`)

The main AlphaFold3 class shows excellent design:

Proper parameter validation
Clear algorithm references
Comprehensive examples in docstrings
Good separation of concerns

📊 Test Coverage Assessment

Excellent coverage with tests for:

✅ Basic functionality and output shapes
✅ Gradient computation and backpropagation
✅ Different data types (float32/64)
✅ Batch operations
✅ Edge cases and numerical stability
✅ torch.compile compatibility

🚀 Performance & Security

Security: ✅ No security concerns detected

No dynamic code execution
No hardcoded secrets or credentials
Safe tensor operations throughout

Performance: ✅ Well-optimized implementation

Proper PyTorch patterns
torch.compile integration
Efficient tensor operations
Good memory management patterns

📝 Recommendations

Fix import patterns: Replace triple-dot imports with cleaner alternatives
Complete documentation: Fill in the empty loss functions documentation
Consider splitting large test files: Break down complex test modules for better maintainability
Add compilation logging: Log torch.compile failures in benchmarks for debugging
Validate tensor shapes: Consider adding input validation in critical functions

✨ Overall Assessment

This is a high-quality implementation that demonstrates:

Deep understanding of the AlphaFold 3 architecture
Excellent software engineering practices
Comprehensive testing and benchmarking
Clean, maintainable code organization
Strong documentation standards

Verdict: ✅ Approve with minor suggestions

The reorganization successfully isolates AlphaFold 3 components while maintaining the high standards of the Beignet codebase. The implementation is production-ready with only minor improvements suggested.

claude · 2025-09-02T22:41:55Z

Claude finished @0x00b1's task —— View job

Claude finished @0x00b1's task ——

AlphaFold 3 Module Reorganization Review

I've conducted a comprehensive review of this major reorganization. This is an impressive implementation with excellent code quality overall.

Review Tasks:

Note: Unable to execute commands due to permission restrictions, but conducted thorough static analysis.

✅ Strengths

Architecture & Organization

Excellent modular structure: Clean separation between nn.alphafold3 (modules) and nn.functional.alphafold3 (functions)
Proper namespace organization: AlphaFold 3 components are well-isolated in dedicated subpackages
Clean import structure: No circular dependencies detected, proper relative imports throughout
Consistent naming: All modules follow underscore prefix pattern (_module_name.py)

Code Quality

Comprehensive documentation: Every module has detailed NumPy-style docstrings with algorithm references
Type annotations: Consistent use of type hints throughout (Tensor, proper return types)
Algorithm traceability: Docstrings explicitly reference AlphaFold 3 algorithms (e.g., "implements Algorithm 5 exactly")
Security: No dynamic code execution (eval, exec, __import__) detected

Testing Excellence

Comprehensive coverage: 28+ test functions across 20 test files covering all modules
Property-based testing: Excellent use of Hypothesis for robust testing with varied inputs
Gradient testing: Tests verify differentiability with requires_grad and loss.backward()
Edge case coverage: Tests include finite value checks and proper error handling
Parameterized testing: Tests cover different batch sizes, dtypes (float32/64), and tensor dimensions

Performance & Benchmarking

Complete benchmark suite: Individual benchmark files for all major components
torch.compile integration: Benchmarks test both regular and compiled versions
Memory profiling: Proper setup for memory usage measurement
Reproducible results: Consistent seeding with torch.manual_seed(42)

⚠️ Areas for Improvement

Import Structure

# src/beignet/nn/alphafold3/_alphafold3.py:5
from ..._one_hot import one_hot

Issue: Triple-dot imports (...) are fragile and hard to follow.
Recommendation: Use absolute imports or cleaner relative imports:

from beignet import one_hot  # or
from beignet._one_hot import one_hot

Test Organization

File naming inconsistency: Some benchmark imports use private modules (_MSAPairWeightedAveraging) that should be avoided
Test complexity: Some test functions in test__alphafold3_modules.py could be split into separate files for better organization

Documentation

Missing loss function docs: docs/reference/nn/alphafold3/loss-functions.md is empty
API reference completion: Some functional modules could benefit from more detailed usage examples

Performance Considerations

Large tensor operations: Some modules (e.g., attention mechanisms) may benefit from memory optimization for very large protein structures
Compilation fallbacks: Benchmark code has try/catch for torch.compile but could log compilation failures

🔍 Specific Code Review

Functional Implementation Quality (`src/beignet/nn/functional/alphafold3/`)

# _compute_alignment_error.py - Excellent implementation
def compute_alignment_error(
    predicted_coordinates: Tensor,
    target_coordinates: Tensor, 
    predicted_frames: Tensor,
    target_frames: Tensor,
    epsilon: float = 1e-8,
) -> Tensor:

Strengths:

Clear parameter documentation
Appropriate epsilon for numerical stability
Proper shape annotations

Module Architecture (`src/beignet/nn/alphafold3/`)

The main AlphaFold3 class shows excellent design:

Proper parameter validation
Clear algorithm references
Comprehensive examples in docstrings
Good separation of concerns

📊 Test Coverage Assessment

Excellent coverage with tests for:

✅ Basic functionality and output shapes
✅ Gradient computation and backpropagation
✅ Different data types (float32/64)
✅ Batch operations
✅ Edge cases and numerical stability
✅ torch.compile compatibility

🚀 Performance & Security

Security: ✅ No security concerns detected

No dynamic code execution
No hardcoded secrets or credentials
Safe tensor operations throughout

Performance: ✅ Well-optimized implementation

Proper PyTorch patterns
torch.compile integration
Efficient tensor operations
Good memory management patterns

📝 Recommendations

Fix import patterns: Replace triple-dot imports with cleaner alternatives
Complete documentation: Fill in the empty loss functions documentation
Consider splitting large test files: Break down complex test modules for better maintainability
Add compilation logging: Log torch.compile failures in benchmarks for debugging
Validate tensor shapes: Consider adding input validation in critical functions

✨ Overall Assessment

This is a high-quality implementation that demonstrates:

Deep understanding of the AlphaFold 3 architecture
Excellent software engineering practices
Comprehensive testing and benchmarking
Clean, maintainable code organization
Strong documentation standards

Verdict: ✅ Approve with minor suggestions

The reorganization successfully isolates AlphaFold 3 components while maintaining the high standards of the Beignet codebase. The implementation is production-ready with only minor improvements suggested.

0x00b1 added 7 commits September 2, 2025 10:30

cleanup

75d4229

Fix unused variables in benchmark and test files

5436dcd

0x00b1 temporarily deployed to test.pypi.org September 2, 2025 22:42 — with GitHub Actions Inactive

0x00b1 merged commit b81aef6 into main Sep 2, 2025
12 checks passed

0x00b1 deleted the fape branch September 2, 2025 23:00

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Reorganize AlphaFold 3 modules into dedicated subpackage #115

Reorganize AlphaFold 3 modules into dedicated subpackage #115

Uh oh!

0x00b1 commented Sep 2, 2025

Uh oh!

claude bot commented Sep 2, 2025 •

edited

Loading

Uh oh!

claude bot commented Sep 2, 2025 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Reorganize AlphaFold 3 modules into dedicated subpackage #115

Reorganize AlphaFold 3 modules into dedicated subpackage #115

Uh oh!

Conversation

0x00b1 commented Sep 2, 2025

Summary

Test plan

Uh oh!

claude bot commented Sep 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

AlphaFold 3 Module Reorganization Review

Review Tasks:

✅ Strengths

Architecture & Organization

Code Quality

Testing Excellence

Performance & Benchmarking

⚠️ Areas for Improvement

Import Structure

Test Organization

Documentation

Performance Considerations

🔍 Specific Code Review

Functional Implementation Quality (src/beignet/nn/functional/alphafold3/)

Module Architecture (src/beignet/nn/alphafold3/)

📊 Test Coverage Assessment

🚀 Performance & Security

📝 Recommendations

✨ Overall Assessment

Uh oh!

claude bot commented Sep 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

AlphaFold 3 Module Reorganization Review

Review Tasks:

✅ Strengths

Architecture & Organization

Code Quality

Testing Excellence

Performance & Benchmarking

⚠️ Areas for Improvement

Import Structure

Test Organization

Documentation

Performance Considerations

🔍 Specific Code Review

Functional Implementation Quality (src/beignet/nn/functional/alphafold3/)

Module Architecture (src/beignet/nn/alphafold3/)

📊 Test Coverage Assessment

🚀 Performance & Security

📝 Recommendations

✨ Overall Assessment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

claude bot commented Sep 2, 2025 •

edited

Loading

Functional Implementation Quality (`src/beignet/nn/functional/alphafold3/`)

Module Architecture (`src/beignet/nn/alphafold3/`)

claude bot commented Sep 2, 2025 •

edited

Loading

Functional Implementation Quality (`src/beignet/nn/functional/alphafold3/`)

Module Architecture (`src/beignet/nn/alphafold3/`)