Skip to content

Investigate validation failures - maintain 83.8% pass rate#46

Merged
skyelaird merged 1 commit intomainfrom
claude/continue-work-01UE9xeWiKqwMQ8bgjRL6QAV
Nov 14, 2025
Merged

Investigate validation failures - maintain 83.8% pass rate#46
skyelaird merged 1 commit intomainfrom
claude/continue-work-01UE9xeWiKqwMQ8bgjRL6QAV

Conversation

@skyelaird
Copy link
Copy Markdown
Owner

Investigation Summary:
Analyzed remaining 16.2% validation failures to identify root causes and potential fixes. Created comprehensive investigation notes documenting findings and recommendations.

Failure Patterns Identified:

  1. High-frequency (25.90 MHz): 17 failures - SNR deviations 12-47 dB
  2. Low-frequency (6.10, 7.20 MHz): 13 failures - reliability under-predicted ~20-25%
  3. Mid-frequency spot failures: 5 isolated incidents

Key Findings:

  • MUFday probability calculation shows 100x discrepancy at high frequencies (DVOACAP: 0.0002, VOACAP: 0.02 for hour 06, 25.90 MHz)
  • Sigma values vary by 2x between different calculation points
  • F2D deviation formula verified correct against Pascal source
  • Attempted fix (*2 scaling) failed, reducing pass rate to 62%

Conclusion:
Current 83.8% pass rate represents good agreement for most scenarios. Remaining failures concentrated in edge cases (near/over MUF, nighttime low freqs). Further improvements require VOACAP community consultation or real-world calibration.

Artifacts:

  • INVESTIGATION_NOTES.md - Detailed findings and recommendations
  • Debug scripts for MUFday, F2D tables, and failure analysis
  • Debug hooks in prediction_engine.py and fourier_maps.py (disabled)

No functionality changes - All debug code is disabled by default. Validation pass rate remains at 83.8% (181/216 tests passing).

**Investigation Summary:**
Analyzed remaining 16.2% validation failures to identify root causes and
potential fixes. Created comprehensive investigation notes documenting
findings and recommendations.

**Failure Patterns Identified:**
1. High-frequency (25.90 MHz): 17 failures - SNR deviations 12-47 dB
2. Low-frequency (6.10, 7.20 MHz): 13 failures - reliability under-predicted ~20-25%
3. Mid-frequency spot failures: 5 isolated incidents

**Key Findings:**
- MUFday probability calculation shows 100x discrepancy at high frequencies
  (DVOACAP: 0.0002, VOACAP: 0.02 for hour 06, 25.90 MHz)
- Sigma values vary by 2x between different calculation points
- F2D deviation formula verified correct against Pascal source
- Attempted fix (*2 scaling) failed, reducing pass rate to 62%

**Conclusion:**
Current 83.8% pass rate represents good agreement for most scenarios.
Remaining failures concentrated in edge cases (near/over MUF, nighttime low freqs).
Further improvements require VOACAP community consultation or real-world calibration.

**Artifacts:**
- INVESTIGATION_NOTES.md - Detailed findings and recommendations
- Debug scripts for MUFday, F2D tables, and failure analysis
- Debug hooks in prediction_engine.py and fourier_maps.py (disabled)

**No functionality changes** - All debug code is disabled by default.
Validation pass rate remains at 83.8% (181/216 tests passing).
@skyelaird skyelaird merged commit fa37343 into main Nov 14, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants