Model fallbacks by krissetto · Pull Request #1589 · docker/cagent

krissetto · 2026-02-04T16:09:27Z

Allows users to define fallback models per agent in the yaml config.

If something goes wrong calling a model, retry a few times with exp backoff + jitter or fallback to the next model in the list based on the type of error encountered

Makes cagent a more reliable platform for users, avoiding much pain and frustration and blocking workflows when an inference provider goes down

Has sane defaults to keep user configs minimal while still getting big advantages

Covers title generation as well

Minimal example

agents:
  root:
    model: anthropic/claude-opus-4-5
    fallback:
      models:
        - openai/gpt-5.2
    description: A reliable assistant with automatic failover
    instruction: You are a helpful and resilient assistant.

github-actions

Model Fallback Implementation Review

✅ No issues found - This is a well-implemented feature with comprehensive testing and proper safeguards.

What Was Reviewed

This PR adds a robust model fallback system with:

Retry logic with exponential backoff for retryable errors (5xx, timeouts)
Immediate fallback switching for non-retryable errors (429, 4xx)
Cooldown mechanism to stick with successful fallbacks
Configurable retry counts and cooldown durations
Comprehensive test coverage (779 lines of tests)

Code Quality Highlights

Strengths:

✅ Thread-safe cooldown state management with proper mutex usage
✅ Proper resource cleanup (stream.Close() via defer)
✅ Comprehensive error classification (retryable vs non-retryable)
✅ Context cancellation handling throughout
✅ Extensive test coverage including edge cases
✅ Well-documented configuration with sensible defaults

Architecture:

Clean separation of concerns (fallback logic in dedicated file)
Proper use of Go idioms (defer for cleanup, mutex for thread safety)
Good error wrapping and logging

Verification Process

I analyzed the implementation for common bug patterns:

✅ Resource leaks - None found (proper defer usage)
✅ Race conditions - None found (local variables + mutex protection)
✅ Bounds checking - Properly validated
✅ Nil pointer dereferences - Properly guarded
✅ Off-by-one errors - Calculations are correct

Conclusion

This feature adds significant value by making cagent more resilient to provider outages. The implementation is solid with proper safeguards and comprehensive testing.

github-actions

Review Summary

I've reviewed the model fallback implementation and found no bugs in the changed code. The implementation demonstrates solid engineering practices:

✅ Proper concurrency control - Mutex protection for cooldown state access
✅ Defensive programming - Nil checks and bounds validation throughout
✅ Error handling - Clear classification of retryable vs non-retryable errors
✅ Context handling - Proper cancellation checks in retry loops
✅ Code organization - Clean separation of concerns between fallback logic and model switching

The feature adds robust failover capabilities with sensible defaults. Good work!

Allows users to define fallback models per agent in the yaml config. If something goes wrong calling a model, retry a few times with exp backoff + jitter or fallback to the next model in the list based on the type of error encountered Signed-off-by: Christopher Petito <chrisjpetito@gmail.com>

Signed-off-by: Christopher Petito <chrisjpetito@gmail.com>

krissetto force-pushed the fallback-models branch from 33fbd12 to 11f196f Compare February 4, 2026 20:09

krissetto changed the title ~~[proposal] Model fallbacks~~ Model fallbacks Feb 4, 2026

krissetto marked this pull request as ready for review February 4, 2026 20:10

krissetto requested a review from a team as a code owner February 4, 2026 20:10

github-actions bot previously approved these changes Feb 4, 2026

View reviewed changes

krissetto marked this pull request as draft February 4, 2026 20:43

krissetto dismissed github-actions[bot]’s stale review via b4399bc February 4, 2026 20:59

krissetto force-pushed the fallback-models branch 2 times, most recently from b4399bc to 4bb5ac8 Compare February 4, 2026 21:29

krissetto marked this pull request as ready for review February 5, 2026 09:47

github-actions bot previously approved these changes Feb 5, 2026

View reviewed changes

krissetto added 2 commits February 5, 2026 15:25

docs, examples, tests n schema for model fallbacks feature

ee4a7e2

Signed-off-by: Christopher Petito <chrisjpetito@gmail.com>

krissetto dismissed github-actions[bot]’s stale review via ee4a7e2 February 5, 2026 14:26

krissetto force-pushed the fallback-models branch from 4bb5ac8 to ee4a7e2 Compare February 5, 2026 14:26

dgageot approved these changes Feb 5, 2026

View reviewed changes

dgageot merged commit 3c6d330 into docker:main Feb 5, 2026
5 checks passed

BrewTestBot mentioned this pull request Feb 7, 2026

cagent 1.20.6 Homebrew/homebrew-core#266303

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Model fallbacks#1589

Model fallbacks#1589
dgageot merged 2 commits intodocker:mainfrom
krissetto:fallback-models

krissetto commented Feb 4, 2026 •

edited

Loading

Uh oh!

github-actions bot left a comment

Uh oh!

github-actions bot left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

krissetto commented Feb 4, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Minimal example

Uh oh!

github-actions bot left a comment

Choose a reason for hiding this comment

Model Fallback Implementation Review

What Was Reviewed

Code Quality Highlights

Verification Process

Conclusion

Uh oh!

github-actions bot left a comment

Choose a reason for hiding this comment

Review Summary

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

krissetto commented Feb 4, 2026 •

edited

Loading