Remove manual ShutdownSignalException handling, rely on RabbitMQ auto-recovery. #250

ratkokostov7 · 2025-07-24T12:52:03Z

During AWS RabbitMQ maintenance windows, applications experienced connection recovery failures with errors like:

AlreadyClosedException: connection is already closed
TopologyRecoveryException: Caught an exception while recovering channel
Duplicate consumers created on queues after recovery
Service disruptions during planned maintenance

Root Cause: Manual recovery logic in shutdown callbacks was racing with RabbitMQ's built-in auto-recovery mechanism, causing conflicts when both tried to recreate channels and consumers simultaneously.

How I tested the changes:
Environment: 3-node RabbitMQ cluster with HAProxy load balancer simulating AWS MQ Multi-AZ maintenance scenarios using Docker containers.
Test Scenario: Simulated AZ failures with connection termination using custom maintenance messages (CONNECTION_FORCED - Node was put into maintenance mode) and monitored consumer recovery behavior during planned maintenance windows.
One service using the extension on 2 different ports (8080 and 8081) to simulate Kubernetes pods behavior, allowing us to test concurrent connection recovery scenarios across multiple application instances.

Results:

Before Changes:

Duplicate consumers created on queues
Inconsistent queue state during recovery
Service disruption and downtime during node transitions
AlreadyClosedException and recovery errors in logs
Application instability during maintenance windows

After Changes:

Consumer per queue maintained consistently
Queues remain stable throughout maintenance
Clean automatic recovery without errors
Seamless maintenance windows with no application impact

…-recovery.

Remove manual ShutdownSignalException handling, rely on RabbitMQ auto…

c3a219e

…-recovery.

gasper-vrhovsek approved these changes Jul 24, 2025

View reviewed changes

ctomc approved these changes Jul 24, 2025

View reviewed changes

ctomc merged commit a2fad06 into iris-events:main Jul 24, 2025
2 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Remove manual ShutdownSignalException handling, rely on RabbitMQ auto-recovery. #250

Remove manual ShutdownSignalException handling, rely on RabbitMQ auto-recovery. #250

Uh oh!

ratkokostov7 commented Jul 24, 2025 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Remove manual ShutdownSignalException handling, rely on RabbitMQ auto-recovery. #250

Remove manual ShutdownSignalException handling, rely on RabbitMQ auto-recovery. #250

Uh oh!

Conversation

ratkokostov7 commented Jul 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

ratkokostov7 commented Jul 24, 2025 •

edited

Loading