Webhook Processing Service

This application processes and sends webhooks to specified endpoints, handling retries with an exponential backoff strategy to manage failures. It is designed for scalability, using a queue-based approach to manage webhook delivery, a retry mechanism to handle temporary failures, and a dead-letter queue to track unrecoverable errors.

Features

Exponential Backoff Retry: Retries failed webhooks with an increasing delay, capped at 1 minute.
Dead-letter Queue: Webhooks that exceed the retry threshold are stored in a dead-letter queue.
Concurrency and Threading: Supports concurrent processing using worker threads for efficient processing.
Failure Threshold: Stops sending to any endpoint after 5 consecutive failures.
Thread-safe Locking: Ensures reliable access to shared resources like queues and failure counts.

Technologies Used

Python: Core language for application logic.
Redis: Used as a message queue for storing and retrying webhook tasks.
MySQL: Persistent database for storing webhook records and tracking their statuses.
Threading: Implements concurrent processing of webhooks with thread safety.

Setup and Installation

Prerequisites

Python 3.12+
Redis: Used as the queue storage.
MySQL: Used as the database for storing webhook records.
Dependencies: Ensure you have access to pip for installing required Python packages.

1. Clone the Repository

git clone https://github.com/thebolarin/webhook.git
cd webhook

2. Create a .env File

Create a .env file in the root directory of the project using the sample in the .env.example file

3. Build and Start the Containers

docker-compose up -d --build

4. Database Migrations

To ensure that the database is up-to-date before using the application, follow these steps:

Apply the latest migrations
Run the following command to upgrade the database to the latest migration state:
```
docker-compose exec web alembic upgrade head
```

If you make changes to the models and need to create a new migration, run:

docker-compose exec web alembic revision --autogenerate -m "Your migration message"

Then apply the migration:

docker-compose exec web alembic upgrade head

6. How to Run the Webhook API

Once the application is running, you can trigger the webhook processor by making a POST request to http://127.0.0.1:8000/webhooks/process.

Example of Making a POST Request You can use curl, Postman, or any HTTP client to make the request. Here's an example using curl:

curl -X POST http://127.0.0.1:8000/webhooks/process \
     -H "Content-Type: application/json"

7. Running the test

Run the command below to run the integration tests for the application

docker-compose exec web pytest -v

8. Monitoring

Run the command below to run view the logs of the webhook processor

docker-compose logs -f web

Explanation of the Main Components

WebhookProcessorService
- load_webhooks: Loads webhooks from a file into the Redis queue.
- send_webhook: Attempts to send a webhook, with retries on failure.
- process_retry_queue: Processes webhooks in the retry queue based on their scheduled retry times.
- worker_with_session: Starts a worker thread to process webhooks concurrently.
WebhookService: Manages database and Redis operations, including enqueueing, dequeueing, updating, and tracking webhooks state.

Design Decisions

Exponential Backoff with Randomized Delay: Exponential backoff is applied to each failed webhook with random jitter. This ensures that retries are spread over time, reducing the risk of overwhelming the endpoint if there’s a temporary outage.
Dead-letter Queue: Allows for tracking of unresolvable failures without blocking the entire queue, improving robustness.
Threading for Concurrency: Using multiple worker threads improves processing speed for high-volume webhook delivery, making the system more scalable.
Redis for Queue Management: Redis serves as the backend for both the main processing queue and the retry queue, enabling fast access and persistence for enqueue and dequeue operations. For the retry queue, a priority queue (using Redis sorted sets) is used, which stores webhooks based on the scheduled retry time. This allows the retry processor to efficiently fetch only the webhooks ready to be retried, optimizing delay handling.
Failure Thresholds: Endpoints are given up to 5 retry attempts before being marked as permanently failed. After reaching the threshold, any further webhooks for that endpoint are redirected to a dead-letter queue, maintaining endpoint stability without continuous retries.
Database for Persistence: MySQL stores the persistent record of webhooks, their status, and retry attempts, ensuring durability across service restarts.
Separation of Concerns: The solution is split into two main classes: WebhookProcessorService for processing logic and WebhookService for queue management. This approach isolates concerns for easier maintenance and testing.
Failure Thresholds: Endpoints are given up to 5 retry attempts before being marked as permanently failed. After reaching the threshold, any further webhooks for that endpoint are redirected to a dead-letter queue, maintaining endpoint stability without continuous retries.
Concurrency: Using multiple worker threads allows parallel processing of webhooks, improving throughput. Each worker uses a separate Redis queue to dequeue webhooks, while a retry thread manages delayed retries, contributing to scalability and speed.
Monitoring: Logs will display processing details, including success, retry, and failure notifications.
Webhook Deduplication and Idempotency: Ensuring unique payloads before sending is beneficial if duplicate events are common, reducing unnecessary retries.

Security Considerations

Sanitization of Endpoint URLs: URL parsing ensures only valid URLs are processed, reducing the chance of sending requests to unintended destinations.
Thread Safety and Redis Locks: Locks (e.g., failure_counts_lock and retry_queue_lock) ensure that updates to failure counters and retries are thread-safe, avoiding race conditions in a concurrent environment.
Failure Count Tracking: Limits the number of retries for an endpoint, preventing potential abuse or accidental spamming of endpoints.
Error Logging: By logging errors and invalid URLs, we can monitor any potential issues without exposing sensitive payload data in logs.

Trade-offs

Threading over AsyncIO: Threading allows leveraging CPU-bound concurrency without adding the complexity of an async framework. However, an asynchronous approach might be more efficient under extreme load.
Dead-letter Queue vs Real-time Handling: The dead-letter queue allows tracking of unrecoverable errors but does not automatically retry them unless manually processed. This keeps the system manageable, though it requires periodic monitoring.
Asynchronous Webhook Handler: The webhook processor currently needs to be triggered before webhooks are processed. However an asynchronus approach might be more efficient where it continously polls from the webhook queue.

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
alembic		alembic
app		app
tests		tests
.env.example		.env.example
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
alembic.ini		alembic.ini
docker-compose.yaml		docker-compose.yaml
main.py		main.py
requirements.txt		requirements.txt
server_pre_start.py		server_pre_start.py
webhooks.txt		webhooks.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Webhook Processing Service

Features

Technologies Used

Setup and Installation

Prerequisites

1. Clone the Repository

2. Create a .env File

3. Build and Start the Containers

4. Database Migrations

6. How to Run the Webhook API

7. Running the test

8. Monitoring

Explanation of the Main Components

Design Decisions

Security Considerations

Trade-offs

About

Uh oh!

Releases

Packages

Uh oh!

Languages

thebolarin/webhook

Folders and files

Latest commit

History

Repository files navigation

Webhook Processing Service

Features

Technologies Used

Setup and Installation

Prerequisites

1. Clone the Repository

2. Create a .env File

3. Build and Start the Containers

4. Database Migrations

6. How to Run the Webhook API

7. Running the test

8. Monitoring

Explanation of the Main Components

Design Decisions

Security Considerations

Trade-offs

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages