GenAI Policy Adjudicator: Enterprise Microservice

1. Background and Goal

Insurance claims processing often suffers from high manual overhead during the initial triage phase, where adjusters must cross-reference unstructured claim reports against dense Product Disclosure Statements (PDS).

The Goal: Build an Enterprise-grade, lightweight, agentic Retrieval-Augmented Generation (RAG) microservice that automatically ingests a claim, retrieves the relevant policy clauses, and outputs a structured adjudication decision (Approve/Deny/Escalate). This prototype demonstrates how to reduce cycle times while maintaining strict explainability guardrails, testing suites, and experiment tracking APIs.

For detailed architectural overviews, debugging, and dashboard access, please see DEVELOPMENT.md.

2. Enterprise Features

FastAPI Microservice: The core adjudication engine is wrapped in a RESTful API (POST /adjudicate) allowing seamless integration with core insurance systems.
MLflow Governance: Every evaluation tracks model versions, parameters, and logs the raw LLM prompts and json results as artifacts for perfect auditability and experiment tracking.
Two-Stage Retrieval System: Dense vector embedding retrieval via Gemini, immediately followed by a fine-tuned Cross-Encoder (ms-marco-MiniLM-L-6-v2) re-ranking pass to perfectly distill context.
Business Value Dashboards: Evaluation notebooks dynamically generating risk metrics and enforcing a Human-in-the-Loop decision threshold.
Docker Containerization: Application and data persistence layers are orchestrated via docker-compose.yml, simplifying deployment workflows.
LLM-as-a-Judge Evaluation: Robust Pytest suite featuring quantitative LLM evaluation tools (eval_ragas.py) that strictly assert the agent's logic and citations against the raw claim data.
CI/CD Automation: A GitHub Actions pipeline guarantees code quality and logic preservation across all future PRs.
Abstract Data Layer: The vector store utilizes the Strategy Pattern, allowing the local ChromaDB logic to be hot-swapped for a Databricks Vector Search implementation in the future.

3. Structure and Implementation

The codebase is designed for extreme modularity and scalability.

requirements.txt: Dependencies for the API, ML Tracking, and Testing framework.
.env: (User created) Environment file for storing GEMINI_API_KEY.
src/api.py: FastAPI endpoints for health monitoring and claim adjudication processing.
src/adjudicator.py: The core orchestration script pulling vectors, executing Gemini with Pydantic structured schemas, and tracking metrics in MLflow.
src/reranker.py: Employs a cross-encoder model to re-rank chunks and improve context precision.
src/vector_store.py: Abstract Base Class for ChromaDB local storage of Gemini text embeddings.
src/mock_data.py: Generates synthetic PDS markdown files and claim JSONs.
tests/: Contains pytest files for API validation, LLM-as-a-judge quantitative validation, and mocking tools for timeout handling.
notebooks/evaluation_dashboard.ipynb: Analyzes the business value and calculates the 85% Escalate threshold.
Dockerfile / docker-compose.yml / Makefile: Containerization and orchestration scripts for running the API server.
run.py: The Uvicorn entry point that starts the local API server and initializes dummy data.

4. Execution

python3 -m venv .venv
source .venv/bin/activate
pip install -r requirements.txt
source .env # ensure GEMINI_API_KEY is exported
python run.py

After running the script, the Uvicorn server will host the API on port 8000. You can test the endpoint using curl:

curl -X POST http://localhost:8000/adjudicate \
     -H "Content-Type: application/json" \
     -d @data/claim_1.json

5. Deployment

To run the application using Docker, ensure that you have provided your GEMINI_API_KEY in a local .env file. You can then use the provided Makefile commands for simplified orchestration:

# Build the Docker image
make build

# Start the Docker containers in the background
make up

# View the application logs
make logs

# Stop and remove the Docker containers
make down

6. Roadmap & Scaling

For the enterprise target-state architecture, production deployment roadmap, and security/compliance guardrails, see docs/FUTURE.md.

Name		Name	Last commit message	Last commit date
Latest commit History 58 Commits
.github/workflows		.github/workflows
data		data
docs		docs
mlruns/1		mlruns/1
mlruns_artifacts		mlruns_artifacts
notebooks		notebooks
presentation		presentation
src		src
tests		tests
.dockerignore		.dockerignore
.flake8		.flake8
.gitignore		.gitignore
Dockerfile		Dockerfile
Makefile		Makefile
README.md		README.md
docker-compose.yml		docker-compose.yml
mlflow.db		mlflow.db
requirements.txt		requirements.txt
run.py		run.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

GenAI Policy Adjudicator: Enterprise Microservice

1. Background and Goal

2. Enterprise Features

3. Structure and Implementation

4. Execution

5. Deployment

6. Roadmap & Scaling

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

GenAI Policy Adjudicator: Enterprise Microservice

1. Background and Goal

2. Enterprise Features

3. Structure and Implementation

4. Execution

5. Deployment

6. Roadmap & Scaling

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages