Surgical middleware for LLMs. Reduces token waste by 40-60% using Triple-Tier Memory (Anchor + Fact Sheet + Buffer) and Semantic Caching. Built for production-grade branding & cost-recovery.
cost-optimization anthropic anthropic-claude token-reduction llm-ops context-management ai-infrastructure semantic-caching ai-middleware
-
Updated
Apr 26, 2026 - JavaScript