Feature/video gen by cryptopoly · Pull Request #20 · cryptopoly/ChaosEngineAI

cryptopoly · 2026-05-01T06:30:25Z

No description provided.

The warm-pool eviction policy was a fixed count (MAX_WARM_MODELS = 2), so loading three 17-18 GB models in succession on a 64 GB Mac kept all three resident — 17 + 17 + 18 + 21 GB llama-server = ~73 GB, swap, OS crash. Replace the count cap with a memory-aware policy: - _model_resident_bytes(info): on-disk weight size as a proxy for RAM, works for both mlx-lm (mmap) and llama.cpp (full-load). - _memory_budget_bytes(): live psutil snapshot of available RAM minus WARM_POOL_MEMORY_HEADROOM_BYTES (6 GB, mirrors spareHeadroomGb). - _evict_warm_pool(incoming_bytes=N): apply the count cap first (defensive — handles psutil-unavailable hosts), then evict oldest until pool + incoming fits under the budget. - _park_active_engine_or_unload now passes the parked model's resident bytes so eviction targets the right footprint. Chat picker also defaulted to a catalog-only model (nvidia/NVIDIA-Nemotron-3-Nano-4B-GGUF) on a fresh thread, then 500'd on Load with 'isn't downloaded on this machine'. Two fixes: - src/App.tsx threadModelOptions drops the catalog branch entirely. Discover tab is the place to pull a new model; the picker shouldn't surface entries that don't exist locally. - backend_service/state.py _default_session_model now prefers the first available text library entry over _default_chat_variant() when no model is loaded. Falls back to the catalog default only if the library is empty (true first-launch case).

- Convert image/video model catalog views to denser sortable rows with filters - Replace model actions/status controls with icon buttons and tooltips - Add shared video component delete handling and filter non-chat models out of chat selectors - Add collapsed install logs for launch-modal custom runtime strategies - Calibrate image/video RAM estimates with runtime and device-specific footprint metadata - Cover catalog payloads, discover sorting, library filtering, and safety estimators with tests

cryptopoly added 4 commits April 30, 2026 00:06

Update media model management

91565e5

Fix TurboQuant install detection and warm pool memory guard

495ac8a

cryptopoly merged commit 8076c40 into main May 1, 2026
1 check failed

cryptopoly mentioned this pull request May 1, 2026

Pin imageDiscoverMemoryEstimate tests to Mac host #24

Merged

2 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature/video gen#20

Feature/video gen#20
cryptopoly merged 4 commits intomainfrom
feature/video-gen

cryptopoly commented May 1, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

cryptopoly commented May 1, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant