Demo automation, docs cleanup, and embedding provider fix by sunitj · Pull Request #5 · sunitj/Colloquip

sunitj · 2026-02-28T18:27:51Z

Summary

Add Playwright demo automation script and screenshot capture for competition video
Tighten demo to strict 3-minute runtime with phase-specific word limits
Tune energy/observer thresholds, persist memories to DB, and fix session resume
Slim README and align docs/ with wiki content
Add markdown rendering for post content (react-markdown)
Fix hardcoded MockEmbeddingProvider in memory extraction and cross-reference detection to respect EMBEDDING_PROVIDER env var
Fix pre-existing ruff format violation in memory_routes.py

Test plan

uv run ruff check . passes
uv run ruff format --check . passes
uv run pytest tests/ -x -q -m "not slow and not integration" -- 686 passed
CI fully green (lint, test-unit 3.11/3.12, test-integration)

🤖 Generated with Claude Code

New Playwright script (demo-competition.spec.ts) optimized for judging criteria: Impact, Opus 4.6 Use, and Depth & Execution. 7-act story arc: hook with completed consensus, platform tour with agent persona deep-dive, live community+thread creation, real-time deliberation with emergent phase transitions, human intervention with energy spike, institutional memory knowledge graph, and closing contrast with a second completed thread. Accompanying voiceover script (VOICEOVER_SCRIPT.md) with per-act cues, adaptable narrator lines, recording tips, and key phrases mapped to each judging criterion. https://claude.ai/code/session_01QezAFNoTPMs4Q1KaQqCb4M

Cut Act 2 expertise tab and separate members view. Reduced deliberation loop from 8 to 5 iterations. Removed Act 5 continuation loop and cost summary spotlight. Shortened Act 6 graph interaction. Eliminated Act 7 enzyme engineering detour — go straight to closing shot. Compressed all pause durations. Voiceover script trimmed to match tighter timing. https://claude.ai/code/session_01QezAFNoTPMs4Q1KaQqCb4M

Shorter, phase-appropriate agent responses via prompt word limits (EXPLORE=250, DEBATE=300, DEEPEN=250, CONVERGE=150) backed by configurable per-phase max_tokens as a hard API cap. Fix content duplication in parse_agent_response by actually stripping structured sections. Render post content as markdown in the frontend using react-markdown with theme-aware component mappings. Update wiki (Agent-System, Deliberation-Engine) and add wiki TOC to README. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Reduce README from 459 to 203 lines by replacing verbose sections (API table, project structure tree, institutional memory deep-dive, demo seeding, testing categories) with concise summaries linking to the wiki. Rewrite docs/README.md as a spec index with wiki cross-references. Add wiki pointer headers to all 5 spec files. Fix stale content: Next.js -> React 19, 6 agents -> 10, add phase_max_tokens to EngineConfig and engine.yaml examples. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

- Relax observer thresholds and double phase token limits for faster phase progression and richer agent responses - Propagate full YAML config (energy + observer) through SessionManager instead of loading only phase_max_tokens - Persist extracted memories to DB and detect cross-references after deliberation completion - Add DB fallback in memory list and graph endpoints when in-memory store is empty - Fix seed script to handle paginated posts response format and auto- detect Docker Compose postgres - Render consensus content as markdown in ConsensusReveal - Reconnect WebSocket on session resume when deliberation is running - Update tests for new default config values Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

- Add screenshots.spec.ts for automated wiki screenshot capture (requires headed mode for WebGL graph rendering) - Update OBSERVER_SPEC.md and SYSTEM_DESIGN.md with relaxed thresholds (hysteresis 2, window 5, doubled token limits) Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Use create_embedding_provider() so memory extraction and cross-reference detection respect the EMBEDDING_PROVIDER env var instead of always using mock embeddings. Also remove unused extract_entities import. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

claude and others added 8 commits February 16, 2026 19:44

Fix ruff format violation in memory_routes.py

f1071e6

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

sunitj merged commit f1b4a1e into main Feb 28, 2026
7 checks passed

sunitj deleted the claude/playwright-demo-automation-XqnMV branch February 28, 2026 18:29

sunitj mentioned this pull request Apr 10, 2026

Add Phase 6: mission directives, per-agent budgets, autoresearch, approval queue, dashboards #6

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Demo automation, docs cleanup, and embedding provider fix#5

Demo automation, docs cleanup, and embedding provider fix#5
sunitj merged 8 commits intomainfrom
claude/playwright-demo-automation-XqnMV

sunitj commented Feb 28, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

sunitj commented Feb 28, 2026

Summary

Test plan

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants