Updated: 2026-04-02 | Version 4.14.0
The front door to one of the most ambitious open-source AI architecture projects ever built.
Ask-RuvNet is a prescriptive, RAG-powered knowledge interface for the RuVector ecosystem. It makes 200+ interconnected repositories, 114 Rust crates across 17 capability areas, and years of engineering decisions searchable, explainable, and actionable -- powered by 507 expert-curated knowledge entries (including 5 Golden Path implementation guides), HNSW vector search, GitNexus structural intelligence across 13 repos (652K code symbols, 1.2M dependency edges), and Claude Sonnet 4.6.
New in v4.14.0: The system is now prescriptive, not encyclopedic. When you ask "how do I store vectors?", it leads with the one best path (RVF), copy-paste code, and explicit anti-patterns -- not a balanced menu of options. Backed by structural analysis of module cohesion scores across the entire monorepo.
Production: https://ask-ruvnet.up.railway.app
Ask-RuvNet has no PostgreSQL dependency. The knowledge base lives in a single flat file (kb-master.json) and is compiled into RVF format for production.
kb-master.json (507 entries, source of truth)
|
+-> build-lean-rvf.mjs -----> knowledge.rvf (HNSW-indexed, content embedded)
| + content-sidecar.json.gz
| + .ruvector/knowledge-base/
|
+-> build-quantized-rvf.mjs -> SQ8 browser assets (knowledge-sq8.bin, knowledge-meta.json)
|
+-> export-mcp-kb.mjs ------> MCP server format (kb-data/)
|
+-> git push main -----------> Railway auto-deploys
| Metric | Value |
|---|---|
| Knowledge entries | 507 (expert-curated, avg quality 97/100) |
| Golden Path guides | 5 (prescriptive implementation paths) |
| Categories | 17 (including new implementation-path) |
| Rust crates covered | 114 across 17 capability areas |
| Technologies covered | 200+ |
| GitNexus structural data | 652K symbols, 1.2M edges across 13 repos |
| Embedding model | ONNX Xenova/all-MiniLM-L6-v2 |
| Embedding dimensions | 384 |
| Index type | HNSW (M=16, efConstruction=200, cosine) |
| Query latency | ~0.3ms (HNSW native) |
| Storage format | RVF cognitive container |
| External DB required | None |
kb-master.json (507 gold entries)
|
+---> knowledge.rvf ---> Railway server (read-only, /api/search)
|
+---> SQ8 browser assets ---> Browser client (WASM HNSW, ~5ms search)
|
+---> MCP embedded KB ---> Claude Code MCP tools (kb_search, etc.)
Every consumer reads the same data compiled from the same source.
Every question goes through a multi-stage pipeline before reaching the LLM:
| Stage | Module | What It Does |
|---|---|---|
| 1. Expand | QueryExpander |
Rewrites terse queries into richer search terms |
| 2. Search | HybridSearch |
Combines BM25 keyword matching with semantic HNSW vector search |
| 3. Rank | ReRanker |
Scores results using 5 weighted factors (semantic similarity 40%, intent alignment 20%, source authority 15%, quality score 15%, usefulness 10%) |
| 4a. Gold boost | Stage 4c | Curated entries get 4x multiplicative boost over raw docs |
| 4b. Intent detect | Stage 4d | Detects implementation intent ("how do I...", "build", "create") |
| 4c. Golden Path boost | Stage 4d | Implementation-path entries get 5x boost when intent detected |
| 5. Filter | Relevance floor | Discards anything below 0.30 similarity + anti-noise domain filter |
| 6. Anti-pattern | Stage 6b | Annotates deprecated paths when golden path alternatives exist |
| 7. Compress | ContextCompressor |
Trims context to fit the token window, preserving code blocks |
| 8. Retrieve | MultiHopRetriever |
Follows cross-references for multi-step questions |
Prescriptive Mode: When golden path entries appear in context, the system prompt shifts to prescriptive mode -- lead with code, state anti-patterns explicitly, include "Graduate When" thresholds. See docs/adr/ADR-002-prescriptive-intelligence.md.
Ask-RuvNet automatically detects configured API keys and builds a resilient fallback chain:
| Priority | Provider | Model | Notes |
|---|---|---|---|
| 1 | OpenRouter | claude-sonnet-4.6 | Multi-model gateway, primary |
| 2 | Anthropic | claude-sonnet-4.6 | Direct API |
| 3 | Groq | llama-3.3-70b-versatile | Free tier available |
| 4 | OpenAI | gpt-4o | Paid |
| 5 | DeepSeek | deepseek-chat | Last resort |
Set LLM_PROVIDER to override the primary.
- Chat-first hero with prominent search input above the fold (autofocus, placeholder examples)
- Intent-based quick pills: "Store Vectors", "Coordinate Agents", "Self-Learning AI", "Compare Alternatives"
- Use-case on-ramp cards: "Search Documents", "Coordinate AI Agents", "Shared AI Memory" (organized by what you BUILD, not product names)
- Progressive disclosure: DIKW stack, explore tiles, resources, and latest repos behind "Explore more" toggle
- Mermaid diagram generation in responses
- Source citations with relevance scores and direct links
- Follow-up suggestion pills for deeper exploration
- Light and dark mode with proper theming
- Mobile-responsive layout with fullscreen canvas overlay
Interactive explorer at /ruvector-catalog.html showing all 114 crates across 17 capability areas. 47 detailed catalog entries covering every RuVector crate, with descriptions, use cases, and integration guidance.
Users can install the ruvector-catalog skill to get RuVector technology recommendations directly in their own Claude Code sessions. The skill proactively activates when tasks could benefit from vector search, graph intelligence, self-learning, attention mechanisms, or other RuVector capabilities.
Knowledge is packaged in the RVF (RuVector Format) cognitive container format:
- Crypto file identity and provenance tracking
- Segmented binary with vectors, HNSW index, and metadata
- Forward-compatible (unknown segments preserved)
- Self-contained -- no external database needed at runtime
- Node.js 22+
- At least one LLM API key (Groq free tier works)
git clone https://github.com/stuinfla/Ask-Ruvnet.git
cd Ask-Ruvnet
# Install backend dependencies
npm install
# Install frontend dependencies
cd src/ui && npm install && cd ../..
# Build the knowledge base from kb-master.json (no PostgreSQL needed)
node scripts/build-lean-rvf.mjs
node scripts/build-quantized-rvf.mjs
# Build the frontend
cd src/ui && npm run build && cd ../..
# Create .env with your credentials
cp .env.example .env
# Edit .env: set at least one LLM API key
# Start the server
node src/server/app.jsThe app runs at http://localhost:3000.
The knowledge base source of truth is kb-master.json -- a flat JSON file with 507 expert-curated entries. No PostgreSQL required.
# Summary of KB contents
node scripts/kb-inspect.mjs
# Search the KB
node scripts/kb-inspect.mjs --search "attention mechanisms"
# Category breakdown
node scripts/kb-inspect.mjs --categories
# Add catalog entries
node scripts/ingest-catalog.mjsnode scripts/build-lean-rvf.mjs # kb-master.json -> .ruvector/ + knowledge.rvf + sidecar
node scripts/build-quantized-rvf.mjs # .ruvector/ -> SQ8 browser assets
node scripts/export-mcp-kb.mjs --output kb-data/ # MCP server format
cd src/ui && npm run build && cd ../.. # Frontend with quantized assetsteaching (165), videos (50), agents (48), architecture (38), neural (37), wasm-local-llm (36), general (35), algorithms (25), vector-db (22), security (20), deployment (8), performance (6), implementation-path (5), swarms (5), memory (3), reinforcement-learning (2), sparc (2).
Five prescriptive implementation guides backed by GitNexus structural analysis:
| Entry | What It Prescribes | Anti-Pattern It Prevents |
|---|---|---|
| Vector Storage | Use RVF (RvfDatabase.create()) |
Don't use postgres for new apps (cohesion: 0.21) |
| Agent Coordination | Use Ruflo hierarchical, 3-5 agents | Don't use mesh with <10 agents (cohesion: 0.59) |
| Self-Learning | Use IntelligenceEngine wrapper |
Don't use SONA directly (7 cross-crate deps) |
| MCP Integration | Use rvf-mcp-server or mcp-brain-server | Don't write raw MCP handlers |
| Security | Use AIMDS middleware (2 lines) | Don't build custom validators (cohesion: 0.18) |
| Item | Value |
|---|---|
| Platform | Railway |
| Container | Docker (node:22-bookworm-slim) |
| Auto-deploy | Yes (push to main) |
| URL | https://ask-ruvnet.up.railway.app |
bash scripts/deployment/deploy.sh patch # Bug fix: bump, build, push
bash scripts/deployment/deploy.sh minor # New feature
bash scripts/deployment/deploy.sh major # Breaking change
# Railway auto-deploys from main# Health check
curl https://ask-ruvnet.up.railway.app/health
# KB status
curl https://ask-ruvnet.up.railway.app/api/kb-stats
# LLM providers
curl https://ask-ruvnet.up.railway.app/api/providerscurl https://ask-ruvnet.up.railway.app/health{ "status": "ok", "version": "4.14.0" }Submit a question and receive a grounded answer with source citations.
curl -X POST https://ask-ruvnet.up.railway.app/api/chat \
-H "Content-Type: application/json" \
-d '{"message": "What swarm topologies does ruflo support?"}'| Field | Type | Required | Description |
|---|---|---|---|
message |
string | Yes | The user's question |
Response: JSON object with answer (string) and sources (array).
Generate an architectural visualization using Gemini.
| Field | Type | Required | Description |
|---|---|---|---|
concept |
string | Yes | What to visualize |
style |
string | No | Visual style hint |
resolution |
string | No | "1K" or "2K" (default: "1K") |
Perform special actions: simplify, code, diagram, or visualize.
| Field | Type | Required | Description |
|---|---|---|---|
action |
string | Yes | simplify, code, diagram, or visualize |
content |
string | Yes | The content to act on |
| Endpoint | Description |
|---|---|
GET /api/providers |
Active LLM fallback chain |
GET /api/kb-stats |
Knowledge base status and entry counts |
GET /api/ecosystem-stats |
Aggregated RuVector ecosystem statistics |
GET /api/latest-repos |
Live npm version data for ecosystem packages |
POST /api/learn |
Submit feedback to improve answer quality |
| Variable | Description |
|---|---|
GROQ_API_KEY |
Groq (free tier: 1M tokens/day) |
OPENAI_API_KEY |
OpenAI (gpt-4o) |
ANTHROPIC_API_KEY |
Anthropic (Claude Sonnet 4.6). Also accepts CLAUDE_API_KEY. |
OPENROUTER_API_KEY |
OpenRouter multi-model gateway |
DEEPSEEK_API_KEY |
DeepSeek |
| Variable | Default | Description |
|---|---|---|
LLM_PROVIDER |
auto-detect | Override the primary provider |
GEMINI_API_KEY |
-- | Gemini API key for visualization |
NODE_ENV |
development |
Set to production on Railway |
PORT |
3000 |
Server port |
Getting Started (triggers Prescriptive Mode with Golden Paths)
- "How do I store vectors?" -- leads with RVF code, warns against postgres
- "How do I coordinate AI agents?" -- leads with Ruflo hierarchical setup
- "How do I build a self-learning app?" -- leads with IntelligenceEngine
- "How do I add AI security?" -- leads with AIMDS 2-line middleware
Architecture and Design
- "What swarm topologies does Ruflo support?"
- "How does SONA achieve sub-millisecond adaptation?"
- "Explain the RVF cognitive container format"
Comparison and Evaluation
- "How does RuVector compare to pgvector and Pinecone?"
- "What are the tradeoffs between RuVector-WASM and RuVector-Postgres?"
- "Is RuVector production ready?"
Deep Technical
- "How does the ReasoningBank implement self-learning with LoRA?"
- "Explain EWC++ and how it prevents catastrophic forgetting"
- "What Rust crates does the ecosystem depend on?"
Ruben Cohen (rUv) has built an ecosystem of interlocking AI systems spanning agent orchestration, self-learning vector databases, graph neural networks, cognitive containers, distributed swarm intelligence, and neuromorphic computing.
Across 200+ repositories in three GitHub organizations (ruvnet, openclaw, VibiumDev), the RuVector ecosystem covers:
- Agent orchestration -- Ruflo, ruv-swarm, agentic-flow
- Vector databases -- RuVector (114 Rust crates: PostgreSQL-native, WASM, neuromorphic)
- Self-learning AI -- SONA, AIMDS, ReasoningBank, LoRA adapters
- Cognitive containers -- RVF format (24-segment, self-booting, 5.5KB WASM)
- Graph intelligence -- GNN, min-cut analysis, Louvain community detection
- Distributed consensus -- Byzantine, Raft, CRDT, Gossip protocols
Ask-RuvNet is the knowledge interface for all of it.
Ask-Ruvnet/
+-- kb-master.json # Source of truth (507 entries, flat file)
+-- knowledge.rvf # HNSW-indexed binary KB
+-- content-sidecar.json.gz # Full-text content for RVF entries
+-- docs/
| +-- adr/
| +-- ADR-001-eliminate-postgresql.md
| +-- ADR-002-prescriptive-intelligence.md # GitNexus + Golden Path design
+-- src/
| +-- server/
| | +-- app.js # Express server, RAG pipeline (ADR-002 stages 4d, 6b)
| | +-- RuvPersona.js # LLM system prompt and persona
| +-- core/ # RAG pipeline modules
| | +-- RvfStore.js # Primary KB backend (RVF HNSW search)
| | +-- HybridSearch.js # BM25 + semantic fusion
| | +-- QueryExpander.js # Query expansion
| | +-- ReRanker.js # 5-factor result reranking
| | +-- ContextCompressor.js # Context length management
| | +-- MultiHopRetriever.js # Multi-step query handling
| +-- ui/
| +-- src/ # React source (Vite)
| +-- dist/ # Built frontend
+-- scripts/
| +-- build-lean-rvf.mjs # kb-master.json -> RVF
| +-- build-quantized-rvf.mjs # RVF -> SQ8 browser assets
| +-- export-mcp-kb.mjs # RVF -> MCP format
| +-- kb-inspect.mjs # KB inspection and search
| +-- ingest-catalog.mjs # Add catalog entries
| +-- deployment/
| +-- deploy.sh # Version bump + build + push
+-- bin/
| +-- mcp-server.js # MCP server (embedded-only mode)
+-- kb-data/ # MCP server data (generated)
+-- Dockerfile # Railway Docker build
+-- package.json # Version source of truth
+-- README.md # This file
Chat returns generic answers or no results
Check /api/kb-stats -- it should show "connected": true with a non-zero vector count. If the count is 0, the RVF knowledge base was not loaded. Run node scripts/build-lean-rvf.mjs to rebuild.
Frontend not loading
Run cd src/ui && npm run build and check for errors. The dist/ directory must exist at src/ui/dist/.
All LLM providers failing
Check /api/providers to see which are configured. At least one API key must be set.
Visualization not generating
The /api/visualize endpoint requires a Gemini API key. Check that GEMINI_API_KEY is set.
| Date | Version | Change |
|---|---|---|
| 2026-04-02 | 4.14.0 | Prescriptive Intelligence (ADR-002). 507 entries (5 Golden Path implementation guides), GitNexus integration (13 repos, 652K symbols, 1.2M edges), intent-aware RAG pipeline (implementation intent detection, 5x golden path boost, prescriptive system prompt injection, anti-pattern annotations), chat-first hero design (search input above fold, intent-based cards replacing product cards, progressive disclosure) |
| 2026-03-28 | 4.12.0 | 502 entries, kb-master.json flat-file architecture (no PG dependency), RuVector Catalog (114 crates, 17 capability areas), installable Claude Code skill, 47 catalog entries |
| 2026-03-19 | 4.10.0 | 434 gold entries, auto-curation pipeline, verification suite (36 checks, 7 layers) |
| 2026-03-14 | 4.0.1 | Ground-up v4 redesign, DIKW framework, PaperBanana illustrations, RVF-first pipeline |
| 2026-03-07 | 3.5.0 | NotebookLM studio pipeline, Ruflo rebrand |
| 2026-03-03 | 3.3.0 | RVF Cognitive Container with Transformers.js semantic search |
| 2026-03-02 | 3.2.0 | RVF-first architecture, zero external DB dependency |
| 2026-03-01 | 3.0.0 | Complete visual overhaul: glassmorphism, aurora background, stats bar |
| 2026-02-27 | 2.2.0 | Rich responses with source citations, evolutionary knowledge |
| 2026-02-23 | 2.1.2 | Multi-provider LLM fallback chain |
| 2026-01-14 | 2.0.0 | PostgreSQL KB wired to chat engine |
Contributions are welcome -- especially around knowledge coverage, RAG pipeline improvements, and frontend UX.
- Fork the repository
- Create a feature branch
- Run
cd src/ui && npm run buildand verify no errors before submitting - Open a pull request against
main
Ruben Cohen (rUv)
Maintained by: @stuinfla Production: https://ask-ruvnet.up.railway.app
MIT