Ask-RuvNet

Updated: 2026-04-02 | Version 4.14.0

The front door to one of the most ambitious open-source AI architecture projects ever built.

Ask-RuvNet is a prescriptive, RAG-powered knowledge interface for the RuVector ecosystem. It makes 200+ interconnected repositories, 114 Rust crates across 17 capability areas, and years of engineering decisions searchable, explainable, and actionable -- powered by 507 expert-curated knowledge entries (including 5 Golden Path implementation guides), HNSW vector search, GitNexus structural intelligence across 13 repos (652K code symbols, 1.2M dependency edges), and Claude Sonnet 4.6.

New in v4.14.0: The system is now prescriptive, not encyclopedic. When you ask "how do I store vectors?", it leads with the one best path (RVF), copy-paste code, and explicit anti-patterns -- not a balanced menu of options. Backed by structural analysis of module cohesion scores across the entire monorepo.

Production: https://ask-ruvnet.up.railway.app

Architecture

Ask-RuvNet has no PostgreSQL dependency. The knowledge base lives in a single flat file (kb-master.json) and is compiled into RVF format for production.

kb-master.json (507 entries, source of truth)
  |
  +-> build-lean-rvf.mjs -----> knowledge.rvf (HNSW-indexed, content embedded)
  |                              + content-sidecar.json.gz
  |                              + .ruvector/knowledge-base/
  |
  +-> build-quantized-rvf.mjs -> SQ8 browser assets (knowledge-sq8.bin, knowledge-meta.json)
  |
  +-> export-mcp-kb.mjs ------> MCP server format (kb-data/)
  |
  +-> git push main -----------> Railway auto-deploys

Key Stats

Metric	Value
Knowledge entries	507 (expert-curated, avg quality 97/100)
Golden Path guides	5 (prescriptive implementation paths)
Categories	17 (including new `implementation-path`)
Rust crates covered	114 across 17 capability areas
Technologies covered	200+
GitNexus structural data	652K symbols, 1.2M edges across 13 repos
Embedding model	ONNX Xenova/all-MiniLM-L6-v2
Embedding dimensions	384
Index type	HNSW (M=16, efConstruction=200, cosine)
Query latency	~0.3ms (HNSW native)
Storage format	RVF cognitive container
External DB required	None

Three Consumers, One Source

kb-master.json (507 gold entries)
       |
       +---> knowledge.rvf         ---> Railway server (read-only, /api/search)
       |
       +---> SQ8 browser assets    ---> Browser client (WASM HNSW, ~5ms search)
       |
       +---> MCP embedded KB       ---> Claude Code MCP tools (kb_search, etc.)

Every consumer reads the same data compiled from the same source.

Features

RAG Pipeline (ADR-002: Prescriptive Intelligence)

Every question goes through a multi-stage pipeline before reaching the LLM:

Stage	Module	What It Does
1. Expand	`QueryExpander`	Rewrites terse queries into richer search terms
2. Search	`HybridSearch`	Combines BM25 keyword matching with semantic HNSW vector search
3. Rank	`ReRanker`	Scores results using 5 weighted factors (semantic similarity 40%, intent alignment 20%, source authority 15%, quality score 15%, usefulness 10%)
4a. Gold boost	Stage 4c	Curated entries get 4x multiplicative boost over raw docs
4b. Intent detect	Stage 4d	Detects implementation intent ("how do I...", "build", "create")
4c. Golden Path boost	Stage 4d	Implementation-path entries get 5x boost when intent detected
5. Filter	Relevance floor	Discards anything below 0.30 similarity + anti-noise domain filter
6. Anti-pattern	Stage 6b	Annotates deprecated paths when golden path alternatives exist
7. Compress	`ContextCompressor`	Trims context to fit the token window, preserving code blocks
8. Retrieve	`MultiHopRetriever`	Follows cross-references for multi-step questions

Prescriptive Mode: When golden path entries appear in context, the system prompt shifts to prescriptive mode -- lead with code, state anti-patterns explicitly, include "Graduate When" thresholds. See docs/adr/ADR-002-prescriptive-intelligence.md.

5-Model LLM Fallback Chain

Ask-RuvNet automatically detects configured API keys and builds a resilient fallback chain:

Priority	Provider	Model	Notes
1	OpenRouter	claude-sonnet-4.6	Multi-model gateway, primary
2	Anthropic	claude-sonnet-4.6	Direct API
3	Groq	llama-3.3-70b-versatile	Free tier available
4	OpenAI	gpt-4o	Paid
5	DeepSeek	deepseek-chat	Last resort

Set LLM_PROVIDER to override the primary.

Interactive UI (v4.14.0: Intent-First Design)

Chat-first hero with prominent search input above the fold (autofocus, placeholder examples)
Intent-based quick pills: "Store Vectors", "Coordinate Agents", "Self-Learning AI", "Compare Alternatives"
Use-case on-ramp cards: "Search Documents", "Coordinate AI Agents", "Shared AI Memory" (organized by what you BUILD, not product names)
Progressive disclosure: DIKW stack, explore tiles, resources, and latest repos behind "Explore more" toggle
Mermaid diagram generation in responses
Source citations with relevance scores and direct links
Follow-up suggestion pills for deeper exploration
Light and dark mode with proper theming
Mobile-responsive layout with fullscreen canvas overlay

RuVector Catalog

Interactive explorer at /ruvector-catalog.html showing all 114 crates across 17 capability areas. 47 detailed catalog entries covering every RuVector crate, with descriptions, use cases, and integration guidance.

Installable Claude Code Skill

Users can install the ruvector-catalog skill to get RuVector technology recommendations directly in their own Claude Code sessions. The skill proactively activates when tasks could benefit from vector search, graph intelligence, self-learning, attention mechanisms, or other RuVector capabilities.

RVF Cognitive Containers

Knowledge is packaged in the RVF (RuVector Format) cognitive container format:

Crypto file identity and provenance tracking
Segmented binary with vectors, HNSW index, and metadata
Forward-compatible (unknown segments preserved)
Self-contained -- no external database needed at runtime

Getting Started

Prerequisites

Node.js 22+
At least one LLM API key (Groq free tier works)

Setup

git clone https://github.com/stuinfla/Ask-Ruvnet.git
cd Ask-Ruvnet

# Install backend dependencies
npm install

# Install frontend dependencies
cd src/ui && npm install && cd ../..

# Build the knowledge base from kb-master.json (no PostgreSQL needed)
node scripts/build-lean-rvf.mjs
node scripts/build-quantized-rvf.mjs

# Build the frontend
cd src/ui && npm run build && cd ../..

# Create .env with your credentials
cp .env.example .env
# Edit .env: set at least one LLM API key

# Start the server
node src/server/app.js

The app runs at http://localhost:3000.

KB Management

The knowledge base source of truth is kb-master.json -- a flat JSON file with 507 expert-curated entries. No PostgreSQL required.

# Summary of KB contents
node scripts/kb-inspect.mjs

# Search the KB
node scripts/kb-inspect.mjs --search "attention mechanisms"

# Category breakdown
node scripts/kb-inspect.mjs --categories

# Add catalog entries
node scripts/ingest-catalog.mjs

Build Commands (Run in Order)

node scripts/build-lean-rvf.mjs       # kb-master.json -> .ruvector/ + knowledge.rvf + sidecar
node scripts/build-quantized-rvf.mjs   # .ruvector/ -> SQ8 browser assets
node scripts/export-mcp-kb.mjs --output kb-data/  # MCP server format
cd src/ui && npm run build && cd ../.. # Frontend with quantized assets

Entry Categories (507 entries, 17 categories)

teaching (165), videos (50), agents (48), architecture (38), neural (37), wasm-local-llm (36), general (35), algorithms (25), vector-db (22), security (20), deployment (8), performance (6), implementation-path (5), swarms (5), memory (3), reinforcement-learning (2), sparc (2).

Golden Path Entries (New in v4.14.0)

Five prescriptive implementation guides backed by GitNexus structural analysis:

Entry	What It Prescribes	Anti-Pattern It Prevents
Vector Storage	Use RVF (`RvfDatabase.create()`)	Don't use postgres for new apps (cohesion: 0.21)
Agent Coordination	Use Ruflo hierarchical, 3-5 agents	Don't use mesh with <10 agents (cohesion: 0.59)
Self-Learning	Use `IntelligenceEngine` wrapper	Don't use SONA directly (7 cross-crate deps)
MCP Integration	Use rvf-mcp-server or mcp-brain-server	Don't write raw MCP handlers
Security	Use AIMDS middleware (2 lines)	Don't build custom validators (cohesion: 0.18)

Deployment

Railway (Production)

Item	Value
Platform	Railway
Container	Docker (node:22-bookworm-slim)
Auto-deploy	Yes (push to `main`)
URL	https://ask-ruvnet.up.railway.app

Deploy

bash scripts/deployment/deploy.sh patch   # Bug fix: bump, build, push
bash scripts/deployment/deploy.sh minor   # New feature
bash scripts/deployment/deploy.sh major   # Breaking change
# Railway auto-deploys from main

Verifying a Deployment

# Health check
curl https://ask-ruvnet.up.railway.app/health

# KB status
curl https://ask-ruvnet.up.railway.app/api/kb-stats

# LLM providers
curl https://ask-ruvnet.up.railway.app/api/providers

API Reference

GET /health

curl https://ask-ruvnet.up.railway.app/health

{ "status": "ok", "version": "4.14.0" }

POST /api/chat

Submit a question and receive a grounded answer with source citations.

curl -X POST https://ask-ruvnet.up.railway.app/api/chat \
  -H "Content-Type: application/json" \
  -d '{"message": "What swarm topologies does ruflo support?"}'

Field	Type	Required	Description
`message`	string	Yes	The user's question

Response: JSON object with answer (string) and sources (array).

POST /api/visualize

Generate an architectural visualization using Gemini.

Field	Type	Required	Description
`concept`	string	Yes	What to visualize
`style`	string	No	Visual style hint
`resolution`	string	No	`"1K"` or `"2K"` (default: `"1K"`)

POST /api/special

Perform special actions: simplify, code, diagram, or visualize.

Field	Type	Required	Description
`action`	string	Yes	`simplify`, `code`, `diagram`, or `visualize`
`content`	string	Yes	The content to act on

Other Endpoints

Endpoint	Description
`GET /api/providers`	Active LLM fallback chain
`GET /api/kb-stats`	Knowledge base status and entry counts
`GET /api/ecosystem-stats`	Aggregated RuVector ecosystem statistics
`GET /api/latest-repos`	Live npm version data for ecosystem packages
`POST /api/learn`	Submit feedback to improve answer quality

Environment Variables

LLM Providers (at least one required)

Variable	Description
`GROQ_API_KEY`	Groq (free tier: 1M tokens/day)
`OPENAI_API_KEY`	OpenAI (gpt-4o)
`ANTHROPIC_API_KEY`	Anthropic (Claude Sonnet 4.6). Also accepts `CLAUDE_API_KEY`.
`OPENROUTER_API_KEY`	OpenRouter multi-model gateway
`DEEPSEEK_API_KEY`	DeepSeek

Optional

Variable	Default	Description
`LLM_PROVIDER`	auto-detect	Override the primary provider
`GEMINI_API_KEY`	--	Gemini API key for visualization
`NODE_ENV`	`development`	Set to `production` on Railway
`PORT`	`3000`	Server port

What You Can Ask

Getting Started (triggers Prescriptive Mode with Golden Paths)

"How do I store vectors?" -- leads with RVF code, warns against postgres
"How do I coordinate AI agents?" -- leads with Ruflo hierarchical setup
"How do I build a self-learning app?" -- leads with IntelligenceEngine
"How do I add AI security?" -- leads with AIMDS 2-line middleware

Architecture and Design

"What swarm topologies does Ruflo support?"
"How does SONA achieve sub-millisecond adaptation?"
"Explain the RVF cognitive container format"

Comparison and Evaluation

"How does RuVector compare to pgvector and Pinecone?"
"What are the tradeoffs between RuVector-WASM and RuVector-Postgres?"
"Is RuVector production ready?"

Deep Technical

"How does the ReasoningBank implement self-learning with LoRA?"
"Explain EWC++ and how it prevents catastrophic forgetting"
"What Rust crates does the ecosystem depend on?"

The Big Picture

Ruben Cohen (rUv) has built an ecosystem of interlocking AI systems spanning agent orchestration, self-learning vector databases, graph neural networks, cognitive containers, distributed swarm intelligence, and neuromorphic computing.

Across 200+ repositories in three GitHub organizations (ruvnet, openclaw, VibiumDev), the RuVector ecosystem covers:

Agent orchestration -- Ruflo, ruv-swarm, agentic-flow
Vector databases -- RuVector (114 Rust crates: PostgreSQL-native, WASM, neuromorphic)
Self-learning AI -- SONA, AIMDS, ReasoningBank, LoRA adapters
Cognitive containers -- RVF format (24-segment, self-booting, 5.5KB WASM)
Graph intelligence -- GNN, min-cut analysis, Louvain community detection
Distributed consensus -- Byzantine, Raft, CRDT, Gossip protocols

Ask-RuvNet is the knowledge interface for all of it.

Project Structure

Ask-Ruvnet/
+-- kb-master.json                       # Source of truth (507 entries, flat file)
+-- knowledge.rvf                        # HNSW-indexed binary KB
+-- content-sidecar.json.gz              # Full-text content for RVF entries
+-- docs/
|   +-- adr/
|       +-- ADR-001-eliminate-postgresql.md
|       +-- ADR-002-prescriptive-intelligence.md  # GitNexus + Golden Path design
+-- src/
|   +-- server/
|   |   +-- app.js                       # Express server, RAG pipeline (ADR-002 stages 4d, 6b)
|   |   +-- RuvPersona.js                # LLM system prompt and persona
|   +-- core/                            # RAG pipeline modules
|   |   +-- RvfStore.js                  # Primary KB backend (RVF HNSW search)
|   |   +-- HybridSearch.js              # BM25 + semantic fusion
|   |   +-- QueryExpander.js             # Query expansion
|   |   +-- ReRanker.js                  # 5-factor result reranking
|   |   +-- ContextCompressor.js         # Context length management
|   |   +-- MultiHopRetriever.js         # Multi-step query handling
|   +-- ui/
|       +-- src/                         # React source (Vite)
|       +-- dist/                        # Built frontend
+-- scripts/
|   +-- build-lean-rvf.mjs              # kb-master.json -> RVF
|   +-- build-quantized-rvf.mjs         # RVF -> SQ8 browser assets
|   +-- export-mcp-kb.mjs              # RVF -> MCP format
|   +-- kb-inspect.mjs                  # KB inspection and search
|   +-- ingest-catalog.mjs             # Add catalog entries
|   +-- deployment/
|       +-- deploy.sh                   # Version bump + build + push
+-- bin/
|   +-- mcp-server.js                   # MCP server (embedded-only mode)
+-- kb-data/                             # MCP server data (generated)
+-- Dockerfile                           # Railway Docker build
+-- package.json                         # Version source of truth
+-- README.md                            # This file

Troubleshooting

Chat returns generic answers or no results Check /api/kb-stats -- it should show "connected": true with a non-zero vector count. If the count is 0, the RVF knowledge base was not loaded. Run node scripts/build-lean-rvf.mjs to rebuild.

Frontend not loading Run cd src/ui && npm run build and check for errors. The dist/ directory must exist at src/ui/dist/.

All LLM providers failing Check /api/providers to see which are configured. At least one API key must be set.

Visualization not generating The /api/visualize endpoint requires a Gemini API key. Check that GEMINI_API_KEY is set.

Deployment History

Date	Version	Change
2026-04-02	4.14.0	Prescriptive Intelligence (ADR-002). 507 entries (5 Golden Path implementation guides), GitNexus integration (13 repos, 652K symbols, 1.2M edges), intent-aware RAG pipeline (implementation intent detection, 5x golden path boost, prescriptive system prompt injection, anti-pattern annotations), chat-first hero design (search input above fold, intent-based cards replacing product cards, progressive disclosure)
2026-03-28	4.12.0	502 entries, kb-master.json flat-file architecture (no PG dependency), RuVector Catalog (114 crates, 17 capability areas), installable Claude Code skill, 47 catalog entries
2026-03-19	4.10.0	434 gold entries, auto-curation pipeline, verification suite (36 checks, 7 layers)
2026-03-14	4.0.1	Ground-up v4 redesign, DIKW framework, PaperBanana illustrations, RVF-first pipeline
2026-03-07	3.5.0	NotebookLM studio pipeline, Ruflo rebrand
2026-03-03	3.3.0	RVF Cognitive Container with Transformers.js semantic search
2026-03-02	3.2.0	RVF-first architecture, zero external DB dependency
2026-03-01	3.0.0	Complete visual overhaul: glassmorphism, aurora background, stats bar
2026-02-27	2.2.0	Rich responses with source citations, evolutionary knowledge
2026-02-23	2.1.2	Multi-provider LLM fallback chain
2026-01-14	2.0.0	PostgreSQL KB wired to chat engine

Contributing

Contributions are welcome -- especially around knowledge coverage, RAG pipeline improvements, and frontend UX.

Fork the repository
Create a feature branch
Run cd src/ui && npm run build and verify no errors before submitting
Open a pull request against main

Author

Ruben Cohen (rUv)

GitHub: @ruvnet
Ecosystem: 200+ repositories across ruvnet, openclaw, VibiumDev

Maintained by: @stuinfla Production: https://ask-ruvnet.up.railway.app

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 368 Commits
.claude		.claude
.github/workflows		.github/workflows
.ruvector		.ruvector
agents		agents
assets		assets
bin		bin
config		config
data_ingestion_ruv_coaching		data_ingestion_ruv_coaching
docs		docs
kb-data		kb-data
packages/nlm-pipeline-mcp		packages/nlm-pipeline-mcp
public		public
scripts		scripts
src		src
tests/storage		tests/storage
.dockerignore		.dockerignore
.env.example		.env.example
.gitignore		.gitignore
.railwayignore		.railwayignore
.ruvnet-monitor-state.json		.ruvnet-monitor-state.json
AGENTS.md		AGENTS.md
CLAUDE.md		CLAUDE.md
Dockerfile		Dockerfile
README.md		README.md
content-sidecar.json.gz		content-sidecar.json.gz
kb-master.json		kb-master.json
knowledge.rvf		knowledge.rvf
nixpacks.toml		nixpacks.toml
package-lock.json		package-lock.json
package.json		package.json

Folders and files

Latest commit

History

Repository files navigation

Ask-RuvNet

Architecture

Key Stats

Three Consumers, One Source

Features

RAG Pipeline (ADR-002: Prescriptive Intelligence)

5-Model LLM Fallback Chain

Interactive UI (v4.14.0: Intent-First Design)

RuVector Catalog

Installable Claude Code Skill

RVF Cognitive Containers

Getting Started

Prerequisites

Setup

KB Management

Build Commands (Run in Order)

Entry Categories (507 entries, 17 categories)

Golden Path Entries (New in v4.14.0)

Deployment

Railway (Production)

Deploy

Verifying a Deployment

API Reference

GET /health

POST /api/chat

POST /api/visualize

POST /api/special

Other Endpoints

Environment Variables

LLM Providers (at least one required)

Optional

What You Can Ask

The Big Picture

Project Structure

Troubleshooting

Deployment History

Contributing

Author

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages