[knowledge-rag] - Drop docs, search instantly from Claude Code — 12 MCP tools, 20 format parsers, hybrid search + reranking. Zero servers, zero API keys, 100% local.
-
Updated
May 26, 2026 - Python
[knowledge-rag] - Drop docs, search instantly from Claude Code — 12 MCP tools, 20 format parsers, hybrid search + reranking. Zero servers, zero API keys, 100% local.
SQL-like query language and CLI for Qdrant vector search engine
The highest-scoring AI memory system ever benchmarked that isn't reliant on LLM reranking. And it's free & burns less tokens.
Chat with Lex! A RAG app, using HyDE with milvus DB for vector store, VLLM for LLM inference, and FastEmbed for Embeddings!
FastAPI backend for archive.ire.org
A polars plugin for embedding DataFrames
A fast and simple local API for generating text embeddings.
This project showcases how to use AWS Lambda Managed Instances with AI/ML capabilities for real-time customer analytics
🧠 Universal long-term memory for AI agents. GraphRAG-powered knowledge base with vector search + graph traversal. Privacy-first, local-only, MCP-compatible. Connect Claude, Copilot, or any AI assistant.
Hub repo for Claude Code related tools
Hub repo for Claude Code related tools (Japanese distributions)
In-memory vector store with FastEmbed integration for Python applications.
Semantic search and Q&A Telegram bot for Google Drive, backed by Qdrant and an optional OpenAI-compatible LLM.
Local AI memory daemon — persistent memory for Claude Code, Cursor, Windsurf, Cline. Reference implementation of the Open Memory Protocol. Apache 2.0.
Add a description, image, and links to the fastembed topic page so that developers can more easily learn about it.
To associate your repository with the fastembed topic, visit your repo's landing page and select "manage topics."