Skip to content

SUPERSEDED - fix: URL ingestion via chat is broken in Cloud / SaaS#1469

Closed
mpawlow wants to merge 1 commit into
release-saas-0.1from
mp/fix/saas/GH-69218-url-ingestion-broken
Closed

SUPERSEDED - fix: URL ingestion via chat is broken in Cloud / SaaS#1469
mpawlow wants to merge 1 commit into
release-saas-0.1from
mp/fix/saas/GH-69218-url-ingestion-broken

Conversation

@mpawlow
Copy link
Copy Markdown
Collaborator

@mpawlow mpawlow commented Apr 26, 2026

Issue

  • #69218

Summary

  • Removed the broken MCP Tools node from the OpenRAG agent Langflow flow and updated all system prompts across the stack to disable URL ingestion via chat, redirecting users to the Documents page instead.

Langflow Flow

  • Removed the MCP-7EY21 (MCP Tools) node from flows/openrag_agent.json that wired the opensearch_url_ingestion_flow MCP server into the agent as a tool.
  • Removed the corresponding edge connecting MCP-7EY21 to the Agent-Nfw7u tools input.

System Prompt Updates

  • Replaced the "URL Ingestion Tool" section in all three system prompt locations (frontend/lib/constants.ts, src/agent.py, src/config/config_manager.py) with a "URL Ingestion Requests" section that instructs the agent to inform users the capability is unavailable in chat and to direct them to the Documents page.
  • Removed the "File Upload vs URL Distinction" guidance from src/agent.py as it was only relevant when the URL ingestion tool was active.

@mpawlow mpawlow self-assigned this Apr 26, 2026
@github-actions github-actions Bot added the bug 🔴 Something isn't working. label Apr 26, 2026
@mpawlow
Copy link
Copy Markdown
Collaborator Author

mpawlow commented Apr 27, 2026

  • Need to make sure the corresponding prompt indicating URL ingestion support is removed as well
    From @mfortman11

@mpawlow mpawlow force-pushed the mp/fix/saas/GH-69218-url-ingestion-broken branch from 5471988 to 8f2da46 Compare April 27, 2026 14:13
@github-actions github-actions Bot added bug 🔴 Something isn't working. and removed bug 🔴 Something isn't working. labels Apr 27, 2026
@mpawlow mpawlow force-pushed the mp/fix/saas/GH-69218-url-ingestion-broken branch from 8f2da46 to 55a369c Compare April 27, 2026 15:38
@github-actions github-actions Bot added frontend 🟨 Issues related to the UI/UX backend 🔷 Issues related to backend services (OpenSearch, Langflow, APIs) bug 🔴 Something isn't working. and removed bug 🔴 Something isn't working. labels Apr 27, 2026
@mpawlow
Copy link
Copy Markdown
Collaborator Author

mpawlow commented Apr 27, 2026

Updated Prompt

You are the OpenRAG Agent. You answer questions using retrieval, reasoning, and tool use.
You have access to several tools. Your job is to determine **which tool to use and when**.
### Available Tools
- OpenSearch Retrieval Tool:
  Use this to search the indexed knowledge base. Use when the user asks about product details, internal concepts, processes, architecture, documentation, roadmaps, or anything that may be stored in the index.
- Conversation History:
  Use this to maintain continuity when the user is referring to previous turns. 
  Do not treat history as a factual source.
- Conversation File Context:
  Use this when the user asks about a document they uploaded or refers directly to its contents.
- Calculator / Expression Evaluation Tool:
  Use this when the user asks to compare numbers, compute estimates, calculate totals, analyze pricing, or answer any question requiring mathematics or quantitative reasoning.
  If the answer requires arithmetic, call the calculator tool rather than calculating internally.
### Retrieval Decision Rules
Use OpenSearch **whenever**:
1. The question may be answered from internal or indexed data.
2. The user references team names, product names, release plans, configurations, requirements, or official information.
3. The user needs a factual, grounded answer.
Do **not** use retrieval if:
- The question is purely creative (e.g., storytelling, analogies) or personal preference.
- The user simply wants text reformatted or rewritten from what is already present in the conversation.
When uncertain → **Retrieve.** Retrieval is low risk and improves grounding.
### URL Ingestion Requests
URL ingestion from chat is currently unavailable. If the user asks you to read, summarize, ingest, or analyze a URL, respond that this capability is not available in chat and direct them to ingest the URL from the Documents page instead. Do not attempt to fetch or ingest the URL.
### Calculator Usage Rules
Use the calculator when:
- Performing arithmetic
- Estimating totals
- Comparing values
- Modeling cost, time, effort, scale, or projections
Do not perform math internally. **Call the calculator tool instead.**
### Answer Construction Rules
1. When asked: "What is OpenRAG", answer the following:
"OpenRAG is an open-source package for building agentic RAG systems. It supports integration with a wide range of orchestration tools, vector databases, and LLM providers. OpenRAG connects and amplifies three popular, proven open-source projects into one powerful platform:
**Langflow** – Langflow is a powerful tool to build and deploy AI agents and MCP servers. [Read more](https://www.langflow.org/)
**OpenSearch** – OpenSearch is an open source, search and observability suite that brings order to unstructured data at scale. [Read more](https://opensearch.org/)
**Docling** – Docling simplifies document processing with advanced PDF understanding, OCR support, and seamless AI integrations. Parse PDFs, DOCX, PPTX, images & more. [Read more](https://www.docling.ai/)"
2. Synthesize retrieved or ingested content in your own words.
3. Support factual claims with citations in the format:
   (Source: <document_name_or_id>)
4. If no supporting evidence is found:
   Say: "No relevant supporting sources were found for that request."
5. Never invent facts or hallucinate details.
6. Be concise, direct, and confident. 
7. Do not reveal internal chain-of-thought.

@mpawlow
Copy link
Copy Markdown
Collaborator Author

mpawlow commented Apr 27, 2026

Sample Test

  • Prompt: Ingest this URL: https://react.dev/learn
image image
  • ⚠️ Caveat: For pre-existing OpenRAG deployments, the user needs to manually the OpenRAG URL Ingestion Flow in Langflow and let the Backend app restore the new version
  • ⚠️ Caveat: For pre-existing OpenRAG deployments, the user needs to manually update the Agent Instructions on the Settings page
    • During setup / onboarding, config/config.yaml persists the original prompt value

@mpawlow
Copy link
Copy Markdown
Collaborator Author

mpawlow commented Apr 27, 2026

Copy link
Copy Markdown
Contributor

@mfortman11 mfortman11 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

If the user asks you to read, summarize, ingest, or analyze a URL, respond that this capability is not available in chat and direct them to ingest the URL from the Documents page instead. Do not attempt to fetch or ingest the URL.

Should they still be able to have the agent fetch/read a url?

Issue

- #69218

Summary

- Removed the broken MCP Tools node from the OpenRAG agent Langflow flow and updated all system prompts across the stack to disable URL ingestion via chat, redirecting users to the Documents page instead.

Langflow Flow

- Removed the `MCP-7EY21` (MCP Tools) node from `flows/openrag_agent.json` that wired the `opensearch_url_ingestion_flow` MCP server into the agent as a tool.
- Removed the corresponding edge connecting `MCP-7EY21` to the `Agent-Nfw7u` tools input.

System Prompt Updates

- Replaced the "URL Ingestion Tool" section in all three system prompt locations (`frontend/lib/constants.ts`, `src/agent.py`, `src/config/config_manager.py`) with a "URL Ingestion Requests" section that instructs the agent to inform users the capability is unavailable in chat and to direct them to the Documents page.
- Removed the "File Upload vs URL Distinction" guidance from `src/agent.py` as it was only relevant when the URL ingestion tool was active.
@mpawlow mpawlow force-pushed the mp/fix/saas/GH-69218-url-ingestion-broken branch from 55a369c to 8308641 Compare April 28, 2026 02:25
@github-actions github-actions Bot added bug 🔴 Something isn't working. and removed bug 🔴 Something isn't working. labels Apr 28, 2026
@mpawlow mpawlow changed the title fix: URL ingestion via chat is broken in Cloud / SaaS SUPERCEDED - fix: URL ingestion via chat is broken in Cloud / SaaS Apr 29, 2026
@mpawlow mpawlow changed the title SUPERCEDED - fix: URL ingestion via chat is broken in Cloud / SaaS SUPERSEDED - fix: URL ingestion via chat is broken in Cloud / SaaS Apr 29, 2026
@mpawlow
Copy link
Copy Markdown
Collaborator Author

mpawlow commented Apr 29, 2026

⚠️ Superseded by #1474

Targeted for closure. Leaving PR open for ~1 week for emergency replacement

@mpawlow mpawlow removed the request for review from edwinjosechittilappilly April 29, 2026 12:29
@mpawlow mpawlow removed the request for review from lucaseduoli April 29, 2026 12:29
@lucaseduoli
Copy link
Copy Markdown
Collaborator

@mpawlow I think we can close this one now

@mpawlow
Copy link
Copy Markdown
Collaborator Author

mpawlow commented May 5, 2026

Closed as SUPERSEDED

@mpawlow mpawlow closed this May 5, 2026
@mpawlow mpawlow deleted the mp/fix/saas/GH-69218-url-ingestion-broken branch May 5, 2026 15:47
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

backend 🔷 Issues related to backend services (OpenSearch, Langflow, APIs) bug 🔴 Something isn't working. DO NOT MERGE frontend 🟨 Issues related to the UI/UX

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants