security Priority 4/5 5/28/2026, 11:05:47 AM

MemMorph Research Reveals Tool Hijacking Vulnerabilities in LLM Agents via Memory Poisoning

A new research paper titled MemMorph: Tool Hijacking in LLM Agents via Memory Poisoning highlights a critical vulnerability in how large language model agents utilize external tools. While previous security research focused primarily on prompt injection and data leakage, MemMorph demonstrates that an agent's memory or dialogue history can be intentionally corrupted. This poisoning causes the agent to select incorrect tools or execute unauthorized actions without direct command interference.

Related tools

Comparison

Aspect	Before / Alternative	After / This
Attack Vector	Direct prompt injection into current input	Long-term memory or dialogue history contamination
Impact Scope	Immediate session-based command hijacking	Persistent manipulation of tool selection logic
Detection Difficulty	Relatively high via input filtering and sanitization	Low due to the subtle influence of historical context
Primary Target	The immediate model response generation	The agent's internal tool-calling decision mechanism

Action Checklist

Implement strict validation for tool-calling arguments Ensure parameters align with expected schemas regardless of historical context.
Sanitize and monitor long-term memory storage Regularly audit dialogue histories for anomalous or repetitive patterns.
Enforce human-in-the-loop for high-privilege actions Require manual approval for tools that handle sensitive data or infrastructure.
Apply context-window filtering techniques Use secondary models to summarize or verify the integrity of past interactions.

Source: arXiv

This page summarizes the original source. Check the source for full details.

More English news Open source

MemMorph Research Reveals Tool Hijacking Vulnerabilities in LLM Agents via Memory Poisoning

Recommended tools for this topic

Comparison

Action Checklist

Related