MemMorph Research Reveals Tool Hijacking Vulnerabilities in LLM Agents via Memory Poisoning

A new research paper titled MemMorph: Tool Hijacking in LLM Agents via Memory Poisoning highlights a critical vulnerability in how large language model agents utilize external tools. While previous security research focused primarily on prompt injection and data leakage, MemMorph demonstrates that an agent's memory or dialogue history can be intentionally corrupted. This poisoning causes the agent to select incorrect tools or execute unauthorized actions without direct command interference.
Related tools
Recommended tools for this topic
These picks prioritize high-intent tools relevant to this topic. Some links may include partner or affiliate tracking.
A strong security and edge platform match across CDN, Zero Trust, and app protection.
View CloudflareA strong fit for readers comparing Claude-class models, safety, and long-context workflows.
View AnthropicA high-relevance security pick for identity, secret management, and team access control.
View 1PasswordComparison
| Aspect | Before / Alternative | After / This |
|---|---|---|
| Attack Vector | Direct prompt injection into current input | Long-term memory or dialogue history contamination |
| Impact Scope | Immediate session-based command hijacking | Persistent manipulation of tool selection logic |
| Detection Difficulty | Relatively high via input filtering and sanitization | Low due to the subtle influence of historical context |
| Primary Target | The immediate model response generation | The agent's internal tool-calling decision mechanism |
Action Checklist
- Implement strict validation for tool-calling arguments Ensure parameters align with expected schemas regardless of historical context.
- Sanitize and monitor long-term memory storage Regularly audit dialogue histories for anomalous or repetitive patterns.
- Enforce human-in-the-loop for high-privilege actions Require manual approval for tools that handle sensitive data or infrastructure.
- Apply context-window filtering techniques Use secondary models to summarize or verify the integrity of past interactions.
Source: arXiv
This page summarizes the original source. Check the source for full details.


