ATLASAML.T0067
ATLAS index
AML.T0067

LLM Trusted Output Components Manipulation

Adversaries may utilize prompts to a large language model (LLM) which manipulate various components of its response in order to make it appear trustworthy to the user. This helps the adversary continue to operate in the victim's environment and evade detection by the users it interacts with. The LLM may be instructed t

Framework
MITRE ATLAS
Maturity
Demonstrated
Platforms
Generative AI, Agentic AI
Release
2026.05

Overview

Adversaries may utilize prompts to a large language model (LLM) which manipulate various components of its response in order to make it appear trustworthy to the user. This helps the adversary continue to operate in the victim's environment and evade detection by the users it interacts with.

The LLM may be instructed to tailor its language to appear more trustworthy to the user or attempt to manipulate the user to take certain actions. Other response components that could be manipulated include links, recommended follow-up actions, retrieved document metadata, and Citations.

Sources

  1. MITRE ATLAS AML.T0067: LLM Trusted Output Components Manipulation — MITRE