ガスライトマルウェア、38の誤ったエラーメッセージでAI分析を毒殺

A 3.5 KB payload of 38 fabricated system messages is the most interesting part of a new macOS backdoor called Gaslight. SentinelOne attributes the Rust binary to a North Korean-linked threat actor, but the info-stealing and backdoor capabilities are standard. The attack on LLM-assisted analysis is not.

How Gaslight Attacks LLM Analysis, Not Sandboxes

Most malware tries to evade sandbox execution by detecting virtual environments or sleeping. Gaslight does something different. It embeds a cascade of fake system messages designed to make an LLM-powered triage agent doubt its own session. SentinelOne calls it an attack on the agent's perception, not the sandbox.

The fake messages use Markdown formatting and template placeholders to mimic legitimate developer logs, crash reports, and debugging output. Examples include fabricated memory dumps, token expiration warnings, Redis connection failures, build-pipeline errors, and SQL injection alerts. The goal is to push an LLM agent into aborting, truncating, or refusing to continue analysis.

The Fake Error Catalog

SentinelOne provides specific examples of the embedded strings. One claims "Token expiration handling" with a dump placeholder. Another says "Crash: Worker node OOM" with a memory dump template. There are logs about excessive logging filling disk space, static analysis flags for SQL injection, and JSON parsing errors. None of these relate to the malware's actual behavior.

These are essentially prompt injection strings baked into the binary. The scaffold contains fake warnings about injection vulnerabilities and static-analysis flags. The attacker's aim is to exploit the way LLM agents evaluate context: if they see repeated errors and warnings, they may decide the session is broken or the data is corrupted.

What This Means for AI-Powered Security Tools

SentinelOne did not demonstrate this technique successfully bypassing their own or any other AI analysis platform. But the existence of this malware proves threat actors are actively experimenting with anti-analysis methods tailored to LLM-assisted pipelines. The technique is novel: it exploits the trust an LLM agent has in its own context, not the runtime environment.

Security teams relying on AI-driven malware triage need to test how their tools handle deliberately misleading error messages. A 3.5 KB payload of fake crash logs is cheap to generate and could cause expensive false negatives in automated incident response.

Source: New macOS malware embeds fake errors to confuse AI analysis tools
Domain: bleepingcomputer.com

ガスライトマルウェア、38の誤ったエラーメッセージでAI分析を毒殺

How Gaslight Attacks LLM Analysis, Not Sandboxes

The Fake Error Catalog

What This Means for AI-Powered Security Tools

More in Cybersecurity