Should we treat the OWASP LLM Top 10 as a compliance checklist?

No. It is a risk catalogue, not a control framework. Treat each entry as a question to answer for your specific architecture.

Do all ten categories apply to every LLM application?

Rarely. Scope to your architecture. A read-only assistant with no tools usually does not need LLM06 testing.

Is this in addition to the regular OWASP Top 10 or a replacement?

In addition. LLM applications are still web applications and need conventional AppSec testing plus LLM-specific coverage.

How often is the list updated?

Major revisions every 18 to 24 months. Minor updates published continuously by the OWASP GenAI Security Project working group.

Do industry verticals add categories?

Yes. Regulated industries layer their own concerns (PII disclosure for healthcare, model-output advice for finance) on top of the OWASP categories.

OWASP LLM Top 10 (2025): Every Risk Explained

AI Security · LearnAI Penetration Testing Download PDF

TL;DR

OWASP (the Open Worldwide Application Security Project) is the non-profit behind the most-used security risk lists for software. The LLM Top 10 is their list for products built on large language models, the AI behind chatbots and assistants. The 2025 version sharpened the categories and added new ones for attacks on retrieval systems and attacks that drain resources. Most security teams use it as the first checklist for AI security work.

By Rohit Hatagale, AI Security Lead, SecureLayer7Updated June 9, 2026

What is the OWASP LLM Top 10 and why does it exist?

The OWASP LLM Top 10 is the community-maintained list of risks for apps built on large language models. The first list came out in mid-2023 to give security teams shared words for a problem the regular OWASP Top 10 did not cover. The 2025 version is the second major update: it sharpened the categories, added new ones (notably Vector and Embedding Weaknesses), and rewrote the agent-related entries to match what shows up in real products.

The OWASP GenAI Security Project working group maintains it. They publish a detail page for each risk, fixes, and a shared vocabulary that other frameworks (MITRE ATLAS, NIST AI 600-1) point to.

What are the ten risks in the 2025 list?

Each risk links to the OWASP detail page and to the SecureLayer7 deep-dive where one exists.

LLM01: Prompt Injection: untrusted input overrides the operator's instructions. SL7 explainer.
LLM02: Sensitive Information Disclosure: the model reveals secrets it can see or was trained on.
LLM03: Supply Chain: tampered model weights, plugins, or training data.
LLM04: Data and Model Poisoning: bad training data changes how the model behaves.
LLM05: Improper Output Handling: downstream systems trust the model's output without checking it.
LLM06: Excessive Agency: an agent can take more actions than the job needs.
LLM07: System Prompt Leakage: an attacker pulls out the operator's hidden instructions.
LLM08: Vector and Embedding Weaknesses: attacks on the RAG store and the embedding pipeline. SL7 RAG poisoning explainer.
LLM09: Misinformation: the model states false things with confidence, and other systems act on them.
LLM10: Unbounded Consumption: resource-draining and model-theft attacks. SL7 model-extraction explainer.

What changed between the 2023 and 2025 lists?

A few real shifts:

Vector and Embedding Weaknesses (LLM08) is new. The 2023 list lumped these into prompt injection. 2025 splits them out, because the defenses and the attackers are different.
System Prompt Leakage (LLM07) got its own entry. Pulling out the operator's hidden instructions was split off from prompt injection, because the damage (leaked IP and policy) is its own thing.
Excessive Agency (LLM06) was rewritten for real agent products. The 2023 version was about functions the agent should not have. The 2025 version is about an agent that can take more actions than its job calls for.
Unbounded Consumption (LLM10) replaces Model Denial of Service. Wider scope: model theft, resource drain, and runaway cost, not just downtime.
Training Data Poisoning (LLM04) absorbed model poisoning, since both shape the same risk.

How does SecureLayer7 use the list when scoping an engagement?

As a coverage map, not a tick-box list. Every engagement starts with one question: which of these ten actually apply to your setup? A read-only assistant with no tools rarely needs LLM06 testing. A RAG pipeline that takes user uploads almost always needs LLM01, LLM05, and LLM08.

For each category in scope, we run a payload library against your exact configuration, then hand-build follow-up attacks wherever one half-works. The report gives per-category notes, what we tested, what we found, what we advise, so an auditor can see which risks were covered.

References

[1]OWASP LLM Top 10 (2025)(OWASP)
[2]OWASP GenAI Security Project(OWASP)
[3]MITRE ATLAS(MITRE)
[4]NIST AI 600-1 (Generative AI Profile)(NIST)

Related terms

If your application integrates a language model, this list is the floor for what to think about. Talk to a security expert above to scope an engagement against the categories that apply.

OWASP LLM Top 10 (2025), explained.

The industry standard list of security risks for products built on large language models. Ten categories, updated in 2025, each pointing at a real failure mode that shows up in production systems.

What is the OWASP LLM Top 10 and why does it exist?

What are the ten risks in the 2025 list?

What changed between the 2023 and 2025 lists?

How does SecureLayer7 use the list when scoping an engagement?

References

OWASP LLM Top 10, asked often

Map your application to the OWASP LLM Top 10.

OWASP LLM Top 10 (2025), explained.

The industry standard list of security risks for products built on large language models. Ten categories, updated in 2025, each pointing at a real failure mode that shows up in production systems.

What is the OWASP LLM Top 10 and why does it exist?

What are the ten risks in the 2025 list?

What changed between the 2023 and 2025 lists?

How does SecureLayer7 use the list when scoping an engagement?

How does the OWASP LLM Top 10 relate to MITRE ATLAS and NIST AI 600-1?

References

Should we treat the OWASP LLM Top 10 as a compliance checklist?

Do all ten categories apply to every LLM application?

Is this in addition to the regular OWASP Top 10 or a replacement?

How often is the list updated?

Do industry verticals (fintech, healthcare) add categories?

Map your application to the OWASP LLM Top 10.