AI Security · Learn

OWASP LLM Top 10 (2025), explained.

The industry standard list of security risks for products built on large language models. Ten categories, updated in 2025, each pointing at a real failure mode that shows up in production systems.

AI Security · LearnAI Penetration Testing
TL;DR

OWASP (the Open Worldwide Application Security Project) is the non-profit behind the most-used security risk lists for software. The LLM Top 10 is their list for products built on large language models, the AI behind chatbots and assistants. The 2025 version sharpened the categories and added new ones for attacks on retrieval systems and attacks that drain resources. Most security teams use it as the first checklist for AI security work.

By Rohit Hatagale, AI Security Lead, SecureLayer7Updated

What is the OWASP LLM Top 10 and why does it exist?

The OWASP LLM Top 10 is the community-maintained list of risks for apps built on large language models. The first list came out in mid-2023 to give security teams shared words for a problem the regular OWASP Top 10 did not cover. The 2025 version is the second major update: it sharpened the categories, added new ones (notably Vector and Embedding Weaknesses), and rewrote the agent-related entries to match what shows up in real products.

The OWASP GenAI Security Project working group maintains it. They publish a detail page for each risk, fixes, and a shared vocabulary that other frameworks (MITRE ATLAS, NIST AI 600-1) point to.

What are the ten risks in the 2025 list?

Each risk links to the OWASP detail page and to the SecureLayer7 deep-dive where one exists.

What changed between the 2023 and 2025 lists?

A few real shifts:

  • Vector and Embedding Weaknesses (LLM08) is new. The 2023 list lumped these into prompt injection. 2025 splits them out, because the defenses and the attackers are different.
  • System Prompt Leakage (LLM07) got its own entry. Pulling out the operator's hidden instructions was split off from prompt injection, because the damage (leaked IP and policy) is its own thing.
  • Excessive Agency (LLM06) was rewritten for real agent products. The 2023 version was about functions the agent should not have. The 2025 version is about an agent that can take more actions than its job calls for.
  • Unbounded Consumption (LLM10) replaces Model Denial of Service. Wider scope: model theft, resource drain, and runaway cost, not just downtime.
  • Training Data Poisoning (LLM04) absorbed model poisoning, since both shape the same risk.

How does SecureLayer7 use the list when scoping an engagement?

As a coverage map, not a tick-box list. Every engagement starts with one question: which of these ten actually apply to your setup? A read-only assistant with no tools rarely needs LLM06 testing. A RAG pipeline that takes user uploads almost always needs LLM01, LLM05, and LLM08.

For each category in scope, we run a payload library against your exact configuration, then hand-build follow-up attacks wherever one half-works. The report gives per-category notes, what we tested, what we found, what we advise, so an auditor can see which risks were covered.

References

  1. [1]OWASP LLM Top 10 (2025)(OWASP)
  2. [2]OWASP GenAI Security Project(OWASP)
  3. [3]MITRE ATLAS(MITRE)
  4. [4]NIST AI 600-1 (Generative AI Profile)(NIST)
Related terms

If your application integrates a language model, this list is the floor for what to think about. Talk to a security expert above to scope an engagement against the categories that apply.

Common questions

OWASP LLM Top 10, asked often

Want a coverage checklist for your application?

Scope an engagement

Map your application to the OWASP LLM Top 10.

We scope AI security engagements against the OWASP LLM Top 10 categories that actually apply to your specific product, run real attacks against each, and deliver a per-category report your security and audit teams can use.

See our AI testing methodology30-min scoping call, fixed-price proposal in 48 hours.