Protecting AI systems (model, data, prompts, plugins, integrations) against attacks, leaks and misuse — covering both attacks that target AI (prompt injection, model theft) and attacks amplified by AI (deepfakes, phishing).

What is the OWASP LLM Top 10?

OWASP's list of the ten most critical vulnerabilities for LLM applications: prompt injection, insecure output handling, training data poisoning, DoS, supply chain, sensitive information disclosure, insecure plugins, excessive agency, overreliance and model theft.

How do I defend against prompt injection?

Separate the system prompt from user input and treat input as data, not commands. Restrict output and actions, sandbox untrusted content, and add human approval plus guardrails for critical operations.

Which standards should I follow?

NIST AI Risk Management Framework, ISO/IEC 42001 and the OWASP LLM Top 10 are solid starting frameworks, combined with KVKK/GDPR compliance.

English hub

AI Security: Risks, Defenses and Tools

AI systems are both targets and tools of attacks. This guide covers the OWASP LLM Top 10, threat types and — layer by layer — what to do and which tools to use.

Türkçe sürüm: Yapay Zeka Güvenliği

Secure AI lifecycle

Governance

Data

Access

Model

Supply chain

Monitoring

OWASP LLM Top 10

LLM01

Prompt Injection

Hidden instructions in input hijack the model's behavior (direct or indirect via web pages/documents). Separate system prompt from input; treat input as data, not commands.

LLM02

Insecure Output Handling

Passing model output to other systems (browser, DB, shell) unvalidated can cause XSS, SQL injection or RCE. Always validate and escape output.

LLM03

Training Data Poisoning

Malicious content in training/fine-tuning data installs backdoors or bias. Verify sources, track provenance, scan for anomalies.

LLM04

Model Denial of Service

Resource-heavy requests slow, crash or inflate cost. Apply input limits, rate limiting and token budgets.

LLM05

Supply Chain Vulnerabilities

Flaws in third-party models, libraries and datasets. Keep an SBOM, verify signatures, use trusted sources.

LLM06

Sensitive Information Disclosure

The model leaks PII/secrets from training data or context. Mask sensitive input, use enterprise plans, add output filtering/DLP.

LLM07

Insecure Plugin Design

Plugins/tools with weak validation or excessive access. Strictly type and validate inputs, apply least privilege.

LLM08

Excessive Agency

An agent has too much permission or autonomy. Apply least privilege; gate irreversible actions with human approval.

LLM09

Overreliance

Accepting model output without verification despite hallucination risk. Verify with sources (RAG), add human review.

LLM10

Model Theft

Unauthorized access or query-based copying of the model. Access control, encryption, rate limits, monitoring.

What to do: layered defense and tools

1. Governance & Policy

AI usage policy and approved-tools list
Data classification: what can/can't go to AI
KVKK/GDPR and sector compliance

NIST AI RMFISO/IEC 42001OWASP LLM Top 10

2. Data Security & Privacy

Mask PII/secrets, data minimization
Use enterprise plans (no training on your data)
Encrypt data, add DLP

ChatGPT EnterpriseMicrosoft Purview DLPPresidio

3. Access & Identity

Role-based access (RBAC), least privilege
Secure and rotate API keys
MFA; scope agent/plugin permissions

SSO / IdPSecrets ManagerAPI Gateway

4. Application & Model Security

Separate system prompt from user input
Validate/escape output; don't run as command
Guardrails; human approval for critical actions

Guardrails AINeMo GuardrailsLlama GuardRebuff

5. Supply Chain

Keep an SBOM/AIBOM
Verify signatures/integrity
Pin versions, scan dependencies

SBOM (CycloneDX)SigstoreSnyk

6. Monitoring & Response

Prompt/output logging and audit trail
Anomaly and abuse detection
Incident plan and red teaming

SIEMLangfuseGarak (red teaming)

FAQ

What is AI security?: Protecting AI systems (model, data, prompts, plugins, integrations) against attacks, leaks and misuse — covering both attacks that target AI (prompt injection, model theft) and attacks amplified by AI (deepfakes, phishing).
What is the OWASP LLM Top 10?: OWASP's list of the ten most critical vulnerabilities for LLM applications: prompt injection, insecure output handling, training data poisoning, DoS, supply chain, sensitive information disclosure, insecure plugins, excessive agency, overreliance and model theft.
How do I defend against prompt injection?: Separate the system prompt from user input and treat input as data, not commands. Restrict output and actions, sandbox untrusted content, and add human approval plus guardrails for critical operations.
Which standards should I follow?: NIST AI Risk Management Framework, ISO/IEC 42001 and the OWASP LLM Top 10 are solid starting frameworks, combined with KVKK/GDPR compliance.