System Prompt Leakage

Disclosure of hidden instructions, policy text, credentials, tool schemas, or internal logic that can weaken controls or reveal sensitive implementation details. While prompts alone are not security boundaries, leakage can expose internal policies, tool names, business logic, and attack surfaces that enable further compromise.

Examples

1.
OWASP LLM07:2025 System Prompt Leakage was added to the 2025 OWASP Top 10 for LLM Applications.
2.
Multiple researchers extracted Bing Chat's system prompt in February 2023, including its internal codename Sydney and its rules.
3.
GPT Store custom GPTs were widely shown in 2023 and 2024 to leak their system prompts and uploaded files via simple repeat-back attacks.

Related terms

Back to glossary

Examples

Related terms

Loading…

Examples

Related terms