📚

Insights on Claude's Leaked System Prompt

May 8, 2025,

Lecture Notes: Claude's Leaked System Prompt

Introduction

Discussion of Claude's leaked system prompt revealing its internal workings.
Exploration of how Claude knows specific answers like election outcomes, attributed to information embedded in the system prompt.
System prompt is comprehensive: 24,000 tokens, including tool usage, guidelines, and more.

Election Information

Claude's Knowledge of 2024 Election:
- Encoded by Anthropic in the system prompt.
- Line 1073 states Donald Trump won over Kamala Harris.
- Highlights potential bias in system prompts used by AI corporations.

Web Search Guidelines

Core behaviors for web search:
- Avoid unnecessary tool calls.
- Respond normally if uncertain and suggest tools.
- Match tool use to query complexity.
Specific instructions on tool usability and user notification.

Counting Instructions

Explicit instructions for counting words, letters, or characters:
- Think step-by-step before answering.
- Assign numbers explicitly before responding.

Wellbeing and Ethics

Claude's guidelines to care for users' wellbeing:
- Avoid encouraging self-destructive behaviors.
- Avoid creating harmful content.

Handling Preferences

Responses to Preference Questions:
- Engage hypothetically without claiming personal preference.

Web Search Responses

Citation requirements for web search results.
Introduction of NML (Anthropic Markup Language) for tool calling and information passing.

Artifact Usage

Instructions on when to use artifacts:
- For text over 20 lines or specific creative requests.
- Examples of how Claude manages user requests regarding song lyrics and other copyrighted material.

Prompt Engineering and Rationales

Prompt Examples:
- Inclusion of rationale with responses for better understanding.

Mental Math and Tool Use

Guidelines for mathematical problem-solving:
- Use of analysis tool (JavaScript REPL) based on problem complexity.
- Clarification on what constitutes 'mental math'.
Examples of Claude's behavior with different math problems.

Conclusion

System prompt intricacies reveal Claude's functioning and limitations.
Offers insights for GenAI enthusiasts and practitioners.
Encouragement to explore the full prompt for deeper understanding.

Note: Lecture highlights system prompt learning opportunities for those working with large language models.

Full transcript