How System Prompts control model behavior: Token sequence, Attention weights and practical examples
System Prompts are the invisible hand behind ChatGPT and Claude. This demo shows how different System Prompt styles (short vs. long, vague vs. precise) influence model behavior – from tonality to factual accuracy.
Practical complement to the Attention heatmap. Shows how theory (attention distribution) translates into practice (prompt formulation).
Anthropic publishes Claude's System Prompt; OpenAI keeps GPT-4's secret. Both are core parts of the product. Good System Prompts can dramatically improve quality.
How much does the model attend to different positions (System vs. User)?