GPT Image 2.
The prompting
guide.
OpenAI's new image model reasons before it generates — planning layout, composition, and constraints. Near-perfect text rendering. Up to 2K resolution. Here's how to get the most out of it.
GPT Image 2 reads natural language. It does not respond to keyword spam.
Words like stunning, hyper-realistic, 8k, masterpiece do nothing — they dilute the prompt. Describe facts the model can draw, not adjectives about quality.
The model's strongest feature — but it still needs explicit instruction:
- Write the exact text you want in quotes or ALL CAPS
- Specify font style, weight, colour, and placement explicitly
- Add verbatim — no extra characters, no substitutions when accuracy is critical
- End with no duplicate text, no extra words
Always tell it what changes AND what stays locked. If you don't provide a preserve list, the model drifts on faces, logos, and text you wanted to keep.
Label inputs by role and reference those labels in the instruction.
Don't solve everything in one prompt. First generation is a baseline. Then refine conversationally with short follow-up messages: warm up the sky, move the logo to the bottom right, make her expression more relaxed.
Restate your invariants every turn. Without it the model quietly redesigns things you wanted to keep.
| Mistake | Fix |
|---|---|
| Stacking adjectives — "stunning", "epic", "8k" | Replace with visual facts — lens, light source, surface texture |
| Vague text instruction — "add a title" | Specify exactly: EXACT TEXT: SUMMER COLLECTION, bold sans-serif, white, centred |
| No constraints at the end | Always close with what must NOT appear |
| Assuming 4K is always better | OpenAI flags above 2K as experimental — generate at high quality, upscale separately |
| Forgetting preserve list on edits | List every element that must stay locked |
Get free access to all our tools
Join our newsletter to unlock this page. No spam, just new tools, automations, and shortcuts when we drop them.