Gemini 3 Pro Image
Provider: Google
OpenRouter ID: google/gemini-3-pro-image-preview
Capabilities
- Text rendering: Industry-leading — best in class for long text, multilingual, detailed layout
- Image input: Yes — multimodal reasoning with identity preservation for up to 5 subjects. Ideal for passing logo + product shot and composing a complete ad in one request.
- Sizes: 2K / 4K, flexible aspect ratios
- Context window: 65K tokens
Pricing
| Component | Cost |
|---|---|
| Input | $2 / M tokens |
| Output | $12 / M tokens |
Best For
- Final production ads requiring precise text and logo integration
- Consistent brand identity preservation across multi-subject compositions
- Long-form text in image (product descriptions, specs, multilingual copy)
- Passing Clarity Diamonds logo + product shot in a single request
Notes
- Most reliable model for logo placement accuracy
- Can maintain identity of up to 5 distinct subjects (logo, product, person, etc.)
See Also
- Gemini 3.1 Flash Image — faster, cheaper iteration version
- Gemini 2.5 Flash Image — budget option
- AI Image Model Capabilities — OpenRouter — full comparison