1 min read 191 words Updated Mar 17, 2026 Created Mar 17, 2026
#google#image-gen#openrouter#tools

Gemini 3 Pro Image

Provider: Google
OpenRouter ID: google/gemini-3-pro-image-preview

Capabilities

  • Text rendering: Industry-leading — best in class for long text, multilingual, detailed layout
  • Image input: Yes — multimodal reasoning with identity preservation for up to 5 subjects. Ideal for passing logo + product shot and composing a complete ad in one request.
  • Sizes: 2K / 4K, flexible aspect ratios
  • Context window: 65K tokens

Pricing

ComponentCost
Input$2 / M tokens
Output$12 / M tokens

Best For

  • Final production ads requiring precise text and logo integration
  • Consistent brand identity preservation across multi-subject compositions
  • Long-form text in image (product descriptions, specs, multilingual copy)
  • Passing Clarity Diamonds logo + product shot in a single request

Notes

  • Most reliable model for logo placement accuracy
  • Can maintain identity of up to 5 distinct subjects (logo, product, person, etc.)

See Also