Nano Banana Prompt Templates: The Gemini Image Generation Patterns That Actually Work


The difference between burning through tokens and getting perfect images? Using the proven patterns Google already tested.

I just discovered Google’s official Nano Banana image generation documentation, and it’s a goldmine of battle-tested prompt templates. Not theory - these are the exact patterns they use in their examples, complete with working code samples across Python, JavaScript, Go, Java, and REST.

Here are the 7 core templates that actually exist in their docs:

1. The Photorealistic Photography Formula

Template:

A photorealistic [shot type] of [subject], [action or expression], set in [environment].
The scene is illuminated by [lighting description], creating a [mood] atmosphere.
Captured with a [camera/lens details], emphasizing [key textures and details].
The image should be in a [aspect ratio] format.

Real working example:

A photorealistic close-up portrait of an elderly Japanese ceramicist with deep, sun-etched wrinkles and a warm, knowing smile. He is carefully inspecting a freshly glazed tea bowl. The setting is his rustic, sun-drenched workshop. The scene is illuminated by soft, golden hour light streaming through a window, highlighting the fine texture of the clay. Captured with an 85mm portrait lens, resulting in a soft, blurred background (bokeh). The overall mood is serene and masterful. Vertical portrait orientation.

This isn’t just descriptive fluff - Google’s docs specifically mention using photography terms like “85mm portrait lens” and “golden hour light” to guide the model toward photorealistic results.

2. The Stylized Illustration Pattern

Template:

A [style] sticker/illustration of a [subject], featuring [key characteristics] and a [color palette].
The design should have [line style] and [shading style].
The background must be [transparent/specific color].

Working example:

A kawaii-style sticker of a happy red panda wearing a tiny bamboo hat. It's munching on a green bamboo leaf. The design features bold, clean outlines, simple cel-shading, and a vibrant color palette. The background must be white.

The key here is being explicit about style elements - “bold outlines,” “cel-shading,” “vibrant color palette” - rather than just saying “make it cute.”

3. The Text Rendering Template (This One’s Huge)

Template:

Create a [image type] for [brand/concept] with the text "[exact text]" in a [font style description].
The design should be [style description], with a [color scheme].

Working example:

Create a modern, minimalist logo for a coffee shop called 'The Daily Grind'. The text should be in a clean, bold, sans-serif font. The color scheme is black and white. Put the logo in a circle. Use a coffee bean in a clever way.

Text rendering is where most image models fail, but Gemini handles it well when you’re specific about font styles and placement.

4. The Product Photography Formula

Template:

A high-resolution, studio-lit product photograph of a [product description] on a [background surface].
The lighting is a [lighting setup] to [lighting purpose].
The camera angle is a [angle type] to showcase [specific feature].
Ultra-realistic, with sharp focus on [key detail]. [Aspect ratio].

Working example:

A high-resolution, studio-lit product photograph of a minimalist ceramic coffee mug in matte black, presented on a polished concrete surface. The lighting is a three-point softbox setup designed to create soft, diffused highlights and eliminate harsh shadows. The camera angle is a slightly elevated 45-degree shot to showcase its clean lines. Ultra-realistic, with sharp focus on the steam rising from the coffee. Square image.

This pattern consistently produces clean, professional product shots perfect for e-commerce.

5. The Minimalist Design Template

Template:

A minimalist composition featuring a single [subject] positioned in the [location] of the frame.
The background is a vast, empty [color] canvas, creating significant negative space.
Soft, subtle lighting. [Aspect ratio].

Working example:

A minimalist composition featuring a single, delicate red maple leaf positioned in the bottom-right of the frame. The background is a vast, empty off-white canvas, creating significant negative space for text. Soft, diffused lighting from the top left. Square image.

Perfect for website backgrounds, presentation slides, or anywhere you need space for text overlay.

6. The Sequential Art Pattern

Template:

Make a [number] panel comic in a [style description]. Put the character in a [type of scene].

Working example:

Make a 3 panel comic in a gritty, noir art style with high-contrast black and white inks. Put the character in a humorous scene.

This one’s surprisingly effective for creating storyboards or comic sequences while maintaining style consistency.

7. The Real-time Information Template

Template:

Make a simple but stylish graphic of [recent event/topic]

Working example:

Make a simple but stylish graphic of last night's Arsenal game in the Champion's League

This works because Gemini can ground its generation with Google Search to get current information.

The Editing Templates (These Are Game-Changers)

Google’s docs include several patterns for editing existing images:

Adding/Removing Elements

Using the provided image of [subject], please [add/remove/modify] [element] to/from the scene.
Ensure the change is [description of how the change should integrate].

Inpainting (Changing Specific Parts)

Using the provided image, change only the [specific element] to [new description].
Keep everything else in the image exactly the same, preserving the original style, lighting, and composition.

Style Transfer

Transform the provided photograph of [subject] into the artistic style of [artist/style].
Preserve the original composition but render it with [description of stylistic elements].

Multi-Image Composition

Create a new image by combining the elements from the provided images.
Take the [element from image 1] and place it with/on the [element from image 2].
The final image should be a [description of the final scene].

The Resolution Reality Check

Here’s what Google’s docs actually specify for token costs:

Gemini 2.5 Flash: 1024x1024 (1290 tokens flat rate) Gemini 3 Pro: 1K/2K/4K options

  • 1K: 1120 tokens
  • 2K: 1120 tokens (same price as 1K - this is your sweet spot)
  • 4K: 2000 tokens

The docs explicitly state: “2K images: 1120 tokens (same price as 1K)” - so 2K is literally free resolution upgrade.

The Implementation Reality

Every template works across all their SDKs. Here’s the Python version for the photography template:

from google import genai
from google.genai import types

client = genai.Client()

response = client.models.generate_content(
    model="gemini-2.5-flash-image",
    contents="A photorealistic close-up portrait of an elderly Japanese ceramicist with deep, sun-etched wrinkles and a warm, knowing smile. He is carefully inspecting a freshly glazed tea bowl. The setting is his rustic, sun-drenched workshop. The scene is illuminated by soft, golden hour light streaming through a window, highlighting the fine texture of the clay. Captured with an 85mm portrait lens, resulting in a soft, blurred background (bokeh). The overall mood is serene and masterful.",
)

for part in response.parts:
    if part.text is not None:
        print(part.text)
    elif part.inline_data is not None:
        image = part.as_image()
        image.save("generated_image.png")

The Bottom Line

These aren’t theoretical patterns - they’re the exact templates Google uses in their official documentation, complete with working code examples and proven results. The key insight: describe scenes like a photographer or designer would, not like you’re keyword stuffing for SEO.

Start with these proven templates, then iterate using Gemini’s conversational editing features. Way cheaper than guessing.