
Nano Banana Pro Getting Started — 5 steps for text rendering, A+B compositing, and editing
Five quickest steps to try Nano Banana Pro: text-on-image, A+B multi-image compositing (up to 14 inputs), and local edits. Pro renders multilingual text accurately, keeps appearance consistency, supports 2K/4K output, and embeds SynthID watermarks.
Nano Banana Pro Getting Started — 5 steps for text rendering, A+B compositing, and editing
- Published: 2025/11/23
- Summary: This quickstart shows the fastest path to Nano Banana Pro: making text-on-image, A+B multi-image composites (up to 14 inputs), and local edits (background swap/lighting). Pro renders multilingual in-image text with high fidelity, supports complex composites with up to 5 consistent people, outputs 2K/4K, and embeds SynthID invisible watermarks.
- Try for free now
- Prompt set (copy/paste)
Table of Contents
Step 0: Try for free (login & launch) Step 1: Text-on-image (sign/poster/thumbnail) Step 2: Swap backgrounds (ID/EC) Step 3: A+B multi-image compositing (try-on/face swap) Step 4: Photographic controls (DOF/light/day→night/resolution) Step 5: Fix failures (common errors and repair prompts) Templates to copy-and-run FAQ
Step 0: Try for free (login & launch)
- Sign in with Google
- Go to the generator. Paste prompts or click “Try in generator” on templates to auto-fill.
- Start with standard resolution (1K). After success, retry with 2K/4K.
[Video placeholder] [VIDEO: Sign-in → Open generator → Paste prompt — 20s]
Step 1: Text-on-image (sign/poster/thumbnail)
Pro strength: multilingual text inside images stays legible. Layout instructions from headline to body copy are reliable.
Example 1 | Restaurant opening poster (text left, image right)
- Canvas: 3:4
- Text: "GRAND OPENING" (top-left, bold sans-serif, uppercase)
- Subline: "Noodle House · 123 Main St · Sat 6 PM"
- Subject: steaming bowl of hand-pulled beef noodles; chopsticks lifting noodles
- Layout: text left, food right; 24px margins; clean grid
- Style: modern; warm red accents; high contrast
- Pro settings: text rendering ON; detail high
- Negatives: warped letters, heavy texture, cluttered background

Example 2 | YouTube thumbnail (versus style)
- Canvas: 16:9
- Headline: "M4 vs M2" (center, extra bold, white with outline)
- Subjects: Mac mini M4 (left) vs M2 (right); speedometer icon center
- Composition: rule of thirds; cutouts with soft shadow; high contrast
- Pro settings: text rendering ON; subject cutout ON

Tips (clean text)
- Keep headlines short and bold; specify center/left align.
- Limit to 3 lines; use generous padding.
- For multilingual, localize step by step (e.g., EN → JA → KO) for stability.
Step 2: Swap backgrounds (ID/EC)
Pro strength: local edits keep hair edges and shadows natural.
Example 1 | Gray solid for ID portrait
- Mode: edit
- Task: replace background with seamless light gray for an ID portrait
- Keep: original lighting and skin tone; natural hair edges; no halo
- Crop: shoulders-up
- Pro settings: detail high; local edit ON
- Negatives: posterization, plastic skin, jagged hair
Example 2 | Pure white for EC
- Mode: edit
- Subject: stainless steel water bottle, 30° profile
- Task: pure white background (RGB 255); keep a soft ground shadow
- Pro settings: shadow soft; reflections control ON

Step 3: A+B multi-image compositing (outfit/face)
Pro strength: up to 14 inputs while keeping up to 5 people consistent. Less breakage on outfit swaps, face swaps, and logo overlays.
Example 1 | Outfit try-on (A=subject to keep / B=outfit)
- Image A: subject to keep (face and body)
- Image B: black tuxedo reference
- Compose: apply tuxedo from B to A; align lapels/collar; match sleeve length; seamless color match
- Blend: medium
- Region: torso
- Pro settings: multi-image A+B; detail high
- Negatives: double collars, warped bow tie, neck tone mismatch
Example 2 | Face swap (the “3rd from left” in group)
- Task: swap face from A onto B
- Keep: B’s pose, lighting, background; natural tone match
- Eyes: sharp and aligned; avoid duplicate eyebrows
- Region: the “third from left” person in the group photo
- Pro settings: tone match ON; blend medium

Step 4: Photographic controls (DOF/light/day→night/resolution)
What you can do:
- Depth of field: shallow depth, subject in focus, soft bokeh
- Lighting: softbox key from left, subtle rim light, neutral white balance
- Day→night: convert daylight scene to cinematic night with practicals
- Composition: move camera angle; adjust aspect ratio (1:1/4:5/16:9); keep grid clean
- Resolution: start 1K; then 2K/4K once prompt is stable
Tips to avoid artifacts:
- Keep prompt concise; add shot/lighting cues first, then style.
- If hands/faces warp, lower detail or reduce blend strength and re-run.
- For text+photo mixes, lock layout first (grid/margins), then specify text.

Step 5: Fix failures (common errors and repair prompts)
- Warped text: shorten headline; specify align (center/left); set 2–3 lines max; use “clean background, high contrast”.
- Plastic skin: add “natural skin texture, no plastic sheen, balanced highlights”.
- Jagged edges after edit: raise resolution slightly and add “anti-aliased edges, soft feather”.
- Color shifts: “keep original color palette; no tint shift; neutral WB”.
- Double logos/extra limbs: reduce blend to medium-low; add “no duplicates”; constrain region.
Templates (copy and run)
Text-on-image poster
Task: ecommerce poster
Text: "BLACK FRIDAY -20%" (top-left, bold sans-serif), CTA "Shop Now →" (bottom-left)
Layout: text left, product right; high contrast; grid-based
Locale: en/ja/ko variants
Negative: warped letters, clutterA+B outfit try-on
Image A: subject to keep (face and body)
Image B: black tuxedo reference
Compose: apply tuxedo from B to A; align lapels and collar; match sleeve length; seamless color match
Blend: medium
Negative: double collars, warped bow tieID/EC background swap
Mode: edit
Task: replace background with seamless light gray (ID) / pure white (EC)
Keep: hair edges and natural shadow
Negative: halo, jagged hairInfographic
Task: infographic
Content: [topic text or facts]
Ask: accurate, context-rich layout; bilingual labels; source note area
Negative: messy grid, low contrastFAQ
Q. Where do I set 2K/4K output?
A. First succeed at 1K, then re-run the same prompt at 2K/4K for stability.
Q. A+B composite keeps breaking
A. Clarify roles of Image A/B and adjust blend from medium; add region if needed.
Q. Can I verify provenance of generated images?
A. Pro outputs embed SynthID invisible watermarks; visible watermarking is also possible.
Q. Commercial use?
A. Depends on the product/contract (Ads, Workspace, API). Check each platform’s policy.
Author

Categories
Newsletter
Join the community
Subscribe to our newsletter for the latest news and updates

























































