Skip to content

Google

Modes: image · video · audio · Models: 13

Vendor: Google AI for Developers · Vertex AI · Official API docs: Image · Video (Veo) · Music (Lyria)

Google contributes across all three modes: the Veo video family, the Nano Banana (Gemini Image) family, Gemini TTS, and Lyria music.

Models

idNameModeInput
veo-3.1Veo 3.1videot2v
veo-3.1-fastVeo 3.1 Fastvideot2v
veo-3.1-liteVeo 3.1 Litevideot2v
gemini-3-pro-imageNano Banana Proimaget2i
gemini-3.1-flash-imageNano Banana 2imaget2i
gemini-2.5-flash-imageNano Bananaimaget2i
gemini-2.5-flash-ttsGemini 2.5 Flash TTSaudiotts
gemini-2.5-pro-ttsGemini 2.5 Pro TTSaudiotts
lyria-3-clipLyria 3 Clipaudiomusic
lyria-3-proLyria 3 Proaudiomusic

gen-ai models --provider google lists the current set (13 models).

Veo 3.1 (video)

bash
gen-ai generate -m veo-3.1 -p "a drone shot over a snowy ridge at golden hour" \
  --ar 16:9 -r 1080p -d 8 --audio-gen
json
{ "name": "picsart_generate",
  "arguments": { "model": "veo-3.1", "prompt": "a drone shot over a snowy ridge", "duration": 8, "resolution": "1080p", "generateAudio": true } }

Veo clips are chainable with gen-ai extend (+7s per segment). Full params for every Veo / Gemini / Imagen / Lyria model are in Parameters below.

Nano Banana Pro (image)

bash
gen-ai generate -m gemini-3-pro-image -p "a cinematic product render of a smart speaker" --ar 16:9 -r 4K

Gemini TTS & Lyria (audio)

bash
gen-ai generate -m gemini-2.5-pro-tts -p "Here is your daily briefing."   # speech
gen-ai generate -m lyria-3-pro -p "uplifting cinematic orchestral score"  # music

Parameters

Full parameter surface for every model, sourced from gen-ai models info <id> --json. CLI flags show the primary short form; the canonical --kebab-case long form always works too.

gemini-3.1-flash-image — Nano Banana 2

Try gemini-3.1-flash-image in Playground ↗

Input type: t2i

ParamCLI flagTypeValues
prompt-ptextrequired
aspectRatio--arenum1:1 · 16:9 · 9:16 · 3:4 · 4:3 · 3:2 · 2:3 · 4:5 · 5:4 · 4:1 · 1:4 · 8:1 · 1:8 · 21:9 (default 1:1)
resolution-renum0.5K · 1K · 2K · 4K (default 1K)
count-nenum1 · 2 · 4 · 6 · 8 · 10 (default 1)
thinkingLevel--thinkingenumminimal (Minimal (faster)) · high (High (more reasoning)) (default minimal)
imageUrls-ifileimage (up to 14)

veo-3.1 — Veo 3.1

Try veo-3.1 in Playground ↗

Input type: t2v

ParamCLI flagTypeValues
prompt-ptextrequired
aspectRatio--arenum16:9 · 9:16 (default 16:9)
duration-denum4 · 6 · 8 (default 8)
resolution-renum720p · 1080p · 4k (default 720p)
imageUrls-ifileimage (up to 3)
generateAudio--audio-genbooleantrue · false (default true)
negativePrompt--negtextfree text
startFrame--start-framefileimage
endFrame--end-framefileimage

veo-3.1-fast — Veo 3.1 Fast

Try veo-3.1-fast in Playground ↗

Input type: t2v

ParamCLI flagTypeValues
prompt-ptextrequired
aspectRatio--arenum16:9 · 9:16 (default 16:9)
duration-denum4 · 6 · 8 (default 8)
resolution-renum720p · 1080p · 4k (default 720p)
imageUrls-ifileimage (up to 3)
generateAudio--audio-genbooleantrue · false (default true)
negativePrompt--negtextfree text
startFrame--start-framefileimage
endFrame--end-framefileimage

gemini-3-pro-image — Nano Banana Pro

Try gemini-3-pro-image in Playground ↗

Input type: t2i

ParamCLI flagTypeValues
prompt-ptextrequired
aspectRatio--arenum1:1 · 16:9 · 9:16 · 3:4 · 4:3 · 2:3 · 21:9 (default 1:1)
resolution-renum1K · 2K · 4K (default 2K)
count-nenum1 · 2 · 4 · 6 · 8 · 10 (default 1)
thinkingBudget--thinking-budgetinteger12832768, step 128, default 128
imageUrls-ifileimage (up to 14)

gemini-2.5-flash-image — Nano Banana

Try gemini-2.5-flash-image in Playground ↗

Input type: t2i

ParamCLI flagTypeValues
prompt-ptextrequired
aspectRatio--arenum1:1 · 16:9 · 9:16 · 3:4 · 4:3 · 2:3 · 21:9 (default 16:9)
count-nenum1 · 2 · 4 · 6 · 8 · 10 (default 1)
imageUrls-ifileimage (up to 14)

veo-3.1-lite — Veo 3.1 Lite

Try veo-3.1-lite in Playground ↗

Input type: t2v

ParamCLI flagTypeValues
prompt-ptextrequired
aspectRatio--arenum16:9 · 9:16 (default 16:9)
duration-denum4 · 6 · 8 (default 8)
resolution-renum720p · 1080p (default 720p)
startFrame--start-framefileimage

gemini-2.5-flash-tts — Gemini 2.5 Flash TTS

Try gemini-2.5-flash-tts in Playground ↗

Input type: tts

ParamCLI flagTypeValues
language--languagetextfree text
accent--accenttextfree text
prompt-ptextrequired (≤5000 chars)
voiceId--voiceenumAoede · Charon · Fenrir · Kore · Leda · Orus · Puck · Zephyr · Achernar · Achird · Algenib · Algieba · Alnilam · Autonoe · Despina · Enceladus · Erinome · Gacrux · Iapetus · Laomedeia · Pulcherrima · Rasalgethi · Sadachbia · Sadaltager · Schedar · Sulafat · Umbriel · Vindemiatrix · Zubenelgenubi (default Kore)

gemini-2.5-pro-tts — Gemini 2.5 Pro TTS

Try gemini-2.5-pro-tts in Playground ↗

Input type: tts

ParamCLI flagTypeValues
language--languagetextfree text
accent--accenttextfree text
prompt-ptextrequired (≤5000 chars)
voiceId--voiceenumAoede · Charon · Fenrir · Kore · Leda · Orus · Puck · Zephyr · Achernar · Achird · Algenib · Algieba · Alnilam · Autonoe · Despina · Enceladus · Erinome · Gacrux · Iapetus · Laomedeia · Pulcherrima · Rasalgethi · Sadachbia · Sadaltager · Schedar · Sulafat · Umbriel · Vindemiatrix · Zubenelgenubi (default Kore)

imagen-4.0 — Imagen 4.0

Try imagen-4.0 in Playground ↗

Input type: t2i

ParamCLI flagTypeValues
prompt-ptextrequired
aspectRatio--arenum1:1 · 16:9 · 9:16 · 3:4 · 4:3 (default 1:1)
count-nenum1 · 2 · 4 (default 1)
enhancePrompt--enhance-promptbooleantrue · false (default true)
negativePrompt--negtextfree text

imagen-4.0-ultra — Imagen 4.0 Ultra

Try imagen-4.0-ultra in Playground ↗

Input type: t2i

ParamCLI flagTypeValues
prompt-ptextrequired
aspectRatio--arenum1:1 · 16:9 · 9:16 · 3:4 · 4:3 (default 1:1)
count-nenum1 · 2 · 4 (default 1)
enhancePrompt--enhance-promptbooleantrue · false (default true)
negativePrompt--negtextfree text

imagen-4.0-fast — Imagen 4.0 Fast

Try imagen-4.0-fast in Playground ↗

Input type: t2i

ParamCLI flagTypeValues
prompt-ptextrequired
aspectRatio--arenum1:1 · 16:9 · 9:16 · 3:4 · 4:3 (default 1:1)
count-nenum1 · 2 · 4 (default 1)
enhancePrompt--enhance-promptbooleantrue · false (default true)
negativePrompt--negtextfree text

lyria-3-clip — Lyria 3 Clip

Try lyria-3-clip in Playground ↗

Input type: music

ParamCLI flagTypeValues
prompt-ptextrequired
imageUrls-ifileimage (up to 1)

lyria-3-pro — Lyria 3 Pro

Try lyria-3-pro in Playground ↗

Input type: music

ParamCLI flagTypeValues
prompt-ptextrequired
imageUrls-ifileimage (up to 1)

Notes: Veo audio is native (generateAudio); Imagen and Gemini image models differ in resolution and reasoning controls (thinkingLevel / thinkingBudget). TTS voiceId values are Gemini voice presets.

Built on @picsart/ai-sdk · gen-ai CLI · Picsart MCP · Skills