Skip to content

OpenAI

Modes: video · image · audio · Models: 8

Vendor: OpenAI · Official API docs: OpenAI Developer Docs

OpenAI spans two media families on the Playground: Sora 2 text-to-video (standard and Pro, plus a chainable Extend workflow) and GPT Image text-to-image (gpt-image-2, gpt-image-1.5). Sora renders up to 720p (1080p on sora-2-pro) with native audio and optional first-frame image conditioning.

Models

idNameInput type
sora-2Sora 2t2v
sora-2-proSora 2 Prot2v
sora-2-extendSora 2 Extendv2v
gpt-image-2GPT Image 2t2i
gpt-image-1.5GPT Image 1.5t2i
gpt-image-1GPT Image 1t2i
openai-tts-1OpenAI TTS-1tts
openai-tts-1-hdOpenAI TTS-1 HDtts

CLI

bash
# text-to-video
gen-ai generate -m sora-2 \
  -p "a paper boat drifting down a rain-soaked city gutter, cinematic, slow motion" \
  --ar 16:9 -d 8

# image-anchored video (first-frame conditioning)
gen-ai generate -m sora-2-pro -p "the boat sets sail into open water" -i ./boat.jpg --ar 16:9 -d 12

# text-to-image
gen-ai generate -m gpt-image-2 \
  -p "an isometric cutaway of a cozy bookshop, warm lighting" \
  --ar 16:9 -n 2

MCP

json
{ "name": "picsart_generate",
  "arguments": {
    "model": "sora-2",
    "prompt": "a paper boat drifting down a rain-soaked city gutter, cinematic, slow motion",
    "aspectRatio": "16:9",
    "duration": 8
  } }
json
{ "name": "picsart_generate",
  "arguments": {
    "model": "gpt-image-2",
    "prompt": "an isometric cutaway of a cozy bookshop, warm lighting",
    "aspectRatio": "16:9",
    "count": 2
  } }

Parameters

Full parameter surface for every model, sourced from gen-ai models info <id> --json. CLI flags show the primary short form; the canonical --kebab-case long form always works too.

sora-2-pro — Sora 2 Pro

Try sora-2-pro in Playground ↗

Input type: t2v

ParamCLI flagTypeValues
prompt-ptextrequired
imageUrls-ifileimage (up to 1)
aspectRatio--arenum16:9 · 9:16 (default 16:9)
resolution-renum720p · 1024p · 1080p (default 720p)
duration-denum4 · 8 · 12 · 16 · 20 (default 4)

sora-2 — Sora 2

Try sora-2 in Playground ↗

Input type: t2v

ParamCLI flagTypeValues
prompt-ptextrequired
imageUrls-ifileimage (up to 1)
aspectRatio--arenum16:9 · 9:16 (default 16:9)
duration-denum4 · 8 · 12 · 16 · 20 (default 4)

sora-2-extend — Sora 2 Extend

Try sora-2-extend in Playground ↗

Input type: v2v

ParamCLI flagTypeValues
prompt-ptextrequired
videoId--video-idenumdynamic value (no fixed list)
duration-denum4 · 8 · 12 · 16 · 20 (default 8)

gpt-image-2 — GPT Image 2

Try gpt-image-2 in Playground ↗

Input type: t2i

ParamCLI flagTypeValues
prompt-ptextrequired
aspectRatio--arenum1:1 · 3:2 · 2:3 · 16:9 · 9:16 · 4:3 · 3:4 · auto (default 1:1)
quality--qualityenumhigh · medium · low (default high)
outputFormat--formatenumpng · jpeg · webp (default png)
count-nenum1 · 2 · 4 · 6 · 8 · 10 (default 1)
imageUrls-ifileimage (up to 5)

gpt-image-1.5 — GPT Image 1.5

Try gpt-image-1.5 in Playground ↗

Input type: t2i

ParamCLI flagTypeValues
prompt-ptextrequired
aspectRatio--arenum1:1 · 3:2 · 2:3 · 16:9 · 9:16 · 4:3 · 3:4 (default 1:1)
quality--qualityenumhigh · medium · low (default high)
background--backgroundenumopaque · transparent (default opaque)
outputFormat--formatenumpng · jpeg · webp (default png)
count-nenum1 · 2 · 4 · 6 · 8 · 10 (default 1)
imageUrls-ifileimage (up to 5)

gpt-image-1 — GPT Image 1

Try gpt-image-1 in Playground ↗

Input type: t2i

ParamCLI flagTypeValues
prompt-ptextrequired
aspectRatio--arenum1:1 · 3:2 · 2:3 · 16:9 · 9:16 · 4:3 · 3:4 (default 1:1)
quality--qualityenumhigh · medium · low (default high)
background--backgroundenumopaque · transparent (default opaque)
outputFormat--formatenumpng · jpeg · webp (default png)
count-nenum1 · 2 · 4 · 6 · 8 · 10 (default 1)
imageUrls-ifileimage (up to 5)

openai-tts-1 — OpenAI TTS-1

Try openai-tts-1 in Playground ↗

Input type: tts

ParamCLI flagTypeValues
prompt-ptextrequired (≤4096 chars)
voiceId--voiceenumalloy (Alloy) · ash (Ash) · ballad (Ballad) · coral (Coral) · echo (Echo) · fable (Fable) · nova (Nova) · onyx (Onyx) · sage (Sage) · shimmer (Shimmer) · verse (Verse) (default alloy)

openai-tts-1-hd — OpenAI TTS-1 HD

Try openai-tts-1-hd in Playground ↗

Input type: tts

ParamCLI flagTypeValues
prompt-ptextrequired (≤4096 chars)
voiceId--voiceenumalloy (Alloy) · ash (Ash) · ballad (Ballad) · coral (Coral) · echo (Echo) · fable (Fable) · nova (Nova) · onyx (Onyx) · sage (Sage) · shimmer (Shimmer) · verse (Verse) (default alloy)

Notes: sora-2-pro adds 1080p output; sora-2-extend continues an existing video. gpt-image-2 is opaque-only; gpt-image-1.5 adds a transparent background option.

Pricing

bash
gen-ai pricing sora-2 -d 8 --ar 16:9
gen-ai pricing gpt-image-2 --quality high -n 2

Sora cost scales with duration, resolution (720p vs 1080p on Pro), and tier (sora-2 vs sora-2-pro). GPT Image cost scales with quality, output size, and count.

Built on @picsart/ai-sdk · gen-ai CLI · Picsart MCP · Skills