AI Jupyter logo
AI JupyterAI developer tool intelligence
API pricing snapshot

AI Model API Pricing for Text, Image, and Video

Compare representative API prices from mainstream paid model providers including OpenAI, Claude, Gemini, Qwen, DeepSeek, xAI, Mistral, and Runway. This page is a planning index, not a billing guarantee.

Last checked

June 11, 2026

Providers change model names, regional availability, free quotas, and discount tiers frequently. Follow the source links before committing production spend.

Prices are copied or normalized from official provider pages only.

Token prices use USD per 1M tokens unless the provider uses per-image, per-second, per-credit, or page units.

For tiered pricing, the table keeps the common entry tier visible and explains the higher tiers in notes.

Discount programs such as batch, cache hits, flex, priority, enterprise commitments, and regional processing can materially change final spend.

Modality coverage

Which providers publish text, image, and video API prices?

ProviderTextImage generationVideo generationPlanning note
OpenAIYesYesYesToken-billed GPT models, GPT Image models, and Sora video models.
Anthropic ClaudeYesNo generation API listedNo generation API listedClaude supports multimodal understanding, but this page covers generation prices only.
Google GeminiYesYesYesGemini, Imagen, and Veo prices are all listed on the Gemini API pricing page.
Alibaba Qwen/WanYesYesYesPrices vary by deployment region and model family.
DeepSeekYesNo generation API listedNo generation API listedOfficial USD table covers chat and reasoning token prices.
xAI GrokYesYesYesGrok Imagine covers image and video generation/editing.
MistralYesConsumer plan feature; API rate not itemized hereNo generation API listedThe public API table primarily lists text, OCR, voice, and related endpoints.
RunwayNo general LLM chat APIYesYesIncluded as a mainstream creative API rather than a general text LLM provider.

Text APIs

Text and reasoning model prices

Use input and output token prices to model chat, coding, extraction, search, agents, and reasoning workloads. Cached input, batch, and regional tiers can change total cost more than the headline model price.

Estimate token spend
ProviderModelRepresentative priceBilling unitNotesSource
OpenAIGPT-5.5$5.00 input, $0.50 cached input, $30.00 outputPer 1M tokens, short contextLong-context requests and priority processing cost more; batch and flex are lower than standard.OpenAI
OpenAIGPT-5.4 mini$0.75 input, $0.075 cached input, $4.50 outputPer 1M tokensRepresentative lower-cost OpenAI frontier-family option on standard processing.OpenAI
AnthropicClaude Opus 4.8$5.00 input, $0.50 cache hit, $25.00 outputPer 1M tokensCache writes are billed separately at $6.25 per 1M tokens for 5 minutes and $10.00 for 1 hour.Anthropic
AnthropicClaude Sonnet 4.6$3.00 input, $0.30 cache hit, $15.00 outputPer 1M tokensBalanced Claude tier with prompt caching and batch pricing options.Anthropic
GoogleGemini 2.5 Pro$1.25 input and $10.00 output up to 200k prompt tokensPer 1M tokensPrompts above 200k tokens are listed at $2.50 input and $15.00 output.Google
GoogleGemini 2.5 Flash$0.30 input, $2.50 outputPer 1M tokensApplies to text, image, and video input; audio input is priced separately.Google
Alibaba Qwenqwen3-max, Global$0.359-$1.004 input, $1.434-$4.014 outputPer 1M tokens, tiered by prompt sizeGlobal deployment uses US Virginia or Germany Frankfurt endpoints and has no free quota.Alibaba Cloud
Alibaba Qwenqwen-plus, US/EU/HK$0.40 input, $1.20 non-thinking output, $4.00 thinking outputPer 1M tokens, up to 256k prompt tokensThe 256k-1M prompt tier is higher at $1.20 input, $3.60 non-thinking output, and $12 thinking output.Alibaba Cloud
DeepSeekdeepseek-chat$0.07 cache-hit input, $0.27 cache-miss input, $1.10 outputPer 1M tokens64k context and 8k max output in the official USD pricing detail table.DeepSeek
DeepSeekdeepseek-reasoner$0.14 cache-hit input, $0.55 cache-miss input, $2.19 outputPer 1M tokens64k context, 32k max CoT tokens, and 8k max output in the official USD table.DeepSeek
xAIgrok-4.3$1.25 input, $0.20 cached input, $2.50 outputPer 1M tokensxAI lists a 1M token context window for grok-4.3 on the Chat API pricing table.xAI
MistralMistral Large 3$0.50 input, $1.50 outputPer 1M tokensMistral also lists Medium 3.5 at $1.50 input and $7.50 output, and Small 4 at $0.10 input and $0.30 output.Mistral

Image APIs

Image generation prices

Image pricing is less standardized than text pricing. Some providers charge by output image, some by output tokens, and creative API platforms may use credits.

ProviderModelRepresentative priceBilling unitNotesSource
OpenAIgpt-image-2Image: $8.00 input, $2.00 cached input, $30.00 output; text input: $5.00Per 1M tokensOpenAI image generation is token-billed; use the official calculator for per-image estimates by quality and size.OpenAI
GoogleGemini 2.5 Flash Image$0.039 per image, plus $0.30 input per 1M text/image tokensPer image and per 1M input tokensBatch and flex output are $0.0195 per image; priority output is $0.0702 per image.Google
GoogleImagen 4$0.02 Fast, $0.04 Standard, $0.06 UltraPer imagePaid Gemini API tier pricing for Imagen 4 image generation.Google
Alibaba Qwenqwen-image-plus and qwen-image-2.0$0.03 per qwen-image-plus image; $0.035 per qwen-image-2.0 imagePer successfully generated imageInternational deployment bills only successful image outputs; qwen-image-2.0-pro and qwen-image-max are $0.075 per image.Alibaba Cloud
xAIGrok Imagine Image$0.02 output per standard image; $0.05-$0.07 output for quality modePer imageInput image pricing is listed separately at $0.002 per image for standard and $0.01 per image for quality mode.xAI
RunwayGen-4 Image and Gen-4 Image Turbo$0.05 per 720p image, $0.08 per 1080p image, or $0.02 per turbo imageCredits converted at $0.01 per creditRunway also resells third-party image models such as gpt_image_2 and gemini_2.5_flash through its API credit system.Runway

Video APIs

Video generation prices

Video models usually bill per generated second, with price differences for resolution, audio, speed tier, model quality, and batch processing.

ProviderModelRepresentative priceBilling unitNotesSource
OpenAISora 2$0.10 per second at 720pPer generated secondBatch pricing is listed at $0.05 per second.OpenAI
OpenAISora 2 Pro$0.30 per second at 720p, $0.50 at 1024p, $0.70 at 1080pPer generated secondBatch prices are half of standard prices on the OpenAI table.OpenAI
GoogleVeo 3.1$0.40 per second standard with audio; $0.10-$0.30 fast; $0.05-$0.08 litePer generated secondGoogle lists separate 720p, 1080p, and 4K prices where available.Google
Alibaba Wanwan2.6-t2v, Global$0.086012 per second at 720p, $0.143353 per second at 1080pPer generated secondGlobal deployment uses US Virginia or Germany Frankfurt endpoints and has no free quota.Alibaba Cloud
Alibaba Wanwan2.6-i2v-flash, International$0.05 per second at 720p with audio, $0.075 at 1080p with audioPer generated secondNo-audio international pricing is $0.025 per second at 720p and $0.0375 at 1080p.Alibaba Cloud
xAIGrok Imagine Video$0.05 per second at 480p, $0.07 per second at 720pPer generated secondInput video is listed at $0.01 per second and image input at $0.002 per image.xAI
RunwayGen-4 Turbo and Gen-4.5$0.05 per second for Gen-4 Turbo, $0.12 per second for Gen-4.5Credits converted at $0.01 per creditRunway also lists Seedance, Veo, HappyHorse, Aleph, and Act-Two video models with model-specific credit rates.Runway

Source list

Official provider pages used for this snapshot

These links should be checked again before budgeting or publishing a production price comparison because model catalogs and discount tiers change often.

FAQ

API pricing questions that affect estimates

Why are some image prices listed per token and others per image?

Providers use different billing units. OpenAI image generation is token-priced, while Google Imagen, Qwen-Image, xAI Imagine, and Runway image endpoints publish per-image or credit-based prices.

Why does the same provider show different regional prices?

Alibaba Model Studio, OpenAI data residency, and other regional deployment options can use different endpoints, data residency rules, and compute pools. Always match the price to the region you actually call.

Is this page a replacement for provider calculators?

No. It is a quick planning index. Before production launch, verify the provider page, run a small workload, and compare actual billing export data with your estimate.