gpt-audio Pricing - Cost Calculator

Model Details

Owner

OpenAI

Released

Aug 2025

Status

Active

Context Length

128K

Max Output

16K

Provider

OpenAI

Where is gpt-audio a perfect fit?

GPT-Audio is OpenAI’s specialized model for speech recognition and generation, offering high-quality transcription, translation, and natural voice synthesis. It integrates real-time audio processing capabilities suitable for both developers and enterprises.

Perfect Fit For:
- Live transcription and captioning systems
- Voice assistants and AI call agents
- Podcast or meeting summarization tools
- Multilingual translation and voice cloning

Quick Model Estimate

Tokens Characters Words

Model

Provider

Modality

Tier

Context Size

Input Tokens

(USD 32.0000 per 1M tokens)

Output Tokens

(USD 64.0000 per 1M tokens)

No. of API Calls

Your GPT-5 Cost Estimate

💰 Total Cost

USD 3.00

for 1000 input + 1000 output tokens

📥 Input (1000 × $32.000000) USD 1.5000

📤 Output (1000 × $64.000000) USD 1.5000

Cost Breakdown

📥 Input 50% 📤 Output 50%

Prices updated daily from official provider data.

Other Models in the gpt-audio Family

gpt-audio-mini

Active

Multimodal · 128K context

Pricing

Provider ↕	Modality ↕	Tier ↕	Input Price (per 1M tokens) ↕	Output Price (per 1M tokens) ↕	Context Window ↕	View
OpenAI	Audio	Standard	$32.0000	$64.0000	128,000 tokens	→
OpenAI	Text	Standard	$2.5000	$10.0000	128,000 tokens	→