gpt-4o-audio-preview Pricing - Cost Calculator

Model Details

Owner

OpenAI

Released

Oct 2024

Status

Active

Context Length

128K

Max Output

Provider

OpenAI

Where is gpt-4o-audio-preview a perfect fit?

GPT-4o-Audio-Preview is a multimodal audio model built for speech comprehension and generation. It enables live transcription, language translation, and human-like voice output—optimized for natural, low-latency conversational experiences.

Perfect Fit For:
- Real-time transcription and translation systems
- Conversational AI with natural voice responses
- Audio-based accessibility and assistive tools
- Content narration, dubbing, and media automation

Quick Model Estimate

Tokens Characters Words

Model

Provider

Modality

Tier

Context Size

Input Tokens

(USD 40.0000 per 1M tokens)

Output Tokens

(USD 80.0000 per 1M tokens)

No. of API Calls

Your GPT-5 Cost Estimate

💰 Total Cost

USD 3.00

for 1000 input + 1000 output tokens

📥 Input (1000 × $40.000000) USD 1.5000

📤 Output (1000 × $80.000000) USD 1.5000

Cost Breakdown

📥 Input 50% 📤 Output 50%

Prices updated daily from official provider data.

Provider ↕	Modality ↕	Tier ↕	Input Price (per 1M tokens) ↕	Output Price (per 1M tokens) ↕	Context Window ↕	View
OpenAI	Audio	Standard	$40.0000	$80.0000	128,000 tokens	→
OpenAI	Text	Standard	$2.5000	$10.0000	128,000 tokens	→

Model Details

Where is gpt-4o-audio-preview a perfect fit?

Quick Model Estimate

Your GPT-5 Cost Estimate

Other Models in the gpt-4o Family

chatgpt-4o-latest

gpt-4o

gpt-4o-2024-05-13

gpt-4o-2024-08-06

gpt-4o-mini

gpt-4o-mini-2024-07-18

gpt-4o-mini-audio-preview

gpt-4o-mini-realtime-preview

gpt-4o-mini-search-preview

gpt-4o-mini-transcribe

gpt-4o-mini-tts

gpt-4o-realtime-preview

gpt-4o-search-preview

gpt-4o-transcribe

gpt-4o-transcribe-diarize

Pricing