Model Details
Released
Aug 2025
Status
Active
Context Length
128K
Max Output
16K
Provider
OpenAI
Where is gpt-audio a perfect fit?
GPT-Audio is OpenAIβs specialized model for speech recognition and generation, offering high-quality transcription, translation, and natural voice synthesis. It integrates real-time audio processing capabilities suitable for both developers and enterprises.
Perfect Fit For:
- Live transcription and captioning systems
- Voice assistants and AI call agents
- Podcast or meeting summarization tools
- Multilingual translation and voice cloning
Perfect Fit For:
- Live transcription and captioning systems
- Voice assistants and AI call agents
- Podcast or meeting summarization tools
- Multilingual translation and voice cloning
Quick Model Estimate
Your GPT-5 Cost Estimate
π° Total Cost
USD 3.00
for 1000 input + 1000 output tokens
π₯ Input (1000 Γ $32.000000)
USD 1.5000
π€ Output (1000 Γ $64.000000)
USD 1.5000
Cost Breakdown
π₯ Input 50%
π€ Output 50%
Prices updated daily from official provider data.
Pricing
|
Provider
β
|
Modality
β
|
Input Price
(per 1M tokens) β |
Output Price
(per 1M tokens) β |
Context Window
β
|
Last Updated
β
|
View
|
|---|---|---|---|---|---|---|
OpenAI
|
Audio | $32.0000 | $64.0000 | 128,000 tokens | 2026-03-21 | β |
OpenAI
|
Text | $2.5000 | $10.0000 | 128,000 tokens | 2025-11-18 | β |
