Discover OpenAI's GPT Audio Model

Experience natural-sounding audio with advanced voice consistency.

Input: text · audio Output: text · audio Context: 128,000 tokens Release: 2026-01-19
OpenAI's GPT Audio model, released on January 19, 2026, is the first audio model from OpenAI that enables seamless interaction between text and audio. With a context length of 128,000 tokens, it supports both text and audio input/output modalities, making it versatile for various applications. The model features an upgraded decoder that produces more natural-sounding voices while ensuring better voice consistency. This capability allows users to generate high-quality audio outputs from text and vice versa, enhancing the overall user experience in audio processing tasks.

Use Cases

Here are a few ways teams apply OpenAI: GPT Audio in practice—from fast drafting to multimodal understanding. Adapt these ideas to your workflow.

Generate audio from text with high fidelity

Transcribe audio to text accurately

Create engaging voiceovers for content

Develop interactive audio applications

Key Features

A quick look at the capabilities that make this model useful in real projects.

Upgraded decoder for natural-sounding voices

Supports text and audio input/output

Context length of 128,000 tokens

Enhanced voice consistency

Versatile for various audio applications

Specs

Overview
Vendor
openai
Model ID
openai/gpt-audio
Release
2026-01-19
Modalities & context
Input
text · audio
Output
text · audio
Context
128,000 tokens
Parameters & defaults

Supported parameters: frequency_penalty, logit_bias, logprobs, max_tokens, presence_penalty, response_format, seed, stop, structured_outputs, temperature, top_logprobs, top_p

Defaults: temperature 0.2, top_p 0.95

Benchmark tests: OpenAI: GPT Audio

We ran this model against a few representative prompts to show its range. Review the outputs below and be the judge.

Ready to try OpenAI: GPT Audio?

Chat with GPT Audio
up.end
/ˌəpˈend/
verb

To “upend” means to completely disrupt, overturn, or drastically change the established order or structure of something. It implies a significant shift or alteration that can potentially have far-reaching consequences. When something is upended, it is turned upside down or transformed in a way that challenges conventional norms or expectations. The term often carries a sense of innovation, transformation, and sometimes even a hint of upheaval, indicating that the changes are not just minor adjustments but rather a fundamental reimagining of the status quo.