Explore OpenAI: GPT Audio

Openai: OpenAI: GPT Audio

Input: text · audio Output: text · audio Context: 128,000 tokens Release: 2026-01-19

Use Cases

Here are a few ways teams apply OpenAI: GPT Audio in practice—from fast drafting to multimodal understanding. Adapt these ideas to your workflow.

Meeting transcription & notes

Multilingual audio translation

Audio understanding

Reliable drafting

Specs

Overview
Vendor
openai
Model ID
openai/gpt-audio
Release
2026-01-19
Modalities & context
Input
text · audio
Output
text · audio
Context
128,000 tokens
Parameters & defaults

Supported parameters: frequency_penalty, logit_bias, logprobs, max_tokens, presence_penalty, response_format, seed, stop, structured_outputs, temperature, top_logprobs, top_p

Defaults: temperature 0.2, top_p 0.95

Benchmark tests: OpenAI: GPT Audio

We ran this model against a few representative prompts to show its range. Review the outputs below and be the judge.

Ready to try OpenAI: GPT Audio?

Chat with OpenAI: GPT Audio
up.end
/ˌəpˈend/
verb

To “upend” means to completely disrupt, overturn, or drastically change the established order or structure of something. It implies a significant shift or alteration that can potentially have far-reaching consequences. When something is upended, it is turned upside down or transformed in a way that challenges conventional norms or expectations. The term often carries a sense of innovation, transformation, and sometimes even a hint of upheaval, indicating that the changes are not just minor adjustments but rather a fundamental reimagining of the status quo.