Discover Mistral: Voxtral Small 24B 2507

Efficient text and audio processing with advanced capabilities.

Input: text · audio Output: text Context: 32,000 tokens Release: 2025-10-30
Mistral: Voxtral Small 24B 2507, released by mistralai on October 30, 2025, is designed for both text and audio processing. With a context window of 32,000 tokens, it handles text-to-text tasks efficiently while integrating advanced audio input capabilities. This model is particularly adept at speech transcription, translation, and understanding audio inputs. It supports various parameters like frequency penalty and temperature, offering flexibility for different applications. Whether you need to process text or audio, Voxtral Small provides a robust solution.

Use Cases

Here are a few ways teams apply Mistral: Voxtral Small 24B 2507 in practice—from fast drafting to multimodal understanding. Adapt these ideas to your workflow.

Transcribe lengthy audio recordings

Translate spoken language accurately

Understand complex audio data

Generate text responses from audio input

Key Features

A quick look at the capabilities that make this model useful in real projects.

Supports text and audio inputs

32,000 token context window

Excels in speech transcription

Advanced audio understanding

Flexible parameter support

Specs

Overview
Vendor
mistralai
Model ID
mistralai/voxtral-small-24b-2507
Release
2025-10-30
Modalities & context
Input
text · audio
Output
text
Context
32,000 tokens
Parameters & defaults

Supported parameters: frequency_penalty, max_tokens, presence_penalty, response_format, seed, stop, structured_outputs, temperature, tool_choice, tools, top_p

Defaults: temperature 0.2, top_p 0.95

Benchmark tests: Mistral: Voxtral Small 24B 2507

We ran this model against a few representative prompts to show its range. Review the outputs below and be the judge.

Text

Prompt:
Write 150 words on how AI might positively upend work, leisure and creativity
AI has the potential to significantly enhance work, leisure, and creativity. In the workplace, AI can automate repetitive tasks, freeing up human time for more strategic and creative work. It can also provide data-driven insights, improving decision-making and productivity. For leisure, AI can personalize entertainment, from recommending movies to creating tailored playlists. It can also facilitate social connections, with AI-driven platforms helping people find like-minded individuals or groups. In terms of creativity, AI can serve as a tool for artists, musicians, and writers, generating ideas, creating drafts, or even collaborating on projects. It can also democratize creativity, making tools and resources more accessible. However, it's crucial to ensure that AI is used ethically and responsibly, with a focus on augmenting human capabilities rather than replacing them.
Run this prompt on Upend.AI

Ready to try Mistral: Voxtral Small 24B 2507?

Chat with Mistral: Voxtral Small 24B 2507
up.end
/ˌəpˈend/
verb

To “upend” means to completely disrupt, overturn, or drastically change the established order or structure of something. It implies a significant shift or alteration that can potentially have far-reaching consequences. When something is upended, it is turned upside down or transformed in a way that challenges conventional norms or expectations. The term often carries a sense of innovation, transformation, and sometimes even a hint of upheaval, indicating that the changes are not just minor adjustments but rather a fundamental reimagining of the status quo.