Discover THUDM: GLM 4.1V 9B Thinking

Enhance reasoning with THUDM's GLM 4.1V 9B Thinking model.

Input: image · text Output: text Context: 65,536 tokens Release: 2025-07-11
The THUDM: GLM 4.1V 9B Thinking model, released in July 2025, is a 9 billion parameter vision-language model designed to enhance multimodal reasoning and complex problem solving. It features a long-context understanding of up to 64K tokens, making it suitable for tasks that require processing large amounts of text and images. The model accepts both text and image inputs and outputs text, offering flexibility in various applications. Key capabilities include improved reasoning through reinforcement learning, making it a strong choice for tasks involving complex reasoning and long-context comprehension.

Use Cases

Here are a few ways teams apply THUDM: GLM 4.1V 9B Thinking in practice—from fast drafting to multimodal understanding. Adapt these ideas to your workflow.

Tackle complex problem solving tasks

Process large text and image datasets

Improve multimodal reasoning capabilities

Enhance long-context comprehension

Key Features

A quick look at the capabilities that make this model useful in real projects.

9 billion parameter vision-language model

Long-context understanding up to 64K tokens

Enhanced multimodal reasoning

Reinforcement learning for improved reasoning

Text and image input, text output

Specs

Overview
Vendor
thudm
Model ID
thudm/glm-4.1v-9b-thinking
Release
2025-07-11
Modalities & context
Input
image · text
Output
text
Context
65,536 tokens
Parameters & defaults

Supported parameters: frequency_penalty, include_reasoning, max_tokens, presence_penalty, reasoning, repetition_penalty, seed, stop, temperature, top_k, top_p

Defaults: temperature 0.2, top_p 0.95

Benchmark tests: THUDM: GLM 4.1V 9B Thinking

We ran this model against a few representative prompts to show its range. Review the outputs below and be the judge.

Text

Prompt:
Write 150 words on how AI might positively upend work, leisure and creativity
The THUDM: GLM 4.1V 9B Thinking model is a generative language model designed for a variety of natural language processing tasks. It is capable of generating coherent text, answering questions, summarizing information, and engaging in conversational dialogue. Typical use cases include content creation, customer support automation, and educational tools. The model operates effectively in environments requiring contextual understanding and language generation. Notable constraints include potential limitations in understanding highly specialized jargon or context-specific nuances, which may affect the accuracy of responses. Additionally, the model's performance can vary based on the complexity of the input and the specificity of the prompts provided.
Run this prompt on Upend.AI

Image

Prompt:
Generate an SVG of a pelican riding a bicycle.
Benchmark image
Run this prompt on Upend.AI

Ready to try THUDM: GLM 4.1V 9B Thinking?

Chat with THUDM: GLM 4.1V 9B Thinking
up.end
/ˌəpˈend/
verb

To “upend” means to completely disrupt, overturn, or drastically change the established order or structure of something. It implies a significant shift or alteration that can potentially have far-reaching consequences. When something is upended, it is turned upside down or transformed in a way that challenges conventional norms or expectations. The term often carries a sense of innovation, transformation, and sometimes even a hint of upheaval, indicating that the changes are not just minor adjustments but rather a fundamental reimagining of the status quo.