Cloud models · Ollama

glm-5.2

GLM-5.2 is Z.ai’s flagship model for the era of long-horizon tasks.

11.8K Pulls 1 Tag Updated yesterday

kimi-k2.7-code

Kimi K2.7 Code is Moonshot AI's coding-focused agentic model built upon Kimi K2.6, with substantial improvements on real-world long-horizon coding tasks and roughly 30% lower thinking-token usage.

vision tools thinking cloud

15.7K Pulls 1 Tag Updated 4 days ago

minimax-m3

MiniMax M3: Coding & Agentic Frontier. 1M context window. Native Multimodality.

vision tools thinking cloud

55.5K Pulls 1 Tag Updated 2 weeks ago

nemotron-3-ultra

NVIDIA Nemotron 3 Ultra is built for high-throughput reasoning and long-running agent workflows.

tools thinking cloud

13K Pulls 1 Tag Updated 1 week ago

gemma4

Gemma 4 models are designed to deliver frontier-level performance at each size. They are well-suited for reasoning, agentic workflows, coding, and multimodal understanding.

vision tools thinking audio cloud e2b e4b 12b 26b 31b

14.4M Pulls 48 Tags Updated 3 days ago

qwen3.5

Qwen 3.5 is a family of open-source multimodal models that delivers exceptional utility and performance.

vision tools thinking cloud 0.8b 2b 4b 9b 27b 35b 122b

13.8M Pulls 64 Tags Updated 3 weeks ago

glm-5.1

GLM-5.1 is our next-generation flagship model for agentic engineering, with significantly stronger coding capabilities than its predecessor. It achieves state-of-the-art performance on SWE-Bench Pro and leads GLM-5 by a wide margin.

tools thinking cloud

2.2M Pulls 1 Tag Updated 2 months ago

minimax-m2.7

MiniMax's M2-series model for coding, agentic workflows, and professional productivity.

tools thinking cloud

2.2M Pulls 1 Tag Updated 3 months ago

nemotron-3-super

NVIDIA Nemotron 3 Super is a 120B open MoE model activating just 12B parameters to deliver maximum compute efficiency and accuracy for complex multi-agent applications.

tools thinking cloud 120b

2.4M Pulls 7 Tags Updated 3 months ago

glm-5

A strong reasoning and agentic model from Z.ai with 744B total parameters (40B active), built for complex systems engineering and long-horizon tasks.

tools thinking cloud

2.3M Pulls 1 Tag Updated 4 months ago

minimax-m2.5

MiniMax-M2.5 is a state-of-the-art large language model designed for real-world productivity and coding tasks.

tools thinking cloud

2.3M Pulls 1 Tag Updated 4 months ago

glm-4.7

Advancing the Coding Capability

tools thinking cloud

2.2M Pulls 1 Tag Updated 5 months ago

minimax-m2.1

Exceptional multilingual capabilities to elevate code engineering

tools cloud

2.1M Pulls 1 Tag Updated 5 months ago

kimi-k2.6

Kimi K2.6 is an open-source, native multimodal agentic model that advances practical capabilities in long-horizon coding, coding-driven design, proactive autonomous execution, and swarm-based task orchestration.

vision tools thinking cloud

305.2K Pulls 1 Tag Updated 1 month ago

deepseek-v4-pro

DeepSeek-V4-Pro is a frontier Mixture-of-Experts model with a 1M-token context window and three reasoning modes.

tools thinking cloud

127.1K Pulls 1 Tag Updated 1 month ago

deepseek-v4-flash

DeepSeek-V4-Flash is a preview of the DeepSeek-V4 series, a Mixture-of-Experts model with 284B total parameters and 13B activated, built for efficient reasoning across a 1M-token context window.

tools thinking cloud

121.5K Pulls 1 Tag Updated 1 month ago

kimi-k2.5

Kimi K2.5 is an open-source, native multimodal agentic model that seamlessly integrates vision and language understanding with advanced agentic capabilities, instant and thinking modes, as well as conversational and agentic paradigms.

vision tools thinking cloud

324.3K Pulls 1 Tag Updated 4 months ago

gpt-oss

OpenAI’s open-weight models designed for powerful reasoning, agentic tasks, and versatile developer use cases.

tools thinking cloud 20b 120b

10.2M Pulls 5 Tags Updated 8 months ago

qwen3-coder

Alibaba's performant long context models for agentic and coding tasks.

tools cloud 30b 480b

6.4M Pulls 10 Tags Updated 8 months ago

gemini-3-flash-preview

Gemini 3 Flash offers frontier intelligence built for speed at a fraction of the cost.

vision tools thinking cloud

2.2M Pulls 2 Tags Updated 5 months ago