List of All LLM Models

Discover and compare 500+ large language models with real-time rankings, benchmarks, and community votes.

Google: Veo 3.1 Fast

Google: Veo 3.1 Fast

By Google

Google's mid-tier video generation model balancing speed and quality. Veo 3.1 Fast generates high-quality video from text or image prompts with native synchronized audio, offering faster turnaround than Veo 3.1 at lower cost. Supports first-frame and last-frame conditioning, multiple resolutions and aspect ratios, and SynthID watermarking.

Release Date

24 Apr 2026

Context Size

0

Zyphra: Zonos v0.1 Transformer

Zyphra: Zonos v0.1 Transformer

By zyphra

Zonos v0.1 Transformer is a text-to-speech model from Zyphra built on a pure transformer architecture. It offers the same American and British English voice coverage as the Hybrid variant, and is suited for deployments where a transformer-only inference stack is preferred.

Release Date

23 Apr 2026

Context Size

4.10K

Zyphra: Zonos v0.1 Hybrid

Zyphra: Zonos v0.1 Hybrid

By zyphra

Zonos v0.1 Hybrid is a text-to-speech model from Zyphra built on a hybrid architecture. It produces English speech output with coverage across American and British accents in male and female voices. It is suited for English-language voice applications requiring accent and gender variety.

Release Date

23 Apr 2026

Context Size

4.10K

Sesame: CSM 1B

Sesame: CSM 1B

By sesame

CSM 1B is a conversational speech model from Sesame. It accepts text input and produces English speech output, with voice options spanning conversational and read-speech styles. At 1B parameters, it is suited for dialogue-oriented applications such as voice assistants and interactive agents.

Release Date

23 Apr 2026

Context Size

4.10K

Canopy Labs: Orpheus 3B

Canopy Labs: Orpheus 3B

By canopylabs

Orpheus 3B is an English text-to-speech model from Canopy Labs, fine-tuned for natural prosody and expressive delivery. It offers 7 preset voices and is suited for narration, voice assistants, and interactive applications where naturalistic speech is a priority.

Release Date

23 Apr 2026

Context Size

4.10K

hexgrad: Kokoro 82M

hexgrad: Kokoro 82M

By hexgrad

Kokoro 82M is a lightweight, open-weight text-to-speech model from hexgrad. It converts text to speech across 8 languages (American and British English, Spanish, French, Hindi, Italian, Japanese, Portuguese, and Chinese) using 54 preset voices organized by language and gender. At 82M parameters, it is well-suited for multilingual TTS deployments where footprint and cost efficiency matter.

Release Date

23 Apr 2026

Context Size

4.10K

Google: Veo 3.1 Lite

Google: Veo 3.1 Lite

By Google

Google's most cost-effective video generation model, designed for high-volume applications and rapid iteration. Veo 3.1 Lite generates 720p and 1080p video from text or image prompts with native synchronized audio at less than 50% of the cost of Veo 3.1 Fast. Supports 4–8 second clips in landscape (16:9) and portrait (9:16) formats, with SynthID watermarking. Ideal for content platforms, short-form video creation, and automated media generation.

Release Date

23 Apr 2026

Context Size

0

inclusionAI: Ling-2.6-1T

inclusionAI: Ling-2.6-1T

By inclusionai

Ling-2.6-1T is an instant (instruct) model from inclusionAI and the company’s trillion-parameter flagship, designed for real-world agents that require fast execution and high efficiency at scale. It uses a “fast thinking” approach to reduce costs to roughly a quarter of comparable models while maintaining top-tier performance. The model achieves state-of-the-art results on benchmarks such as AIME26 and SWE-bench Verified, and is well suited for advanced coding, complex reasoning, and large-scale agent workflows where both capability and efficiency are critical.

Release Date

23 Apr 2026

Context Size

262.14K

Tencent: Hy3 preview

Tencent: Hy3 preview

By tencent

Hy3 preview is a high-efficiency Mixture-of-Experts model from Tencent designed for agentic workflows and production use. It supports configurable reasoning levels across disabled, low, and high modes, allowing it to balance speed and depth depending on the task, while delivering strong code generation and reliable performance across multi-step, real-world workflows.

Release Date

22 Apr 2026

Context Size

262.14K

Xiaomi: MiMo-V2.5-Pro

Xiaomi: MiMo-V2.5-Pro

By Xiaomi

MiMo-V2.5-Pro is Xiaomi’s flagship model, delivering strong performance in general agentic capabilities, complex software engineering, and long-horizon tasks, with top rankings on benchmarks such as ClawEval, GDPVal, and SWE-bench Pro. It can independently and autonomously complete professional tasks that would take human experts days or weeks, involving more than a thousand tool calls. Its context length of up to 1M makes it well suited for integration with a wide range of agent frameworks.

Release Date

22 Apr 2026

Context Size

1.05M

Xiaomi: MiMo-V2.5

Xiaomi: MiMo-V2.5

By Xiaomi

MiMo-V2.5 is a native omnimodal model by Xiaomi. It delivers Pro-level agentic performance at roughly half the inference cost, while surpassing MiMo-V2-Omni in multimodal perception across image and video understanding tasks. Its 1M context window supports complete documents, extended conversations, and complex task contexts in a single pass, making it ideal for integration with agent frameworks where strong reasoning, rich perception, and cost efficiency all matter.

Release Date

22 Apr 2026

Context Size

1.05M

OpenAI: GPT-5.4 Image 2

OpenAI: GPT-5.4 Image 2

By OpenAI

[GPT-5.4](https://openrouter.ai/openai/gpt-5.4) Image 2 combines OpenAI's GPT-5.4 model with state-of-the-art image generation capabilities from GPT Image 2. It enables rich multimodal workflows, allowing users to seamlessly move between reasoning, coding, and visual generation within the same interaction.

Release Date

21 Apr 2026

Context Size

272K

inclusionAI: Ling-2.6-flash

inclusionAI: Ling-2.6-flash

By inclusionai

Ling-2.6-flash is an instant (instruct) model from inclusionAI with 104B total parameters and 7.4B active parameters, designed for real-world agents that require fast responses, strong execution, and high token efficiency. It delivers performance comparable to state-of-the-art models at a similar scale while significantly reducing token usage across coding, document processing, and lightweight agent workflows.

Release Date

21 Apr 2026

Context Size

262.14K

Pareto Code Router

Pareto Code Router

By OpenRouter

The Pareto Router maintains a tiered shortlist of strong coding models, ranked by [Artificial Analysis](https://artificialanalysis.ai/) coding percentiles. Set min_coding_score between 0 and 1 on the [pareto-router plugin](https://openrouter.ai/docs/guides/routing/routers/pareto-router#the-min_coding_score-parameter) to control how strong a coder you need; higher scores select stronger (and typically more expensive) models. If you omit min_coding_score, the router defaults to the High tier. Selecting Nitro from the variant dropdown ranks the models in your tier by measured throughput and routes each request to the fastest one, so you trade some model variety for lower latency. Read the [Pareto Router docs](https://openrouter.ai/docs/guides/routing/routers/pareto-router) for the full selection logic, fallback behavior, and how to customize routing. For another way to route, see the [Auto Router](/openrouter/auto).

Release Date

21 Apr 2026

Context Size

2M

Baidu: Qianfan-OCR-Fast

Baidu: Qianfan-OCR-Fast

By baidu

Qianfan-OCR-Fast is a domain-specific multimodal large model purpose-built for OCR. By leveraging specialized OCR training data while preserving versatile multimodal intelligence, it provides a powerful performance upgrade over Qianfan-OCR.

Release Date

20 Apr 2026

Context Size

65.54K

Baidu: Qianfan-OCR-Fast (free)

Baidu: Qianfan-OCR-Fast (free)

By baidu

Qianfan-OCR-Fast is a domain-specific multimodal large model purpose-built for OCR. By leveraging specialized OCR training data while preserving versatile multimodal intelligence, it provides a powerful performance upgrade over Qianfan-OCR.

Release Date

20 Apr 2026

Context Size

65.54K

Kling: Video O1

Kling: Video O1

By kwaivgi

Kling Video O1 is a video generation model from Kuaishou. It supports text and image inputs with video output, enabling text-to-video and image-to-video workflows. It is suited for cinematic content production, with first-frame and last-frame control for precise scene composition. It generates 5 or 10 second clips in 16:9, 9:16, or 1:1 aspect ratios.

Release Date

20 Apr 2026

Context Size

0

MiniMax: Hailuo 2.3

MiniMax: Hailuo 2.3

By MiniMax

Hailuo 2.3 is a video generation model from MiniMax. It accepts text prompts and reference images as input and generates video output, supporting both text-to-video and image-to-video workflows. It is suited for creative content production, cinematic scene generation, and character animation, with a focus on realistic motion and expressive character rendering.

Release Date

20 Apr 2026

Context Size

0

MoonshotAI: Kimi K2.6 (free)

MoonshotAI: Kimi K2.6 (free)

By moonshotai

Kimi K2.6 is Moonshot AI's next-generation multimodal model, designed for long-horizon coding, coding-driven UI/UX generation, and multi-agent orchestration. It handles complex end-to-end coding tasks across Python, Rust, and Go, and can convert prompts and visual inputs into production-ready interfaces. Its agent swarm architecture scales to hundreds of parallel sub-agents for autonomous task decomposition - delivering documents, websites, and spreadsheets in a single run without human oversight.

Release Date

20 Apr 2026

Context Size

262.14K

MoonshotAI: Kimi K2.6 (free)

MoonshotAI: Kimi K2.6 (free)

By moonshotai

Kimi K2.6 is Moonshot AI's next-generation multimodal model, designed for long-horizon coding, coding-driven UI/UX generation, and multi-agent orchestration. It handles complex end-to-end coding tasks across Python, Rust, and Go, and...

Release Date

20 Apr 2026

Context Size

262.14K

Mistral: Voxtral Mini TTS

Mistral: Voxtral Mini TTS

By Mistral AI

Voxtral Mini TTS is Mistral's text-to-speech model featuring zero-shot voice cloning and multilingual support. It converts text input into natural-sounding audio output.

Release Date

19 Apr 2026

Context Size

4.10K

OpenAI: GPT-4o Mini TTS

OpenAI: GPT-4o Mini TTS

By OpenAI

GPT-4o Mini TTS is OpenAI's cost-efficient text-to-speech model. It converts text input into natural-sounding audio output, supporting a variety of voices and tones.

Release Date

18 Apr 2026

Context Size

4.10K

Google: Gemini Embedding 2 Preview

Google: Gemini Embedding 2 Preview

By Google

Gemini Embedding 2 Preview is Google's first multimodal embedding model. We currently support mapping text and images into a unified vector space for semantic search and retrieval-augmented generation (RAG). It supports input context up to 8,192 tokens and flexible output dimensions from 128 to 3,072 (recommended: 768, 1536, or 3,072). Designed for cross-modal similarity — you can embed a text query and retrieve the most relevant images, or vice versa — making it well-suited for multimodal search, recommendation, and document understanding pipelines.

Release Date

17 Apr 2026

Context Size

8.19K

Anthropic: Claude Opus 4.7

Anthropic: Claude Opus 4.7

By Anthropic

Opus 4.7 is the next generation of Anthropic's Opus family, built for long-running, asynchronous agents. Building on the coding and agentic strengths of Opus 4.6, it delivers stronger performance on complex, multi-step tasks and more reliable agentic execution across extended workflows. It is especially effective for asynchronous agent pipelines where tasks unfold over time - large codebases, multi-stage debugging, and end-to-end project orchestration. Beyond coding, Opus 4.7 brings improved knowledge work capabilities - from drafting documents and building presentations to analyzing data. It maintains coherence across very long outputs and extended sessions, making it a strong default for tasks that require persistence, judgment, and follow-through. For users upgrading from earlier Opus versions, see our [official migration guide here](https://openrouter.ai/docs/guides/evaluate-and-optimize/model-migrations/claude-4-7)

Release Date

16 Apr 2026

Context Size

1M

ByteDance: Seedance 2.0

ByteDance: Seedance 2.0

By bytedance

Seedance 2.0 is a video generation model from ByteDance. It supports text-to-video, image-to-video with first and last frame control, and multimodal reference-to-video. It is particularly strong at preserving character consistency, visual style, and camera movement from reference material. The number of tokens is given by (height of output video * width of output video * duration * 24) / 1024

Release Date

15 Apr 2026

Context Size

0

ByteDance: Seedance 2.0 Fast

ByteDance: Seedance 2.0 Fast

By bytedance

Seedance 2.0 Fast is a video generation model from ByteDance. It supports text-to-video, image-to-video with first and last frame control, and multimodal reference-to-video. It prioritizes generation speed and lower cost over maximum output quality. The number of tokens is given by (height of output video * width of output video * duration * 24) / 1024

Release Date

15 Apr 2026

Context Size

0

Alibaba: Wan 2.7

Alibaba: Wan 2.7

By alibaba

Wan 2.7 is a video generation model from Alibaba. It supports text-to-video, image-to-video with first and last frame control, and reference-to-video, where multiple reference images guide the style and content of the generated scene.

Release Date

15 Apr 2026

Context Size

0

Elephant Alpha

Elephant Alpha

By OpenRouter

Elephant Alpha is a 100B-parameter text model focused on intelligence efficiency, delivering strong performance while minimizing token usage. It supports a 256K context window with up to 32K output tokens, function calling, structured output, and prompt caching. It is particularly well-suited for code completion and debugging, rapid document processing, and lightweight agent interactions. Note: Prompts and completions may be logged by the provider and used to improve the model.

Release Date

13 Apr 2026

Context Size

262.14K

Anthropic: Claude Opus 4.6 (Fast)

Anthropic: Claude Opus 4.6 (Fast)

By Anthropic

Fast-mode variant of [Opus 4.6](/anthropic/claude-opus-4.6) - identical capabilities with higher output speed at premium 6x pricing. Learn more in Anthropic's docs: https://platform.claude.com/docs/en/build-with-claude/fast-mode

Release Date

07 Apr 2026

Context Size

1M

Z.ai: GLM 5.1

Z.ai: GLM 5.1

By Z.ai

GLM-5.1 delivers a major leap in coding capability, with particularly significant gains in handling long-horizon tasks. Unlike previous models built around minute-level interactions, GLM-5.1 can work independently and continuously on a single task for more than 8 hours, autonomously planning, executing, and improving itself throughout the process, ultimately delivering complete, engineering-grade results.

Release Date

07 Apr 2026

Context Size

202.75K

Showing page 3 of 26 with 762 models total