Browse all available AI models across providers
50 of 50 models
qwen3.7-plus
qwen3-coder-free
Qwen3-Coder-480B-A35B-Instruct is a Mixture-of-Experts (MoE) code generation model developed by the Qwen team. It is optimized for agentic coding tasks such as function calling, tool use, and long-con
doubao-seed-2.0-lite
256K, balanced performance+cost
doubao-seed-2.0-pro
256K, vision+tools+reasoning, flagship
gpt-audio
The gpt-audio model is OpenAIs first generally available audio model. The new snapshot features an upgraded decoder for more natural sounding voices and maintains better voice consistency. Audio is p
gpt-audio-mini
A cost-efficient version of GPT Audio. The new snapshot features an upgraded decoder for more natural sounding voices and maintains better voice consistency. Input is priced at $0.60 per million...
deepseek-r1
Reasoning, chain-of-thought, math/code/science
deepseek-v4-pro
Latest MoE flagship, 1M context
lyria-3-pro-preview
Full-length songs are priced at $0.08 per song. Lyria 3 is Googles family of music generation models, available through the Gemini API. With Lyria 3, you can generate high-quality, 48kHz...
lyria-3-pro
AI music generation, composition+arrangement
kimi-k2.5
200K context, 100-agent cluster, previous flagship
voxtral-small-24b-2507
Voxtral Small is an enhancement of Mistral Small 3, incorporating state-of-the-art audio input capabilities while retaining best-in-class text performance. It excels at speech transcription, translati
grok-4.20
Latest, multi-agent capable
grok-4.20-multi-agent
Grok 4.20 Multi-Agent is a variant of xAI’s Grok 4.20 designed for collaborative, agent-based workflows. Multiple agents operate in parallel to conduct deep research, coordinate tool use, and synthesi
command-r
Efficient RAG LLM
command-r-plus
Enterprise RAG LLM, tool use, multilingual
ernie-speed
Fast, cost-effective
pangu-5.5
Domestic chip sovereign, industrial verticals
sensenova-v6
CV leader, embodied AI, 30+ industrial scenes
spark-x2
Voice interaction leader, education/medical/office
yi-vision
Chinese+English bilingual multimodal
baichuan-m3
Chinese LLM, medical/legal vertical focus
step-3.5-flash
Step 3.5 Flash is StepFuns most capable open-source foundation model. Built on a sparse Mixture of Experts (MoE) architecture, it selectively activates only 11B of its 196B parameters per token....
step-3.7-flash
Fast, multimodal terminal agent
vidu-q1
VFX+AI sound, 5s 1080p
hunyuan-a13b-instruct
Hunyuan-A13B is a 13B active parameter Mixture-of-Experts (MoE) language model developed by Tencent, with a total parameter count of 80B and support for reasoning via Chain-of-Thought. It offers compe
nova-pro
Balanced performance
wizardlm-2-8x22b
WizardLM-2 8x22B is Microsoft AIs most advanced Wizard model. It demonstrates highly competitive performance compared to leading proprietary models, and it consistently outperforms all existing state
stable-audio-open
44.1kHz stereo, 3-min music gen
sd3.5-large
8B flagship, open weights
flux-2-pro
Professional
flux-schnell
Fastest, Apache 2.0 open
jamba-large-1.7
Jamba Large 1.7 is the latest model in the Jamba open family, offering improvements in grounding, instruction-following, and overall efficiency. Built on a hybrid SSM-Transformer architecture with a 2
jamba-1.5
Mamba-Transformer hybrid, 262K
sonar-pro
Search-augmented, 200K, vision
sonar-reasoning-pro
Multi-step CoT + search
reka-edge
Reka Edge is an extremely efficient 7B multimodal vision-language model that accepts image/video+text inputs and generates text outputs. This model is optimized specifically to deliver industry-leadin
reka-flash-3
Reka Flash 3 is a general-purpose, instruction-tuned large language model with 21 billion parameters, developed by Reka. It excels at general chat, coding tasks, instruction-following, and function ca
inflection-3-pi
Inflection 3 Pi powers Inflections [Pi](https://pi.ai) chatbot, including backstory, emotional intelligence, productivity, and safety. It has access to recent news, and excels in scenarios like custo
inflection-3-productivity
Inflection 3 Productivity is optimized for following instructions. It is better for tasks requiring JSON output or precise adherence to provided guidelines. It has access to recent news. For emotional
lfm-2.5-1.2b-instruct-free
LFM2.5-1.2B-Instruct is a compact, high-performance instruction-tuned model built for fast on-device AI. It delivers strong chat quality in a 1.2B parameter footprint, with efficient edge inference an
lfm-2.5-1.2b-thinking-free
LFM2.5-1.2B-Thinking is a lightweight reasoning-focused model optimized for agentic tasks, data extraction, and RAG—while still running comfortably on edge devices. It supports long context (up to 32K
dbrx-instruct
132B MoE open-source
snowflake-arctic
Dense-MoE hybrid
internlm-3
Chinese multimodal open-source flagship
internvl-2.5
Vision-language, image/video/document
miniCPM-4.0
On-device, extreme compression, Sparse
granite-4.1-8b
IBM Apache 2.0, 12 languages, FIM
laguna-m.1-free
Laguna M.1 is the flagship coding agent model from [Poolside](https://poolside.ai), optimized for complex software engineering tasks. Designed for agentic coding workflows, it supports tool calling an
laguna-xs.2-free
Laguna XS.2 is the second-generation model in the XS size class from [Poolside](https://poolside.ai), their efficient coding agent series. It combines tool calling and reasoning capabilities with a co