AI Model Release Dashboard

Track every major LLM release from OpenAI, Anthropic, Google, and Meta.

Provider
Model
Release Date
Notes
Release Notes
Google
Gemini 3.1 Flash Image
February 25, 2026
Faster image generation with 4K resolution, improved text rendering, and multi-language support. #1 on Artificial Analysis Image Arena at launch.
https://techcrunch.com/2026/02/26/google-launches-nano-banana-2-model-with-faster-image-generation/
Google
Gemini 3 Deep Think
December 3, 2025
Extended-thinking version of Gemini 3 Pro. Rolled out to Ultra subscribers. Designed for the hardest reasoning tasks.
https://deepmind.google/models/gemini/
Mistral
Mistral Small 3.2
June 19, 2025
24B parameter open-weight model (Apache 2.0). Improved instruction following and function calling over v3.1.
https://mistral.ai/news/mistral-small-3-2
Mistral
Magistral Small
June 9, 2025
Mistral's first reasoning model with chain-of-thought capabilities. 24B parameters, open-source under Apache 2.0.
https://mistral.ai/news/magistral
Anthropic
Claude Sonnet 4.5
September 28, 2025
Best coding model at launch with 77.2% on SWE-bench Verified. 30+ hour autonomous task horizon.
https://www.anthropic.com/news/claude-sonnet-4-5
Anthropic
Claude Opus 4.1
August 4, 2025
Drop-in replacement for Opus 4 with superior performance. Handles complex multi-step agentic tasks with more rigor.
https://www.anthropic.com/news/claude-opus-4-1
OpenAI
GPT-5.1
November 11, 2025
Warmer personality, improved instruction-following. Adds shopping research and 8 personality options. Retired March 2026.
https://openai.com/index/gpt-5-1/
OpenAI
GPT-5.2
December 10, 2025
Next upgrade to the GPT-5 series. Smarter and more useful for work and learning. 400K context, 128K output.
https://openai.com/index/introducing-gpt-5-2/
OpenAI
GPT-5.2-Codex
December 17, 2025
Agentic coding model for long-horizon task completion. Context compaction for multi-window workflows. 56.4% on SWE-Bench Pro.
https://openai.com/index/introducing-gpt-5-2-codex/
OpenAI
GPT-5.3-Codex
February 4, 2026
Most capable agentic coding model to date. 25% faster than GPT-5.2-Codex. State-of-the-art on SWE-Bench Pro and Terminal-Bench.
https://openai.com/index/introducing-gpt-5-3-codex/
OpenAI
GPT-5.3 Instant
March 2, 2026
Smoother, more helpful everyday conversations. 400K context. 26.8% fewer hallucinations than GPT-5.2 with web search.
https://openai.com/index/gpt-5-3-instant/
OpenAI
GPT-5.4 nano
March 16, 2026
Smallest and cheapest OpenAI model. Optimized for classification, data extraction, ranking, and coding subagents.
https://openai.com/index/introducing-gpt-5-4-mini-and-nano/
OpenAI
GPT-5.4 mini
March 16, 2026
Most capable small OpenAI model. 2x faster than GPT-5.4. Strong on coding, reasoning, and computer use. Free and Go users.
https://openai.com/index/introducing-gpt-5-4-mini-and-nano/
OpenAI
GPT-5.4
March 4, 2026
Most capable frontier model for professional work. Native computer use, 1M context, consolidated coding+reasoning. 33% fewer hallucinations vs GPT-5.2.
https://openai.com/index/introducing-gpt-5-4/
Mistral
Mistral Small 4
March 16, 2026
Unified model combining text, vision, reasoning, and coding. MoE 119B total / 6B active params. 256K context. Apache 2.0.
https://mistral.ai/news/mistral-small-4
Google
Gemini 3.1 Flash Lite
March 2, 2026
Ultra-efficient Gemini 3.1 model for developers. Fast and cost-effective for high-volume API workloads.
https://ai.google.dev/gemini-api/docs/models
Anthropic
Claude Sonnet 4.6
February 16, 2026
Default model for free and Pro users. 1M token context (beta). Improved computer use, coding, and large-scale data processing.
https://www.anthropic.com/news/claude-sonnet-4-6
Google
Gemini 3.1 Pro
February 18, 2026
Advanced reasoning over complex problems. 77.1% on ARC-AGI-2. Available via Gemini API, Vertex AI, and NotebookLM.
https://blog.google/innovation-and-ai/models-and-research/gemini-models/gemini-3-1-pro/
Anthropic
Claude Opus 4.6
February 4, 2026
1M token context window. 14.5-hour task completion horizon. Top Finance Agent benchmark. Stronger coding and planning.
https://www.anthropic.com/news/claude-opus-4-6
Google
Gemini 3 Flash
December 16, 2025
Efficient Gemini 3 model. Includes Gemini 3 Pro reasoning in a faster, cheaper package. Default model in the Gemini app.
https://techcrunch.com/2025/12/17/google-launches-gemini-3-flash-makes-it-the-default-model-in-the-gemini-app/
Mistral
Ministral 3 (3B/8B/14B)
December 1, 2025
Family of dense small models in 3 sizes × 3 variants (Base/Instruct/Reasoning). Runs on 4GB VRAM. Apache 2.0 license.
https://mistral.ai/news/mistral-3
Mistral
Mistral Large 3
December 1, 2025
MoE architecture: 41B active / 675B total parameters. Multimodal, 256K context. Apache 2.0. #2 on LMArena OSS leaderboard.
https://mistral.ai/news/mistral-3
Google
Gemini 3 Pro
November 17, 2025
Best multimodal understanding model. Most powerful agentic and coding model from Google. Replaces Gemini 2.5 Pro.
https://blog.google/products/gemini/gemini-3/
Anthropic
Claude Opus 4.5
November 23, 2025
Most robustly aligned Claude model. Reclaimed coding crown with 80.9% SWE-bench Verified. Significantly reduced pricing.
https://www.anthropic.com/news/claude-opus-4-5
Anthropic
Claude Haiku 4.5
October 14, 2025
Fast and affordable Claude 4 family model. Targets latency-sensitive applications. Available on Claude.ai and mobile.
https://www.anthropic.com/news/claude-haiku-4-5
Google
Gemini 2.5 Flash Image
August 25, 2025
Dedicated image generation model from Google. Better instruction following and text rendering than previous image models.
https://blog.google/products/gemini/
Google
Gemini 2.5 Flash-Lite
June 16, 2025
Ultra-efficient Gemini 2.5 variant optimized for speed and cost. Lowest latency in the Gemini 2.5 family.
https://ai.google.dev/gemini-api/docs/models
OpenAI
o3-pro
June 9, 2025
Extended-thinking version of o3. Thinks longer for more reliable responses on the hardest tasks. ChatGPT Pro and API.
https://openai.com/blog/o3-pro
OpenAI
GPT-4.1 mini
April 13, 2025
Fast, capable small model. Replaces GPT-4o mini for paid users. Significant improvements in coding and instruction-following.
https://openai.com/blog/gpt-4-1
OpenAI
GPT-4.1
April 13, 2025
Specialized for coding and instruction-following. Stronger than GPT-4o at precise web dev tasks. API-only release.
https://openai.com/blog/gpt-4-1
Google
Gemini 2.5 Flash
June 16, 2025
Efficient version of Gemini 2.5 (GA). Controllable thinking budget. Best price-performance in Gemini family.
https://developers.googleblog.com/en/gemini-2-5-flash-updates/
OpenAI
GPT-5
August 6, 2025
Unifies reasoning and standard model capabilities. Automatically switches between fast and deep thinking modes based on task needs.
https://openai.com/blog/gpt-5
Anthropic
Claude Opus 4
July 8, 2025
Most intelligent Claude model. Designed for complex, long-horizon tasks and deep reasoning.
https://www.anthropic.com/news/claude-opus-4
Anthropic
Claude Sonnet 4
July 8, 2025
Next generation Claude model. Enhanced coding, reasoning, and agentic task performance.
https://www.anthropic.com/news/claude-sonnet-4
Meta
Llama 4
April 4, 2025
MoE architecture models: Scout (17Bx16E) and Maverick (17Bx128E). Native multimodal, 10M context window.
https://ai.meta.com/blog/llama-4-multimodal-intelligence/
OpenAI
o4-mini
April 15, 2025
Cost-efficient reasoning model with vision. Faster than o3, strong coding performance. Tool use support.
https://openai.com/blog/introducing-o3-and-o4-mini
OpenAI
o3
April 15, 2025
Flagship reasoning model. Top ARC-AGI scores. Available in API with high/medium/low reasoning effort settings.
https://openai.com/blog/introducing-o3-and-o4-mini
Mistral
Mistral Small 3.1
March 16, 2025
Updated small model with vision capabilities added. 24B parameters, 128K context. Apache 2.0 license.
https://mistral.ai/news/mistral-small-3-1/
Google
Gemini 2.5 Pro (Preview)
March 24, 2025
State-of-the-art reasoning model. 1M context, native audio/video/image understanding. Top LMArena scores.
https://blog.google/technology/google-deepmind/gemini-model-thinking-updates-march-2025/
OpenAI
GPT-4.5
February 26, 2025
Largest and most capable GPT model. Improved emotional intelligence and world knowledge. 128K context.
https://openai.com/blog/gpt-4-5
Google
Gemini 2.0 Pro (Experimental)
February 4, 2025
Most capable Gemini 2.0 model for complex tasks. 2M context window. Code execution and Google Search grounding.
https://blog.google/products/gemini/gemini-2-flash-family/
Google
Gemini 2.0 Flash
February 4, 2025
Generally available release. Upgraded image generation, improved multilingual audio. 1M context. Free tier available.
https://blog.google/products/gemini/gemini-2-flash-family/
Mistral
Mistral Saba
February 16, 2025
Specialized for Arabic and South Asian languages. Strong Middle East/North Africa cultural context understanding.
https://mistral.ai/news/mistral-saba/
Anthropic
Claude 3.7 Sonnet
February 23, 2025
First hybrid reasoning model. Switchable extended thinking mode. Top scores on SWE-bench Verified (70.3%).
https://www.anthropic.com/news/claude-3-7-sonnet
OpenAI
o3-mini
January 30, 2025
Cost-efficient small reasoning model. Three effort levels (low/medium/high). Outperforms o1 on AIME and SWE-bench.
https://openai.com/blog/openai-o3-mini
Mistral
Mistral Small 3
January 14, 2025
24B parameter model with 81% MMLU. Apache 2.0 license. Best-in-class small model for latency-sensitive tasks.
https://mistral.ai/news/mistral-small-3/
Mistral
Codestral 25.01
January 12, 2025
Updated code generation model. 256K context window, supports 80+ programming languages. Available via API.
https://mistral.ai/news/codestral-2501/
Meta
Llama 3.3 70B
December 5, 2024
Updated 70B model with performance matching Llama 3.1 405B. More compute-efficient. Open weights.
https://www.llama.com/docs/model-cards-and-prompt-formats/llama3_3/
Google
Gemini 2.0 Flash (Experimental)
December 10, 2024
First Gemini 2.0 model. Multimodal I/O, native tool use, 1M context. Faster than 1.5 Pro. Agentic capabilities.
https://blog.google/technology/google-deepmind/google-gemini-ai-update-december-2024/
OpenAI
o1
December 4, 2024
Full reasoning model release. Includes vision support, 200K context. Available in ChatGPT and API.
https://openai.com/blog/o1-and-new-tools-for-developers
Anthropic
Claude 3.5 Haiku
November 3, 2024
Fastest Claude model. Matches Claude 3 Opus on many benchmarks. 200K context. API and Bedrock availability.
https://www.anthropic.com/news/claude-3-5-haiku
Mistral
Pixtral Large
November 6, 2024
124B parameter multimodal model. Strong document understanding, charts, and scientific diagrams.
https://mistral.ai/news/pixtral-large/
Anthropic
Claude 3.5 Sonnet (v2)
October 21, 2024
Upgraded Sonnet with improved coding (SWE-bench 49%), vision, and computer use capability (beta).
https://www.anthropic.com/news/3-5-models-and-computer-use
Google
Gemini 1.5 Flash-8B
October 2, 2024
Smallest Gemini model for high-frequency, low-latency tasks. 1M token context. Generally available via API.
https://developers.googleblog.com/en/gemini-15-flash-8b-is-now-generally-available-for-use/
Meta
Llama 3.2
September 24, 2024
Multimodal models (11B, 90B) with vision. Lightweight text models (1B, 3B) for edge deployment. Open weights.
https://ai.meta.com/blog/llama-3-2-connect-2024-vision-edge-mobile-devices/
OpenAI
o1-mini
September 11, 2024
Smaller, faster reasoning model. Cost-effective for coding and STEM tasks without broad world knowledge needs.
https://openai.com/blog/openai-o1-mini-advancing-cost-efficient-reasoning
OpenAI
o1-preview
September 11, 2024
Early access reasoning model with extended chain-of-thought. Designed for complex science, coding, and math tasks.
https://openai.com/blog/learning-to-reason-with-llms
Mistral
Pixtral 12B
September 10, 2024
First multimodal model from Mistral. 12B parameters with vision capabilities. Apache 2.0 license.
https://mistral.ai/news/pixtral-12b/
Mistral
Mistral Large 2
July 23, 2024
128K context, strong coding and multilingual capabilities. 123B parameters. Apache 2.0 license.
https://mistral.ai/news/mistral-large-2407/
Meta
Llama 3.1
July 22, 2024
Open-weights models: 8B, 70B, 405B. 128K context window. First open model competitive with frontier closed models.
https://ai.meta.com/blog/meta-llama-3-1/
OpenAI
GPT-4o mini
July 17, 2024
Affordable, fast frontier model. 128K context, supports vision. Lower cost alternative to GPT-4o.
https://openai.com/blog/gpt-4o-mini-advancing-cost-efficient-intelligence