MiniMax M2.5: A New Chinese Open-Source Model That Claims to Beat Claude & GPT at Coding
MiniMax M2.5 scores 80.2% on SWE-Bench Verified, runs at 100 tokens/sec, and costs $1/hour. Here’s what we know — and what’s still unverified.
MiniMax M2.5 scores 80.2% on SWE-Bench Verified, runs at 100 tokens/sec, and costs $1/hour. Here’s what we know — and what’s still unverified.
On February 5, 2026, Anthropic released Claude Opus 4.6 — the latest and most capable model in its Claude lineup. Arriving just three months after Opus 4.5, this release brings a 1-million-token context window to the Opus family for the first time, introduces collaborative agent teams in Claude Code, and delivers benchmark results that put […]
On January 27, 2026, Chinese AI company Moonshot AI released Kimi K2.5 — and the tech world took notice. Within days, independent evaluations confirmed what the company claimed: Kimi K2.5 performs on par with the best AI models from OpenAI, Google, and Anthropic on many key benchmarks. But unlike ChatGPT, Claude, or Gemini, Kimi K2.5 […]
Today, November 25, 2025, Black Forest Labs released FLUX.2, a new family of image-generation models aimed directly at the high-end creative, marketing, and product-visualization markets. The company, founded in 2024 and known for its open-core approach to multimodal research, positions FLUX.2 as both a frontier-level image generator and a model that can actually hold up in real production […]
November 24, 2025. Anthropic has officially launched Claude Opus 4.5, a major refresh of its top-tier model and the company’s strongest push yet in the fight for AI leadership. Coming just days after Google’s Gemini 3 debut, the new Opus arrives with a sharp focus on professional coding, long-running agents, and desk-work automation—and Anthropic is backing the […]
Le mercredi 12 novembre 2025, OpenAI a dévoilé GPT-5.1, une mise à jour majeure à mi-parcours de ses modèles ChatGPT phares. La mise à jour introduit deux nouvelles variantes - GPT-5.1 Instant et GPT-5.1 Thinking - toutes deux conçues pour rendre les conversations plus rapides, plus intelligentes et plus naturelles. Selon OpenAI, GPT-5.1 est conçu pour être plus chaleureux et plus humain, tout en améliorant le suivi des instructions et le raisonnement. Le modèle Instant [...]
Le 6 novembre 2025, Moonshot AI, soutenu par Alibaba, a publié Kimi K2 Thinking, son modèle open-source le plus avancé à ce jour. Il s'agit de la première variante axée sur le raisonnement de la famille Kimi K2, qui marque une avancée majeure dans le domaine du raisonnement en contexte long et en plusieurs étapes, ainsi que dans l'utilisation autonome d'outils. Kimi K2 Thinking a immédiatement fait les gros titres pour ses performances : il a atteint de nouveaux scores sur plusieurs benchmarks ouverts, dont Humanity's Last [...]
On September 29, 2025, Anthropic has released Claude Sonnet 4.5, its latest frontier AI model, positioning it as the most capable and most aligned model the company has developed to date. The launch builds on the foundation of previous Claude Sonnet and Opus releases, with a focus on sustained autonomous performance, coding ability, tool use, and […]
Alibaba’s AI stack just crossed a symbolic frontier. On September 5 2025, the company’s cloud division unveiled Qwen-3-Max-Preview (Instruct), a 1-trillion-parameter large-language model that immediately takes its place among the most complex systems ever deployed and the most powerful AI models. The rollout began quietly inside Qwen Chat and Alibaba Cloud Model Studio. Soon coming to Fello AI. Only […]
Moonshot AI, Chinese startup backed by Alibaba and Tencent, is rolling out an upgraded version of its open‑weight open-source large language model, Kimi K2. The new build Kimi K2‑0905, introduces a 256,000-token context window, improved coding performance, and retains its permissive modified MIT license. Although early beta access was briefly mentioned on the company’s Discord […]
Rejoignez des milliers de fans et de professionnels de l'IA et bénéficiez de conseils et d'informations exclusifs de la part des leaders de l'industrie.