MiniMax M2.5: A New Chinese Open-Source Model That Claims to Beat Claude & GPT at Coding
MiniMax M2.5 scores 80.2% on SWE-Bench Verified, runs at 100 tokens/sec, and costs $1/hour. Here’s what we know — and what’s still unverified.
MiniMax M2.5 scores 80.2% on SWE-Bench Verified, runs at 100 tokens/sec, and costs $1/hour. Here’s what we know — and what’s still unverified.
On February 5, 2026, Anthropic released Claude Opus 4.6 — the latest and most capable model in its Claude lineup. Arriving just three months after Opus 4.5, this release brings a 1-million-token context window to the Opus family for the first time, introduces collaborative agent teams in Claude Code, and delivers benchmark results that put […]
On January 27, 2026, Chinese AI company Moonshot AI released Kimi K2.5 — and the tech world took notice. Within days, independent evaluations confirmed what the company claimed: Kimi K2.5 performs on par with the best AI models from OpenAI, Google, and Anthropic on many key benchmarks. But unlike ChatGPT, Claude, or Gemini, Kimi K2.5 […]
Today, November 25, 2025, Black Forest Labs released FLUX.2, a new family of image-generation models aimed directly at the high-end creative, marketing, and product-visualization markets. The company, founded in 2024 and known for its open-core approach to multimodal research, positions FLUX.2 as both a frontier-level image generator and a model that can actually hold up in real production […]
November 24, 2025. Anthropic has officially launched Claude Opus 4.5, a major refresh of its top-tier model and the company’s strongest push yet in the fight for AI leadership. Coming just days after Google’s Gemini 3 debut, the new Opus arrives with a sharp focus on professional coding, long-running agents, and desk-work automation—and Anthropic is backing the […]
On Wednesday, November 12, 2025, OpenAI unveiled GPT-5.1, a major mid-cycle upgrade to its flagship ChatGPT models. The update introduces two new variants — GPT-5.1 Instant and GPT-5.1 Thinking — both designed to make conversations faster, smarter, and more natural. According to OpenAI, GPT-5.1 is built to feel “warmer” and more human-like in tone, while also improving instruction-following and reasoning. The Instant model […]
On November 6, 2025, Alibaba-backed Moonshot AI released Kimi K2 Thinking, its most advanced open-source model yet. It’s the first reasoning-focused variant in the Kimi K2 family and marks a major step forward in long-context, multi-step reasoning and autonomous tool use. Kimi K2 Thinking immediately made headlines for its performance: it set new state-of-the-art scores on several open benchmarks, including Humanity’s Last […]
On September 29, 2025, Anthropic has released Claude Sonnet 4.5, its latest frontier AI model, positioning it as the most capable and most aligned model the company has developed to date. The launch builds on the foundation of previous Claude Sonnet and Opus releases, with a focus on sustained autonomous performance, coding ability, tool use, and […]
Alibaba’s AI stack just crossed a symbolic frontier. On September 5 2025, the company’s cloud division unveiled Qwen-3-Max-Preview (Instruct), a 1-trillion-parameter large-language model that immediately takes its place among the most complex systems ever deployed and the most powerful AI models. The rollout began quietly inside Qwen Chat and Alibaba Cloud Model Studio. Soon coming to Fello AI. Only […]
Moonshot AI, Chinese startup backed by Alibaba and Tencent, is rolling out an upgraded version of its open‑weight open-source large language model, Kimi K2. The new build Kimi K2‑0905, introduces a 256,000-token context window, improved coding performance, and retains its permissive modified MIT license. Although early beta access was briefly mentioned on the company’s Discord […]
Junte-se a milhares de fãs e profissionais de IA que beneficiam de dicas exclusivas e de conhecimentos dos líderes do sector.