Newsletter image

Subscribe to the Newsletter

Join 10k+ people to get notified about new posts, news and tips.

Do not worry we don't spam!

By pressing the Subscribe button, you confirm that you have read and are agreeing to our Privacy Policy and Terms of Use

Search

GDPR Compliance

We use cookies to ensure you get the best experience on our website. By continuing to use our site, you accept our use of cookies, Privacy Policy, and Terms of Service.

AI2 OLMo 3.1

The Allen Institute for AI (AI2) has released OLMo 3.1, a family of fully open 32 billion parameter reasoning models. Unlike most open-weight releases, OLMo 3.1 ships with the complete training data, training code, evaluation scripts, and intermediate checkpoints, making it the most transparent frontier model ever published. The Think 32B variant gains 5+ points on AIME, 4+ on ZebraLogic, 4+ on IFEval, and over 20 points on IFBench compared to OLMo 3. AI2 calls the Instruct 32B variant the most capable fully open chat model to date.

Alibaba Qwen3.5

Alibaba has released Qwen3.5, headlined by a 397 billion parameter Mixture-of-Experts model with 17 billion active parameters per token. Shipped under Apache 2.0 on Hugging Face, Qwen3.5 scores 93.3% on AIME 2026, 85.0 on LiveCodeBench v6, and 76.8% on SWE-Bench Verified, putting it in frontier territory for math, coding, and agent tasks. The broader Qwen3.5 family spans dense models from sub-1 billion up to 32 billion parameters, plus sparse MoE variants, giving developers open-weight options at every scale.

Moonshot Kimi K2.5

Moonshot AI has released Kimi K2.5, a 1 trillion parameter Mixture-of-Experts language model with 32 billion active parameters per request. The model uses 61 layers with 384 experts and sparse 8-expert activation, natively trained on roughly 15 trillion mixed vision and text tokens for a 256K context window. Kimi K2.5 outperforms GPT-5.2 on MMMU Pro (78.5%), BrowseComp (74.9%), and AIME 2025 (96.1%), and the Agent Swarm configuration reaches 50.2% on Humanity Last Exam at 76% lower cost than Claude Opus 4.5. Weights ship on Hugging Face under a Modified MIT license.