Newsletter image

Subscribe to the Newsletter

Join 10k+ people to get notified about new posts, news and tips.

Do not worry we don't spam!

By pressing the Subscribe button, you confirm that you have read and are agreeing to our Privacy Policy and Terms of Use

Search

GDPR Compliance

We use cookies to ensure you get the best experience on our website. By continuing to use our site, you accept our use of cookies, Privacy Policy, and Terms of Service.

ZAYA1-8B

Zyphra released ZAYA1-8B, an 8.4B-total / 760M-active Mixture-of-Experts reasoning model under Apache 2.0. It is the first large MoE pretrained, midtrained, and fine-tuned end-to-end on AMD Instinct MI300X, with no Nvidia in the loop. The single-pass math and code scores are strong, the agentic numbers are not, and every benchmark is Zyphra-reported.

HY-MT1.5

HY-MT1.5 is Tencent open-source translation family with 33 languages and 5 dialects, from 7B server down to a 440MB GGUF that runs on phones.