Newsletter image

Subscribe to the Newsletter

Join 10k+ people to get notified about new posts, news and tips.

Do not worry we don't spam!

By pressing the Subscribe button, you confirm that you have read and are agreeing to our Privacy Policy and Terms of Use

Search

GDPR Compliance

We use cookies to ensure you get the best experience on our website. By continuing to use our site, you accept our use of cookies, Privacy Policy, and Terms of Service.

Bagel AI

In May 2025, ByteDance introduced BAGEL, an open-source multimodal AI model with 7 billion active parameters that excels in text understanding, image generation, video processing, and reasoning, outperforming leading open-source models. BAGEL uses a unified, decoder-only architecture with a Mixture-of-Transformer-Experts (MoT) and dual encoders, making it efficient across diverse modalities. It is trained on a large dataset of interleaved multimodal tokens and is available under the Apache 2.0 license. BAGEL surpasses competitors in benchmarks for multimodal tasks and is praised for its performance and accessibility. It holds potential for applications in creative industries, robotics, and research. Despite facing challenges like dependency requirements, BAGEL is set to drive innovation in AI. Explore its capabilities on GitHub or Hugging Face.

Cogito v1

DeepCogito, based in San Francisco, has introduced the Cogito v1 Preview series of open-source AI models, available in various parameter sizes from 3B to 70B. These models are designed for diverse tasks, from lightweight to heavy-duty challenges, and are freely accessible for commercial use on platforms like Hugging Face and Ollama. They claim to outperform competitors like LLaMA and Qwen, though specific benchmark scores are not disclosed. The models feature hybrid reasoning modes, enabling them to switch between standard and reasoning tasks, and are optimized for coding, STEM tasks, and agentic applications. Trained on over 30 languages with a 128k context length, they are versatile and globally applicable. While these models are early versions, larger ones are anticipated. DeepCogito encourages innovators to explore and utilize these models to unlock AI's potential and drive future advancements.

HiDream-L1

HiDream-L1 is the latest AI model from HiDream AI, launched on April 7, 2025. This 17-billion-parameter model is designed to generate high-quality visual content quickly and is open-source under the MIT license, making it free for commercial use. The company, founded in 2023 in China, aims to democratize AI across various media forms. HiDream-L1 surpasses competitors in performance metrics, offering photorealistic images with prompt accuracy. To run it, you'll need a powerful system with specific hardware and software requirements. HiDream AI is fostering an open-source movement, encouraging creators to utilize and innovate with their tools. The model is accessible on platforms like GitHub and Hugging Face.