Newsletter image

Subscribe to the Newsletter

Join 10k+ people to get notified about new posts, news and tips.

Do not worry we don't spam!

By pressing the Subscribe button, you confirm that you have read and are agreeing to our Privacy Policy and Terms of Use

Search

GDPR Compliance

We use cookies to ensure you get the best experience on our website. By continuing to use our site, you accept our use of cookies, Privacy Policy, and Terms of Service.

OpenAI’s gpt-oss Models: A New Era for Open-Source AI

OpenAI has released two open-weight models, gpt-oss-120b and gpt-oss-20b, under Apache 2.0, marking a shift towards open-source AI. These models are designed for efficiency and performance, with the 120b model featuring 120 billion parameters suitable for a single 80GB GPU and the 20b model running on 16GB of VRAM, accessible for consumer hardware. They excel in reasoning, code generation, and tool integration, competing with leading AI models. Despite not releasing training data due to legal concerns, OpenAI emphasizes the models' performance and encourages community feedback through a Red Teaming Challenge to enhance safety. These models offer developers powerful tools for AI applications without proprietary constraints, fostering innovation in areas like code assistance, customer service, and research tools.

Google Gemma 3

Google's Gemma 3 models are making significant strides in the open-source AI community. These models, ranging from 270M to 27B parameters, are designed for high efficiency and versatility, capable of running on devices from smartphones to consumer GPUs. Key features include multimodal capabilities, an expanded context window, multilingual support, and efficient fine-tuning. The models offer high performance with a smaller footprint, making them ideal for on-device applications. With open-source availability, developers can use, modify, and distribute the models for commercial purposes. Safety measures ensure responsible usage, and Gemma 3's accessibility democratizes AI development, enabling innovative and cost-effective solutions.

Bagel AI

In May 2025, ByteDance introduced BAGEL, an open-source multimodal AI model with 7 billion active parameters that excels in text understanding, image generation, video processing, and reasoning, outperforming leading open-source models. BAGEL uses a unified, decoder-only architecture with a Mixture-of-Transformer-Experts (MoT) and dual encoders, making it efficient across diverse modalities. It is trained on a large dataset of interleaved multimodal tokens and is available under the Apache 2.0 license. BAGEL surpasses competitors in benchmarks for multimodal tasks and is praised for its performance and accessibility. It holds potential for applications in creative industries, robotics, and research. Despite facing challenges like dependency requirements, BAGEL is set to drive innovation in AI. Explore its capabilities on GitHub or Hugging Face.