Mistral Large 3: the 675B flagship Mistral finally shipped under Apache 2.0

Subscribe to the Newsletter

Join 10k+ people to get notified about new posts, news and tips.

Do not worry we don't spam!

By pressing the Subscribe button, you confirm that you have read and are agreeing to our Privacy Policy and Terms of Use

Log in

Have no account yet? Sign up

Create an account

Already have an account? Log in

Reset password

Remember your password? Log in

Terms of use

SingularityByte.com values the privacy of our users. Therefore, this privacy policy explains in detail how we use and protect the information we collect when you visit our website.. Read this privacy policy completely. Please refrain from visiting the site if the terms outlined below are not satisfactory to you. We reserve the right to change this policy at any time and will list these changes in the updates section of the policy. By reading this notice and visiting the site, you agree that you understand that customers will not be personally notified when this policy changes. Therefore, we advise our customers to frequently review our privacy policy so that they remain aware of its updates. By using the site, you accept that the posted policy and all its changes apply to your interaction with SingularityByte.com.

Information Collected by SingularityByte.com

Personal information may be collected by this site in many ways. This information includes:

Personal identifying information like your name, address, email, phone number, age, gender, and other personal data
Server data related to the IP address you used to visit our website, which includes your address, browser, OS, access time, and site activity.
Financial information related to your orders including your payment method and identifying payment information. We rarely store financial information collected on our site for transaction purposes. That information gets sent directly to our payment processor.
Social network data including Facebook permissions and user information from other networks, provided you log onto our site using one of these media sites.
Mobile device information such as your device ID, model, and location, if you use our site by accessing trough our website.

How We Use This Information

Our website uses information collected to:
• Manage your account information
• Customize ads
• Deliver promotions
• Email your account confirmation
• Manage purchases and payments
• Increase site efficiency
• Notify you of updates
• Offer new products
• Monitor and prevent theft
• Request your customer feedback
• Resolve account disputes
• Respond to your service requests

Information Disclosure

Normally, your information stays on our site. However, below we have listed the situations that may
require us to share the information we collect from you:
• When required by law, such as for fraud protection
• With our third-party providers for payment processing and hosting
• With your consent for marketing purposes
• When you post comments on the site
• To our advertisers, affiliates, and partners
• If this site goes bankrupt and data must be transferred

Cookies, Trackers, and Online Ads

We may use cookies, trackers, web beacons, and other technology to customize our website to improve your experience. We may customize the site using this information. These trackers do not have access to your personal information and can be removed from your browser options. In addition, third-party software provides ads for our site for marketing campaigns. These programs have access to tracking technology to optimize your ad experience. For more information about these
ads, visit [link to the privacy policies of affiliate advertisers]. Website analytics such as through Google Analytics may also be used to track users
and remarket our website. We do not give these vendors access to your personal information.

Other Sites

Our website may contain links to third-party websites in the form of policies, ads, and other non-affiliated links. Once you leave our site, we are no longer responsible for how your information is collected and disclosed. Please refer to the privacy policies of those third-party sites for more information.

Information Security

We take technical and administrative precautions to protect your data, but we cannot guarantee its safety against all types of fraud or misuse. If you provide personal information, we cannot verify its total security against all types of interception.

Do-Not-Track

Some browsers offer Do-Not-Track settings to prevent any information from being distributed. Since these settings have not been legally established as standard practice, we do acknowledge these settings.

Additional Options

At any time, you may opt to review or change your account settings, including contact information. If you wish to delete your account, you may do so to remove most of your information, however, some identifying information will be retained to prevent fraud.
You may also opt-out of emails and other correspondences from our site at any time.

Microsoft Clarity

We partner with Microsoft Clarity and Microsoft Advertising to capture how you use and interact with our website through behavioral metrics, heatmaps, and session replay to improve and market our products/services. Website usage data is captured using first and third-party cookies and other tracking technologies to determine the popularity of products/services and online activity. Additionally, we use this information for site optimization, fraud/security purposes, and advertising. For more information about how Microsoft collects and uses your data, visit the Microsoft Privacy Statement.

Contact Us

If you have questions or concerns about this privacy policy, please feel free to contact us at: desk@SingularityByte.com

Do you agree to our terms? Sign up

License Apache 2.0

TL;DR

Mistral AI's flagship: a 675B-parameter MoE (about 41B active), multimodal and multilingual, with a 256K context.
Released December 2, 2025 under Apache 2.0, moving the whole Mistral 3 family off the research-only Mistral Research License.
A non-reasoning model: strong for chat and assistant work, behind reasoning-tuned open models on hard coding and math. Not a laptop model.

☍ Announcement ⬇ Download Model

System Requirements

RAM	192GB+ (FP8 node)
GPU	8x H200/B200
VRAM	FP8 single node

✓ Ollama

Table of Contents

Mistral did the thing the open-source community had been asking for since the Mixtral days: it put a flagship-class model under a license you can actually build a business on. Mistral Large 3, released December 2, 2025, is a 675-billion-parameter Mixture-of-Experts model under Apache 2.0. No research-only clause, no commercial license to buy, no asterisk. For a European lab that spent two years gating its best weights behind a restrictive license, that is the story. The model itself is good, not frontier, and we will be straight about which is which.

What changed: the license, mostly

Start with the part that matters to builders. Mistral's previous flagship, Mistral Large 2, shipped under the Mistral Research License: free to study, but you needed a paid commercial license to ship anything with it. Mistral Large 3 drops that entirely. The whole Mistral 3 family, including this 675B flagship, is Apache 2.0: use it commercially, self-host it, fine-tune it, redistribute it, no permission and no fee.

That puts Mistral back on the same open footing as DeepSeek, Qwen, and Llama, after a stretch where its openness story had quietly eroded. For anyone who wants a frontier-adjacent European model with clean commercial terms and EU data residency, this is the one.

What it actually is

Mistral Large 3 is a sparse Mixture-of-Experts model, Mistral's first big MoE since Mixtral. (An MoE routes each token to a few "expert" subnetworks, so you store all the parameters but only run a fraction per token.) The numbers: 675B total, about 41B active per token, which breaks down as a 673B language model plus a 2.5B vision encoder. It takes text and images, handles 40-plus languages, and carries a 256K-token context window. Mistral says it was trained from scratch on 3,000 NVIDIA H200 GPUs.

One thing it is not: a reasoning model. There is no long chain-of-thought mode baked in. That single fact explains most of the benchmark picture below.

Benchmarks: strong chat, not a leaderboard-topper

Be clear-eyed here. As a non-reasoning instruct model, Mistral Large 3 competes on general knowledge and chat, not on the hard reasoning and agentic-coding tests that reasoning-tuned models dominate. Mistral's own framing puts it at parity with the best instruction-tuned open models, second among open non-reasoning models on LMArena. Independent trackers are cooler.

Model	Type	Active params	License	AA Intelligence Index
GLM-5.2	reasoning	~40B	MIT	51
MiniMax M3	reasoning	~23B	open	44
DeepSeek V4 Pro	reasoning	n/p	open	44
Mistral Large 3	non-reasoning	41B	Apache 2.0	16

Artificial Analysis Intelligence Index (independent). The index leans heavily on reasoning and agentic tasks, which Mistral Large 3 deliberately does not do, so it understates the model's value for plain chat and assistant work. Treat it as a reasoning-weighted ranking, not a verdict on general usefulness.

On GPQA-Diamond and SWE-Bench it trails the reasoning models badly, because those reward the step-by-step thinking it skips. It lands above Llama 4 Maverick and OLMo 3 on the same index. The takeaway: this is a competent, fast, multilingual assistant model, not the thing you reach for to solve a competition math problem or drive an autonomous coding agent.

Limitations and gotchas

Not a reasoning model. If your task needs multi-step reasoning, agentic coding, or hard math, the reasoning-tuned open models beat it outright.
675B is not local. All experts must be resident; FP8 fits a single H200 or B200 node, NVFP4 a single H100 or A100 node. For your laptop, Mistral points to the smaller Ministral 3 family (14B, 8B, 3B).
No GGUF or llama.cpp build at launch, and transformers support lagged. vLLM is the path; Ollama exposes it only as a cloud tag.
Apache 2.0 governs the weights, though the model card adds a standard acceptable-use note.

Who should use it

Use it if you want a permissively licensed, multilingual, multimodal model for chat, drafting, extraction, and general assistant work, and you value EU provenance and clean commercial terms over leaderboard position. Reach for the API ($0.50 in, $1.50 out per million tokens) or a rented node; self-host only if you have the hardware. If you need reasoning or agentic coding, pair it with, or swap it for, a reasoning model like GLM-5.2 or DeepSeek V4. For local use, the smaller Mistral Small family is the better fit.

Run it in about 10 minutes

The fastest path is the hosted API. Self-hosting means a multi-GPU server.

# Quickest: the hosted API (la Plateforme). Set MISTRAL_API_KEY first.
curl https://api.mistral.ai/v1/chat/completions \
  -H "Authorization: Bearer $MISTRAL_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{"model":"mistral-large-2512","messages":[{"role":"user","content":"Summarize this clause in plain English."}]}'

# Self-host FP8 on a single H200/B200 node with vLLM
vllm serve mistralai/Mistral-Large-3-675B-Instruct-2512 --tensor-parallel-size 8

If you only want to feel the licensing freedom, the more useful 10-minute move is to pull a small family member you can actually run, the Ministral 3 3B or Mistral Small, and confirm the Apache 2.0 license file is right there in the repo. That is the part you could not do with Large 2.

Sources and further reading

Tested on: not independently tested. Mistral Large 3 is a 675B MoE that needs a single H200/B200-class node even at FP8, beyond our bench. Mistral's own benchmark charts ship as images, so the comparison here uses independent Artificial Analysis figures, flagged as such. Sources linked above.
Date checked: 2026-06-26

Subscribe to the Newsletter

Search

GDPR Compliance