Mistral Large 3

mistralai 's Collections

updated 4 days ago

A state-of-the-art, open-weight, general-purpose multimodal model with a granular Mixture-of-Experts architecture.

mistralai/Mistral-Large-3-675B-Instruct-2512

Updated 3 days ago • 247 • 160

Note A no-loss FP8 version to reduce resource requirements. Can be deployed on a node of B200s or H200s.
mistralai/Mistral-Large-3-675B-Instruct-2512-NVFP4

Updated 3 days ago • 202 • 35

Note A high quality NVFP4 version to reduce resource requirements. Can be deployed on a node of H100s or A100s.
mistralai/Mistral-Large-3-675B-Instruct-2512-Eagle

Updated 3 days ago • 92 • 24

Note An EAGLE speculator for Mistral Large 3 in FP8, used for speculative decoding.
mistralai/Mistral-Large-3-675B-Base-2512

Updated 3 days ago • 63 • 30

Note Base BF16 Weights, used for fine-tuning.