Qwen3-VL Collection Qwen's new multimodal vision models in GGUF, safetensor, and dynamic Unsloth formats. ⢠56 items ⢠Updated 4 days ago ⢠17
MMLU Pro benchmark for GGUFs (1 shot) Collection "Not all quantized model perform good", serving framework ollama uses NVIDIA gpu, llama.cpp uses CPU with AVX & AMX ⢠13 items ⢠Updated Aug 15 ⢠10
gpt-oss Collection OpenAI's gpt-oss-20b and gpt-oss-120b is here! The powerful open models are available in GGUF, original & 4-bit formats. ⢠18 items ⢠Updated 4 days ago ⢠36
Llama 4 Collection Meta's new Llama 4 multimodal models, Scout & Maverick. Includes Dynamic GGUFs, 16-bit & Dynamic 4-bit uploads. Run & fine-tune them with Unsloth! ⢠15 items ⢠Updated 4 days ago ⢠53
Gemma 3 Collection All versions of Google's new multimodal models including QAT in 1B, 4B, 12B, and 27B sizes. In GGUF, dynamic 4-bit and 16-bit formats. ⢠55 items ⢠Updated 4 days ago ⢠96
DeepSeek R1 (All Versions) Collection DeepSeek-R1-0528 is here! The most powerful reasoning open LLM, available in GGUF, original & 4-bit formats. Includes Llama & Qwen distilled models. ⢠37 items ⢠Updated 4 days ago ⢠261
Phi-4 (All Versions) Collection Microsoft's Phi-4 models including Reasoning + Reasoning Plus & mini. Includes Dynamic 2.0 GGUF, 4-bit & 16-bit versions. Includes Unsloth's bug fixes ⢠20 items ⢠Updated 4 days ago ⢠76
Deepseek V3 (All Versions) Collection Deepseek-V3-0324 and V3 - available in original, and Dynamic GGUF formats, with support for 2-8-bit quantized versions. ⢠7 items ⢠Updated 4 days ago ⢠39
Eule Collection A series of English/Russian reasoning models. ⢠2 items ⢠Updated Dec 12, 2024 ⢠2
Llama 3.3 (All Versions) Collection Meta's new Llama 3.3 (70B) model in all formats. Includes GGUF, 4-bit bnb and original versions. ⢠3 items ⢠Updated 4 days ago ⢠37
Unsloth 4-bit Dynamic Quants Collection Unsloths Dynamic 4bit Quants selectively skips quantizing certain parameters; greatly improving accuracy while only using <10% more VRAM than BnB 4bit ⢠28 items ⢠Updated 4 days ago ⢠91
view article Article Unleashing the Power of Unsloth and QLora:Redefining Language Model Fine-Tuning Jan 19, 2024 ⢠18
Vision/multimodal Models Collection Collection of the most popular vision models including Llama 3.2, LlaVa, Qwen2 VL, Pixtral, PaliGemma and more! ⢠25 items ⢠Updated 4 days ago ⢠20
Llama 3.2 Vision Collection Meta's Llama 3.2 vision models 11B and 90B. Include 4-bit bnb and original versions. ⢠8 items ⢠Updated 4 days ago ⢠7
Qwen 2.5 Coder Collection Complete collection of Code-specific model series for Qwen2.5 in bnb 4bit, 16bit and GGUF formats. ⢠35 items ⢠Updated 4 days ago ⢠35