Edit Models filters

Apps

Docker Model Runner

Inference Providers

OVHcloud AI Endpoints

HF Inference API

Misc

Inference Endpoints

text-generation-inference

4-bit precision

8-bit precision

text-embeddings-inference

Mixture of Experts

Carbon Emissions

Models

1,060

Full-text search

Active filters: llama.cpp

akshathmangudi/llama3.1-8b-gguf

Updated Jul 26, 2024

dahara1/llama-translate-gguf

8B • Updated Aug 14, 2024 • 789 • 16

jhilburn/gemma-inference

Text Generation • Updated Aug 7, 2024

ghost-x/ghost-8b-beta-1608-gguf

Text Generation • 8B • Updated Aug 26, 2024 • 318 • 6

PaulJusst/codegemma-7b-it-GGUF

Text Generation • 9B • Updated Sep 13, 2024

TheCluster/Llama-3.2-3B-Instruct-GGUF

Text Generation • 3B • Updated Sep 25, 2024 • 4

v000000/Typhon-Mixtral-v1-imatrix-v2.Q6_K-GGUF

Updated Sep 26, 2024 • 72 • 1

LPN64/LongCite-llama3.1-8b-GGUF

Text Generation • 8B • Updated Oct 1, 2024 • 381 • 6

cstr/Ministral-8B-Instruct-2410-GGUF

8B • Updated Oct 17, 2024 • 49 • 1

mrcuddle/Lumimaid-v0.2-12B-Q4_K_M-GGUF

Text Generation • 12B • Updated Oct 20, 2024 • 19

Manel/Llama-3.1-8B-Instruct-Q4_K_M-GGUF

8B • Updated Nov 3, 2024 • 9

Manel/Llama-2-13b-chat-hf-Q4_0-GGUF

Text Generation • 13B • Updated Nov 3, 2024 • 170

dumb-dev/flan-t5-xxl-gguf

11B • Updated Oct 29, 2024 • 1.15k • 18

Manel/gemma-2-9b-Q4_0-GGUF

9B • Updated Nov 3, 2024 • 15

DiYaZeN/aya-sl-biz-8b

Text Generation • 8B • Updated Oct 31, 2024 • 2

shreyasmeher/ConflLlama

Text Classification • 8B • Updated Jul 8 • 104 • 4

dwikitheduck/gen-try1-Q4_K_M-GGUF

15B • Updated Nov 11, 2024 • 3

real-jiakai/Arxiver-Llama-GGUF

8B • Updated Nov 15, 2024 • 27

shreyasmeher/ConflLlama-Alt

Text Classification • 8B • Updated Nov 19, 2024 • 71 • 1

XeAI/LLaMa_3.2_3B_Instruct_Text2SQL-Q4_K_M-GGUF

Text Generation • 3B • Updated Nov 17, 2024 • 12

dwikitheduck/gen-sql-1-Q4_K_M-GGUF

8B • Updated Nov 18, 2024 • 1 • 1

jsjeon/SummLlama3.2-3B-Q4_K_M-GGUF

Updated Nov 19, 2024

dwikitheduck/gen-inst-1-Q4_K_M-GGUF

15B • Updated Nov 25, 2024 • 3

Vikhrmodels/Vikhr-Qwen-2.5-1.5B-Instruct-GGUF

2B • Updated Nov 26, 2024 • 441 • 4

McaTech/Nonet

Text Generation • 0.1B • Updated Jun 30 • 361 • 3

lianghsun/Llama-3.2-Taiwan-3B-Instruct-GGUF

Text Generation • 4B • Updated Jan 15 • 862 • 10

phymbert/Phi-3.5-MoE-instruct-GGUF

Text Generation • 42B • Updated Dec 29, 2024 • 71 • 1

carsenk/llama3.2_3b_122824_uncensored

Text Generation • 3B • Updated Dec 31, 2024 • 136 • 2

carsenk/llama3.2_1b_2025_uncensored

Text Generation • 1B • Updated Jan 1 • 21 • • 3

bullerwins/DeepSeek-V3-GGUF

Text Generation • 671B • Updated Feb 19 • 783 • 102