Why there isn't even a single Quantized Version for this model ?
1
#11 opened 29 days ago
by
kalashshah19
gguf q4 version ?
1
#5 opened 5 months ago
by
exclusif20
none flash_attention_2 version with the same architecture?
🚀
3
#1 opened 5 months ago
by
Hugodonotexit