Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
nm-testing
/
Llama-3.1-8B-Instruct-QKV-Cache-FP8-Per-Tensor
like
0
Follow
NM Testing
92
Transformers
kv-cache
fp8
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
3b8571a
Llama-3.1-8B-Instruct-QKV-Cache-FP8-Per-Tensor
1.52 kB
1 contributor
History:
3 commits
krishnateja95
Upload README.md with huggingface_hub
3b8571a
verified
about 1 month ago
.gitattributes
1.52 kB
initial commit
about 1 month ago
README.md
0 Bytes
Upload README.md with huggingface_hub
about 1 month ago