view article Article Google Cloud C4 Brings a 70% TCO improvement on GPT OSS with Intel and Hugging Face +2 Oct 16 • 18
view article Article Introducing AutoRound: Intel’s Advanced Quantization for LLMs and VLMs +7 Apr 29 • 43
view article Article Introducing HELMET: Holistically Evaluating Long-context Language Models +5 Apr 16 • 40
view article Article Benchmarking Language Model Performance on 5th Gen Xeon at GCP +1 Dec 17, 2024 • 7
view article Article Building Cost-Efficient Enterprise RAG applications with Intel Gaudi 2 and Intel Xeon +6 May 9, 2024 • 12
view article Article Building Cost-Efficient Enterprise RAG applications with Intel Gaudi 2 and Intel Xeon +6 May 9, 2024 • 12