Samyam Rajbhandari's picture

1

Samyam Rajbhandari

samyam

·

AI & ML interests

None yet

Organizations

None yet

authored a paper about 1 year ago

SwiftKV: Fast Prefill-Optimized Inference with Knowledge-Preserving Model Transformation

Paper • 2410.03960 • Published Oct 4, 2024 • 2

authored a paper almost 2 years ago

DeepSpeed-FastGen: High-throughput Text Generation for LLMs via MII and DeepSpeed-Inference

Paper • 2401.08671 • Published Jan 9, 2024 • 15

authored 2 papers about 2 years ago

DeepSpeed Ulysses: System Optimizations for Enabling Training of Extreme Long Sequence Transformer Models

Paper • 2309.14509 • Published Sep 25, 2023 • 19

DeepSpeed-VisualChat: Multi-Round Multi-Image Interleave Chat via Multi-Modal Causal Attention

Paper • 2309.14327 • Published Sep 25, 2023 • 22