35 29 18

Denis Kuznedelev

SpiridonSunRotator

https://github.com/Godofnothing

Godofnothing

AI & ML interests

Model compression, computer vision, NLP

Recent Activity

upvoted a paper 5 days ago

WUSH: Near-Optimal Adaptive Transforms for LLM Quantization

updated a model 17 days ago

ISTA-DASLab/Kimi-K2-Thinking-GPTQ-2b-32g-experts

upvoted a paper 21 days ago

TiDAR: Think in Diffusion, Talk in Autoregression

View all activity

Organizations

upvoted a paper 5 days ago

WUSH: Near-Optimal Adaptive Transforms for LLM Quantization

Paper • 2512.00956 • Published 8 days ago • 17

upvoted a paper 21 days ago

TiDAR: Think in Diffusion, Talk in Autoregression

Paper • 2511.08923 • Published 26 days ago • 111

upvoted an article 25 days ago

Article

A Review on the Evolvement of Load Balancing Strategy in MoE LLMs: Pitfalls and Lessons

Feb 4

•

upvoted an article about 1 month ago

Article

Introducing Trackio: A Lightweight Experiment Tracking Library from Hugging Face

Jul 29

•

202

upvoted a paper 2 months ago

Bridging the Gap Between Promise and Performance for Microscaling FP4 Quantization

Paper • 2509.23202 • Published Sep 27 • 27

upvoted a paper 4 months ago

The Geometry of LLM Quantization: GPTQ as Babai's Nearest Plane Algorithm

Paper • 2507.18553 • Published Jul 24 • 40

upvoted 2 papers 5 months ago

nablaNABLA: Neighborhood Adaptive Block-Level Attention

Paper • 2507.13546 • Published Jul 17 • 124

MADrive: Memory-Augmented Driving Scene Modeling

Paper • 2506.21520 • Published Jun 26 • 36

upvoted 5 papers 6 months ago

Geopolitical biases in LLMs: what are the "good" and the "bad" countries according to contemporary language models

Paper • 2506.06751 • Published Jun 7 • 71

Will It Still Be True Tomorrow? Multilingual Evergreen Question Classification to Improve Trustworthy QA

Paper • 2505.21115 • Published May 27 • 140

Unified Scaling Laws for Compressed Representations

Paper • 2506.01863 • Published Jun 2 • 19

Alchemist: Turning Public Text-to-Image Data into Generative Gold

Paper • 2505.19297 • Published May 25 • 84

Position of Uncertainty: A Cross-Linguistic Study of Positional Bias in Large Language Models

Paper • 2505.16134 • Published May 22 • 18

upvoted a paper 7 months ago

Quartet: Native FP4 Training Can Be Optimal for Large Language Models

Paper • 2505.14669 • Published May 20 • 78

upvoted an article 7 months ago

Article

Vision Language Models (Better, faster, stronger)

May 12

•

568

upvoted a paper 8 months ago

Hogwild! Inference: Parallel LLM Generation via Concurrent Attention

Paper • 2504.06261 • Published Apr 8 • 110

upvoted 2 papers 9 months ago

One-Step Residual Shifting Diffusion for Image Super-Resolution via Distillation

Paper • 2503.13358 • Published Mar 17 • 95

Scale-wise Distillation of Diffusion Models

Paper • 2503.16397 • Published Mar 20 • 41

upvoted an article 9 months ago

Article

Digest of models based on YandexGPT 5 Lite

Mar 19

•

upvoted a paper 9 months ago

RuCCoD: Towards Automated ICD Coding in Russian

Paper • 2502.21263 • Published Feb 28 • 133

Denis Kuznedelev

AI & ML interests

Recent Activity

Organizations

SpiridonSunRotator's activity

A Review on the Evolvement of Load Balancing Strategy in MoE LLMs: Pitfalls and Lessons

Introducing Trackio: A Lightweight Experiment Tracking Library from Hugging Face

Vision Language Models (Better, faster, stronger)

Digest of models based on YandexGPT 5 Lite