Susant Achary's picture

Susant Achary

Susant-Achary

·

https://huggingface.co/Susant-Achary

SSusantAchary

AI & ML interests

Tiny to Small Language Models, Building from India. Quantization and MLX

Recent Activity

liked a model 22 days ago

mlx-community/medgemma-27b-it-8bit

upvoted an article 24 days ago

We’re open-sourcing our text-to-image model and the process behind it

liked a model about 1 month ago

vandijklab/C2S-Scale-Gemma-2-27B

View all activity

Organizations

upvoted an article 24 days ago

Article

We’re open-sourcing our text-to-image model and the process behind it

25 days ago

•

73

upvoted a paper about 1 month ago

KTO: Model Alignment as Prospect Theoretic Optimization

Paper • 2402.01306 • Published Feb 2, 2024 • 20

upvoted a collection about 1 month ago

deployed-models

1519 items • Updated about 22 hours ago • 24

upvoted a collection about 2 months ago

Qwen3-VL

60 items • Updated Oct 21 • 10

upvoted an article 2 months ago

Article

The Large Language Model Course

Jan 16

•

212

upvoted 7 collections 2 months ago

💧LFM2-8B-A1B-MoE

Best in Class MoE, better than Qwen3. Optimised for Smaller devices sub 16 GB (M1/2/3/4) Apple Silicon. • 7 items • Updated Oct 9 • 4

ServiceNow-Apriel

Apriel-1.5-15b-Thinker is a multimodal reasoning model in ServiceNow’s Apriel SLM series which achieves competitive performance against models 10 time • 6 items • Updated Oct 5 • 1

MedGemma

Collection of Gemma 3 variants for performance on medical text and image comprehension to accelerate building healthcare-based AI applications. • 14 items • Updated Jul 10 • 5

EmbeddingGemma

7 items • Updated Sep 4 • 3

Granite-4.0 Family

22 items • Updated Oct 28 • 3

Qwen3-Coder-MoE

💻 Significant Performance: among open models on Agentic Coding, Agentic Browser-Use, and other foundational coding tasks, achieving ~Claude Sonnet. • 6 items • Updated Oct 4 • 1

Qwen3Guard

7 items • Updated Sep 30 • 59

upvoted an article 4 months ago

Article

Welcome GPT OSS, the new open-source model family from OpenAI!

+10

Aug 5

•

509

upvoted an article 6 months ago

Article

Improving Hugging Face Model Access for Kaggle Users

+3

May 14

•

33

upvoted a collection 10 months ago

PaliGemma 2 Mix

13 items • Updated Jul 10 • 62

upvoted an article 10 months ago

Article

Open-source DeepResearch – Freeing our search agents

+3

Feb 4

•

1.31k

upvoted a collection about 1 year ago

Open LLM Leaderboard best models ❤️‍🔥

A daily uploaded list of models with best evaluations on the LLM leaderboard: • 65 items • Updated Mar 20 • 653

upvoted a paper about 1 year ago

FoleyGen: Visually-Guided Audio Generation

Paper • 2309.10537 • Published Sep 19, 2023 • 9

upvoted a collection about 1 year ago

small language models

under 7b 🐁 • 76 items • Updated about 5 hours ago • 40

upvoted a paper almost 2 years ago

MobileLLM: Optimizing Sub-billion Parameter Language Models for On-Device Use Cases

Paper • 2402.14905 • Published Feb 22, 2024 • 134