The Dragon Hatchling: The Missing Link between the Transformer and Models of the Brain Paper • 2509.26507 • Published Sep 30 • 535
Less is More: Recursive Reasoning with Tiny Networks Paper • 2510.04871 • Published Oct 6 • 497
Intern-S1: A Scientific Multimodal Foundation Model Paper • 2508.15763 • Published Aug 21 • 256
LongLive: Real-time Interactive Long Video Generation Paper • 2509.22622 • Published Sep 26 • 184
Bark Collection Bark is a transformer-based text-to-audio model created by Suno. Currently, two checkpoints are supported: a small and a large version. • 3 items • Updated Sep 14, 2023 • 20
MultiSlav Collection Multilingual Machine Translation Open-Source Slavic models • 19 items • Updated Mar 7 • 9
Kimi-VL-A3B Collection Moonshot's efficient MoE VLMs, exceptional on agent, long-context, and thinking • 7 items • Updated Oct 30 • 77
Hibiki fr-en Collection Hibiki is a model for streaming speech translation , which can run on device! See https://github.com/kyutai-labs/hibiki. • 5 items • Updated about 11 hours ago • 54
DeepSeek R1 (All Versions) Collection DeepSeek-R1-0528 is here! The most powerful reasoning open LLM, available in GGUF, original & 4-bit formats. Includes Llama & Qwen distilled models. • 37 items • Updated 9 days ago • 261
GTE models Collection General Text Embedding Models Released by Tongyi Lab of Alibaba Group • 21 items • Updated Jan 21 • 32