Mem0: Building Production-Ready AI Agents with Scalable Long-Term Memory Paper • 2504.19413 • Published Apr 28 • 36
SmolDocling: An ultra-compact vision-language model for end-to-end multi-modal document conversion Paper • 2503.11576 • Published Mar 14 • 122
TurboDiffusion: Accelerating Video Diffusion Models by 100-200 Times Paper • 2512.16093 • Published 12 days ago • 84
Emergent temporal abstractions in autoregressive models enable hierarchical reinforcement learning Paper • 2512.20605 • Published 6 days ago • 54
— Long-context post-training 🧶 — Collection Resources for post-training LLMs with long-context samples • 5 items • Updated Sep 14 • 6
VL-JEPA: Joint Embedding Predictive Architecture for Vision-language Paper • 2512.10942 • Published 18 days ago • 11
V-JEPA 2 Collection A frontier video understanding model developed by FAIR, Meta, which extends the pretraining objectives of https://ai.meta.com/blog/v-jepa-yann • 8 items • Updated Jun 13 • 177
view article Article AprielGuard: A Guardrail for Safety and Adversarial Robustness in Modern LLM Systems 6 days ago • 31
view article Article The Open Evaluation Standard: Benchmarking NVIDIA Nemotron 3 Nano with NeMo Evaluator 12 days ago • 34
Google Gemma Scope 2 - Neuronpedia Collection Google Gemma Scope 2: JumpReLU SAEs for Gemma 2 interpretability. 270M PT/IT, 1B PT variants. Neuronpedia integration. Mechanistic analysis. • 11 items • Updated 1 day ago • 1