Tiny Model, Big Logic: Diversity-Driven Optimization Elicits Large-Model Reasoning Ability in VibeThinker-1.5B Paper • 2511.06221 • Published Nov 9 • 129
Supervised Reinforcement Learning: From Expert Trajectories to Step-wise Reasoning Paper • 2510.25992 • Published Oct 29 • 45
Temporal Alignment Guidance: On-Manifold Sampling in Diffusion Models Paper • 2510.11057 • Published Oct 13 • 30
Temporal Alignment Guidance: On-Manifold Sampling in Diffusion Models Paper • 2510.11057 • Published Oct 13 • 30 • 2
Agentic Context Engineering: Evolving Contexts for Self-Improving Language Models Paper • 2510.04618 • Published Oct 6 • 124
Coreset Sampling from Open-Set for Fine-Grained Self-Supervised Learning Paper • 2303.11101 • Published Mar 20, 2023 • 1
Fast and Robust Early-Exiting Framework for Autoregressive Language Models with Synchronized Parallel Decoding Paper • 2310.05424 • Published Oct 9, 2023 • 1
Relaxed Recursive Transformers: Effective Parameter Sharing with Layer-wise LoRA Paper • 2410.20672 • Published Oct 28, 2024 • 6
Why In-Context Learning Transformers are Tabular Data Classifiers Paper • 2405.13396 • Published May 22, 2024
Automated Filtering of Human Feedback Data for Aligning Text-to-Image Diffusion Models Paper • 2410.10166 • Published Oct 14, 2024
Mixture-of-Recursions: Learning Dynamic Recursive Depths for Adaptive Token-Level Computation Paper • 2507.10524 • Published Jul 14 • 70