-
OpenMMReasoner: Pushing the Frontiers for Multimodal Reasoning with an Open and General Recipe
Paper • 2511.16334 • Published • 92 -
Parallel-R1: Towards Parallel Thinking via Reinforcement Learning
Paper • 2509.07980 • Published • 101 -
ParaThinker: Native Parallel Thinking as a New Paradigm to Scale LLM Test-time Compute
Paper • 2509.04475 • Published • 3 -
Stabilizing Reinforcement Learning with LLMs: Formulation and Practices
Paper • 2512.01374 • Published • 93
Collections
Discover the best community collections!
Collections including paper arxiv:2512.17260
-
A Survey of Data Agents: Emerging Paradigm or Overstated Hype?
Paper • 2510.23587 • Published • 65 -
D2E: Scaling Vision-Action Pretraining on Desktop Data for Transfer to Embodied AI
Paper • 2510.05684 • Published • 141 -
Thinking with Camera: A Unified Multimodal Model for Camera-Centric Understanding and Generation
Paper • 2510.08673 • Published • 125 -
Lumine: An Open Recipe for Building Generalist Agents in 3D Open Worlds
Paper • 2511.08892 • Published • 201
-
Contrastive Learning for Many-to-many Multilingual Neural Machine Translation
Paper • 2105.09501 • Published -
Cross-modal Contrastive Learning for Speech Translation
Paper • 2205.02444 • Published -
ByteTransformer: A High-Performance Transformer Boosted for Variable-Length Inputs
Paper • 2210.03052 • Published -
Diffusion Glancing Transformer for Parallel Sequence to Sequence Learning
Paper • 2212.10240 • Published • 1
-
TiDAR: Think in Diffusion, Talk in Autoregression
Paper • 2511.08923 • Published • 117 -
Diffusion Language Models are Super Data Learners
Paper • 2511.03276 • Published • 128 -
What Makes Diffusion Language Models Super Data Learners?
Paper • 2510.04071 • Published -
LLaDA2.0: Scaling Up Diffusion Language Models to 100B
Paper • 2512.15745 • Published • 77
-
Bourbaki: Self-Generated and Goal-Conditioned MDPs for Theorem Proving
Paper • 2507.02726 • Published • 14 -
Towards Solving More Challenging IMO Problems via Decoupled Reasoning and Proving
Paper • 2507.06804 • Published • 16 -
Seed-Prover: Deep and Broad Reasoning for Automated Theorem Proving
Paper • 2507.23726 • Published • 114 -
Model-Based and Sample-Efficient AI-Assisted Math Discovery in Sphere Packing
Paper • 2512.04829 • Published • 11
-
OpenMMReasoner: Pushing the Frontiers for Multimodal Reasoning with an Open and General Recipe
Paper • 2511.16334 • Published • 92 -
Parallel-R1: Towards Parallel Thinking via Reinforcement Learning
Paper • 2509.07980 • Published • 101 -
ParaThinker: Native Parallel Thinking as a New Paradigm to Scale LLM Test-time Compute
Paper • 2509.04475 • Published • 3 -
Stabilizing Reinforcement Learning with LLMs: Formulation and Practices
Paper • 2512.01374 • Published • 93
-
TiDAR: Think in Diffusion, Talk in Autoregression
Paper • 2511.08923 • Published • 117 -
Diffusion Language Models are Super Data Learners
Paper • 2511.03276 • Published • 128 -
What Makes Diffusion Language Models Super Data Learners?
Paper • 2510.04071 • Published -
LLaDA2.0: Scaling Up Diffusion Language Models to 100B
Paper • 2512.15745 • Published • 77
-
A Survey of Data Agents: Emerging Paradigm or Overstated Hype?
Paper • 2510.23587 • Published • 65 -
D2E: Scaling Vision-Action Pretraining on Desktop Data for Transfer to Embodied AI
Paper • 2510.05684 • Published • 141 -
Thinking with Camera: A Unified Multimodal Model for Camera-Centric Understanding and Generation
Paper • 2510.08673 • Published • 125 -
Lumine: An Open Recipe for Building Generalist Agents in 3D Open Worlds
Paper • 2511.08892 • Published • 201
-
Bourbaki: Self-Generated and Goal-Conditioned MDPs for Theorem Proving
Paper • 2507.02726 • Published • 14 -
Towards Solving More Challenging IMO Problems via Decoupled Reasoning and Proving
Paper • 2507.06804 • Published • 16 -
Seed-Prover: Deep and Broad Reasoning for Automated Theorem Proving
Paper • 2507.23726 • Published • 114 -
Model-Based and Sample-Efficient AI-Assisted Math Discovery in Sphere Packing
Paper • 2512.04829 • Published • 11
-
Contrastive Learning for Many-to-many Multilingual Neural Machine Translation
Paper • 2105.09501 • Published -
Cross-modal Contrastive Learning for Speech Translation
Paper • 2205.02444 • Published -
ByteTransformer: A High-Performance Transformer Boosted for Variable-Length Inputs
Paper • 2210.03052 • Published -
Diffusion Glancing Transformer for Parallel Sequence to Sequence Learning
Paper • 2212.10240 • Published • 1