UserRL: Training Interactive User-Centric Agent via Reinforcement Learning Paper • 2509.19736 • Published Sep 24 • 12
GTA1 Collection A collection of GUI grounding models trained with GRPO. • 5 items • Updated Oct 31 • 4
FLEX: Continuous Agent Evolution via Forward Learning from Experience Paper • 2511.06449 • Published 28 days ago • 11
IterResearch: Rethinking Long-Horizon Agents via Markovian State Reconstruction Paper • 2511.07327 • Published 27 days ago • 74
The Path Not Taken: RLVR Provably Learns Off the Principals Paper • 2511.08567 • Published 26 days ago • 31
Grounding Computer Use Agents on Human Demonstrations Paper • 2511.07332 • Published 27 days ago • 104
LoopTool: Closing the Data-Training Loop for Robust LLM Tool Calls Paper • 2511.09148 • Published 25 days ago • 16
WMPO: World Model-based Policy Optimization for Vision-Language-Action Models Paper • 2511.09515 • Published 25 days ago • 17
Lumine: An Open Recipe for Building Generalist Agents in 3D Open Worlds Paper • 2511.08892 • Published 26 days ago • 194
ResearchRubrics: A Benchmark of Prompts and Rubrics For Evaluating Deep Research Agents Paper • 2511.07685 • Published 27 days ago • 9
PAN: A World Model for General, Interactable, and Long-Horizon World Simulation Paper • 2511.09057 • Published 26 days ago • 75
MarsRL: Advancing Multi-Agent Reasoning System via Reinforcement Learning with Agentic Pipeline Parallelism Paper • 2511.11373 • Published 23 days ago • 12
Simulating the Visual World with Artificial Intelligence: A Roadmap Paper • 2511.08585 • Published 26 days ago • 29
NORA-1.5: A Vision-Language-Action Model Trained using World Model- and Action-based Preference Rewards Paper • 2511.14659 • Published 19 days ago • 12
MiroThinker: Pushing the Performance Boundaries of Open-Source Research Agents via Model, Context, and Interactive Scaling Paper • 2511.11793 • Published 23 days ago • 158
Agent-R1: Training Powerful LLM Agents with End-to-End Reinforcement Learning Paper • 2511.14460 • Published 19 days ago • 17
Agent0: Unleashing Self-Evolving Agents from Zero Data via Tool-Integrated Reasoning Paper • 2511.16043 • Published 18 days ago • 104