Ziyang Luo's picture

Ziyang Luo

Ziyang

·

https://chiyeunglaw.github.io/

AI & ML interests

Agents, LLMs, Multimodal ML

Recent Activity

upvoted a paper 9 days ago

UserRL: Training Interactive User-Centric Agent via Reinforcement Learning

upvoted a collection 10 days ago

upvoted a collection 10 days ago

Elastic-Reasoning

View all activity

Organizations

upvoted a paper 9 days ago

UserRL: Training Interactive User-Centric Agent via Reinforcement Learning

Paper • 2509.19736 • Published Sep 24 • 12

upvoted 2 collections 10 days ago

GTA1

A collection of GUI grounding models trained with GRPO. • 5 items • Updated Oct 31 • 4

Elastic-Reasoning

5 items • Updated Oct 31 • 7

upvoted 17 papers 11 days ago

FLEX: Continuous Agent Evolution via Forward Learning from Experience

Paper • 2511.06449 • Published 28 days ago • 11

IterResearch: Rethinking Long-Horizon Agents via Markovian State Reconstruction

Paper • 2511.07327 • Published 27 days ago • 74

The Path Not Taken: RLVR Provably Learns Off the Principals

Paper • 2511.08567 • Published 26 days ago • 31

Grounding Computer Use Agents on Human Demonstrations

Paper • 2511.07332 • Published 27 days ago • 104

Adapting Web Agents with Synthetic Supervision

Paper • 2511.06101 • Published 29 days ago • 6

LoopTool: Closing the Data-Training Loop for Robust LLM Tool Calls

Paper • 2511.09148 • Published 25 days ago • 16

WMPO: World Model-based Policy Optimization for Vision-Language-Action Models

Paper • 2511.09515 • Published 25 days ago • 17

Lumine: An Open Recipe for Building Generalist Agents in 3D Open Worlds

Paper • 2511.08892 • Published 26 days ago • 194

ResearchRubrics: A Benchmark of Prompts and Rubrics For Evaluating Deep Research Agents

Paper • 2511.07685 • Published 27 days ago • 9

PAN: A World Model for General, Interactable, and Long-Horizon World Simulation

Paper • 2511.09057 • Published 26 days ago • 75

MarsRL: Advancing Multi-Agent Reasoning System via Reinforcement Learning with Agentic Pipeline Parallelism

Paper • 2511.11373 • Published 23 days ago • 12

Simulating the Visual World with Artificial Intelligence: A Roadmap

Paper • 2511.08585 • Published 26 days ago • 29

NORA-1.5: A Vision-Language-Action Model Trained using World Model- and Action-based Preference Rewards

Paper • 2511.14659 • Published 19 days ago • 12

UFO^3: Weaving the Digital Agent Galaxy

Paper • 2511.11332 • Published 23 days ago • 18

MiroThinker: Pushing the Performance Boundaries of Open-Source Research Agents via Model, Context, and Interactive Scaling

Paper • 2511.11793 • Published 23 days ago • 158

Agent-R1: Training Powerful LLM Agents with End-to-End Reinforcement Learning

Paper • 2511.14460 • Published 19 days ago • 17

Agent0: Unleashing Self-Evolving Agents from Zero Data via Tool-Integrated Reasoning

Paper • 2511.16043 • Published 18 days ago • 104