1 13 6

Jingming Zhuo

JingmingZ

AI & ML interests

Large Language Models

Recent Activity

authored a paper 11 days ago

DR Tulu: Reinforcement Learning with Evolving Rubrics for Deep Research

upvoted a paper 12 days ago

DR Tulu: Reinforcement Learning with Evolving Rubrics for Deep Research

upvoted a collection 17 days ago

DR Tulu

View all activity

Organizations

authored a paper 11 days ago

DR Tulu: Reinforcement Learning with Evolving Rubrics for Deep Research

Paper • 2511.19399 • Published 12 days ago • 54

upvoted a paper 12 days ago

DR Tulu: Reinforcement Learning with Evolving Rubrics for Deep Research

Paper • 2511.19399 • Published 12 days ago • 54

upvoted a collection 17 days ago

DR Tulu

Collection

Models and data associated with DR Tulu, http://allenai-web/papers/drtulu • 5 items • Updated 12 days ago • 30

liked a dataset 17 days ago

rl-research/dr-tulu-sft-data

Viewer • Updated 12 days ago • 13.1k • 725 • 24

upvoted a paper 2 months ago

Alignment through Meta-Weighted Online Sampling: Bridging the Gap between Data Generation and Preference Optimization

Paper • 2509.23371 • Published Sep 27 • 5

updated a dataset 2 months ago

rl-rag/hle_rlvr_no_prompt

Viewer • Updated Sep 28 • 500 • 18

published a dataset 2 months ago

rl-rag/hle_rlvr_no_prompt

Viewer • Updated Sep 28 • 500 • 18

upvoted a paper 3 months ago

WebExplorer: Explore and Evolve for Training Long-Horizon Web Agents

Paper • 2509.06501 • Published Sep 8 • 78

updated a dataset 3 months ago

rl-rag/verified_miro_trajectories

Viewer • Updated Aug 31 • 9.88k • 25

published a dataset 3 months ago

rl-rag/verified_miro_trajectories

Viewer • Updated Aug 31 • 9.88k • 25

updated a dataset 3 months ago

rl-rag/bc_synthetic_v_2

Viewer • Updated Aug 30 • 3.99k • 19

published a dataset 3 months ago

rl-rag/bc_synthetic_v_2

Viewer • Updated Aug 30 • 3.99k • 19

upvoted a paper 4 months ago

FutureX: An Advanced Live Benchmark for LLM Agents in Future Prediction

Paper • 2508.11987 • Published Aug 16 • 71

upvoted a paper 5 months ago

MegaScience: Pushing the Frontiers of Post-Training Datasets for Science Reasoning

Paper • 2507.16812 • Published Jul 22 • 63

upvoted a paper 6 months ago

MINT-CoT: Enabling Interleaved Visual Tokens in Mathematical Chain-of-Thought Reasoning

Paper • 2506.05331 • Published Jun 5 • 13

liked a dataset 6 months ago

xy06/MINT-CoT-Dataset

Viewer • Updated Jun 10 • 100 • 193 • 7

liked a model 6 months ago

xy06/MINT-CoT-7B

8B • Updated Jun 4 • 27 • 6

upvoted a paper 8 months ago

Envisioning Beyond the Pixels: Benchmarking Reasoning-Informed Visual Editing

Paper • 2504.02826 • Published Apr 3 • 68

upvoted a paper 10 months ago

Qwen2.5-VL Technical Report

Paper • 2502.13923 • Published Feb 19 • 211

liked a Space 12 months ago

Open LMM Reasoning Leaderboard

🥇

A Leaderboard that demonstrates LMM reasoning capabilities

Jingming Zhuo

AI & ML interests

Recent Activity

Organizations

JingmingZ's activity

Open LMM Reasoning Leaderboard