71 120 72

Ge Zhang

zhangysk

AI & ML interests

None yet

Recent Activity

upvoted a paper 5 days ago

How Far Are We from Genuinely Useful Deep Research Agents?

authored a paper 5 days ago

From Code Foundation Models to Agents and Applications: A Practical Guide to Code Intelligence

upvoted a paper 5 days ago

From Code Foundation Models to Agents and Applications: A Practical Guide to Code Intelligence

View all activity

Organizations

authored a paper 5 days ago

From Code Foundation Models to Agents and Applications: A Practical Guide to Code Intelligence

Paper • 2511.18538 • Published 13 days ago • 238

authored 5 papers 18 days ago

MME-CC: A Challenging Multi-Modal Evaluation Benchmark of Cognitive Capacity

Paper • 2511.03146 • Published Nov 5 • 7

RLoop: An Self-Improving Framework for Reinforcement Learning with Iterative Policy Initialization

Paper • 2511.04285 • Published about 1 month ago • 7

authored 11 papers about 1 month ago

IFEvalCode: Controlled Code Generation

Paper • 2507.22462 • Published Jul 30

VideoScore2: Think before You Score in Generative Video Evaluation

Paper • 2509.22799 • Published Sep 26 • 25

Towards Personalized Deep Research: Benchmarks and Evaluations

Paper • 2509.25106 • Published Sep 29 • 29

Flash-Searcher: Fast and Effective Web Agents via DAG-Based Parallel Execution

Paper • 2509.25301 • Published Sep 29 • 19

Knapsack RL: Unlocking Exploration of LLMs via Optimizing Budget Allocation

Paper • 2509.25849 • Published Sep 30 • 47

OmniVideoBench: Towards Audio-Visual Understanding Evaluation for Omni MLLMs

Paper • 2510.10689 • Published Oct 12 • 46

ACADREASON: Exploring the Limits of Reasoning Models with Academic Research Problems

Paper • 2510.11652 • Published Oct 13 • 28

Beyond Correctness: Evaluating Subjective Writing Preferences Across Cultures

Paper • 2510.14616 • Published Oct 16 • 11

A$^2$FM: An Adaptive Agent Foundation Model for Tool-Aware Hybrid Reasoning

Paper • 2510.12838 • Published Oct 13 • 24

Scaling Latent Reasoning via Looped Language Models

Paper • 2510.25741 • Published Oct 29 • 219

Scaling Latent Reasoning via Looped Language Models

Paper • 2510.25741 • Published Oct 29 • 219

authored 3 papers 3 months ago

FinSearchComp: Towards a Realistic, Expert-Level Evaluation of Financial Search and Reasoning

Paper • 2509.13160 • Published Sep 16 • 29

Reverse-Engineered Reasoning for Open-Ended Generation

Paper • 2509.06160 • Published Sep 7 • 149

UI-TARS-2 Technical Report: Advancing GUI Agent with Multi-Turn Reinforcement Learning

Paper • 2509.02544 • Published Sep 2 • 124

Ge Zhang

AI & ML interests

Recent Activity

Organizations

zhangysk's activity