Wentian Zhao

zwt123home123

[email protected]

AI & ML interests

None yet

Recent Activity

upvoted a paper 6 days ago

Schoenfeld's Anatomy of Mathematical Reasoning by Language Models

upvoted a paper 10 days ago

Can LLMs Estimate Student Struggles? Human-AI Difficulty Alignment with Proficiency Simulation for Item Difficulty Prediction

upvoted a paper about 2 months ago

Routing Manifold Alignment Improves Generalization of Mixture-of-Experts LLMs

View all activity

Organizations

None yet

upvoted a paper 6 days ago

Schoenfeld's Anatomy of Mathematical Reasoning by Language Models

Paper • 2512.19995 • Published 11 days ago • 14

upvoted a paper 10 days ago

Can LLMs Estimate Student Struggles? Human-AI Difficulty Alignment with Proficiency Simulation for Item Difficulty Prediction

Paper • 2512.18880 • Published 12 days ago • 23

upvoted a paper about 2 months ago

Routing Manifold Alignment Improves Generalization of Mixture-of-Experts LLMs

Paper • 2511.07419 • Published Nov 10, 2025 • 26

upvoted 2 papers 3 months ago

Vision-Zero: Scalable VLM Self-Improvement via Strategic Gamified Self-Play

Paper • 2509.25541 • Published Sep 29, 2025 • 140

EPO: Entropy-regularized Policy Optimization for LLM Agents Reinforcement Learning

Paper • 2509.22576 • Published Sep 26, 2025 • 134

upvoted 2 papers 6 months ago

Skip a Layer or Loop it? Test-Time Depth Adaptation of Pretrained LLMs

Paper • 2507.07996 • Published Jul 10, 2025 • 34

Give Me FP32 or Give Me Death? Challenges and Solutions for Reproducible Reasoning

Paper • 2506.09501 • Published Jun 11, 2025 • 19

published a model 7 months ago

zwt123home123/code_log_3

Updated May 28, 2025

updated a model 8 months ago

zwt123home123/reproduce_log

Updated May 19, 2025

published a model 8 months ago

zwt123home123/reproduce_log

Updated May 19, 2025

updated a model 8 months ago

zwt123home123/code_log_2

Updated May 12, 2025

published a model 8 months ago

zwt123home123/code_log_2

Updated May 12, 2025

published a dataset 8 months ago

zwt123home123/code_log_2

Updated May 12, 2025 • 3

updated a dataset 8 months ago

zwt123home123/code_log

Updated May 12, 2025 • 8

published a dataset 8 months ago

zwt123home123/code_log

Updated May 12, 2025 • 8

authored a paper 9 months ago

DUMP: Automated Distribution-Level Curriculum Learning for RL-based LLM Post-training

Paper • 2504.09710 • Published Apr 13, 2025 • 19

upvoted a paper 9 months ago

DUMP: Automated Distribution-Level Curriculum Learning for RL-based LLM Post-training

Paper • 2504.09710 • Published Apr 13, 2025 • 19

updated a model 9 months ago

zwt123home123/standardtraining_2p_Qwen2.5-7B-Instruct-1M-4ppl_largebs_global_step_320_actor

8B • Updated Apr 3, 2025 • 8

published a model 9 months ago

zwt123home123/standardtraining_2p_Qwen2.5-7B-Instruct-1M-4ppl_largebs_global_step_320_actor

8B • Updated Apr 3, 2025 • 8

updated a model 9 months ago

zwt123home123/standardtraining_2p_Qwen2.5-7B-Instruct-1M-4ppl_largebs_global_step_203_actor

8B • Updated Apr 3, 2025 • 7

Wentian Zhao

AI & ML interests

Recent Activity

Organizations

zwt123home123's activity