Haiquan Zhao's picture

4 7 13

Haiquan Zhao

haidequanbu

·

https://haidequanbu.github.io

haidequanbu

AI & ML interests

Natural Language Processing, LLM safety

Recent Activity

upvoted a paper about 7 hours ago

Stabilizing Reinforcement Learning with LLMs: Formulation and Practices

liked a model 4 days ago

Tongyi-MAI/Z-Image-Turbo

liked a dataset about 2 months ago

Qwen/Qwen3GuardTest

View all activity

Organizations

upvoted a paper about 7 hours ago

Stabilizing Reinforcement Learning with LLMs: Formulation and Practices

Paper • 2512.01374 • Published 8 days ago • 83

upvoted a paper about 2 months ago

Qwen3Guard Technical Report

Paper • 2510.14276 • Published Oct 16 • 14

upvoted a collection 2 months ago

Qwen3Guard

7 items • Updated Sep 30 • 59

upvoted a collection 3 months ago

Qwen3-VL

37 items • Updated Nov 1 • 497

upvoted a paper 3 months ago

DuoGuard: A Two-Player RL-Driven Framework for Multilingual LLM Guardrails

Paper • 2502.05163 • Published Feb 7 • 23

upvoted a paper 10 months ago

Lean and Mean: Decoupled Value Policy Optimization with Global Value Guidance

Paper • 2502.16944 • Published Feb 24 • 10

upvoted a collection 11 months ago

Qwen2.5

Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. • 46 items • Updated Jul 21 • 666