Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Tianqi Liu's picture
3 11

Tianqi Liu

TianqiLiuAI
shahzad4894's profile picture shuyuej's profile picture kashif's profile picture
·

AI & ML interests

None yet

Organizations

Google's profile picture

commented a paper about 1 year ago

RRM: Robust Reward Model Training Mitigates Reward Hacking

Paper • 2409.13156 • Published Sep 20, 2024 • 5 •
2
commented a paper over 1 year ago

Building Math Agents with Multi-Turn Iterative Preference Learning

Paper • 2409.02392 • Published Sep 4, 2024 • 16 •
2
commented a paper almost 2 years ago

LiPO: Listwise Preference Optimization through Learning-to-Rank

Paper • 2402.01878 • Published Feb 2, 2024 • 20 •
6
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs