Conversational AI (CoAI) group from Tsinghua University

university

http://coai.cs.tsinghua.edu.cn/

thu-coai

Activity Feed Request to join this org

AI & ML interests

Dialogue Systems, Language Generation

Recent Activity

chujiezheng authored a paper 5 days ago

Soft Adaptive Policy Optimization

chujiezheng authored a paper 5 days ago

Stabilizing Reinforcement Learning with LLMs: Formulation and Practices

JinfengZhou authored a paper about 2 months ago

SocialEval: Evaluating Social Intelligence of Large Language Models

View all activity

chujiezheng

authored 2 papers 5 days ago

Soft Adaptive Policy Optimization

Paper • 2511.20347 • Published 12 days ago • 33

Stabilizing Reinforcement Learning with LLMs: Formulation and Practices

Paper • 2512.01374 • Published 6 days ago • 77

JinfengZhou

authored 2 papers about 2 months ago

SocialEval: Evaluating Social Intelligence of Large Language Models

Paper • 2506.00900 • Published Jun 1

Think Socially via Cognitive Reasoning

Paper • 2509.22546 • Published Sep 26

JinfengZhou

updated a dataset about 2 months ago

thu-coai/CogFlow

Preview • Updated Oct 12 • 127 • 3

JinfengZhou

published a dataset 2 months ago

thu-coai/CogFlow

Preview • Updated Oct 12 • 127 • 3

yangjunxiao2021

authored a paper 3 months ago

Loong: Synthesize Long Chain-of-Thoughts at Scale through Verifiers

Paper • 2509.03059 • Published Sep 3 • 24

yida-lu

updated a dataset 4 months ago

thu-coai/Agent-SafetyBench

Updated Aug 11 • 43 • 3

yida-lu

published a dataset 4 months ago

thu-coai/Agent-SafetyBench

Updated Aug 11 • 43 • 3

chujiezheng

authored a paper 5 months ago

Group Sequence Policy Optimization

Paper • 2507.18071 • Published Jul 24 • 313

aliezlo

updated a model 5 months ago

thu-coai/ShieldVLM-7B-qwen

8B • Updated Jul 20 • 8 • 1

aliezlo

published a model 5 months ago

thu-coai/ShieldVLM-7B-qwen

8B • Updated Jul 20 • 8 • 1

aliezlo

updated a dataset 5 months ago

thu-coai/ShieldVLM

Viewer • Updated Jul 11 • 4.24k • 356 • 1

aliezlo

published a dataset 5 months ago

thu-coai/ShieldVLM

Viewer • Updated Jul 11 • 4.24k • 356 • 1

chujiezheng

authored a paper 5 months ago

A Survey on Latent Reasoning

Paper • 2507.06203 • Published Jul 8 • 93

le-qi

authored 4 papers 5 months ago

SafetyBench: Evaluating the Safety of Large Language Models with Multiple Choice Questions

Paper • 2309.07045 • Published Sep 13, 2023

AISafetyLab: A Comprehensive Framework for AI Safety Evaluation and Improvement

Paper • 2502.16776 • Published Feb 24 • 6

SocialEval: Evaluating Social Intelligence of Large Language Models

Paper • 2506.00900 • Published Jun 1

GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning

Paper • 2507.01006 • Published Jul 1 • 240

Jiann

authored a paper 6 months ago

CPM: A Large-scale Generative Chinese Pre-trained Language Model

Paper • 2012.00413 • Published Dec 1, 2020