STAIR - a thu-ml Collection

Models
Datasets
Spaces
Docs
Enterprise
Pricing
Log In
Sign Up

thu-ml 's Collections

STAIR

STAIR

updated Feb 26

Datasets and Models for STAIR (Improving Safety Alignment with Introspective Reasoning)

thu-ml/STAIR-Llama-3.1-8B-SFT

Text Generation • 8B • Updated Feb 25 • 33
thu-ml/STAIR-Qwen2-7B-SFT

Text Generation • 8B • Updated Feb 25 • 20 • 1
thu-ml/STAIR-SFT

Viewer • Updated Feb 25 • 20k • 64
thu-ml/STAIR-Prompts

Viewer • Updated Feb 25 • 63k • 31
STAIR: Improving Safety Alignment with Introspective Reasoning

Paper • 2502.02384 • Published Feb 4
thu-ml/STAIR-Qwen2-7B-DPO-3

Text Generation • 8B • Updated Feb 26 • 24 • 1
thu-ml/STAIR-Llama-3.1-8B-DPO-3

Text Generation • 8B • Updated Feb 26 • 71

Collection guide
Browse collections

Company

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs