Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
thu-ml
's Collections
STAIR
STAIR
updated
Feb 26
Datasets and Models for STAIR (Improving Safety Alignment with Introspective Reasoning)
Upvote
1
thu-ml/STAIR-Llama-3.1-8B-SFT
Text Generation
•
8B
•
Updated
Feb 25
•
33
thu-ml/STAIR-Qwen2-7B-SFT
Text Generation
•
8B
•
Updated
Feb 25
•
20
•
1
thu-ml/STAIR-SFT
Viewer
•
Updated
Feb 25
•
20k
•
64
thu-ml/STAIR-Prompts
Viewer
•
Updated
Feb 25
•
63k
•
31
STAIR: Improving Safety Alignment with Introspective Reasoning
Paper
•
2502.02384
•
Published
Feb 4
thu-ml/STAIR-Qwen2-7B-DPO-3
Text Generation
•
8B
•
Updated
Feb 26
•
24
•
1
thu-ml/STAIR-Llama-3.1-8B-DPO-3
Text Generation
•
8B
•
Updated
Feb 26
•
71
Upvote
1
Share collection
View history
Collection guide
Browse collections