tatsu-lab/linguistic-calibration-lc-sft-wdiff
Text Generation
•
7B
•
Updated
•
10
tatsu-lab/linguistic-calibration-factuality-sft-wdiff
Text Generation
•
7B
•
Updated
•
11
tatsu-lab/linguistic-calibration-claude-distill-wdiff
Text Generation
•
7B
•
Updated
•
12
tatsu-lab/linguistic-calibration-extract-answers
Text Generation
•
3B
•
Updated
•
6
tatsu-lab/linguistic-calibration-lc-rl-wdiff
Text Generation
•
7B
•
Updated
•
8
tatsu-lab/linguistic-calibration-factuality-rl-wdiff
Text Generation
•
7B
•
Updated
•
8
tatsu-lab/linguistic-calibration-reward-model-forecastprobs-wdiff
7B
•
Updated
•
5
tatsu-lab/linguistic-calibration-reward-model-factuality-wdiff
7B
•
Updated
•
6
tatsu-lab/alpaca-farm-ppo-human-wdiff
Text Generation
•
Updated
•
70
•
1
tatsu-lab/alpaca-farm-expiter-human-wdiff
Text Generation
•
Updated
•
19
tatsu-lab/alpaca-farm-ppo-sim-gpt4-20k-wdiff
Text Generation
•
Updated
•
76
tatsu-lab/alpaca-farm-ppo-sim-wdiff
Text Generation
•
Updated
•
8
tatsu-lab/alpaca-farm-reward-model-human-wdiff
Updated
•
5
•
1
tatsu-lab/alpaca-farm-feedme-sim-wdiff
Text Generation
•
Updated
•
6
tatsu-lab/alpaca-farm-feedme-human-wdiff
Text Generation
•
Updated
•
9
tatsu-lab/alpaca-farm-reward-condition-sim-wdiff
Text Generation
•
Updated
•
8
tatsu-lab/alpaca-farm-reward-model-sim-wdiff
Updated
•
10
tatsu-lab/alpaca-farm-expiter-sim-wdiff
Text Generation
•
Updated
•
12
tatsu-lab/alpaca-farm-sft10k-wdiff
Text Generation
•
Updated
•
13
tatsu-lab/alpaca-7b-wdiff
Text Generation
•
Updated
•
140
•
57