Tristan/sft-qwen-lambada-es-custom-splits-lr1e-6-wd0.001-ep10 Text Generation • 0.5B • Updated Sep 4 • 3
Tristan/sft-qwen-lambada-fr-custom-splits-lr1e-6-wd0.0001-ep10 Text Generation • 0.5B • Updated Sep 4 • 2
Tristan/sft-qwen-lambada-it-custom-splits-lr1e-6-wd0.0001-ep10 Text Generation • 0.5B • Updated Sep 4 • 5
Tristan/sft-qwen-lambada-de-custom-splits-lr1e-6-wd0.0001-ep10 Text Generation • 0.5B • Updated Sep 4 • 3
Tristan/sft-qwen-lambada-en-custom-splits-lr1e-6-wd0.001-ep10 Text Generation • 0.5B • Updated Sep 4 • 3
Tristan/dclm-perplexity-correlations-410m-3-openbookqa-gs4 Text Generation • 0.4B • Updated Apr 5 • 5