hamishivi/2010_rl_rag_NAR8_testing64_gpt5_sft_31605_no_cite__1__1764018132_step_2450 8B • Updated 7 days ago • 30
hamishivi/2010_rl_rag_NAR8_testing64_gpt5_sft_31605_no_cite__1__1762677729_step_1300 8B • Updated 10 days ago • 26
hamishivi/2010_rl_rag_NAR8_testing64_gpt5_sft_31605__1__1762886037_checkpoints_step_1300 8B • Updated 11 days ago • 22
hamishivi/2010_rl_rag_NAR8_testing64_gpt5_sft_31605_no_cite__1__1762677729_step1900 8B • Updated 13 days ago • 31
hamishivi/2010_rl_rag_NAR8_testing64_gpt5_sft_31605_no_cite__1__1762677729_checkpoints_step_1700 8B • Updated 16 days ago • 121
hamishivi/1011_rl_rag_open_judge_no_citation_1037__1__1762832496_checkpoints_step_850 8B • Updated 16 days ago • 41
hamishivi/Nemotron-Research-Reasoning-Qwen-1.5B-v2-RLVE Text Generation • 2B • Updated 26 days ago • 55 • 2