arxiv:2510.04800
Sangmin Bae
raymin0223
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
21 days ago
Supervised Reinforcement Learning: From Expert Trajectories to Step-wise
Reasoning
upvoted
an
article
21 days ago
Why Did MiniMax M2 End Up as a Full Attention Model?
Organizations
None yet