Sangmin Bae
raymin0223
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
22 days ago
Supervised Reinforcement Learning: From Expert Trajectories to Step-wise
Reasoning
upvoted
an
article
23 days ago
Why Did MiniMax M2 End Up as a Full Attention Model?
Organizations
None yet