Mangosteen, a 47 billion-token Thai corpus built with a Thai-adapted pipeline, improves language model performance on Thai benchmarks.
Wannaphong Phatthiyaphaibun PRO
wannaphong
AI & ML interests
None yet
Recent Activity
liked
a model
5 days ago
deepseek-ai/DeepSeek-V3.2
upvoted
a
collection
5 days ago
DeepSeek-V3.2
upvoted
a
paper
11 days ago
DR Tulu: Reinforcement Learning with Evolving Rubrics for Deep Research