Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

Thai Pretraining Corpus

community
Activity Feed

AI & ML interests

None defined yet.

Wannaphong Phatthiyaphaibun's profile picture

wannaphong 
authored a paper 4 months ago

Mangosteen: An Open Thai Corpus for Language Model Pretraining

Paper • 2507.14664 • Published Jul 19 • 7
wannaphong 
authored a paper 10 months ago

Sailor2: Sailing in South-East Asia with Inclusive Multilingual LLMs

Paper • 2502.12982 • Published Feb 18 • 19
wannaphong 
authored 2 papers over 1 year ago

PyThaiNLP: Thai Natural Language Processing in Python

Paper • 2312.04649 • Published Dec 7, 2023

Thai Wav2Vec2.0 with CommonVoice V8

Paper • 2208.04799 • Published Aug 9, 2022 • 2
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs