https://ai.meta.com/blog/omnilingual-asr-advancing-automatic-speech-recognition/
Eric Bezzam PRO
bezzam
AI & ML interests
speech, audio, imaging
Recent Activity
commented on
their
article
2 days ago
Open ASR Leaderboard: Trends and Insights with New Multilingual & Long-Form Tracks
upvoted
an
article
3 days ago
We Got Claude to Fine-Tune an Open Source LLM
liked
a model
3 days ago
microsoft/VibeVoice-Realtime-0.5B
Organizations
Multimodel audio
Speech recognition datasets
DigiCam (CelebA)
Models for DigiCam trained on the CelebA 26K dataset.
Omnilingual ASR (1,600+ Languages)
https://ai.meta.com/blog/omnilingual-asr-advancing-automatic-speech-recognition/
VibeVoice
Multimodel audio
Neural codecs
Speech recognition datasets
Text-to-speech datasets
DigiCam (CelebA)
Models for DigiCam trained on the CelebA 26K dataset.
DiffuserCam Mirflickr
Models for the paper "A modular and robust physics-based approach for lensless image reconstruction"