5 89 31

Ha-Yeong Choi

Ha0

https://scholar.google.com/citations?user=Jw3X6UgAAAAJ&hl=ko

hayeong0

AI & ML interests

Speech Synthesis, Voice Conversion, Generative Models

Recent Activity

upvoted a paper 1 day ago

Live Avatar: Streaming Real-time Audio-Driven Avatar Generation with Infinite Length

liked a dataset 14 days ago

yenopoya/thousand-voices-trauma

upvoted a paper about 1 month ago

FARMER: Flow AutoRegressive Transformer over Pixels

View all activity

Organizations

None yet

upvoted a paper 1 day ago

Live Avatar: Streaming Real-time Audio-Driven Avatar Generation with Infinite Length

Paper • 2512.04677 • Published 3 days ago • 137

upvoted a paper about 1 month ago

FARMER: Flow AutoRegressive Transformer over Pixels

Paper • 2510.23588 • Published Oct 27 • 57

upvoted a paper 3 months ago

Discrete Diffusion VLA: Bringing Discrete Diffusion to Action Decoding in Vision-Language-Action Policies

Paper • 2508.20072 • Published Aug 27 • 31

upvoted 2 papers 4 months ago

GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models

Paper • 2508.06471 • Published Aug 8 • 192

HunyuanWorld 1.0: Generating Immersive, Explorable, and Interactive 3D Worlds from Words or Pixels

Paper • 2507.21809 • Published Jul 29 • 135

upvoted 2 papers 5 months ago

CSD-VAR: Content-Style Decomposition in Visual Autoregressive Models

Paper • 2507.13984 • Published Jul 18 • 25

Matryoshka Representation Learning

Paper • 2205.13147 • Published May 26, 2022 • 24

upvoted 3 papers 6 months ago

AnimaX: Animating the Inanimate in 3D with Joint Video-Pose Diffusion Models

Paper • 2506.19851 • Published Jun 24 • 60

Discrete Diffusion in Large Language and Multimodal Models: A Survey

Paper • 2506.13759 • Published Jun 16 • 43

Qwen3 Embedding: Advancing Text Embedding and Reranking Through Foundation Models

Paper • 2506.05176 • Published Jun 5 • 74

upvoted 3 papers 7 months ago

upvoted a paper 8 months ago

PRIMA.CPP: Speeding Up 70B-Scale LLM Inference on Low-Resource Everyday Home Clusters

Paper • 2504.08791 • Published Apr 7 • 137

upvoted a collection 9 months ago

Orpheus TTS

Collection

TTS Towards Human-Sounding Speech • 2 items • Updated Mar 18 • 74

upvoted a paper 9 months ago

Phi-4-Mini Technical Report: Compact yet Powerful Multimodal Language Models via Mixture-of-LoRAs

Paper • 2503.01743 • Published Mar 3 • 89

upvoted 3 papers 10 months ago

Soundwave: Less is More for Speech-Text Alignment in LLMs

Paper • 2502.12900 • Published Feb 18 • 86

Region-Adaptive Sampling for Diffusion Transformers

Paper • 2502.10389 • Published Feb 14 • 53

Large Language Diffusion Models

Paper • 2502.09992 • Published Feb 14 • 123

upvoted a paper 11 months ago

Tensor Product Attention Is All You Need

Paper • 2501.06425 • Published Jan 11 • 90

Ha-Yeong Choi

AI & ML interests

Recent Activity

Organizations

Ha0's activity