Guo-Hua Wang's picture

Guo-Hua Wang

Flourish

·

https://doctorkey.github.io/

DoctorKey

AI & ML interests

None yet

Recent Activity

liked a model about 11 hours ago

qpqpqpqpqpqp/Ovis_Image_7B_fp8

authored a paper 2 days ago

Ovis-Image Technical Report

updated a collection 2 days ago

View all activity

Organizations

upvoted a paper 3 days ago

Ovis-Image Technical Report

Paper • 2511.22982 • Published 8 days ago • 4

upvoted a collection 7 days ago

Ovis-Image

Ovis-Image is a 7B text-to-image model specifically optimized for high-quality text rendering under stringent computational constraints. • 7 items • Updated 2 days ago • 5

upvoted a collection 25 days ago

Diffusion-SDPO

2 items • Updated 25 days ago • 1

upvoted a paper 25 days ago

Diffusion-SDPO: Safeguarded Direct Preference Optimization for Diffusion Models

Paper • 2511.03317 • Published Nov 5 • 6

upvoted a paper 4 months ago

Ovis2.5 Technical Report

Paper • 2508.11737 • Published Aug 15 • 111

upvoted 4 collections 4 months ago

Ovis2.5

Our next-generation MLLMs for native-resolution vision and advanced reasoning • 5 items • Updated Aug 19 • 56

Ovis-U1

4 items • Updated 5 days ago • 1

TeEFusion

2 items • Updated 25 days ago • 1

CHATS

3 items • Updated 5 days ago • 1

upvoted a paper 4 months ago

TeEFusion: Blending Text Embeddings to Distill Classifier-Free Guidance

Paper • 2507.18192 • Published Jul 24 • 7

upvoted a collection 5 months ago

Ovis-U1

An unified model for multimodal understanding, text-to-image generation, and image editing. • 3 items • Updated Jul 2 • 6

upvoted a paper 5 months ago

Ovis-U1 Technical Report

Paper • 2506.23044 • Published Jun 29 • 62

upvoted a paper 6 months ago

CHATS: Combining Human-Aligned Optimization and Test-Time Sampling for Text-to-Image Generation

Paper • 2502.12579 • Published Feb 18 • 1

upvoted a paper 7 months ago

Unified Multimodal Understanding and Generation Models: Advances, Challenges, and Opportunities

Paper • 2505.02567 • Published May 5 • 80

upvoted a collection 10 months ago

Ovis2

Our latest advancement in multi-modal large language models (MLLMs) • 15 items • Updated Mar 25 • 65