-
InstantFamily: Masked Attention for Zero-shot Multi-ID Image Generation
Paper ⢠2404.19427 ⢠Published ⢠74 -
ConsistentID: Portrait Generation with Multimodal Fine-Grained Identity Preserving
Paper ⢠2404.16771 ⢠Published ⢠19 -
Face Adapter for Pre-Trained Diffusion Models with Fine-Grained ID and Attribute Control
Paper ⢠2405.12970 ⢠Published ⢠25 -
FlashFace: Human Image Personalization with High-fidelity Identity Preservation
Paper ⢠2403.17008 ⢠Published ⢠22
Collections
Discover the best community collections!
Collections including paper arxiv:2404.16771
-
Compose and Conquer: Diffusion-Based 3D Depth Aware Composable Image Synthesis
Paper ⢠2401.09048 ⢠Published ⢠10 -
Improving fine-grained understanding in image-text pre-training
Paper ⢠2401.09865 ⢠Published ⢠18 -
Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data
Paper ⢠2401.10891 ⢠Published ⢠62 -
Scaling Up to Excellence: Practicing Model Scaling for Photo-Realistic Image Restoration In the Wild
Paper ⢠2401.13627 ⢠Published ⢠77
-
VideoBooth: Diffusion-based Video Generation with Image Prompts
Paper ⢠2312.00777 ⢠Published ⢠24 -
MotionCtrl: A Unified and Flexible Motion Controller for Video Generation
Paper ⢠2312.03641 ⢠Published ⢠22 -
GenTron: Delving Deep into Diffusion Transformers for Image and Video Generation
Paper ⢠2312.04557 ⢠Published ⢠13 -
DreamVideo: Composing Your Dream Videos with Customized Subject and Motion
Paper ⢠2312.04433 ⢠Published ⢠10
-
EMO: Emote Portrait Alive - Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions
Paper ⢠2402.17485 ⢠Published ⢠195 -
VividTalk: One-Shot Audio-Driven Talking Head Generation Based on 3D Hybrid Prior
Paper ⢠2312.01841 ⢠Published ⢠1 -
MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model
Paper ⢠2311.16498 ⢠Published ⢠1 -
GaussianAvatar: Towards Realistic Human Avatar Modeling from a Single Video via Animatable 3D Gaussians
Paper ⢠2312.02134 ⢠Published ⢠2
-
One-for-All: Generalized LoRA for Parameter-Efficient Fine-tuning
Paper ⢠2306.07967 ⢠Published ⢠25 -
Rerender A Video: Zero-Shot Text-Guided Video-to-Video Translation
Paper ⢠2306.07954 ⢠Published ⢠111 -
TryOnDiffusion: A Tale of Two UNets
Paper ⢠2306.08276 ⢠Published ⢠74 -
Seeing the World through Your Eyes
Paper ⢠2306.09348 ⢠Published ⢠33
-
InstantFamily: Masked Attention for Zero-shot Multi-ID Image Generation
Paper ⢠2404.19427 ⢠Published ⢠74 -
ConsistentID: Portrait Generation with Multimodal Fine-Grained Identity Preserving
Paper ⢠2404.16771 ⢠Published ⢠19 -
Face Adapter for Pre-Trained Diffusion Models with Fine-Grained ID and Attribute Control
Paper ⢠2405.12970 ⢠Published ⢠25 -
FlashFace: Human Image Personalization with High-fidelity Identity Preservation
Paper ⢠2403.17008 ⢠Published ⢠22
-
EMO: Emote Portrait Alive - Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions
Paper ⢠2402.17485 ⢠Published ⢠195 -
VividTalk: One-Shot Audio-Driven Talking Head Generation Based on 3D Hybrid Prior
Paper ⢠2312.01841 ⢠Published ⢠1 -
MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model
Paper ⢠2311.16498 ⢠Published ⢠1 -
GaussianAvatar: Towards Realistic Human Avatar Modeling from a Single Video via Animatable 3D Gaussians
Paper ⢠2312.02134 ⢠Published ⢠2
-
Compose and Conquer: Diffusion-Based 3D Depth Aware Composable Image Synthesis
Paper ⢠2401.09048 ⢠Published ⢠10 -
Improving fine-grained understanding in image-text pre-training
Paper ⢠2401.09865 ⢠Published ⢠18 -
Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data
Paper ⢠2401.10891 ⢠Published ⢠62 -
Scaling Up to Excellence: Practicing Model Scaling for Photo-Realistic Image Restoration In the Wild
Paper ⢠2401.13627 ⢠Published ⢠77
-
One-for-All: Generalized LoRA for Parameter-Efficient Fine-tuning
Paper ⢠2306.07967 ⢠Published ⢠25 -
Rerender A Video: Zero-Shot Text-Guided Video-to-Video Translation
Paper ⢠2306.07954 ⢠Published ⢠111 -
TryOnDiffusion: A Tale of Two UNets
Paper ⢠2306.08276 ⢠Published ⢠74 -
Seeing the World through Your Eyes
Paper ⢠2306.09348 ⢠Published ⢠33
-
VideoBooth: Diffusion-based Video Generation with Image Prompts
Paper ⢠2312.00777 ⢠Published ⢠24 -
MotionCtrl: A Unified and Flexible Motion Controller for Video Generation
Paper ⢠2312.03641 ⢠Published ⢠22 -
GenTron: Delving Deep into Diffusion Transformers for Image and Video Generation
Paper ⢠2312.04557 ⢠Published ⢠13 -
DreamVideo: Composing Your Dream Videos with Customized Subject and Motion
Paper ⢠2312.04433 ⢠Published ⢠10