Architecture Decoupling Is Not All You Need For Unified Multimodal Model Paper • 2511.22663 • Published 9 days ago • 28
Video Generation Models Are Good Latent Reward Models Paper • 2511.21541 • Published 10 days ago • 44
Plan-X: Instruct Video Generation via Semantic Planning Paper • 2511.17986 • Published 14 days ago • 16
SteadyDancer: Harmonized and Coherent Human Image Animation with First-Frame Preservation Paper • 2511.19320 • Published 12 days ago • 39
WithAnyone: Towards Controllable and ID Consistent Image Generation Paper • 2510.14975 • Published Oct 16 • 84
Learning an Image Editing Model without Image Editing Pairs Paper • 2510.14978 • Published Oct 16 • 8
UniFusion: Vision-Language Model as Unified Encoder in Image Generation Paper • 2510.12789 • Published Oct 14 • 18
Stable Video Infinity: Infinite-Length Video Generation with Error Recycling Paper • 2510.09212 • Published Oct 10 • 16
IVEBench: Modern Benchmark Suite for Instruction-Guided Video Editing Assessment Paper • 2510.11647 • Published Oct 13 • 3
Visual Autoregressive Models Beat Diffusion Models on Inference Time Scaling Paper • 2510.16751 • Published Oct 19 • 20
UltraGen: High-Resolution Video Generation with Hierarchical Attention Paper • 2510.18775 • Published Oct 21 • 17
MoGA: Mixture-of-Groups Attention for End-to-End Long Video Generation Paper • 2510.18692 • Published Oct 21 • 40
InstructMix2Mix: Consistent Sparse-View Editing Through Multi-View Model Personalization Paper • 2511.14899 • Published 18 days ago • 11
First Frame Is the Place to Go for Video Content Customization Paper • 2511.15700 • Published 17 days ago • 52
A Style is Worth One Code: Unlocking Code-to-Style Image Generation with Discrete Style Space Paper • 2511.10555 • Published 23 days ago • 60
PAN: A World Model for General, Interactable, and Long-Horizon World Simulation Paper • 2511.09057 • Published 24 days ago • 75
SliderEdit: Continuous Image Editing with Fine-Grained Instruction Control Paper • 2511.09715 • Published 24 days ago • 8
LiteAttention: A Temporal Sparse Attention for Diffusion Transformers Paper • 2511.11062 • Published 22 days ago • 30
Back to Basics: Let Denoising Generative Models Denoise Paper • 2511.13720 • Published 19 days ago • 63