FlowBlending: Stage-Aware Multi-Model Sampling for Fast and High-Fidelity Video Generation Paper • 2512.24724 • Published 2 days ago • 1 • 2
Dream2Flow: Bridging Video Generation and Open-World Manipulation with 3D Object Flow Paper • 2512.24766 • Published 2 days ago • 1 • 2
DiffThinker: Towards Generative Multimodal Reasoning with Diffusion Models Paper • 2512.24165 • Published 3 days ago • 14 • 3
Improving Multi-step RAG with Hypergraph-based Memory for Long-Context Complex Relational Modeling Paper • 2512.23959 • Published 4 days ago • 36 • 2
Dynamic Large Concept Models: Latent Reasoning in an Adaptive Semantic Space Paper • 2512.24617 • Published 3 days ago • 23 • 2
Video-BrowseComp: Benchmarking Agentic Video Research on Open Web Paper • 2512.23044 • Published 5 days ago • 9 • 3
YOLO-Master: MOE-Accelerated with Specialized Transformers for Enhanced Real-time Detection Paper • 2512.23273 • Published 4 days ago • 12 • 4
OmniAgent: Audio-Guided Active Perception Agent for Omnimodal Audio-Video Understanding Paper • 2512.23646 • Published 4 days ago • 14 • 3
Nested Browser-Use Learning for Agentic Information Seeking Paper • 2512.23647 • Published 4 days ago • 15 • 3
Self-Evaluation Unlocks Any-Step Text-to-Image Generation Paper • 2512.22374 • Published 7 days ago • 14 • 3
DiRL: An Efficient Post-Training Framework for Diffusion Language Models Paper • 2512.22234 • Published 10 days ago • 18 • 3
Act2Goal: From World Model To General Goal-conditioned Policy Paper • 2512.23541 • Published 4 days ago • 21 • 3
GRAN-TED: Generating Robust, Aligned, and Nuanced Text Embedding for Diffusion Models Paper • 2512.15560 • Published 16 days ago • 23 • 3
SpotEdit: Selective Region Editing in Diffusion Transformers Paper • 2512.22323 • Published 7 days ago • 36 • 4
SmartSnap: Proactive Evidence Seeking for Self-Verifying Agents Paper • 2512.22322 • Published 7 days ago • 35 • 4
Dream-VL & Dream-VLA: Open Vision-Language and Vision-Language-Action Models with Diffusion Language Model Backbone Paper • 2512.22615 • Published 6 days ago • 37 • 3
Stream-DiffVSR: Low-Latency Streamable Video Super-Resolution via Auto-Regressive Diffusion Paper • 2512.23709 • Published 4 days ago • 40 • 3