-
TurboDiffusion: Accelerating Video Diffusion Models by 100-200 Times
Paper • 2512.16093 • Published • 87 -
Z-Image: An Efficient Image Generation Foundation Model with Single-Stream Diffusion Transformer
Paper • 2511.22699 • Published • 217 -
DataFlow: An LLM-Driven Framework for Unified Data Preparation and Workflow Automation in the Era of Data-Centric AI
Paper • 2512.16676 • Published • 194 -
Sharp Monocular View Synthesis in Less Than a Second
Paper • 2512.10685 • Published • 21
Collections
Discover the best community collections!
Collections including paper arxiv:2512.19693
-
Robust-R1: Degradation-Aware Reasoning for Robust Visual Understanding
Paper • 2512.17532 • Published • 64 -
The Prism Hypothesis: Harmonizing Semantic and Pixel Representations via Unified Autoencoding
Paper • 2512.19693 • Published • 61 -
SemanticGen: Video Generation in Semantic Space
Paper • 2512.20619 • Published • 88 -
EgoX: Egocentric Video Generation from a Single Exocentric Video
Paper • 2512.08269 • Published • 115
-
MMGR: Multi-Modal Generative Reasoning
Paper • 2512.14691 • Published • 114 -
KlingAvatar 2.0 Technical Report
Paper • 2512.13313 • Published • 40 -
SemanticGen: Video Generation in Semantic Space
Paper • 2512.20619 • Published • 88 -
DataFlow: An LLM-Driven Framework for Unified Data Preparation and Workflow Automation in the Era of Data-Centric AI
Paper • 2512.16676 • Published • 194
-
Nuclear Norm Regularization for Deep Learning
Paper • 2405.14544 • Published • 1 -
Token embeddings violate the manifold hypothesis
Paper • 2504.01002 • Published • 1 -
Approximate Nullspace Augmented Finetuning for Robust Vision Transformers
Paper • 2403.10476 • Published • 1 -
ElaLoRA: Elastic & Learnable Low-Rank Adaptation for Efficient Model Fine-Tuning
Paper • 2504.00254 • Published • 1
-
lusxvr/nanoVLM-222M
Image-Text-to-Text • 0.2B • Updated • 361 • 98 -
Search-R1: Training LLMs to Reason and Leverage Search Engines with Reinforcement Learning
Paper • 2503.09516 • Published • 36 -
AlphaOne: Reasoning Models Thinking Slow and Fast at Test Time
Paper • 2505.24863 • Published • 97 -
QwenLong-L1: Towards Long-Context Large Reasoning Models with Reinforcement Learning
Paper • 2505.17667 • Published • 88
-
Decoupled DMD: CFG Augmentation as the Spear, Distribution Matching as the Shield
Paper • 2511.22677 • Published • 28 -
DiP: Taming Diffusion Models in Pixel Space
Paper • 2511.18822 • Published • 28 -
What about gravity in video generation? Post-Training Newton's Laws with Verifiable Rewards
Paper • 2512.00425 • Published • 49 -
Learning Eigenstructures of Unstructured Data Manifolds
Paper • 2512.01103 • Published • 5
-
TurboDiffusion: Accelerating Video Diffusion Models by 100-200 Times
Paper • 2512.16093 • Published • 87 -
Z-Image: An Efficient Image Generation Foundation Model with Single-Stream Diffusion Transformer
Paper • 2511.22699 • Published • 217 -
DataFlow: An LLM-Driven Framework for Unified Data Preparation and Workflow Automation in the Era of Data-Centric AI
Paper • 2512.16676 • Published • 194 -
Sharp Monocular View Synthesis in Less Than a Second
Paper • 2512.10685 • Published • 21
-
lusxvr/nanoVLM-222M
Image-Text-to-Text • 0.2B • Updated • 361 • 98 -
Search-R1: Training LLMs to Reason and Leverage Search Engines with Reinforcement Learning
Paper • 2503.09516 • Published • 36 -
AlphaOne: Reasoning Models Thinking Slow and Fast at Test Time
Paper • 2505.24863 • Published • 97 -
QwenLong-L1: Towards Long-Context Large Reasoning Models with Reinforcement Learning
Paper • 2505.17667 • Published • 88
-
Robust-R1: Degradation-Aware Reasoning for Robust Visual Understanding
Paper • 2512.17532 • Published • 64 -
The Prism Hypothesis: Harmonizing Semantic and Pixel Representations via Unified Autoencoding
Paper • 2512.19693 • Published • 61 -
SemanticGen: Video Generation in Semantic Space
Paper • 2512.20619 • Published • 88 -
EgoX: Egocentric Video Generation from a Single Exocentric Video
Paper • 2512.08269 • Published • 115
-
MMGR: Multi-Modal Generative Reasoning
Paper • 2512.14691 • Published • 114 -
KlingAvatar 2.0 Technical Report
Paper • 2512.13313 • Published • 40 -
SemanticGen: Video Generation in Semantic Space
Paper • 2512.20619 • Published • 88 -
DataFlow: An LLM-Driven Framework for Unified Data Preparation and Workflow Automation in the Era of Data-Centric AI
Paper • 2512.16676 • Published • 194
-
Decoupled DMD: CFG Augmentation as the Spear, Distribution Matching as the Shield
Paper • 2511.22677 • Published • 28 -
DiP: Taming Diffusion Models in Pixel Space
Paper • 2511.18822 • Published • 28 -
What about gravity in video generation? Post-Training Newton's Laws with Verifiable Rewards
Paper • 2512.00425 • Published • 49 -
Learning Eigenstructures of Unstructured Data Manifolds
Paper • 2512.01103 • Published • 5
-
Nuclear Norm Regularization for Deep Learning
Paper • 2405.14544 • Published • 1 -
Token embeddings violate the manifold hypothesis
Paper • 2504.01002 • Published • 1 -
Approximate Nullspace Augmented Finetuning for Robust Vision Transformers
Paper • 2403.10476 • Published • 1 -
ElaLoRA: Elastic & Learnable Low-Rank Adaptation for Efficient Model Fine-Tuning
Paper • 2504.00254 • Published • 1