new

Get trending papers in your email inbox once a day!

Get trending papers in your email inbox!

Daily Papers

byAK and the research community

Jan 1

Submitted by

AdinaY

mHC: Manifold-Constrained Hyper-Connections

deepseek-ai

Submitted by

taesiri

Youtu-LLM: Unlocking the Native Agentic Potential for Lightweight Large Language Models

·
38 authors

Submitted by

taesiri

Let It Flow: Agentic Crafting on Rock and Roll, Building the ROME Model within an Open Agentic Learning Ecosystem

·
88 authors

Submitted by

yulunliu

GaMO: Geometry-aware Multi-view Diffusion Outpainting for Sparse-View 3D Reconstruction

·
5 authors

Submitted by

NasirzadehMoh

A unified framework for detecting point and collective anomalies in operating system logs via collaborative transformers

alarmif

Submitted by

shash42

Scaling Open-Ended Reasoning to Predict the Future

Intelligent-Systems

Max Planck Institute for Intelligent Systems

Submitted by

CaiYuanhao

PhyGDPO: Physics-Aware Groupwise Direct Preference Optimization for Physically Consistent Text-to-Video Generation

meta-ai-for-media-research

Meta AI for Media Research

Submitted by

kugwzk

AI Meets Brain: Memory Systems from Cognitive Neuroscience to Autonomous Agents

·
15 authors

Submitted by

yusufma555

GR-Dexter Technical Report

ByteDance-Seed

Submitted by

xingyu-zhou

Guiding a Diffusion Transformer with the Internal Dynamics of Itself

CVLUESTC

Submitted by

taesiri

Fantastic Reasoning Behaviors and Where to Find Them: Unsupervised Discovery of the Reasoning Process

deepmind

Submitted by

lllyasviel

Pretraining Frame Preservation in Autoregressive Video Memory Compression

·
9 authors

Submitted by

taesiri

SpaceTimePilot: Generative Rendering of Dynamic Scenes Across Space and Time

·
7 authors

Submitted by

songw-zju

Forging Spatial Intelligence: A Roadmap of Multi-Modal Data Pre-Training for Autonomous Systems

zju

Zhejiang University

Submitted by

taesiri

Figure It Out: Improving the Frontier of Reasoning with Active Visual Thinking

·
3 authors

Submitted by

kkail8

JavisGPT: A Unified Multi-modal LLM for Sounding-Video Comprehension and Generation

JavisVerse

Submitted by

Atakanisik

Geometry-Aware Optimization for Respiratory Sound Classification: Enhancing Sensitivity with SAM-Optimized Audio Spectrogram Transformers

·
4 authors

Submitted by

henry12348

BEDA: Belief Estimation as Probabilistic Constraints for Performing Strategic Dialogue Acts

·
11 authors

Submitted by

wenzhengzeng

Factorized Learning for Temporally Grounded Video-Language Models

NationalUniversityofSingapore

National University of Singapore

Submitted by

varam17

Valori: A Deterministic Memory Substrate for AI Systems

·
1 authors