Something random's picture

27 1

Something random

caferemix

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 12 days ago

General Agentic Memory Via Deep Research

upvoted a paper 15 days ago

DeepAgent: A General Reasoning Agent with Scalable Toolsets

upvoted a paper 15 days ago

Tongyi DeepResearch Technical Report

View all activity

Organizations

None yet

upvoted a paper 12 days ago

General Agentic Memory Via Deep Research

Paper • 2511.18423 • Published 13 days ago • 155

upvoted 2 papers 15 days ago

DeepAgent: A General Reasoning Agent with Scalable Toolsets

Paper • 2510.21618 • Published Oct 24 • 99

Tongyi DeepResearch Technical Report

Paper • 2510.24701 • Published Oct 28 • 96

upvoted a paper 16 days ago

The Principles of Diffusion Models

Paper • 2510.21890 • Published Oct 24 • 58

upvoted a paper 3 months ago

VerlTool: Towards Holistic Agentic Reinforcement Learning with Tool Use

Paper • 2509.01055 • Published Sep 1 • 75

upvoted a paper 4 months ago

Story2Board: A Training-Free Approach for Expressive Storyboard Generation

Paper • 2508.09983 • Published Aug 13 • 68

upvoted 3 papers 5 months ago

Drag-and-Drop LLMs: Zero-Shot Prompt-to-Weights

Paper • 2506.16406 • Published Jun 19 • 127

OmniGen2: Exploration to Advanced Multimodal Generation

Paper • 2506.18871 • Published Jun 23 • 78

PhysRig: Differentiable Physics-Based Skinning and Rigging Framework for Realistic Articulated Object Modeling

Paper • 2506.20936 • Published Jun 26 • 12

upvoted 2 papers 6 months ago

SpatialLM: Training Large Language Models for Structured Indoor Modeling

Paper • 2506.07491 • Published Jun 9 • 50

ComfyUI-R1: Exploring Reasoning Models for Workflow Generation

Paper • 2506.09790 • Published Jun 11 • 53

upvoted 4 papers 7 months ago

HunyuanCustom: A Multimodal-Driven Architecture for Customized Video Generation

Paper • 2505.04512 • Published May 7 • 36

Step1X-3D: Towards High-Fidelity and Controllable Generation of Textured 3D Assets

Paper • 2505.07747 • Published May 12 • 61

MiMo: Unlocking the Reasoning Potential of Language Model -- From Pretraining to Posttraining

Paper • 2505.07608 • Published May 12 • 82

Seed1.5-VL Technical Report

Paper • 2505.07062 • Published May 11 • 153

upvoted a paper 9 months ago

LHM: Large Animatable Human Reconstruction Model from a Single Image in Seconds

Paper • 2503.10625 • Published Mar 13 • 33

upvoted 4 papers about 1 year ago

OminiControl: Minimal and Universal Control for Diffusion Transformer

Paper • 2411.15098 • Published Nov 22, 2024 • 61

StdGEN: Semantic-Decomposed 3D Character Generation from Single Images

Paper • 2411.05738 • Published Nov 8, 2024 • 15

MVPaint: Synchronized Multi-View Diffusion for Painting Anything 3D

Paper • 2411.02336 • Published Nov 4, 2024 • 24

Foundation Models for Music: A Survey

Paper • 2408.14340 • Published Aug 26, 2024 • 44