siyeng feng's picture

1045 267

siyeng feng

siyengfeng

·

AI & ML interests

None yet

Recent Activity

liked a model 22 days ago

lerobot/pi05_base

upvoted an article 22 days ago

π0 and π0-FAST: Vision-Language-Action Models for General Robot Control

liked a model 22 days ago

moonshotai/Kimi-K2-Thinking

View all activity

Organizations

None yet

upvoted an article 22 days ago

Article

π0 and π0-FAST: Vision-Language-Action Models for General Robot Control

+2

Feb 4

•

185

upvoted 13 papers 4 months ago

Qwen-Image Technical Report

Paper • 2508.02324 • Published Aug 4 • 263

ScreenCoder: Advancing Visual-to-Code Generation for Front-End Automation via Modular Multimodal Agents

Paper • 2507.22827 • Published Jul 30 • 99

Deep Researcher with Test-Time Diffusion

Paper • 2507.16075 • Published Jul 21 • 67

A Survey of Self-Evolving Agents: On Path to Artificial Super Intelligence

Paper • 2507.21046 • Published Jul 28 • 82

Agentic Reinforced Policy Optimization

Paper • 2507.19849 • Published Jul 26 • 158

X-Omni: Reinforcement Learning Makes Discrete Autoregressive Image Generative Models Great Again

Paper • 2507.22058 • Published Jul 29 • 39

HunyuanWorld 1.0: Generating Immersive, Explorable, and Interactive 3D Worlds from Words or Pixels

Paper • 2507.21809 • Published Jul 29 • 135

Falcon-H1: A Family of Hybrid-Head Language Models Redefining Efficiency and Performance

Paper • 2507.22448 • Published Jul 30 • 66

Cognitive Kernel-Pro: A Framework for Deep Research Agents and Agent Foundation Models Training

Paper • 2508.00414 • Published Aug 1 • 93

CellForge: Agentic Design of Virtual Cell Models

Paper • 2508.02276 • Published Aug 4 • 39

Seed Diffusion: A Large-Scale Diffusion Language Model with High-Speed Inference

Paper • 2508.02193 • Published Aug 4 • 132

Seed-Prover: Deep and Broad Reasoning for Automated Theorem Proving

Paper • 2507.23726 • Published Jul 31 • 114

Phi-Ground Tech Report: Advancing Perception in GUI Grounding

Paper • 2507.23779 • Published Jul 31 • 44

upvoted 6 papers 5 months ago

nablaNABLA: Neighborhood Adaptive Block-Level Attention

Paper • 2507.13546 • Published Jul 17 • 124

Group Sequence Policy Optimization

Paper • 2507.18071 • Published Jul 24 • 315

Can One Domain Help Others? A Data-Centric Study on Multi-Domain Reasoning via Reinforcement Learning

Paper • 2507.17512 • Published Jul 23 • 36

Yume: An Interactive World Generation Model

Paper • 2507.17744 • Published Jul 23 • 87

The Devil behind the mask: An emergent safety vulnerability of Diffusion LLMs

Paper • 2507.11097 • Published Jul 15 • 64

Stabilizing Knowledge, Promoting Reasoning: Dual-Token Constraints for RLVR

Paper • 2507.15778 • Published Jul 21 • 20