Chenyang Li

MorningsunLee

AI & ML interests

None yet

Recent Activity

upvoted a paper 27 days ago

MME-CC: A Challenging Multi-Modal Evaluation Benchmark of Cognitive Capacity

upvoted a paper 27 days ago

Benchmark Designers Should "Train on the Test Set" to Expose Exploitable Non-Visual Shortcuts

upvoted a paper 27 days ago

Contamination Detection for VLMs using Multi-Modal Semantic Perturbation

View all activity

Organizations

None yet

upvoted 4 papers 27 days ago

MME-CC: A Challenging Multi-Modal Evaluation Benchmark of Cognitive Capacity

Paper • 2511.03146 • Published Nov 5 • 7

Benchmark Designers Should "Train on the Test Set" to Expose Exploitable Non-Visual Shortcuts

Paper • 2511.04655 • Published 30 days ago • 7

Contamination Detection for VLMs using Multi-Modal Semantic Perturbation

Paper • 2511.03774 • Published about 1 month ago • 12

NVIDIA Nemotron Nano V2 VL

Paper • 2511.03929 • Published about 1 month ago • 26

upvoted 4 papers 28 days ago

V-Thinker: Interactive Thinking with Images

Paper • 2511.04460 • Published 30 days ago • 96

Thinking with Video: Video Generation as a Promising Multimodal Reasoning Paradigm

Paper • 2511.04570 • Published 30 days ago • 208

GUI-360: A Comprehensive Dataset and Benchmark for Computer-Using Agents

Paper • 2511.04307 • Published 30 days ago • 14

HoneyBee: Data Recipes for Vision-Language Reasoners

Paper • 2510.12225 • Published Oct 14 • 10

upvoted 7 papers about 1 month ago

Tongyi DeepResearch Technical Report

Paper • 2510.24701 • Published Oct 28 • 96

LongCat-Video Technical Report

Paper • 2510.22200 • Published Oct 25 • 25

The Best of N Worlds: Aligning Reinforcement Learning with Best-of-N Sampling via max@k Optimisation

Paper • 2510.23393 • Published Oct 27 • 20

A Survey of Data Agents: Emerging Paradigm or Overstated Hype?

Paper • 2510.23587 • Published Oct 27 • 65

Sample By Step, Optimize By Chunk: Chunk-Level GRPO For Text-to-Image Generation

Paper • 2510.21583 • Published Oct 24 • 30

A Definition of AGI

Paper • 2510.18212 • Published Oct 21 • 34

UI-Ins: Enhancing GUI Grounding with Multi-Perspective Instruction-as-Reasoning

Paper • 2510.20286 • Published Oct 23 • 23

upvoted a collection about 1 month ago

Qwen3-VL

Collection

37 items • Updated Nov 1 • 487

upvoted a paper about 1 month ago

HoloCine: Holistic Generation of Cinematic Multi-Shot Long Video Narratives

Paper • 2510.20822 • Published Oct 23 • 39

upvoted a paper about 2 months ago

Beyond Correctness: Evaluating Subjective Writing Preferences Across Cultures

Paper • 2510.14616 • Published Oct 16 • 11

upvoted 2 papers 3 months ago

AToken: A Unified Tokenizer for Vision

Paper • 2509.14476 • Published Sep 17 • 36

LLM-I: LLMs are Naturally Interleaved Multimodal Creators

Paper • 2509.13642 • Published Sep 17 • 8

Chenyang Li

AI & ML interests

Recent Activity

Organizations

MorningsunLee's activity