Leonardo Molina

leo-mv

AI & ML interests

None yet

Recent Activity

liked a model 17 days ago

Qwen/Qwen3-Reranker-0.6B

liked a model 18 days ago

nvidia/NVIDIA-Nemotron-Parse-v1.1

liked a Space 18 days ago

merve/SAM3-video-segmentation

View all activity

Organizations

None yet

liked a model 17 days ago

Qwen/Qwen3-Reranker-0.6B

Text Ranking • 0.6B • Updated Jun 9 • 561k • 273

liked a model 18 days ago

nvidia/NVIDIA-Nemotron-Parse-v1.1

Image-Text-to-Text • Updated 12 days ago • 14.7k • 112

liked a Space 18 days ago

SAM3 Video Segmentation

🐠

Track and label objects in videos using text prompts or clicks

liked a model 19 days ago

WeiboAI/VibeThinker-1.5B

Text Generation • 2B • Updated 14 days ago • 27.9k • 498

upvoted an article about 1 month ago

Article

Vision Language Models (Better, faster, stronger)

May 12

•

568

upvoted 2 collections about 1 month ago

Nemotron RAG

Collection

12 items • Updated 4 days ago • 47

NVIDIA Nemotron V2

Collection

Open, Production-ready Enterprise Models. Nvidia Open Model license. • 9 items • Updated 4 days ago • 92

liked a model about 2 months ago

Qwen/Qwen3-VL-4B-Instruct

Image-Text-to-Text • 4B • Updated Oct 15 • 795k • 255

liked a model 3 months ago

Qwen/Qwen3-VL-235B-A22B-Instruct

Image-Text-to-Text • 236B • Updated 12 days ago • 68.5k • • 325

liked a model 4 months ago

Qwen/Qwen-Image

Text-to-Image • Updated Aug 18 • 302k • • 2.26k

upvoted an article 5 months ago

Article

Introducing ColQwen-Omni: Retrieve in every modality

Jul 17

•

liked 4 models 5 months ago

upvoted an article 5 months ago

Article

ColPali: Efficient Document Retrieval with Vision Language Models 👀

Jul 5, 2024

•

303

upvoted a paper about 2 years ago

LLaVA-Plus: Learning to Use Tools for Creating Multimodal Agents

Paper • 2311.05437 • Published Nov 9, 2023 • 51

Leonardo Molina

AI & ML interests

Recent Activity

Organizations

leo-mv's activity

SAM3 Video Segmentation

Vision Language Models (Better, faster, stronger)

Introducing ColQwen-Omni: Retrieve in every modality

ColPali: Efficient Document Retrieval with Vision Language Models 👀