2 13 4

yu

yqi19

yqi19

AI & ML interests

None yet

Recent Activity

authored a paper 2 days ago

Are Video Models Ready as Zero-Shot Reasoners? An Empirical Study with the MME-CoF Benchmark

commented on a paper 2 days ago

Are Video Models Ready as Zero-Shot Reasoners? An Empirical Study with the MME-CoF Benchmark

upvoted a paper 2 days ago

Are Video Models Ready as Zero-Shot Reasoners? An Empirical Study with the MME-CoF Benchmark

View all activity

Organizations

None yet

authored a paper 2 days ago

Are Video Models Ready as Zero-Shot Reasoners? An Empirical Study with the MME-CoF Benchmark

Paper • 2510.26802 • Published 3 days ago • 29

commented a paper 2 days ago

Are Video Models Ready as Zero-Shot Reasoners? An Empirical Study with the MME-CoF Benchmark

Paper • 2510.26802 • Published 3 days ago • 29 •

upvoted a paper 2 days ago

Are Video Models Ready as Zero-Shot Reasoners? An Empirical Study with the MME-CoF Benchmark

Paper • 2510.26802 • Published 3 days ago • 29

upvoted a paper 17 days ago

Verbalized Sampling: How to Mitigate Mode Collapse and Unlock LLM Diversity

Paper • 2510.01171 • Published Oct 1 • 17

upvoted a collection 19 days ago

VisionLM

Collection

1701 items • Updated 4 days ago • 128

upvoted 4 papers 19 days ago

upvoted a paper 20 days ago

ThinkGrasp: A Vision-Language System for Strategic Part Grasping in Clutter

Paper • 2407.11298 • Published Jul 16, 2024 • 6

commented a paper 20 days ago

BEAR: Benchmarking and Enhancing Multimodal Language Models for Atomic Embodied Capabilities

Paper • 2510.08759 • Published 24 days ago • 45 •

upvoted 2 papers 20 days ago

D2E: Scaling Vision-Action Pretraining on Desktop Data for Transfer to Embodied AI

Paper • 2510.05684 • Published 26 days ago • 134

RoboBrain 2.0 Technical Report

Paper • 2507.02029 • Published Jul 2 • 32

authored 3 papers 20 days ago

ThinkGrasp: A Vision-Language System for Strategic Part Grasping in Clutter

Paper • 2407.11298 • Published Jul 16, 2024 • 6

MME-CoT: Benchmarking Chain-of-Thought in Large Multimodal Models for Reasoning Quality, Robustness, and Efficiency

Paper • 2502.09621 • Published Feb 13 • 28

BEAR: Benchmarking and Enhancing Multimodal Language Models for Atomic Embodied Capabilities

Paper • 2510.08759 • Published 24 days ago • 45

upvoted a paper 20 days ago

BEAR: Benchmarking and Enhancing Multimodal Language Models for Atomic Embodied Capabilities

Paper • 2510.08759 • Published 24 days ago • 45

liked a dataset 4 months ago

nyu-visionx/VSI-Bench

Viewer • Updated Jan 14 • 5.13k • 5.92k • 52

upvoted a paper 4 months ago

VIKI-R: Coordinating Embodied Multi-Agent Cooperation via Reinforcement Learning

Paper • 2506.09049 • Published Jun 10 • 36

upvoted a paper 8 months ago

Speculative Ad-hoc Querying

Paper • 2503.00714 • Published Mar 2 • 13

yu

AI & ML interests

Recent Activity

Organizations

yqi19's activity