Rui Sun's picture

1 26 1

Rui Sun PRO

ThreeSR

·

https://threesr.github.io/

AI & ML interests

Vision and Language Multimodal Learning, CV, NLP, LLM

Recent Activity

upvoted a paper 21 days ago

Paper2Video: Automatic Video Generation from Scientific Papers

upvoted a paper about 1 month ago

Video models are zero-shot learners and reasoners

upvoted a paper about 1 month ago

MANZANO: A Simple and Scalable Unified Multimodal Model with a Hybrid Vision Tokenizer

View all activity

Organizations

authored a paper about 1 month ago

Embodied Web Agents: Bridging Physical-Digital Realms for Integrated Agent Intelligence

Paper • 2506.15677 • Published Jun 18 • 23

authored a paper 8 months ago

Scaling Autonomous Agents via Automatic Reward Modeling And Planning

Paper • 2502.12130 • Published Feb 17 • 2

authored a paper almost 2 years ago

GENOME: GenerativE Neuro-symbOlic visual reasoning by growing and reusing ModulEs

Paper • 2311.04901 • Published Nov 8, 2023 • 11