Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Rui Sun's picture
1 26 1

Rui Sun PRO

ThreeSR
qywu's profile picture
·
https://threesr.github.io/
  • RuiSun94013021
  • ThreeSR
  • rui-sun-three

AI & ML interests

Vision and Language Multimodal Learning, CV, NLP, LLM

Recent Activity

upvoted a paper 21 days ago
Paper2Video: Automatic Video Generation from Scientific Papers
upvoted a paper about 1 month ago
Video models are zero-shot learners and reasoners
upvoted a paper about 1 month ago
MANZANO: A Simple and Scalable Unified Multimodal Model with a Hybrid Vision Tokenizer
View all activity

Organizations

Columbia NLP's profile picture MemGuiAgent's profile picture

authored a paper about 1 month ago

Embodied Web Agents: Bridging Physical-Digital Realms for Integrated Agent Intelligence

Paper • 2506.15677 • Published Jun 18 • 23
authored a paper 8 months ago

Scaling Autonomous Agents via Automatic Reward Modeling And Planning

Paper • 2502.12130 • Published Feb 17 • 2
authored a paper almost 2 years ago

GENOME: GenerativE Neuro-symbOlic visual reasoning by growing and reusing ModulEs

Paper • 2311.04901 • Published Nov 8, 2023 • 11
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs