Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Shuangrui Ding's picture
3 24 16

Shuangrui Ding

Mar2Ding
russelljohnson's profile picture ChrisDing1105's profile picture Fayaz's profile picture
·
https://mark12ding.github.io/
  • ShuangruiDing
  • mark12ding

AI & ML interests

None yet

Organizations

None yet

authored a paper 8 months ago

SongGen: A Single Stage Auto-regressive Transformer for Text-to-Song Generation

Paper • 2502.13128 • Published Feb 18 • 41
authored 2 papers 10 months ago

OVO-Bench: How Far is Your Video-LLMs from Real-World Online Video Understanding?

Paper • 2501.05510 • Published Jan 9 • 43

Dispider: Enabling Video LLMs with Active Real-Time Interaction via Disentangled Perception, Decision, and Reaction

Paper • 2501.03218 • Published Jan 6 • 36
authored a paper 11 months ago

InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions

Paper • 2412.09596 • Published Dec 12, 2024 • 98
authored a paper about 1 year ago

SAM2Long: Enhancing SAM 2 for Long Video Segmentation with a Training-Free Memory Tree

Paper • 2410.16268 • Published Oct 21, 2024 • 69
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs