Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
3
25
1
Xiaohan Wang
nicholswang
Follow
eMiLNeTo's profile picture
Anselmehacklab's profile picture
l3nux's profile picture
18 followers
·
4 following
https://wxh1996.github.io/
XiaohanWang96
AI & ML interests
Video Understanding, Vision-Language Models
Recent Activity
authored
a paper
8 days ago
Closing the Modality Gap for Mixed Modality Search
authored
a paper
8 days ago
SciVideoBench: Benchmarking Scientific Video Reasoning in Large Multimodal Models
authored
a paper
8 days ago
FineVision: Open Data Is All You Need
View all activity
Organizations
nicholswang
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
published
an
article
3 months ago
view article
Article
TimeScope: How Long Can Your Video Large Multimodal Model Go?
Jul 23
•
46