arxiv:2507.22062
Shang-Wen Daniel Li
swdanielli
AI & ML interests
Large foundation models, vision and language multimodal, and pretraining and self-supervised training
Recent Activity
liked
a model
28 days ago
facebook/DepthLM
upvoted
a
paper
28 days ago
DepthLM: Metric Depth From Vision Language Models
upvoted
a
collection
about 2 months ago
Meta CLIP