2 3 2

Yushuo Guan

UnnamedWatcher

AI & ML interests

None yet

Recent Activity

upvoted a paper 21 days ago

AVoCaDO: An Audiovisual Video Captioner Driven by Temporal Orchestration

upvoted a paper 5 months ago

MME-VideoOCR: Evaluating OCR-Based Capabilities of Multimodal LLMs in Video Scenarios

authored a paper 7 months ago

Mavors: Multi-granularity Video Representation for Multimodal Large Language Model

View all activity

Organizations

None yet

upvoted a paper 21 days ago

AVoCaDO: An Audiovisual Video Captioner Driven by Temporal Orchestration

Paper • 2510.10395 • Published 23 days ago • 28

upvoted a paper 5 months ago

MME-VideoOCR: Evaluating OCR-Based Capabilities of Multimodal LLMs in Video Scenarios

Paper • 2505.21333 • Published May 27 • 38

authored a paper 7 months ago

Mavors: Multi-granularity Video Representation for Multimodal Large Language Model

Paper • 2504.10068 • Published Apr 14 • 30

upvoted a paper 7 months ago

Mavors: Multi-granularity Video Representation for Multimodal Large Language Model

Paper • 2504.10068 • Published Apr 14 • 30

New activity in OpenGVLab/VideoChat-Flash-Qwen2-7B_res448 10 months ago

Error when run the demo code.

#1 opened 10 months ago by

UnnamedWatcher

liked a Space 12 months ago

609

Kolors Portrait With Flux

🤗

Kolors Portrait to keep face identity developed with Flux

liked a Space about 1 year ago

9.84k

Kolors Virtual Try-On

👕

Try on clothes virtually by uploading images

updated a model about 1 year ago

Kwai-Kolors/Kolors-ControlNet-Pose

Updated Aug 5, 2024 • 121 • 10

Yushuo Guan

AI & ML interests

Recent Activity

Organizations

UnnamedWatcher's activity

Error when run the demo code.

Kolors Portrait With Flux

Kolors Virtual Try-On