Rui Sun PRO
ThreeSR
		AI & ML interests
Vision and Language Multimodal Learning, CV, NLP, LLM
		Recent Activity
						upvoted 
								a
								paper
							
						22 days ago
						
					
						
						
						Paper2Video: Automatic Video Generation from Scientific Papers
						
						upvoted 
								a
								paper
							
						about 1 month ago
						
					
						
						
						Video models are zero-shot learners and reasoners
						
						upvoted 
								a
								paper
							
						about 1 month ago
						
					
						
						
						MANZANO: A Simple and Scalable Unified Multimodal Model with a Hybrid
  Vision Tokenizer
						

