- 
	
	
	Learning Video Generation for Robotic Manipulation with Collaborative Trajectory ControlPaper • 2506.01943 • Published • 25
- 
	
	
	LoHoVLA: A Unified Vision-Language-Action Model for Long-Horizon Embodied TasksPaper • 2506.00411 • Published • 31
- 
	
	
	SmolVLA: A Vision-Language-Action Model for Affordable and Efficient RoboticsPaper • 2506.01844 • Published • 140
Ron Zhu
RzZ
		AI & ML interests
None yet
		
		Organizations
None yet
VLM
			
			
	
	- 
	
	
	UniRef++: Segment Every Reference Object in Spatial and Temporal SpacesPaper • 2312.15715 • Published • 21
- 
	
	
	Spatial-MLLM: Boosting MLLM Capabilities in Visual-based Spatial IntelligencePaper • 2505.23747 • Published • 68
- 
	
	
	VideoPrism: A Foundational Visual Encoder for Video UnderstandingPaper • 2402.13217 • Published • 37
- 
	
	
	Scaling RL to Long VideosPaper • 2507.07966 • Published • 157
Robotic
			
			
	
	- 
	
	
	Learning Video Generation for Robotic Manipulation with Collaborative Trajectory ControlPaper • 2506.01943 • Published • 25
- 
	
	
	LoHoVLA: A Unified Vision-Language-Action Model for Long-Horizon Embodied TasksPaper • 2506.00411 • Published • 31
- 
	
	
	SmolVLA: A Vision-Language-Action Model for Affordable and Efficient RoboticsPaper • 2506.01844 • Published • 140
VLM
			
			
	
	- 
	
	
	UniRef++: Segment Every Reference Object in Spatial and Temporal SpacesPaper • 2312.15715 • Published • 21
- 
	
	
	Spatial-MLLM: Boosting MLLM Capabilities in Visual-based Spatial IntelligencePaper • 2505.23747 • Published • 68
- 
	
	
	VideoPrism: A Foundational Visual Encoder for Video UnderstandingPaper • 2402.13217 • Published • 37
- 
	
	
	Scaling RL to Long VideosPaper • 2507.07966 • Published • 157
			models
			11
		
			
	
	
	
	
	RzZ/Qwen2.5-VL-3B-GGUF
		
				3B
			• 
	
				Updated
					
				
				• 
					
					21
				
	
				
				
RzZ/Qwen2.5-VL-32B-Instruct-GGUF
		
				0.7B
			• 
	
				Updated
					
				
				• 
					
					4
				
	
				
				
RzZ/sd-v1-4-adapter-seg
		
	
				Updated
					
				
				• 
					
					1
				
	
				
				
RzZ/sd-v1-4-adapter-depth
		
	
				Updated
					
				
				• 
					
					2
				
	
				
				
RzZ/sd-v1-4-adapter-keypose
		
	
				Updated
					
				
				• 
					
					4
				
	
				
				
RzZ/sd-v1-4-adapter-color
		
	
				Updated
					
				
				• 
					
					1
				
	
				
				
RzZ/sd-v1-4-adapter-canny
		
	
				Updated
					
				
				• 
					
					1
				
	
				
				
RzZ/sd-v1-4-adapter-sketch
		
	
				Updated
					
				
				• 
					
					1
				
	
				
				
RzZ/sd-v1-4-adapter-openpose
		
	
				Updated
					
				
				• 
					
					2
				
	
				
				
RzZ/sd-v1-4-adapter-keypose-depth
		
	
				Updated
					
				
				• 
					
					1
				
	
				
				
 
								