- 
	
	
	Tracking Anything with Decoupled Video SegmentationPaper • 2309.03903 • Published • 28
- 
	
	
	City-on-Web: Real-time Neural Rendering of Large-scale Scenes on the WebPaper • 2312.16457 • Published • 15
- 
	
	
	A Recipe for Scaling up Text-to-Video Generation with Text-free VideosPaper • 2312.15770 • Published • 15
william cody stanford
williamcstanford
		AI & ML interests
None yet
		
		Organizations
None yet
diffusion
			
			
	
	- 
	
	
	A Picture is Worth a Thousand Words: Principled Recaptioning Improves Image GenerationPaper • 2310.16656 • Published • 50
- 
	
	
	CommonCanvas: An Open Diffusion Model Trained with Creative-Commons ImagesPaper • 2310.16825 • Published • 36
- 
	
	
	Matryoshka Diffusion ModelsPaper • 2310.15111 • Published • 43
- 
	
	
	I2VGen-XL: High-Quality Image-to-Video Synthesis via Cascaded Diffusion ModelsPaper • 2311.04145 • Published • 35
video segmentation
			
			
	
	- 
	
	
	Tracking Anything with Decoupled Video SegmentationPaper • 2309.03903 • Published • 28
- 
	
	
	City-on-Web: Real-time Neural Rendering of Large-scale Scenes on the WebPaper • 2312.16457 • Published • 15
- 
	
	
	A Recipe for Scaling up Text-to-Video Generation with Text-free VideosPaper • 2312.15770 • Published • 15
diffusion
			
			
	
	- 
	
	
	A Picture is Worth a Thousand Words: Principled Recaptioning Improves Image GenerationPaper • 2310.16656 • Published • 50
- 
	
	
	CommonCanvas: An Open Diffusion Model Trained with Creative-Commons ImagesPaper • 2310.16825 • Published • 36
- 
	
	
	Matryoshka Diffusion ModelsPaper • 2310.15111 • Published • 43
- 
	
	
	I2VGen-XL: High-Quality Image-to-Video Synthesis via Cascaded Diffusion ModelsPaper • 2311.04145 • Published • 35
			models
			0
		
			
	None public yet
			datasets
			0
		
			
	None public yet
 
								