[NeurIPS 2025] Vision as a Dialect: Unifying Visual Understanding and Generation via Text-Aligned Representations
			
	
	- 
	
	
	
Vision as a Dialect: Unifying Visual Understanding and Generation via Text-Aligned Representations
Paper • 2506.18898 • Published • 33 - 
	
	
	47
Tar
🚀Unified MLLM with Text-Aligned Representations
 - 
	
	
	3
Tar
🚀Unified MLLM with Text-Aligned Representations
 - 
	
	
	60
Tar
🚀Unified MLLM with Text-Aligned Representations