| 
							 | 
						---
 | 
					
					
						
						| 
							 | 
						license: other
 | 
					
					
						
						| 
							 | 
						license_name: qwen
 | 
					
					
						
						| 
							 | 
						license_link: https://huggingface.co/Qwen/Qwen2.5-72B-Instruct/blob/main/LICENSE
 | 
					
					
						
						| 
							 | 
						language:
 | 
					
					
						
						| 
							 | 
						- zho
 | 
					
					
						
						| 
							 | 
						- eng
 | 
					
					
						
						| 
							 | 
						- fra
 | 
					
					
						
						| 
							 | 
						- spa
 | 
					
					
						
						| 
							 | 
						- por
 | 
					
					
						
						| 
							 | 
						- deu
 | 
					
					
						
						| 
							 | 
						- ita
 | 
					
					
						
						| 
							 | 
						- rus
 | 
					
					
						
						| 
							 | 
						- jpn
 | 
					
					
						
						| 
							 | 
						- kor
 | 
					
					
						
						| 
							 | 
						- vie
 | 
					
					
						
						| 
							 | 
						- tha
 | 
					
					
						
						| 
							 | 
						- ara
 | 
					
					
						
						| 
							 | 
						pipeline_tag: text-generation
 | 
					
					
						
						| 
							 | 
						base_model:
 | 
					
					
						
						| 
							 | 
						- Qwen/Qwen2.5-72B
 | 
					
					
						
						| 
							 | 
						- Qwen/Qwen2.5-72B-Instruct
 | 
					
					
						
						| 
							 | 
						base_model_relation: merge
 | 
					
					
						
						| 
							 | 
						tags:
 | 
					
					
						
						| 
							 | 
						- chat
 | 
					
					
						
						| 
							 | 
						library_name: transformers
 | 
					
					
						
						| 
							 | 
						---
 | 
					
					
						
						| 
							 | 
						# Qwen2.5-72B-0.6x-Instruct
 | 
					
					
						
						| 
							 | 
						
 | 
					
					
						
						| 
							 | 
						This is a linear merge of [Qwen/Qwen2.5-72B-Instruct](https://huggingface.co/Qwen/Qwen2.5-72B-Instruct) at weight `0.6` and [Qwen/Qwen2.5-72B](https://huggingface.co/Qwen/Qwen2.5-72B) at weight `0.4`.
 | 
					
					
						
						| 
							 | 
						
 | 
					
					
						
						| 
							 | 
						The resulting model is 60% Instruct and 40% base model, hence the name **`0.6x-Instruct`**.
 | 
					
					
						
						| 
							 | 
						
 | 
					
					
						
						| 
							 | 
						The goal of the merge was to make the Instruct model more flexible and less rigid. After some initial testing, I think the resulting model meets this goal, and I find it useful and interesting enough to warrant publishing. |