๐ŸŒˆ Qwen-Image-Edit-MeiTu

This model โ€” Qwen-Image-Edit-MeiTu โ€” is an improved variant of Qwen/Qwen-Image-Edit, built with DiT-based architecture fine-tuning to enhance visual consistency, aesthetic quality, and structural alignment in complex edits.

Developed by Valiant Cat AI Lab, this version aims to further close the gap between high-fidelity semantic editing and coherent artistic rendering, achieving a more natural and professional output across a wide range of prompts and subjects.


โœจ Key Improvements

  • Enhanced Consistency:
    Utilizes DiT (Diffusion Transformer) fine-tuning to ensure structural stability between input and edited regions, maintaining global spatial coherence.

  • Aesthetic Optimization:
    Trained with aesthetic discriminators and curated aesthetic score datasets, producing more pleasing colors, contrast, and light balance.

  • Better Detail Preservation:
    Improved low-level reconstruction for fine details such as textures, faces, and typography.

  • Broader Scene Adaptability:
    Performs well on portraits, environments, product photos, and illustrations, supporting both semantic and appearance-based editing.


๐Ÿ–ผ๏ธ Showcase

Below are examples of consistency and aesthetic improvement in complex editing scenarios:

Input & Output

๐Ÿ’ฌ Recommended Prompts

Try these prompts to explore the modelโ€™s strengths:

  • โ€œmake the lighting soft and cinematic with better balanceโ€
  • โ€œenhance the photoโ€™s composition and maintain realismโ€
  • โ€œrefine skin tone and texture consistencyโ€
  • โ€œimprove the global color tone and aesthetic harmonyโ€
  • โ€œincrease photo realism and clarity without changing contentโ€

๐Ÿงฉ Integration with ComfyUI

This model works seamlessly with a modified ComfyUI Qwen-Image-Edit workflow.
Just use this model in the Unet node to workflow for edit image.


๐Ÿ“ฅ Download Model

Weights available in Safetensors format:

๐Ÿ‘‰ Download Qwen-Image-Edit-MeiTu


๐Ÿง  Training

This model was trained and optimized by the
AI Laboratory of Chongqing Valiant Cat Technology Co., LTD.
Visit https://vvicat.com/ for business collaborations or research partnerships.


๐Ÿ“œ License

Licensed under Apache 2.0.


๐Ÿ’ผ Join Us

We are hiring research engineers and creative ML practitioners at
Chongqing Valiant Cat Technology Co., LTD โ€” reach out via
๐Ÿ“ง tommy@vvicat.com

Downloads last month
6,654
GGUF
Model size
20B params
Architecture
flux
Hardware compatibility
Log In to view the estimation

4-bit

5-bit

6-bit

8-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for valiantcat/Qwen-Image-Edit-MeiTu

Quantized
(8)
this model
Quantizations
1 model

Collection including valiantcat/Qwen-Image-Edit-MeiTu