Questions about distillation algorithms

#25

by min123456 - opened Sep 29

Sep 29

The distillation models related to qwen are all distilled based on the DMD2 algorithm. The distillation models related to video generation are based on improved training of self-forcing. I don’t know if this is correct.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment