Less and less useful
Hi,
First of all, you would say that this model is impressive.
The possibility of using a single model for everything is an beautiful promise.
It was with this model that I generated my first videos, and I found it very good at the beginning.
But I realized how much of a dead end this model currently is.
The problem is that the model struggles to follow complex instructions, and it only gets worse.
Since I had never used the original Wan 2.2 model, I didn't realize this before, but even Wan AIO with the v* series had difficulties.
But since Mega, it's no longer really usable in my opinion.
Animating a character, asking a character or and object to leave the scene, or causing a significant change seems impossible, the model is simply deaf!
There's a constant feeling that we are just short of being able to create anything we want but never being understood, which is very frustrating.
The thing is, if one has never used the original Wan 2.2 (or even with lightx2v), they don't realize what they are missing from this perspective.
I don't blame the author, his initiative is useful for a wide range of use cases.
But I think a warning message would be useful, because it feels like I've wasted time trying to do fine things with a bulldozer.
I mean, version v10 is not perfect, but it is already more usable than Mega as long as one sticks to the T2V or I2V mode.
So, is it really necessary to unify everything?
There has been a disclaimer on the model card for a long time with this "warning", perhaps you missed it.
If you want to find a recent discussion talking specifically about motion and prompt adherence with potential solutions, see this discussion: https://huggingface.co/Phr00t/WAN2.2-14B-Rapid-AllInOne/discussions/150
It isn't necessary to unify everything, but it isn't necessary to run "original WAN 2.2" either for many use cases.
You claim v10 is more usable than Mega, but only Mega can do "first to last frame" which often is the solution for "asking a character or object to leave the scene".
Use whatever model you want. It doesn't have to be Mega. It doesn't even have to be mine. This is just another option that I find quite useful.