Uncoupling, sounds good. thank you.
As for standard initialization, would it be reasonable to give inputs based on a few first timesteps solved via DPM instead of DDPM? This way, provided that divergence is due to bad initialization, I might come closer to a better start.