You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I am working on an action-controlled Wan2.2 TI2V (Image-to-Video) model. When converting the model from bidirectional flow to an autoregressive teacher for distillation, I encountered the following issue: Training predictions: Look normal and reasonable. Inference results: Are significantly worse, with poor quality.
I have attached the following for reference:
I am working on an action-controlled Wan2.2 TI2V (Image-to-Video) model. When converting the model from bidirectional flow to an autoregressive teacher for distillation, I encountered the following issue:
Training predictions: Look normal and reasonable.
Inference results: Are significantly worse, with poor quality.
I have attached the following for reference:
Training prediction samples
train_pred_step_001300.mp4
Inference output samples:
ar_VID_20251229_164739_147_slice_295_back_180_step_001300.mp4
Training loss curves

What could be causing this discrepancy between training and inference quality?