Temporal inconsistency when decoding chunk-wise ODE latents

Hi, thank you for your great work!

I’m generating ODE data with the chunk-wise model:
```
torchrun --nproc_per_node=8 \
  get_causal_ode_data_chunkwise.py \
  --generator_ckpt checkpoints/chunkwise/ar_diffusion.pt \
  --rawdata_path dataset/clean_data \
  --output_folder dataset/ODE6KCausal_chunkwise_latents
```
During generation, I decoded the latents into RGB for visualization and observed noticeable frame-to-frame jumps.

https://github.com/user-attachments/assets/2fb5ce3c-8f4e-4497-807a-a826c7764982

https://github.com/user-attachments/assets/dfb6a25f-b3f1-4911-88c9-669ea534410c

My understanding is that this may stem from a mismatch between conditioning on ground-truth frames (teacher forcing) versus model-generated frames during autoregressive rollout.

I would like to kindly ask:

1. Is this behavior expected?
2. Would it affect the subsequent ODE training?

Thank you very much for your help!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Temporal inconsistency when decoding chunk-wise ODE latents #31

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Uh oh!

Temporal inconsistency when decoding chunk-wise ODE latents #31

Description

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions