Skip to content
Merged
Show file tree
Hide file tree
Changes from 2 commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
8 changes: 8 additions & 0 deletions text_to_image/README.md

@ShriyaRishab ShriyaRishab Jul 23, 2025

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

To avoid confusion, can we update the pseudocode in Quality metric section so t is not computed but is obtained directly from the dataset? It will help submitters to see pseudocode for what they need to implement instead of how the validation dataset was originally generated.

You can add an appendix in the end with the pseudocode used to generate the timestamps for the validation dataset as an FYI

Original file line number Diff line number Diff line change
Expand Up @@ -78,6 +78,8 @@ The filter file is included in this repository. It was generated using `torchtit

### COCO-2014 subset

For validation purposes, each sample of the dataset is associated with a timestep that is used to evaluate it.
For more details, consult the [evaluation algorithm](#quality-metric)
To download the cleaned data, run the following:

**Note:** We reccomend training directly on preprocessed embeddings. To do that, skip [here](#preprocessing).
Expand Down Expand Up @@ -288,6 +290,9 @@ For precise details, we encourage the consultation of the code at `torchtitan/ex
# 5. Quality
### Quality metric
Validation loss averaged over 8 equidistant time steps [0, 7/8], as described in [Scaling Rectified Flow Transformers for High-Resolution Image Synthesis](https://arxiv.org/pdf/2403.03206).
The validation dataset is prepared in advance so that each sample is associated with a timestep.
This is an integer from 0 to 7 inclusive, and thus should be divided by `8.0` to obtain the timestep.

The algorithm is as follows:

```pseudocode
Expand All @@ -314,6 +319,9 @@ validation_loss = mean(mean_per_timestep)
RETURN validation_loss
```

As we ensure that the validation set has an equal number of samples per timestep,
a simple average of all loss values is equivalent to the above.

### Quality target
0.586
### Evaluation frequency
Expand Down