We will release the synthetic dataset soon.
The dataset will be released in three formats, MX `rec` files, image folder tarified (i.e., for usage with ImageTar dataloader) or uncompressed image folder hierarchy.
@article{rahimi2025synthetic,
title={AugGen: Synthetic Augmentation Can Improve Discriminative Models},
author={Rahimi, Parsa and Teney, Damien and Marcel, Sebastien},
journal={arXiv preprint arXiv:2503.11544},
year={2025}
}