We will release the synthetic dataset soon.
The dataset will be released in three formats, MX `rec` files, image folder tarified (i.e., for usage with ImageTar dataloader) or uncompressed image folder hierarchy.
@inproceedings{
rahimi2025auggen,
title={AugGen: Synthetic Augmentation using Diffusion Models Can Improve Recognition},
author={Parsa Rahimi and Damien Teney and S{\'e}bastien Marcel},
booktitle={The Thirty-ninth Annual Conference on Neural Information Processing Systems},
year={2025},
url={https://openreview.net/forum?id=LuKlBH8DAT}
}