MeVer GAN Detector v1.0
Olga Papadopoulou {olgapapa@iti.gr}
CERTH
This model detects images that have been generated by the StyleGAN2 architecture, as well as GAN images from similar GAN architectures. It is based on a ResNet-50 pretrained on ImageNet. The model is trained using the AdamW optimizer, the learning rate was equal to 10−3, and a step scheduler with 5 epochs was used. Weight decay is also applied with a factor of 5 · 10−5. A drop path rate of 0.1 is employed to prevent overfitting, which randomly drops entire paths (i.e., sequences of layers) in the model during training.
The training set contains images 35.000 real images of FFHQ (human faces) and 35.000 generated images from the corresponding StyleGAN2 pretrained NVIDIA model (that were selected with the GIQA method). The model was trained using strong augmentation schemes in order to achieve high generalization scores in different semantic domains (for e.g. AFHQ).
v1.0, created 15/06/2023
Dogoulis, P., Kordopatis-Zilos, G., Kompatsiaris, I., & Papadopoulos, S. (2023, June). Improving Synthetically Generated Image Detection in Cross-Concept Settings. In Proceedings of the 2nd ACM International Workshop on Multimedia AI against Disinformation (pp. 28-35).