Item

GenMix: Effective Data Augmentation with Generative Diffusion Model Image Editing

Islam, Khawar
Zaheer, Muhammad Zaigham
Mahmood, Arif
Nandakumar, Karthik
Akhtar, Naveed
Supervisor
Department
Computer Vision
Embargo End Date
Type
Journal article
Date
License
Language
English
Collections
Research Projects
Organizational Units
Journal Issue
Abstract
Data augmentation is widely used to enhance generalization in visual classification tasks. However, traditional methods struggle when source and target domains differ, as in domain adaptation, due to their inability to address domain gaps. We introduce a novel GenMix, a generalizable prompt-guided generative data augmentation approach that enhances both in-domain and cross-domain image classification. Our technique leverages image editing to generate augmented images based on custom conditional prompts, designed specifically for each problem type. By blending portions of the input image with its edited generative counterpart and incorporating fractal patterns, our approach mitigates unrealistic images and label ambiguity, improving performance and adversarial robustness of the resulting models. Our GenMix further incorporates a faithfulness-aware filtration step that retains only semantically consistent edited images for augmentation. Efficacy of our method is established with extensive experiments on eight public datasets for general and fine-grained classification, in both in-domain and cross-domain settings. Additionally, we demonstrate performance improvements for self-supervised learning, learning with data scarcity, and adversarial robustness. As compared to the existing state-of-the-art methods, our technique achieves stronger performance across the board. The Code is available on https://github.com/khawar-islam/GenMix
Citation
K. Islam, M.Z. Zaheer, A. Mahmood, K. Nandakumar, N. Akhtar, "GenMix: Effective Data Augmentation with Generative Diffusion Model Image Editing," Expert Systems with Applications, pp. 132273-132273, 2026, https://doi.org/10.1016/j.eswa.2026.132273.
Source
Expert Systems with Applications
Conference
Keywords
46 Information and Computing Sciences, 4607 Graphics, Augmented Reality and Games, 4611 Machine Learning
Subjects
Source
Publisher
Elsevier
Full-text link