【论文阅读】DaST: Data-free Substitute Training for Adversarial Attacks（2020）-CSDN博客

摘要

Machine learning models（机器学习模型） are vulnerable（容易受到） to adversarial examples（对抗样本）. For the black-box setting（对于黑盒设置）, current substitute attacks（目前替代攻击） need pre-trained models（预训练模型） to generate adversarial examples（生成对抗样本）. However, pre-trained models（预训练模型） are hard to obtain（很难获得） in real-world tasks（在实际任务中）. In this paper, we propose a data-free substitute training method（无数据替代训练方法） (DaST) to obtain substitute models（获得替代模型） for adversarial black-box attacks（对抗性黑盒攻击） without the requirement of any real data（不需要任何真实数据）. To achieve this, DaST utilizes specially designed（专门设计） generative adversarial networks（生成对抗网络） (GANs) to train the substitute models. In particular（特别地）, we design a multi-branch architecture（多分支架构） and label-control loss（标签控制损失） for the generative model to deal with（处理） the uneven distribution（不均匀分布） of synthetic samples（生成样本）. The substitute model is then trained by the synthetic samples（合成样本） generated by the generative model（生成模型