Publications
Selected Publications
This page shows selected publications from the last years. For a detailed list please refer to the Google Scholar or DBLP page of Stefan Schneegass.
Type of Publication: Article in Collected Edition
ExplAInable Pixels: Investigating One-Pixel Attacks on Deep Learning Models with Explainable Visualizations
- Author(s):
- Keppel, Jonas; Liebers, Jonathan; Auda, Jonas; Gruenefeld, Uwe; Schneegass, Stefan
- Title of Anthology:
- Proceedings of the 21st International Conference on Mobile and Ubiquitous Multimedia
- pages:
- 231-242
- Publisher:
- Association for Computing Machinery
- Location(s):
- New York, NY, USA
- Publication Date:
- 2022
- ISBN:
- 9781450398206
- Keywords:
- human-in-the-loop, explainability, adversarial examples, one-pixel attacks
- Digital Object Identifier (DOI):
- doi:10.1145/3568444.3568469
- Citation:
- Download BibTeX
Abstract
Nowadays, deep learning models enable numerous safety-critical applications, such as biometric authentication, medical diagnosis support, and self-driving cars. However, previous studies have frequently demonstrated that these models are attackable through slight modifications of their inputs, so-called adversarial attacks. Hence, researchers proposed investigating examples of these attacks with explainable artificial intelligence to understand them better. In this line, we developed an expert tool to explore adversarial attacks and defenses against them. To demonstrate the capabilities of our visualization tool, we worked with the publicly available CIFAR-10 dataset and generated one-pixel attacks. After that, we conducted an online evaluation with 16 experts. We found that our tool is usable and practical, providing evidence that it can support understanding, explaining, and preventing adversarial examples.