Publikationen
Ausgewählte Publikationen
Hier finden Sie ausgewählte Publikationen aus den letzten Jahren. Eine ausführliche Liste der Publikationen finden Sie auf der Google Scholar oder DBLP Seite von Stefan Schneegaß.
Art der Publikation: Beitrag in Sammelwerk
ExplAInable Pixels: Investigating One-Pixel Attacks on Deep Learning Models with Explainable Visualizations
- Autor(en):
- Keppel, Jonas; Liebers, Jonathan; Auda, Jonas; Gruenefeld, Uwe; Schneegass, Stefan
- Titel des Sammelbands:
- Proceedings of the 21st International Conference on Mobile and Ubiquitous Multimedia
- Seiten:
- 231-242
- Verlag:
- Association for Computing Machinery
- Ort(e):
- New York, NY, USA
- Veröffentlichung:
- 2022
- ISBN:
- 9781450398206
- Schlagworte:
- human-in-the-loop, explainability, adversarial examples, one-pixel attacks
- Digital Object Identifier (DOI):
- doi:10.1145/3568444.3568469
- Zitation:
- Download BibTeX
Kurzfassung
Nowadays, deep learning models enable numerous safety-critical applications, such as biometric authentication, medical diagnosis support, and self-driving cars. However, previous studies have frequently demonstrated that these models are attackable through slight modifications of their inputs, so-called adversarial attacks. Hence, researchers proposed investigating examples of these attacks with explainable artificial intelligence to understand them better. In this line, we developed an expert tool to explore adversarial attacks and defenses against them. To demonstrate the capabilities of our visualization tool, we worked with the publicly available CIFAR-10 dataset and generated one-pixel attacks. After that, we conducted an online evaluation with 16 experts. We found that our tool is usable and practical, providing evidence that it can support understanding, explaining, and preventing adversarial examples.