Code supporting the publication: BAN: detecting backdoors activated by adversarial neuron noise.

Author name

doi:10.4121/e53e4f1f-d51e-48c2-ad94-7fc24e9f4fc3

Ctrl K

Limited functionality: Your browser does not support JavaScript.

Code supporting the publication: BAN: detecting backdoors activated by adversarial neuron noise.

mention

contributors

Get started

4 commitsLast commit ≈ 19 months ago

Description

This repo contains the code for Xu et al. "BAN: detecting backdoors activated by adversarial neuron noise." Advances in Neural Information Processing Systems 37 (2024): 114348-114373. This paper improves backdoor feature inversion for backdoor detection by incorporating extra neuron activation information. We adversarially increase the loss of backdoored models with respect to weights to activate the backdoor effect, based on which we can easily differentiate backdoored and clean models.