NAT: Learning to Attack Neurons for Enhanced Adversarial Transferability, WACV 2025

`Krishna Kanth Nakka` and `Alexandre Alahi`

News

[March 2025]: Released best performing generator.
[Jan 2026]: Released 40 pretrained generators on HuggingFace.

Introduction

The generation of transferable adversarial perturbations typically involves training a generator to maximize embedding separation between clean and adversarial images at a single mid-layer of a source model. In this work, we build on this approach and introduce Neuron Attack for Transferability (NAT), a method designed to target specific neuron within the embedding. Our approach is motivated by the observation that previous layer-level optimizations often disproportionately focus on a few neurons representing similar concepts, leaving other neurons within the attacked layer minimally affected. NAT shifts the focus from embeddinglevelseparation to a more fundamental, neuron-specific approach. We find that targeting individual neurons effectively disrupts the core units of the neural network, providing a common basis for transferability across different models. Through extensive experiments on 41 diverse ImageNet models and 9 fine-grained models, NAT achieves fooling rates that surpass existing baselines by over 14% in crossmodel and 4% in cross-domain settings.

For more details, refer to the main paper and supplementary at CVF Website

Setup

The code has been tested on following packages:

conda env create -f environment.yaml

Dataset

For evaluation, we use subset of 5000 images available at the data subfolder. This subset is taken from the LTP paper of NeurIPS 2021.

Trained Models

To download from Hugginface all pretrained generators

from huggingface_hub import hf_hub_download
import os

# ----------------------------------------------------------------
#  to download all generators
# ----------------------------------------------------------------

repo_id = "KKNakka/NAT"

# 2. Download everything to the ./checkpoints folder
local_dir_path = snapshot_download(
    repo_id=repo_id,
    local_dir="./checkpoints",
    local_dir_use_symlinks=False,
)

print(f"All generators downloaded to: {local_dir_path}")

[NEW] We also released all trained 40 generators at Huggingface

Evaluation

To reproduce the results presented in Tables 2, 3, and 4 of the main paper using the query k=1, use the checkpoint with neuron=250
To run the attack on ResNet152, use the following command:
```
python eval.py --nat_attacked_neuron 250
```
Please refer to the Table 1 in the supplementary for the exact versions of the target models.

Training

For training, we use the LTP repository available and change the loss function to choose the single channel instead of all channels. The modified loss function is available in the file loss.py
For generator, we used a slightly modified architecture removing reflectionpad as we found that to be make results non-deterministic even with same seed.

Citation

@InProceedings{Nakka_2025_WACV,
    author    = {Nakka, Krishna Kanth and Alahi, Alexandre},
    title     = {NAT: Learning to Attack Neurons for Enhanced Adversarial Transferability},
    booktitle = {Proceedings of the Winter Conference on Applications of Computer Vision (WACV)},
    month     = {February},
    year      = {2025},
    pages     = {7582-7593}
}

Acknowledgements

We would like thank the authors of CDA who inspired me to work in this direction and for releasing their codebase opensource.

❤️ Thanks for visiting ✨ NAT!

Name		Name	Last commit message	Last commit date
Latest commit History 39 Commits
__pycache__		__pycache__
data		data
images		images
README.md		README.md
dataset.py		dataset.py
environment.yml		environment.yml
envs.sh		envs.sh
eval.py		eval.py
generator.py		generator.py
hubconf.py		hubconf.py
loss.py		loss.py
temp.py		temp.py
temp.py.txt		temp.py.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

NAT: Learning to Attack Neurons for Enhanced Adversarial Transferability, WACV 2025

`Krishna Kanth Nakka` and `Alexandre Alahi`

News

Introduction

Setup

Dataset

Trained Models

Evaluation

Training

Citation

Acknowledgements

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

NAT: Learning to Attack Neurons for Enhanced Adversarial Transferability, WACV 2025

Krishna Kanth Nakka and Alexandre Alahi

News

Introduction

Setup

Dataset

Trained Models

Evaluation

Training

Citation

Acknowledgements

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

`Krishna Kanth Nakka` and `Alexandre Alahi`

Packages