Prepare target models before running attacks #249

mzweilin · 2024-05-02T16:40:53Z

What does this PR do?

This PR adds two preparations before running attacks in an external Lightning pipeline.

Turn off the PyTorch inference mode, so that we can create perturbation variables that require gradients.
Switch the target model to the training mode except for BatchNorm and Dropout layers, if we have to borrow training_step().

Type of change

Please check all relevant options.

Improvement (non-breaking)
Bug fix (non-breaking)
New feature (non-breaking)
Breaking change (fix or feature that would cause existing functionality to not work as expected)
This change requires a documentation update

Testing

Please describe the tests that you ran to verify your changes. Consider listing any relevant details of your test configuration.

pytest
CUDA_VISIBLE_DEVICES=0 python -m mart experiment=CIFAR10_CNN_Adv trainer=gpu trainer.precision=16 reports 70% (21 sec/epoch).
CUDA_VISIBLE_DEVICES=0,1 python -m mart experiment=CIFAR10_CNN_Adv trainer=ddp trainer.precision=16 trainer.devices=2 model.optimizer.lr=0.2 trainer.max_steps=2925 datamodule.ims_per_batch=256 datamodule.world_size=2 reports 70% (14 sec/epoch).

Before submitting

The title is self-explanatory and the description concisely explains the PR
My PR does only one thing, instead of bundling different changes together
I list all the breaking changes introduced by this pull request
I have commented my code
I have added tests that prove my fix is effective or that my feature works
New and existing unit tests pass locally with my changes
I have run pre-commit hooks with pre-commit run -a command without errors

Did you have fun?

Make sure you had fun coding 🙃

dxoigmn · 2024-05-14T18:53:08Z

mart/attack/adversary.py

@@ -151,6 +151,8 @@ def configure_gradient_clipping(
            for group in optimizer.param_groups:
                self.gradient_modifier(group["params"])

+    # Turn off the inference mode, so we will create perturbation that requires gradient.
+    @torch.inference_mode(False)


Why is this necessary now? I thought PL manages this already?

Anomalib turns on the inference mode as we run anomalib test.

MART's trainer turns off the inference mode by default, as in

MART/mart/configs/trainer/default.yaml

Line 19 in 05886f7

inference_mode: False

But Anomalib has its own trainer.

dxoigmn · 2024-05-14T18:53:54Z

mart/callbacks/adversary_connector.py

+        self.training = self.module.training
+        self.module.train(True)
+        # Set some children modules of "excludes" to eval mode instead.
+        self.selective_eval_mode("", self.module, self.excludes)


What is going on with the empty string?

We don't know the variable name of the model, so the module path starts with a dot. This is for debug logging only, which print messages like this

Set .model.student_model.feature_extractor.layer3[1].bn1: BatchNorm2d to eval mode.

dxoigmn · 2024-05-14T18:55:43Z

mart/callbacks/adversary_connector.py

                with MonkeyPatch(pl_module, "log", lambda *args, **kwargs: None):
-                    outputs = pl_module.training_step(batch, dataloader_idx)
+                    with training_mode(


What is the use case here? Are you seeing train-specific code diverging from eval-specific code in some use case?

Yes. Many model implementations return the prediction in the eval mode, and return the loss in the training mode.

In our use case, anomalib test runs the model in the eval mode, in which we won't get the loss.

mzweilin added 3 commits May 2, 2024 09:14

Add batch_c15n for [0,1] image input and imagenet-normalized input.

ece832f

Turn off inference mode before creating perturbations.

726a0af

Switch to training mode before running LightningModule.training_step().

b0c3079

Base automatically changed from mzweilin/add_batch_c15n_instances to main May 13, 2024 20:52

mzweilin added 6 commits May 13, 2024 13:56

Merge branch 'main' into mzweilin/prepare_model

5a4d294

Resolve merge conflicts.

a10a1ab

Set a model to training mode except for Dropout or BatchNorm layers.

311a1a9

Comments.

f442215

Log an accurate path to a module.

36d9d17

Rename as path and child_path for locating a module .

2f07f5b

mzweilin requested a review from dxoigmn May 14, 2024 18:33

dxoigmn reviewed May 14, 2024

View reviewed changes

mzweilin requested a review from dxoigmn May 14, 2024 22:25

dxoigmn approved these changes May 14, 2024

View reviewed changes

mzweilin merged commit c117823 into main May 14, 2024
5 checks passed

mzweilin deleted the mzweilin/prepare_model branch May 14, 2024 23:22

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Prepare target models before running attacks #249

Prepare target models before running attacks #249

mzweilin commented May 2, 2024 •

edited

Loading

dxoigmn May 14, 2024

mzweilin May 14, 2024

dxoigmn May 14, 2024

mzweilin May 14, 2024

dxoigmn May 14, 2024

mzweilin May 14, 2024

Prepare target models before running attacks #249

Prepare target models before running attacks #249

Conversation

mzweilin commented May 2, 2024 • edited Loading

What does this PR do?

Type of change

Testing

Before submitting

Did you have fun?

dxoigmn May 14, 2024

Choose a reason for hiding this comment

mzweilin May 14, 2024

Choose a reason for hiding this comment

dxoigmn May 14, 2024

Choose a reason for hiding this comment

mzweilin May 14, 2024

Choose a reason for hiding this comment

dxoigmn May 14, 2024

Choose a reason for hiding this comment

mzweilin May 14, 2024

Choose a reason for hiding this comment

mzweilin commented May 2, 2024 •

edited

Loading