sueszli's picture
Update README.md
9d7f90b verified
---
license: apache-2.0
---
reproducing: "Ensemble everything everywhere: Multi-scale aggregation for adversarial robustness" (https://arxiv.org/abs/2408.05446)
source code and usage examples: https://github.com/ETH-DISCO/self-ensembling
architecture based on Torchvision's Resnet152 default implementation
hyperparameters:
- criterion: `torch.nn.CrossEntropyLoss()`
- optimizer: `torch.optim.AdamW`
- scaler: `GradScaler`
- datasets: `["cifar10", "cirfar100"]`
- lr: `0.0001`
- num_epochs: `16` (higher would be even better, but maybe by <1%)
- crossmax_k: `2` (difference between `crossmax_k=2` and `crossmax_k=3` is about 1-2%, so it's not a big deal)