My Master’s thesis done at EPFL under the supervision of Princeton’s Vikash Sehwag and Prof. Prateek Mittal, and EPFL’s Prof. Troncoso. We show that we can obtain state of the art results in adversarial training using Vision Transformers (in particular with XCiT) on ImageNet.