Wonjune Kang

wjkang@mit.edu

Scholar Title

MIT | IBM-Watson Undergraduate Research and Innovation Scholar

Research Title

Certified Robustness of Neural Networks for Top-k Predictions

Cohort

2019–2020

Department

EECS

Research Areas

Artificial Intelligence and Machine Learning

Supervisor

Luca Daniel

dluca@mit.edu

Abstract

Deep learning models are fast becoming the method of choice for a variety of machine learning tasks in fields such as vision, language, and audio. As their prominence has increased, it has become increasingly important to examine their vulnerabilities in various adversarial settings. One line of research, called certified robustness, aims to find the largest possible perturbation to a neural network’s input that will guarantee that the model’s output will not change. For classification models, pre-existing methods are able to compute these values for top-1 predictions; that is, they can compute bounds for which a model’s top prediction will not change. However, in many applications, top-k predictions (the set of k most likely labels) are a more relevant evaluation metric. This project aims to extend these baseline methods to the top-k case by computing perturbation bounds for which the model’s original top prediction will continue to stay in its top k predictions. In addition, it will aim to extend baseline methods to other tasks such as object detection.

Quote

“Despite having shown promise in many applications, AI has yet to be universally accepted as a force for good; I believe this is largely due to difficulties in understanding the inner workings of modern deep learning methods. I am participating in SuperUROP because I would like to contribute to a structured, long-term project that works to address these issues, such as robustness, interpretability, and fairness. Through this research, I hope to gain experience and become more familiar with the work in this area.”

Back to Scholars