Behavior of Machine Learning Algorithms in Adversarial Environments

Blaine Nelson

EECS Department, University of California, Berkeley

Technical Report No. UCB/EECS-2010-140

November 23, 2010

http://www2.eecs.berkeley.edu/Pubs/TechRpts/2010/EECS-2010-140.pdf

Machine learning has become a prevalent tool in many computing applications and modern enterprise systems stand to greatly benefit from learning algorithms. However, one concern with learning algorithms is that they may introduce a security fault into the system. The key strengths of learning approaches are their adaptability and ability to infer patterns that can be used for predictions or decision making. However, these assets of learning can potentially be subverted by adversarial manipulation of the learner's environment, which exposes applications that use machine learning techniques to a new class of security vulnerabilities.

I analyze the behavior of learning systems in adversarial environments. My thesis is that learning algorithms are vulnerable to attacks that can transform the learner into a liability for the system they are intended to aid, but by critically analyzing potential security threats, the extent of these threat can be assessed, proper learning techniques can be selected to minimize the adversary's impact, and failures of system can be averted.

I present a systematic approach for identifying and analyzing threats against a machine learning system. I examine real-world learning systems, assess their vulnerabilities, demonstrate real-world attacks against their learning mechanism, and propose defenses that can successful mitigate the effectiveness of such attacks. In doing so, I provide machine learning practitioners with a systematic methodology for assessing a learner's vulnerability and developing defenses to strengthen their system against such threats. Additionally, I also examine and answer theoretical questions about the limits of adversarial contamination and classifier evasion.

Advisors: Anthony D. Joseph

BibTeX citation:

@phdthesis{Nelson:EECS-2010-140,
    Author= {Nelson, Blaine},
    Title= {Behavior of Machine Learning Algorithms in Adversarial Environments},
    School= {EECS Department, University of California, Berkeley},
    Year= {2010},
    Month= {Nov},
    Url= {http://www2.eecs.berkeley.edu/Pubs/TechRpts/2010/EECS-2010-140.html},
    Number= {UCB/EECS-2010-140},
    Abstract= {Machine learning has become a prevalent tool in many computing applications and modern enterprise systems stand to greatly benefit from learning algorithms.  However, one concern with learning algorithms is that they may introduce a security fault into the system.  The key strengths of learning approaches are their adaptability and ability to infer patterns that can be used for predictions or decision making.  However, these assets of learning can potentially be subverted by adversarial manipulation of the learner's environment, which exposes applications that use machine learning techniques to a new class of security vulnerabilities.

I analyze the behavior of learning systems in adversarial environments. My thesis is that learning algorithms are vulnerable to attacks that can transform the learner into a liability for the system they are intended to aid, but by critically analyzing potential security threats, the extent of these threat can be assessed, proper learning techniques can be selected to minimize the adversary's impact, and failures of system can be averted.

I present a systematic approach for identifying and analyzing threats against a machine learning system.  I examine real-world learning systems, assess their vulnerabilities, demonstrate real-world attacks against their learning mechanism, and propose defenses that can successful mitigate the effectiveness of such attacks.  In doing so, I provide machine learning practitioners with a systematic methodology for assessing a learner's vulnerability and developing defenses to strengthen their system against such threats.  Additionally, I also examine and answer theoretical questions about the limits of adversarial contamination and classifier evasion.},
}

EndNote citation:

%0 Thesis
%A Nelson, Blaine 
%T Behavior of Machine Learning Algorithms in Adversarial Environments
%I EECS Department, University of California, Berkeley
%D 2010
%8 November 23
%@ UCB/EECS-2010-140
%U http://www2.eecs.berkeley.edu/Pubs/TechRpts/2010/EECS-2010-140.html
%F Nelson:EECS-2010-140