John F. Canny and Biye Jiang and Ryan Casey and Jian Qiao and Tian Liu and Marc Capelo

EECS Department, University of California, Berkeley

Technical Report No. UCB/EECS-2016-61

May 12, 2016

http://www2.eecs.berkeley.edu/Pubs/TechRpts/2016/EECS-2016-61.pdf

This capstone project seeks to explore the usefulness of interactive visualizations for machine learning. Specifically, we look at online search advertisements, which represent one of the biggest use cases of the recent Big Data boom. We develop a tool aimed to help Ad Operations teams at search engine companies tune parameters for their auction process in an effort to balance tradeoffs between profit, advertiser satisfaction, and user satisfaction. Often times, business decision makers treat machine learning algorithms as a black box, as it is difficult to see what is going on underneath the hood. This project seeks to better inform the user of what is happening by shortening the feedback loop, thus allowing Ad Operations teams to quickly tune models and deploy changes. We accomplish this using a GPU-accelerated machine learning library called BIDMach for click rate prediction, and visualizations developed with a high performance JavaScript library called D3.js. This paper discusses the challenges in using Sparse Factor Analysis for click rate prediction, and how we turned to a Latent Dirichlet Allocation model to achieve better results. We also discuss the system architecture and technological choices for the visualization, and the challenges we faced in connecting it with the backend auction simulation.

Advisors: John F. Canny


BibTeX citation:

@mastersthesis{Canny:EECS-2016-61,
    Author= {Canny, John F. and Jiang, Biye and Casey, Ryan and Qiao, Jian and Liu, Tian and Capelo, Marc},
    Title= {Data Modeling and Interactive Visualization for Advertisement Auction Modeling},
    School= {EECS Department, University of California, Berkeley},
    Year= {2016},
    Month= {May},
    Url= {http://www2.eecs.berkeley.edu/Pubs/TechRpts/2016/EECS-2016-61.html},
    Number= {UCB/EECS-2016-61},
    Abstract= {This capstone project seeks to explore the usefulness of interactive visualizations for machine learning. Specifically, we look at online search advertisements, which represent one of the biggest use cases of the recent Big Data boom. We develop a tool aimed to help Ad Operations teams at search engine companies tune parameters for their auction process in an effort to balance tradeoffs between profit, advertiser satisfaction, and user satisfaction. Often times, business decision makers treat machine learning algorithms as a black box, as it is difficult to see what is going on underneath the hood. This project seeks to better inform the user of what is happening by shortening the feedback loop, thus allowing Ad Operations teams to quickly tune models and deploy changes. We accomplish this using a GPU-accelerated machine learning library called BIDMach for click rate prediction, and visualizations developed with a high performance JavaScript library called D3.js. This paper discusses the challenges in using Sparse Factor Analysis for click rate prediction, and how we turned to a Latent Dirichlet Allocation model to achieve better results. We also discuss the system architecture and technological choices for the visualization, and the challenges we faced in connecting it with the backend auction simulation.},
}

EndNote citation:

%0 Thesis
%A Canny, John F. 
%A Jiang, Biye 
%A Casey, Ryan 
%A Qiao, Jian 
%A Liu, Tian 
%A Capelo, Marc 
%T Data Modeling and Interactive Visualization for Advertisement Auction Modeling
%I EECS Department, University of California, Berkeley
%D 2016
%8 May 12
%@ UCB/EECS-2016-61
%U http://www2.eecs.berkeley.edu/Pubs/TechRpts/2016/EECS-2016-61.html
%F Canny:EECS-2016-61