Abstraction, Generalization, and Embodiment in Neural Program Synthesis
Richard Shin
EECS Department, University of California, Berkeley
Technical Report No. UCB/EECS-2020-164
August 14, 2020
http://www2.eecs.berkeley.edu/Pubs/TechRpts/2020/EECS-2020-164.pdf
Program synthesis, or automatically writing programs from high-level specifications has been a long-standing challenge in computer science and artificial intelligence. Addressing this challenge can help unlock the full power of computing to nontechnical users, assist existing developers on traditional programming tasks, and solve other tasks in artificial intelligence like question answering that are naturally expressible as programs. In recent years, neural methods for program synthesis based on learning have driven significant progress. With this shift, themes like abstraction, generalization, and embodiment that recur in other facets of machine learning provide a natural framework for further improvement. In this dissertation, we present methods to address manifestations of these themes in several concrete instantiations of neural program synthesis.
First, we demonstrate how to better synthesize imperative programs by interacting with the program interpreter environment in the form of predicted execution traces, in a challenging domain for program synthesis called Karel. We also show in empirical studies that generating synthetic data for program synthesis requires significant care to enable models to generalize. In an application of program synthesis from natural language, or semantic parsing, we present attention-based neural architectures that can better encode the natural language specification to enable better generalization to new database domains. In this and other code generation domains, we introduce a method for integrating automatically learned code idioms into the synthesis procedure, learning to automatically switch between multiple levels of abstraction.
Advisors: Dawn Song
BibTeX citation:
@phdthesis{Shin:EECS-2020-164, Author= {Shin, Richard}, Title= {Abstraction, Generalization, and Embodiment in Neural Program Synthesis}, School= {EECS Department, University of California, Berkeley}, Year= {2020}, Month= {Aug}, Url= {http://www2.eecs.berkeley.edu/Pubs/TechRpts/2020/EECS-2020-164.html}, Number= {UCB/EECS-2020-164}, Abstract= {Program synthesis, or automatically writing programs from high-level specifications has been a long-standing challenge in computer science and artificial intelligence. Addressing this challenge can help unlock the full power of computing to nontechnical users, assist existing developers on traditional programming tasks, and solve other tasks in artificial intelligence like question answering that are naturally expressible as programs. In recent years, neural methods for program synthesis based on learning have driven significant progress. With this shift, themes like abstraction, generalization, and embodiment that recur in other facets of machine learning provide a natural framework for further improvement. In this dissertation, we present methods to address manifestations of these themes in several concrete instantiations of neural program synthesis. First, we demonstrate how to better synthesize imperative programs by interacting with the program interpreter environment in the form of predicted execution traces, in a challenging domain for program synthesis called Karel. We also show in empirical studies that generating synthetic data for program synthesis requires significant care to enable models to generalize. In an application of program synthesis from natural language, or semantic parsing, we present attention-based neural architectures that can better encode the natural language specification to enable better generalization to new database domains. In this and other code generation domains, we introduce a method for integrating automatically learned code idioms into the synthesis procedure, learning to automatically switch between multiple levels of abstraction.}, }
EndNote citation:
%0 Thesis %A Shin, Richard %T Abstraction, Generalization, and Embodiment in Neural Program Synthesis %I EECS Department, University of California, Berkeley %D 2020 %8 August 14 %@ UCB/EECS-2020-164 %U http://www2.eecs.berkeley.edu/Pubs/TechRpts/2020/EECS-2020-164.html %F Shin:EECS-2020-164