Rising Stars 2020:

Ana Marasovic

Postdoctoral Researcher

Allen Institute for AI / University of Washington


PhD '19 Heidelberg University, Germany

Areas of Interest

  • Artificial Intelligence

Poster

Natural Language Rationales with Full-Stack Visual Reasoning

Abstract

Natural language rationales could provide intuitive, higher-level explanations that are easily understandable by humans, complementing the more broadly studied lower-level explanations based on gradients or attention weights. We present the first study focused on generating natural language rationales across several complex visual reasoning tasks: visual commonsense reasoning, visual-textual entailment, and visual question answering. The key challenge of accurate rationalization is comprehensive image understanding at all levels: not just their explicit content at the pixel level, but their contextual contents at the semantic and pragmatic levels. We present Rationale^VT Transformer, an integrated model that learns to generate free-text rationales by combining pretrained language models with object recognition, grounded visual semantic frames, and visual commonsense graphs. Our experiments show that the base pretrained language model benefits from visual adaptation and that free-text rationalization is a promising research direction to complement model interpretability for complex visual-textual reasoning tasks.

Bio

Ana Marasovic is a postdoctoral researcher at the Allen Institute for AI (AllenNLP Team) and the University of Washington (Noah's ARK), working with Noah Smith and Yejin Choi. Her research interests include learning with limited data, evaluation across domains, languages, and linguistic phenomena, and explaining reasoning processes with natural language.

Personal home page