First-Order Open-Universe POMDPs: Formulation and Algorithms

Siddharth Srivastava and Xiang Cheng and Stuart J. Russell and Avi Pfeffer

EECS Department, University of California, Berkeley

Technical Report No. UCB/EECS-2013-243

December 25, 2013

http://www2.eecs.berkeley.edu/Pubs/TechRpts/2013/EECS-2013-243.pdf

Interest in relational and first-order languages for probability models has grown rapidly in recent years, and with it the possibility of extending such languages to handle decision processes---both fully and partially observable. We examine the problem of extending a first-order, open-universe language to describe POMDPs and identify non-trivial representational issues in describing an agent's capability for observation and action---issues that were avoided in previous work only by making strong and restrictive assumptions. We present a method for representing actions and observations that respects formal specifications of the sensors and actuators available to an agent, and show how to handle cases---such as seeing an object and picking it up---that could not previously be represented. Finally, we argue that in many cases open-universe POMDPs require belief-state policies rather than automata policies. We present an algorithm and experimental results for evaluating such policies for open-unverse POMDPs.

BibTeX citation:

@techreport{Srivastava:EECS-2013-243,
    Author= {Srivastava, Siddharth and Cheng, Xiang and Russell, Stuart J. and Pfeffer, Avi},
    Title= {First-Order Open-Universe POMDPs: Formulation and Algorithms},
    Year= {2013},
    Month= {Dec},
    Url= {http://www2.eecs.berkeley.edu/Pubs/TechRpts/2013/EECS-2013-243.html},
    Number= {UCB/EECS-2013-243},
    Abstract= { Interest in relational and first-order languages for probability
  models has grown rapidly in recent years, and with it the
  possibility of extending such languages to handle decision
  processes---both fully and partially observable.  We examine the
  problem of extending a first-order, open-universe language to
  describe POMDPs and identify non-trivial representational issues in
  describing an agent's capability for observation and action---issues
  that were avoided in previous work only by making strong and
  restrictive assumptions. We present a method for representing
  actions and observations that respects formal specifications of the
  sensors and actuators available to an agent, and show how to handle
  cases---such as seeing an object and picking it up---that could not
  previously be represented. Finally, we argue that in many cases
  open-universe POMDPs require belief-state policies rather than
  automata policies. We present an algorithm and experimental results
  for evaluating such policies for open-unverse POMDPs.},
}

EndNote citation:

%0 Report
%A Srivastava, Siddharth 
%A Cheng, Xiang 
%A Russell, Stuart J. 
%A Pfeffer, Avi 
%T First-Order Open-Universe POMDPs: Formulation and Algorithms
%I EECS Department, University of California, Berkeley
%D 2013
%8 December 25
%@ UCB/EECS-2013-243
%U http://www2.eecs.berkeley.edu/Pubs/TechRpts/2013/EECS-2013-243.html
%F Srivastava:EECS-2013-243