Angelic Hierarchical Planning: Optimal and Online Algorithms

Bhaskara Marthi and Stuart J. Russell and Jason Wolfe

EECS Department, University of California, Berkeley

Technical Report No. UCB/EECS-2008-150

December 6, 2008

http://www2.eecs.berkeley.edu/Pubs/TechRpts/2008/EECS-2008-150.pdf

High-level actions (HLAs) are essential tools for coping with the large search spaces and long decision horizons encountered in real-world decision making. In a recent paper, we proposed an "angelic" semantics for HLAs that supports proofs that a high-level plan will (or will not) achieve a goal, without ﬁrst reducing the plan to primitive action sequences. This paper extends the angelic semantics with cost information to support proofs that a high-level plan is (or is not) optimal. We describe the Angelic Hierarchical A* algorithm, which generates provably optimal plans, and show its advantages over alternative algorithms. We also present the Angelic Hierarchical Learning Real-Time A* algorithm for situated agents, one of the ﬁrst algorithms to do hierarchical lookahead in an online setting. Since high-level plans are much shorter, this algorithm can look much farther ahead than previous algorithms (and thus choose much better actions) for a given amount of computational effort. This is an extended version of a paper by the same name appearing in ICAPS '08.

BibTeX citation:

@techreport{Marthi:EECS-2008-150,
    Author= {Marthi, Bhaskara and Russell, Stuart J. and Wolfe, Jason},
    Title= {Angelic Hierarchical Planning: Optimal and Online Algorithms},
    Year= {2008},
    Month= {Dec},
    Url= {http://www2.eecs.berkeley.edu/Pubs/TechRpts/2008/EECS-2008-150.html},
    Number= {UCB/EECS-2008-150},
    Abstract= {High-level actions (HLAs) are essential tools for coping with the large search spaces and long decision 
horizons encountered in real-world decision making. In a recent paper, we proposed an "angelic" semantics 
for HLAs that supports proofs that a high-level plan will (or will not) achieve a goal, without ﬁrst reducing the 
plan to primitive action sequences. This paper extends the angelic semantics with cost information to support 
proofs that a high-level plan is (or is not) optimal. We describe the Angelic Hierarchical A* algorithm, which 
generates provably optimal plans, and show its advantages over alternative algorithms. We also present the 
Angelic Hierarchical Learning Real-Time A* algorithm for situated agents, one of the ﬁrst algorithms to do 
hierarchical lookahead in an online setting. Since high-level plans are much shorter, this algorithm can look 
much farther ahead than previous algorithms (and thus choose much better actions) for a given amount of 
computational effort. This is an extended version of a paper by the same name appearing in ICAPS '08.},
}

EndNote citation:

%0 Report
%A Marthi, Bhaskara 
%A Russell, Stuart J. 
%A Wolfe, Jason 
%T Angelic Hierarchical Planning: Optimal and Online Algorithms
%I EECS Department, University of California, Berkeley
%D 2008
%8 December 6
%@ UCB/EECS-2008-150
%U http://www2.eecs.berkeley.edu/Pubs/TechRpts/2008/EECS-2008-150.html
%F Marthi:EECS-2008-150