THOUGHTSCULPT: Reasoning with Intermediate Revision and Search
Yizhou Chi
EECS Department, University of California, Berkeley
Technical Report No. UCB/EECS-2024-55
May 7, 2024
http://www2.eecs.berkeley.edu/Pubs/TechRpts/2024/EECS-2024-55.pdf
We present THOUGHTSCULPT, a general reasoning and search method for tasks with outputs that can be decomposed into components. THOUGHTSCULPT explores a search tree of potential solutions using Monte Carlo Tree Search (MCTS), building solutions one action at a time and evaluating according to any domain-specific heuristic, which in practice is often simply an LLM evaluator. Critically, our action space includes revision actions: THOUGHTSCULPT may choose to revise part of its previous output rather than continuing to build the rest of its output. Empirically, THOUGHTSCULPT outperforms state-of-the-art reasoning methods across three challenging tasks: Story Outline Improvement (up to +30% interestingness), Mini-Crosswords Solving (up to +16% word success rate), and Constrained Generation (up to +10% concept coverage).
Advisors: Daniel Klein
BibTeX citation:
@mastersthesis{Chi:EECS-2024-55,
Author= {Chi, Yizhou},
Title= {THOUGHTSCULPT: Reasoning with Intermediate Revision and Search},
School= {EECS Department, University of California, Berkeley},
Year= {2024},
Month= {May},
Url= {http://www2.eecs.berkeley.edu/Pubs/TechRpts/2024/EECS-2024-55.html},
Number= {UCB/EECS-2024-55},
Abstract= {We present THOUGHTSCULPT, a general reasoning and search method for tasks with outputs that can be decomposed into components. THOUGHTSCULPT explores a search tree of potential solutions using Monte Carlo Tree Search (MCTS), building solutions one action at a time and evaluating according to any domain-specific heuristic, which in practice is often simply an LLM evaluator. Critically, our action space includes revision actions: THOUGHTSCULPT may choose to revise part of its previous output rather than continuing to build the rest of its output. Empirically, THOUGHTSCULPT outperforms state-of-the-art reasoning methods across three challenging tasks: Story Outline Improvement (up to +30% interestingness), Mini-Crosswords Solving (up to +16% word success rate), and Constrained Generation (up to +10% concept coverage).},
}
EndNote citation:
%0 Thesis %A Chi, Yizhou %T THOUGHTSCULPT: Reasoning with Intermediate Revision and Search %I EECS Department, University of California, Berkeley %D 2024 %8 May 7 %@ UCB/EECS-2024-55 %U http://www2.eecs.berkeley.edu/Pubs/TechRpts/2024/EECS-2024-55.html %F Chi:EECS-2024-55