THOUGHTSCULPT: Reasoning with Intermediate Revision and Search

Yizhou Chi

EECS Department, University of California, Berkeley

Technical Report No. UCB/EECS-2024-55

May 7, 2024

http://www2.eecs.berkeley.edu/Pubs/TechRpts/2024/EECS-2024-55.pdf

We present THOUGHTSCULPT, a general reasoning and search method for tasks with outputs that can be decomposed into components. THOUGHTSCULPT explores a search tree of potential solutions using Monte Carlo Tree Search (MCTS), building solutions one action at a time and evaluating according to any domain-specific heuristic, which in practice is often simply an LLM evaluator. Critically, our action space includes revision actions: THOUGHTSCULPT may choose to revise part of its previous output rather than continuing to build the rest of its output. Empirically, THOUGHTSCULPT outperforms state-of-the-art reasoning methods across three challenging tasks: Story Outline Improvement (up to +30% interestingness), Mini-Crosswords Solving (up to +16% word success rate), and Constrained Generation (up to +10% concept coverage).

Advisors: Daniel Klein

BibTeX citation:

@mastersthesis{Chi:EECS-2024-55,
    Author= {Chi, Yizhou},
    Title= {THOUGHTSCULPT: Reasoning with Intermediate Revision and Search},
    School= {EECS Department, University of California, Berkeley},
    Year= {2024},
    Month= {May},
    Url= {http://www2.eecs.berkeley.edu/Pubs/TechRpts/2024/EECS-2024-55.html},
    Number= {UCB/EECS-2024-55},
    Abstract= {We present THOUGHTSCULPT, a general reasoning and search method for tasks with outputs that can be decomposed into components. THOUGHTSCULPT explores a search tree of potential solutions using Monte Carlo Tree Search (MCTS), building solutions one action at a time and evaluating according to any domain-specific heuristic, which in practice is often simply an LLM evaluator. Critically, our action space includes revision actions: THOUGHTSCULPT may choose to revise part of its previous output rather than continuing to build the rest of its output. Empirically, THOUGHTSCULPT outperforms state-of-the-art reasoning methods across three challenging tasks: Story Outline Improvement (up to +30% interestingness), Mini-Crosswords Solving (up to +16% word success rate), and Constrained Generation (up to +10% concept coverage).},
}

EndNote citation:

%0 Thesis
%A Chi, Yizhou 
%T THOUGHTSCULPT: Reasoning with Intermediate Revision and Search
%I EECS Department, University of California, Berkeley
%D 2024
%8 May 7
%@ UCB/EECS-2024-55
%U http://www2.eecs.berkeley.edu/Pubs/TechRpts/2024/EECS-2024-55.html
%F Chi:EECS-2024-55