This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

\affiliations

1Association for the Advancement of Artificial Intelligence
1900 Embarcadero Road, Suite 101
Palo Alto, California 94303-3310 USA
[email protected]

Assessing the Creativity of LLMs in Mathematical Problem Solving

Written by AAAI Press Staff1
AAAI Style Contributions by Pater Patel Schneider, Sunil Issar,
J. Scott Penberthy, George Ferguson, Hans Guesgen, Francisco Cruz\equalcontrib, Marc Pujol-Gonzalez\equalcontrib
With help from the AAAI Publications Committee.
Abstract

This study investigates the creative potential of Large Language Models (LLMs) in mathematical reasoning, an area previously under-explored. We propose a novel framework and benchmark, incorporating problems from middle school to Olympic-level competitions, to evaluate LLMs’ ability to generate novel solutions, employ multi-stage methods, and provide insightful reasoning. Our experiments reveal that while LLMs excel in standard mathematical tasks, their creative problem-solving abilities vary significantly. Notably, the Gemini-1.5-Pro model excelled in producing novel solutions across all tested LLMs. This research pioneers a new direction in assessing AI creativity, highlighting both the strengths and limitations of LLMs in mathematical innovation, and paves the way for future advancements in AI-driven mathematical discovery.