The MathOdyssey dataset was meticulously designed to evaluate the mathematical reasoning capabilities of large language models (LLMs). The creation process involved structured stages, including expert recruitment, problem development, review,…
MathOdyssey: Benchmarking Mathematical Problem-Solving Skills in Large Language Models Using Odyssey Math Data
