Reverse Operation based Data Augmentation for Solving Math Word Problems
- URL: http://arxiv.org/abs/2010.01556v2
- Date: Wed, 10 Nov 2021 16:12:16 GMT
- Title: Reverse Operation based Data Augmentation for Solving Math Word Problems
- Authors: Qianying Liu, Wenyu Guan, Sujian Li, Fei Cheng, Daisuke Kawahara and
Sadao Kurohashi
- Abstract summary: Recent models have reached their performance bottleneck and require more high-quality data for training.
We propose a novel data augmentation method that reverses the mathematical logic of math word problems.
We apply the augmented data on two SOTA math word problem solving models and compare our results with a strong data augmentation baseline.
- Score: 37.26159426631031
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Automatically solving math word problems is a critical task in the field of
natural language processing. Recent models have reached their performance
bottleneck and require more high-quality data for training. We propose a novel
data augmentation method that reverses the mathematical logic of math word
problems to produce new high-quality math problems and introduce new knowledge
points that can benefit learning the mathematical reasoning logic. We apply the
augmented data on two SOTA math word problem solving models and compare our
results with a strong data augmentation baseline. Experimental results show the
effectiveness of our approach. We release our code and data at
https://github.com/yiyunya/RODA.
Related papers
- ControlMath: Controllable Data Generation Promotes Math Generalist Models [38.0858432336873]
We propose ControlMath, an iterative method involving an equation-generator module and two LLM-based agents.
The module creates diverse equations, which the Problem-Crafter agent then transforms into math word problems.
We collect ControlMathQA, which involves 190k math word problems.
arXiv Detail & Related papers (2024-09-20T03:58:26Z) - AI-Assisted Generation of Difficult Math Questions [78.7547836422727]
Current training positions mathematical reasoning as a core capability.
There is unmet demand for diverse and challenging math questions.
We present a design framework that combines the strengths of LLMs with a human-in-the-loop approach.
arXiv Detail & Related papers (2024-07-30T17:55:36Z) - Lean Workbook: A large-scale Lean problem set formalized from natural language math problems [50.22847430754973]
Large language models are not good at math theorem proving using formal languages like Lean.
A significant challenge in this area is the scarcity of training data available in these formal languages.
We propose a novel pipeline that iteratively generates and filters synthetic data to translate natural language mathematical problems into Lean 4 statements.
arXiv Detail & Related papers (2024-06-06T08:25:43Z) - MathScale: Scaling Instruction Tuning for Mathematical Reasoning [70.89605383298331]
Large language models (LLMs) have demonstrated remarkable capabilities in problem-solving.
However, their proficiency in solving mathematical problems remains inadequate.
We propose MathScale, a simple and scalable method to create high-quality mathematical reasoning data.
arXiv Detail & Related papers (2024-03-05T11:42:59Z) - MATHSENSEI: A Tool-Augmented Large Language Model for Mathematical Reasoning [2.9104279358536647]
We present MathSensei, a tool-augmented large language model for mathematical reasoning.
We study the complementary benefits of the tools - knowledge retriever (Bing Web Search), program generator + executor (Python), and symbolic equation solver (Wolfram-Alpha API)
arXiv Detail & Related papers (2024-02-27T05:50:35Z) - Solving Math Word Problems by Combining Language Models With Symbolic
Solvers [28.010617102877923]
Large language models (LLMs) can be combined with external tools to perform complex reasoning and calculation.
We propose an approach that combines an LLM that can incrementally formalize word problems as a set of variables and equations with an external symbolic solver.
Our approach achieves comparable accuracy to the original PAL on the GSM8K benchmark of math word problems and outperforms PAL by an absolute 20% on ALGEBRA.
arXiv Detail & Related papers (2023-04-16T04:16:06Z) - Heterogeneous Line Graph Transformer for Math Word Problems [21.4761673982334]
This paper describes the design and implementation of a new machine learning model for online learning systems.
We aim at improving the intelligent level of the systems by enabling an automated math word problem solver.
arXiv Detail & Related papers (2022-08-11T05:27:05Z) - Measuring Mathematical Problem Solving With the MATH Dataset [55.4376028963537]
We introduce MATH, a dataset of 12,500 challenging competition mathematics problems.
Each problem has a full step-by-step solution which can be used to teach models to generate answer derivations and explanations.
We also contribute a large auxiliary pretraining dataset which helps teach models the fundamentals of mathematics.
arXiv Detail & Related papers (2021-03-05T18:59:39Z) - SMART: A Situation Model for Algebra Story Problems via Attributed
Grammar [74.1315776256292]
We introduce the concept of a emphsituation model, which originates from psychology studies to represent the mental states of humans in problem-solving.
We show that the proposed model outperforms all previous neural solvers by a large margin while preserving much better interpretability.
arXiv Detail & Related papers (2020-12-27T21:03:40Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.