DI-engine/dizoo/tabmwp/README.md · zjowowen/gomoku at db8742405c3f9914b89bccd20dc84c0e3c449e48

TabMWP Env

Dataset

The TabMWP dataset contains 38,431 tabular math word problems. Each question in TabMWP is aligned with a tabular context, which is presented as an image, semi-structured text, and a structured table. There are two types of questions: free-text and multi-choice, and each problem is annotated with gold solutions to reveal the multi-step reasoning process.

The environment is described in the paper Dynamic Prompt Learning via Policy Gradient for Semi-structured Mathematical Reasoning by Pan Lu, Liang Qiu, Kai-Wei Chang, Ying Nian Wu, Song-Chun Zhu, Tanmay Rajpurohit, Peter Clark, Ashwin Kalyan, 2023.

You can find more details in Prompt PG

Benchmark

We collect the responses of GPT-3 using a reduced dataset with 80 training samples and 16 candidates. In this way, there is no need for users to interact with GPT-3 using the API-key of openai.
You can directly reproduce the benchmark by running python dizoo/tabmwp/configs/tabmwp_pg_config.py