ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2209.14610
  4. Cited By
Dynamic Prompt Learning via Policy Gradient for Semi-structured
  Mathematical Reasoning

Dynamic Prompt Learning via Policy Gradient for Semi-structured Mathematical Reasoning

29 September 2022
Pan Lu
Liang Qiu
Kai-Wei Chang
Ying Nian Wu
Song-Chun Zhu
Tanmay Rajpurohit
Peter Clark
Ashwin Kalyan
    ReLM
    LRM
ArXivPDFHTML

Papers citing "Dynamic Prompt Learning via Policy Gradient for Semi-structured Mathematical Reasoning"

50 / 225 papers shown
Title
Adapting Large Language Models for Education: Foundational Capabilities,
  Potentials, and Challenges
Adapting Large Language Models for Education: Foundational Capabilities, Potentials, and Challenges
Qingyao Li
Lingyue Fu
Weiming Zhang
Xianyu Chen
Jingwei Yu
Wei Xia
Weinan Zhang
Ruiming Tang
Yong Yu
AI4Ed
ELM
41
18
0
27 Dec 2023
A Survey of Reasoning with Foundation Models
A Survey of Reasoning with Foundation Models
Jiankai Sun
Chuanyang Zheng
E. Xie
Zhengying Liu
Ruihang Chu
...
Xipeng Qiu
Yi-Chen Guo
Hui Xiong
Qun Liu
Zhenguo Li
ReLM
LRM
AI4CE
27
76
0
17 Dec 2023
Latent Skill Discovery for Chain-of-Thought Reasoning
Latent Skill Discovery for Chain-of-Thought Reasoning
Zifan Xu
Haozhu Wang
Dmitriy Bespalov
Peter Stone
Yanjun Qi
ReLM
LRM
56
2
0
07 Dec 2023
Prompt Optimization via Adversarial In-Context Learning
Prompt Optimization via Adversarial In-Context Learning
Do Xuan Long
Yiran Zhao
Hannah Brown
Yuxi Xie
James Xu Zhao
Nancy F. Chen
Kenji Kawaguchi
Michael Qizhe Xie
Junxian He
72
11
0
05 Dec 2023
Retrieval-augmented Multi-modal Chain-of-Thoughts Reasoning for Large
  Language Models
Retrieval-augmented Multi-modal Chain-of-Thoughts Reasoning for Large Language Models
Bingshuai Liu
Chenyang Lyu
Zijun Min
Zhanyu Wang
Jinsong Su
Longyue Wang
LRM
33
7
0
04 Dec 2023
LANS: A Layout-Aware Neural Solver for Plane Geometry Problem
LANS: A Layout-Aware Neural Solver for Plane Geometry Problem
Zhong-Zhi Li
Ming-Liang Zhang
Fei Yin
Cheng-Lin Liu
21
11
0
25 Nov 2023
DocMath-Eval: Evaluating Numerical Reasoning Capabilities of LLMs in
  Understanding Long Documents with Tabular Data
DocMath-Eval: Evaluating Numerical Reasoning Capabilities of LLMs in Understanding Long Documents with Tabular Data
Yilun Zhao
Yitao Long
Hongjun Liu
Linyong Nan
Lyuhao Chen
Ryo Kamoi
Yixin Liu
Xiangru Tang
Rui Zhang
Arman Cohan
31
12
0
16 Nov 2023
ToolTalk: Evaluating Tool-Usage in a Conversational Setting
ToolTalk: Evaluating Tool-Usage in a Conversational Setting
Nicholas Farn
Richard Shin
LLMAG
ELM
37
14
0
15 Nov 2023
Just Ask One More Time! Self-Agreement Improves Reasoning of Language
  Models in (Almost) All Scenarios
Just Ask One More Time! Self-Agreement Improves Reasoning of Language Models in (Almost) All Scenarios
Lei Lin
Jiayi Fu
Pengli Liu
Qingyang Li
Yan Gong
Junchen Wan
Fuzheng Zhang
Zhongyuan Wang
Di Zhang
Kun Gai
LRM
27
7
0
14 Nov 2023
TempTabQA: Temporal Question Answering for Semi-Structured Tables
TempTabQA: Temporal Question Answering for Semi-Structured Tables
Vivek Gupta
Pranshu Kandoi
M. Vora
Shuo Zhang
Yujie He
R. Reinanda
Vivek Srikumar
LMTD
26
16
0
14 Nov 2023
Can Knowledge Graphs Reduce Hallucinations in LLMs? : A Survey
Can Knowledge Graphs Reduce Hallucinations in LLMs? : A Survey
Garima Agrawal
Tharindu Kumarage
Zeyad Alghami
Huanmin Liu
37
81
0
14 Nov 2023
BizBench: A Quantitative Reasoning Benchmark for Business and Finance
BizBench: A Quantitative Reasoning Benchmark for Business and Finance
Rik Koncel-Kedziorski
Michael Krumdick
Viet Dac Lai
Varshini Reddy
Charles Lovering
Chris Tanner
AIMat
35
4
0
11 Nov 2023
Exploring the Numerical Reasoning Capabilities of Language Models: A
  Comprehensive Analysis on Tabular Data
Exploring the Numerical Reasoning Capabilities of Language Models: A Comprehensive Analysis on Tabular Data
Mubashara Akhtar
Abhilash Shankarampeta
Vivek Gupta
Arpit Patil
O. Cocarascu
Elena Simperl
LRM
ReLM
LMTD
ELM
39
21
0
03 Nov 2023
Ask more, know better: Reinforce-Learned Prompt Questions for Decision
  Making with Large Language Models
Ask more, know better: Reinforce-Learned Prompt Questions for Decision Making with Large Language Models
Xue Yan
Yan Song
Xinyu Cui
Filippos Christianos
Haifeng Zhang
D. Mguni
Jun Wang
LRM
119
6
0
27 Oct 2023
In-Context Ability Transfer for Question Decomposition in Complex QA
In-Context Ability Transfer for Question Decomposition in Complex QA
Venktesh V
Sourangshu Bhattacharya
Avishek Anand
LRM
ReLM
34
4
0
26 Oct 2023
DDCoT: Duty-Distinct Chain-of-Thought Prompting for Multimodal Reasoning
  in Language Models
DDCoT: Duty-Distinct Chain-of-Thought Prompting for Multimodal Reasoning in Language Models
Ge Zheng
Bin Yang
Jiajin Tang
Hong-Yu Zhou
Sibei Yang
LRM
MLLM
35
93
0
25 Oct 2023
Prompt Engineering Through the Lens of Optimal Control
Prompt Engineering Through the Lens of Optimal Control
Yifan Luo
Yiming Tang
Chengfeng Shen
Zhennan Zhou
Bin Dong
OffRL
38
6
0
22 Oct 2023
Auto-Instruct: Automatic Instruction Generation and Ranking for
  Black-Box Language Models
Auto-Instruct: Automatic Instruction Generation and Ranking for Black-Box Language Models
Zhihan Zhang
Shuohang Wang
W. Yu
Yichong Xu
Dan Iter
Qingkai Zeng
Yang Liu
Chenguang Zhu
Meng Jiang
SyDa
ALM
22
22
0
19 Oct 2023
KwaiYiiMath: Technical Report
KwaiYiiMath: Technical Report
Jia-Yi Fu
Lei Lin
Xiaoyang Gao
Pengli Liu
Zhengzong Chen
...
Zijia Lin
Fuzheng Zhang
Zhongyuan Wang
Di Zhang
Kun Gai
LRM
ReLM
RALM
51
2
0
11 Oct 2023
SEER : A Knapsack approach to Exemplar Selection for In-Context HybridQA
SEER : A Knapsack approach to Exemplar Selection for In-Context HybridQA
Jonathan Tonglet
Manon Reusens
Philipp Borchert
Bart Baesens
44
5
0
10 Oct 2023
MuggleMath: Assessing the Impact of Query and Response Augmentation on
  Math Reasoning
MuggleMath: Assessing the Impact of Query and Response Augmentation on Math Reasoning
Chengpeng Li
Zheng Yuan
Hongyi Yuan
Guanting Dong
Keming Lu
Jiancan Wu
Chuanqi Tan
Xiang Wang
Chang Zhou
LRM
20
21
0
09 Oct 2023
Toolink: Linking Toolkit Creation and Using through Chain-of-Solving on
  Open-Source Model
Toolink: Linking Toolkit Creation and Using through Chain-of-Solving on Open-Source Model
Cheng Qian
Chenyan Xiong
Zhenghao Liu
Zhiyuan Liu
LRM
29
12
0
08 Oct 2023
Large Language Model Cascades with Mixture of Thoughts Representations
  for Cost-efficient Reasoning
Large Language Model Cascades with Mixture of Thoughts Representations for Cost-efficient Reasoning
Murong Yue
Jie Zhao
Min Zhang
Liang Du
Ziyu Yao
LRM
32
55
0
04 Oct 2023
MathVista: Evaluating Mathematical Reasoning of Foundation Models in
  Visual Contexts
MathVista: Evaluating Mathematical Reasoning of Foundation Models in Visual Contexts
Pan Lu
Hritik Bansal
Tony Xia
Jiacheng Liu
Chun-yue Li
Hannaneh Hajishirzi
Hao Cheng
Kai-Wei Chang
Michel Galley
Jianfeng Gao
LRM
MLLM
43
503
0
03 Oct 2023
Corex: Pushing the Boundaries of Complex Reasoning through Multi-Model
  Collaboration
Corex: Pushing the Boundaries of Complex Reasoning through Multi-Model Collaboration
Qiushi Sun
Zhangyue Yin
Xiang Li
Zhiyong Wu
Xipeng Qiu
Lingpeng Kong
LRM
LLMAG
28
44
0
30 Sep 2023
ToRA: A Tool-Integrated Reasoning Agent for Mathematical Problem Solving
ToRA: A Tool-Integrated Reasoning Agent for Mathematical Problem Solving
Zhibin Gou
Zhihong Shao
Yeyun Gong
Yelong Shen
Yujiu Yang
Minlie Huang
Nan Duan
Weizhu Chen
LRM
AI4CE
LLMAG
58
145
0
29 Sep 2023
CRAFT: Customizing LLMs by Creating and Retrieving from Specialized
  Toolsets
CRAFT: Customizing LLMs by Creating and Retrieving from Specialized Toolsets
Lifan Yuan
Yangyi Chen
Xingyao Wang
Yi Ren Fung
Hao Peng
Heng Ji
LLMAG
KELM
27
58
0
29 Sep 2023
NLPBench: Evaluating Large Language Models on Solving NLP Problems
NLPBench: Evaluating Large Language Models on Solving NLP Problems
Linxin Song
Jieyu Zhang
Lechao Cheng
Pengyuan Zhou
Dinesh Manocha
Irene Z Li
ELM
LM&MA
LRM
36
10
0
27 Sep 2023
Navigate through Enigmatic Labyrinth A Survey of Chain of Thought
  Reasoning: Advances, Frontiers and Future
Navigate through Enigmatic Labyrinth A Survey of Chain of Thought Reasoning: Advances, Frontiers and Future
Zheng Chu
Jingchang Chen
Qianglong Chen
Weijiang Yu
Tao He
Haotian Wang
Weihua Peng
Ming-Yu Liu
Bing Qin
Ting Liu
LRM
AI4CE
31
151
0
27 Sep 2023
Are Human-generated Demonstrations Necessary for In-context Learning?
Are Human-generated Demonstrations Necessary for In-context Learning?
Rui Li
Guoyin Wang
Jiwei Li
LRM
20
12
0
26 Sep 2023
CoTDet: Affordance Knowledge Prompting for Task Driven Object Detection
CoTDet: Affordance Knowledge Prompting for Task Driven Object Detection
Jiajin Tang
Ge Zheng
Jingyi Yu
Sibei Yang
ObjD
21
22
0
03 Sep 2023
Solving Challenging Math Word Problems Using GPT-4 Code Interpreter with
  Code-based Self-Verification
Solving Challenging Math Word Problems Using GPT-4 Code Interpreter with Code-based Self-Verification
Aojun Zhou
Ke Wang
Zimu Lu
Weikang Shi
Sichun Luo
...
Shaoqing Lu
Anya Jia
Linqi Song
Mingjie Zhan
Hongsheng Li
ReLM
LRM
36
145
0
15 Aug 2023
Through the Lens of Core Competency: Survey on Evaluation of Large
  Language Models
Through the Lens of Core Competency: Survey on Evaluation of Large Language Models
Ziyu Zhuang
Qiguang Chen
Longxuan Ma
Mingda Li
Yi Han
Yushan Qian
Haopeng Bai
Zixian Feng
Weinan Zhang
Ting Liu
ELM
26
9
0
15 Aug 2023
Forward-Backward Reasoning in Large Language Models for Mathematical
  Verification
Forward-Backward Reasoning in Large Language Models for Mathematical Verification
Weisen Jiang
Han Shi
L. Yu
Zheng Liu
Yu Zhang
Zhenguo Li
James T. Kwok
LRM
45
25
0
15 Aug 2023
Tool Documentation Enables Zero-Shot Tool-Usage with Large Language
  Models
Tool Documentation Enables Zero-Shot Tool-Usage with Large Language Models
Cheng-Yu Hsieh
Sibei Chen
Chun-Liang Li
Yasuhisa Fujii
Alexander Ratner
Chen-Yu Lee
Ranjay Krishna
Tomas Pfister
LLMAG
SyDa
46
41
0
01 Aug 2023
Counterfactual Explanation Policies in RL
Counterfactual Explanation Policies in RL
Shripad Deshmukh
R Srivatsan
Supriti Vijay
Jayakumar Subramanian
Chirag Agarwal
OffRL
35
0
0
25 Jul 2023
CohortGPT: An Enhanced GPT for Participant Recruitment in Clinical Study
CohortGPT: An Enhanced GPT for Participant Recruitment in Clinical Study
Zihan Guan
Zihao Wu
Zheng Liu
Dufan Wu
Hui Ren
Quanzheng Li
Xiang Li
Ninghao Liu
LM&MA
25
25
0
21 Jul 2023
SciBench: Evaluating College-Level Scientific Problem-Solving Abilities
  of Large Language Models
SciBench: Evaluating College-Level Scientific Problem-Solving Abilities of Large Language Models
Xiaoxuan Wang
Ziniu Hu
Pan Lu
Yanqiao Zhu
Jieyu Zhang
Satyen Subramaniam
Arjun R. Loomba
Shichang Zhang
Yizhou Sun
Wei Wang
ELM
LRM
30
86
0
20 Jul 2023
AutoHint: Automatic Prompt Optimization with Hint Generation
AutoHint: Automatic Prompt Optimization with Hint Generation
Hong Sun
Xue Li
Yi Xu
Youkow Homma
Qinhao Cao
Min-man Wu
Jian Jiao
Denis Xavier Charles
34
23
0
13 Jul 2023
A Survey on Multimodal Large Language Models
A Survey on Multimodal Large Language Models
Shukang Yin
Chaoyou Fu
Sirui Zhao
Ke Li
Xing Sun
Tong Xu
Enhong Chen
MLLM
LRM
54
556
0
23 Jun 2023
ToolQA: A Dataset for LLM Question Answering with External Tools
ToolQA: A Dataset for LLM Question Answering with External Tools
Yuchen Zhuang
Yue Yu
Kuan-Chieh Jackson Wang
Haotian Sun
Chao Zhang
ELM
LLMAG
24
216
0
23 Jun 2023
Coverage-based Example Selection for In-Context Learning
Coverage-based Example Selection for In-Context Learning
Shivanshu Gupta
Matt Gardner
Sameer Singh
26
40
0
24 May 2023
RetICL: Sequential Retrieval of In-Context Examples with Reinforcement
  Learning
RetICL: Sequential Retrieval of In-Context Examples with Reinforcement Learning
Alexander Scarlatos
Andrew S. Lan
OffRL
LRM
27
20
0
23 May 2023
CREATOR: Tool Creation for Disentangling Abstract and Concrete Reasoning
  of Large Language Models
CREATOR: Tool Creation for Disentangling Abstract and Concrete Reasoning of Large Language Models
Cheng Qian
Chi Han
Yi Ren Fung
Yujia Qin
Zhiyuan Liu
Heng Ji
LRM
18
30
0
23 May 2023
Let GPT be a Math Tutor: Teaching Math Word Problem Solvers with
  Customized Exercise Generation
Let GPT be a Math Tutor: Teaching Math Word Problem Solvers with Customized Exercise Generation
Zhenwen Liang
W. Yu
Tanmay Rajpurohit
Peter Clark
Xiangliang Zhang
Ashwin Kaylan
32
37
0
22 May 2023
TheoremQA: A Theorem-driven Question Answering dataset
TheoremQA: A Theorem-driven Question Answering dataset
Wenhu Chen
Ming Yin
Max W.F. Ku
Pan Lu
Yixin Wan
Xueguang Ma
Jianyu Xu
Xinyi Wang
Tony Xia
AIMat
38
119
0
21 May 2023
CRITIC: Large Language Models Can Self-Correct with Tool-Interactive
  Critiquing
CRITIC: Large Language Models Can Self-Correct with Tool-Interactive Critiquing
Zhibin Gou
Zhihong Shao
Yeyun Gong
Yelong Shen
Yujiu Yang
Nan Duan
Weizhu Chen
KELM
LRM
36
357
0
19 May 2023
Large Language Model Guided Tree-of-Thought
Large Language Model Guided Tree-of-Thought
Jieyi Long
LM&Ro
LRM
11
185
0
15 May 2023
Read, Diagnose and Chat: Towards Explainable and Interactive
  LLMs-Augmented Depression Detection in Social Media
Read, Diagnose and Chat: Towards Explainable and Interactive LLMs-Augmented Depression Detection in Social Media
Wei Qin
Zetong Chen
Lei Wang
Yunshi Lan
Wei Ren
Richang Hong
AI4MH
30
18
0
09 May 2023
Plan-and-Solve Prompting: Improving Zero-Shot Chain-of-Thought Reasoning
  by Large Language Models
Plan-and-Solve Prompting: Improving Zero-Shot Chain-of-Thought Reasoning by Large Language Models
Lei Wang
Wanyu Xu
Yihuai Lan
Zhiqiang Hu
Yunshi Lan
Roy Ka-Wei Lee
Ee-Peng Lim
ReLM
LRM
34
314
0
06 May 2023
Previous
12345
Next