ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2211.12588
  4. Cited By
Program of Thoughts Prompting: Disentangling Computation from Reasoning
  for Numerical Reasoning Tasks
v1v2v3v4 (latest)

Program of Thoughts Prompting: Disentangling Computation from Reasoning for Numerical Reasoning Tasks

22 November 2022
Wenhu Chen
Xueguang Ma
Xinyi Wang
William W. Cohen
    ReLMReCodLRM
ArXiv (abs)PDFHTMLGithub (274★)

Papers citing "Program of Thoughts Prompting: Disentangling Computation from Reasoning for Numerical Reasoning Tasks"

50 / 634 papers shown
Title
IW-Bench: Evaluating Large Multimodal Models for Converting Image-to-Web
IW-Bench: Evaluating Large Multimodal Models for Converting Image-to-Web
Hongcheng Guo
Wei Zhang
Junhao Chen
Yaonan Gu
Jian Yang
...
Binyuan Hui
Tianyu Liu
Jianxin Ma
Chang Zhou
Zhoujun Li
49
4
0
14 Sep 2024
Expediting and Elevating Large Language Model Reasoning via Hidden
  Chain-of-Thought Decoding
Expediting and Elevating Large Language Model Reasoning via Hidden Chain-of-Thought Decoding
Tianqiao Liu
Zui Chen
Zitao Liu
Mi Tian
Weiqi Luo
LRM
70
3
0
13 Sep 2024
TravelAgent: An AI Assistant for Personalized Travel Planning
TravelAgent: An AI Assistant for Personalized Travel Planning
Aili Chen
Xuyang Ge
Ziquan Fu
Yanghua Xiao
Jiangjie Chen
LLMAG
74
13
0
12 Sep 2024
Self-Harmonized Chain of Thought
Self-Harmonized Chain of Thought
Ziqi Jin
Wei Lu
LRM
90
3
0
06 Sep 2024
VProChart: Answering Chart Question through Visual Perception Alignment Agent and Programmatic Solution Reasoning
VProChart: Answering Chart Question through Visual Perception Alignment Agent and Programmatic Solution Reasoning
Muye Huang
L. Zhang
Lai Han
Wenjun Wu
Xinyu Zhang
Jun Liu
82
1
0
03 Sep 2024
SIaM: Self-Improving Code-Assisted Mathematical Reasoning of Large
  Language Models
SIaM: Self-Improving Code-Assisted Mathematical Reasoning of Large Language Models
Dian Yu
Baolin Peng
Ye Tian
Linfeng Song
Haitao Mi
Dong Yu
ALMLRM
73
3
0
28 Aug 2024
CodeGraph: Enhancing Graph Reasoning of LLMs with Code
CodeGraph: Enhancing Graph Reasoning of LLMs with Code
Qiaolong Cai
Zhaowei Wang
Shizhe Diao
James Kwok
Yangqiu Song
LRM
119
4
0
25 Aug 2024
Path-Consistency: Prefix Enhancement for Efficient Inference in LLM
Path-Consistency: Prefix Enhancement for Efficient Inference in LLM
Jiace Zhu
Yingtao Shen
Jie Zhao
An Zou
LLMAGLRM
113
4
0
25 Aug 2024
Is Functional Correctness Enough to Evaluate Code Language Models?
  Exploring Diversity of Generated Codes
Is Functional Correctness Enough to Evaluate Code Language Models? Exploring Diversity of Generated Codes
Heejae Chon
Seonghyeon Lee
Jinyoung Yeo
Dongha Lee
ALM
69
1
0
24 Aug 2024
Understanding Defects in Generated Codes by Language Models
Understanding Defects in Generated Codes by Language Models
Ali Mohammadi Esfahani
N. Kahani
S. Ajila
92
1
0
23 Aug 2024
Fine-tuning Smaller Language Models for Question Answering over
  Financial Documents
Fine-tuning Smaller Language Models for Question Answering over Financial Documents
Karmvir Singh Phogat
Sai Akhil Puranam
Sridhar Dasaratha
Chetan Harsha
Shashishekar Ramakrishna
LRM
47
4
0
22 Aug 2024
Benchmarking Large Language Models for Math Reasoning Tasks
Benchmarking Large Language Models for Math Reasoning Tasks
Kathrin Seßler
Yao Rong
Emek Gözlüklü
Enkelejda Kasneci
LRM
61
4
0
20 Aug 2024
Kubrick: Multimodal Agent Collaborations for Synthetic Video Generation
Kubrick: Multimodal Agent Collaborations for Synthetic Video Generation
Liu He
Yizhi Song
Hejun Huang
Pinxin Liu
Yunlong Tang
Daniel G. Aliaga
Xin Zhou
DiffMVGen
148
6
0
19 Aug 2024
TableBench: A Comprehensive and Complex Benchmark for Table Question Answering
TableBench: A Comprehensive and Complex Benchmark for Table Question Answering
Xianjie Wu
Jian Yang
Linzheng Chai
Ge Zhang
Jiaheng Liu
...
Xianfu Cheng
Tianzhen Sun
Guanglin Niu
Tongliang Li
Zhoujun Li
LMTDELM
108
41
0
17 Aug 2024
FLEXTAF: Enhancing Table Reasoning with Flexible Tabular Formats
FLEXTAF: Enhancing Table Reasoning with Flexible Tabular Formats
Xuanliang Zhang
Dingzirui Wang
Longxu Dou
Baoxin Wang
Dayong Wu
Qingfu Zhu
Wanxiang Che
LMTDReLM
106
3
0
16 Aug 2024
Math-PUMA: Progressive Upward Multimodal Alignment to Enhance
  Mathematical Reasoning
Math-PUMA: Progressive Upward Multimodal Alignment to Enhance Mathematical Reasoning
Wenwen Zhuang
Xin Huang
Xiantao Zhang
Jin Zeng
LRM
123
31
0
16 Aug 2024
SelectLLM: Query-Aware Efficient Selection Algorithm for Large Language Models
SelectLLM: Query-Aware Efficient Selection Algorithm for Large Language Models
Kaushal Kumar Maurya
KV Aditya Srivatsa
Ekaterina Kochmar
105
2
0
16 Aug 2024
Leveraging Web-Crawled Data for High-Quality Fine-Tuning
Leveraging Web-Crawled Data for High-Quality Fine-Tuning
Jing Zhou
Chenglin Jiang
Wei Shen
Xiao Zhou
Xiaonan He
ALM
90
4
0
15 Aug 2024
Chain of Condition: Construct, Verify and Solve Conditions for
  Conditional Question Answering
Chain of Condition: Construct, Verify and Solve Conditions for Conditional Question Answering
Jiuheng Lin
Yuxuan Lai
Yansong Feng
LRM
70
1
0
10 Aug 2024
1.5-Pints Technical Report: Pretraining in Days, Not Months -- Your
  Language Model Thrives on Quality Data
1.5-Pints Technical Report: Pretraining in Days, Not Months -- Your Language Model Thrives on Quality Data
Calvin Tan
Jerome Wang
ALM
119
3
0
07 Aug 2024
Unveiling Factual Recall Behaviors of Large Language Models through
  Knowledge Neurons
Unveiling Factual Recall Behaviors of Large Language Models through Knowledge Neurons
Yifei Wang
Yuheng Chen
Wanting Wen
Yu Sheng
Linjing Li
D. Zeng
KELM
103
9
0
06 Aug 2024
Can LLMs "Reason" in Music? An Evaluation of LLMs' Capability of Music
  Understanding and Generation
Can LLMs "Reason" in Music? An Evaluation of LLMs' Capability of Music Understanding and Generation
Ziya Zhou
Yuhang Wu
Zhiyue Wu
Xinyue Zhang
Ruibin Yuan
Yi Ma
Lu Wang
Emmanouil Benetos
Wei Xue
Yi-Ting Guo
LRM
77
3
0
31 Jul 2024
Fine-Tuned Large Language Model for Visualization System: A Study on
  Self-Regulated Learning in Education
Fine-Tuned Large Language Model for Visualization System: A Study on Self-Regulated Learning in Education
Lin Gao
Jing Lu
Zekai Shao
Ziyue Lin
Shengbin Yue
Chio-in Ieong
Yi Sun
Rory James Zauner
Zhongyu Wei
Siming Chen
64
12
0
30 Jul 2024
Enhancing Temporal Understanding in LLMs for Semi-structured Tables
Enhancing Temporal Understanding in LLMs for Semi-structured Tables
Irwin Deng
Kushagra Dixit
Vivek Gupta
Dan Roth
LMTD
86
5
0
22 Jul 2024
Step-by-Step Reasoning to Solve Grid Puzzles: Where do LLMs Falter?
Step-by-Step Reasoning to Solve Grid Puzzles: Where do LLMs Falter?
Nemika Tyagi
Mihir Parmar
Mohith Kulkarni
Aswin Rrv
Nisarg Patel
Mutsumi Nakamura
Arindam Mitra
Chitta Baral
LRM
111
7
0
20 Jul 2024
Evaluating Linguistic Capabilities of Multimodal LLMs in the Lens of
  Few-Shot Learning
Evaluating Linguistic Capabilities of Multimodal LLMs in the Lens of Few-Shot Learning
Mustafa Dogan
.Ilker Kesen
Iacer Calixto
Aykut Erdem
Erkut Erdem
LRM
87
1
0
17 Jul 2024
Reliable Reasoning Beyond Natural Language
Reliable Reasoning Beyond Natural Language
Nasim Borazjanizadeh
Steven T Piantadosi
LRMReLM
67
7
0
16 Jul 2024
A Comprehensive Evaluation of Large Language Models on Temporal Event Forecasting
A Comprehensive Evaluation of Large Language Models on Temporal Event Forecasting
He Chang
Chenchen Ye
Zhulin Tao
Jie Wu
Zhengmao Yang
Yunshan Ma
Xianglin Huang
Tat-Seng Chua
AI4TS
91
2
0
16 Jul 2024
Unraveling the Truth: Do LLMs really Understand Charts? A Deep Dive into
  Consistency and Robustness
Unraveling the Truth: Do LLMs really Understand Charts? A Deep Dive into Consistency and Robustness
Srija Mukhopadhyay
Adnan Qidwai
Aparna Garimella
Pritika Ramu
Vivek Gupta
Dan Roth
111
3
0
15 Jul 2024
Key-Point-Driven Mathematical Reasoning Distillation of Large Language
  Model
Key-Point-Driven Mathematical Reasoning Distillation of Large Language Model
Xunyu Zhu
Jian Li
Can Ma
Weiping Wang
LRM
46
0
0
14 Jul 2024
Lean-STaR: Learning to Interleave Thinking and Proving
Lean-STaR: Learning to Interleave Thinking and Proving
Haohan Lin
Zhiqing Sun
Yiming Yang
Sean Welleck
ReLMLRM
197
29
0
14 Jul 2024
Solving for X and Beyond: Can Large Language Models Solve Complex Math
  Problems with More-Than-Two Unknowns?
Solving for X and Beyond: Can Large Language Models Solve Complex Math Problems with More-Than-Two Unknowns?
Kuei-Chun Kao
Ruochen Wang
Cho-Jui Hsieh
ELMLRM
82
4
0
06 Jul 2024
DotaMath: Decomposition of Thought with Code Assistance and
  Self-correction for Mathematical Reasoning
DotaMath: Decomposition of Thought with Code Assistance and Self-correction for Mathematical Reasoning
Chengpeng Li
Guanting Dong
Mingfeng Xue
Ru Peng
Xiang Wang
Dayiheng Liu
LRMReLM
100
13
0
04 Jul 2024
Solving Zebra Puzzles Using Constraint-Guided Multi-Agent Systems
Solving Zebra Puzzles Using Constraint-Guided Multi-Agent Systems
Shmuel Berman
Kathleen McKeown
Baishakhi Ray
LLMAG
82
1
0
04 Jul 2024
STOC-TOT: Stochastic Tree-of-Thought with Constrained Decoding for
  Complex Reasoning in Multi-Hop Question Answering
STOC-TOT: Stochastic Tree-of-Thought with Constrained Decoding for Complex Reasoning in Multi-Hop Question Answering
Zhenyu Bi
Daniel Hajialigol
Zhongkai Sun
Jie Hao
Xuan Wang
LRM
84
1
0
04 Jul 2024
Pelican: Correcting Hallucination in Vision-LLMs via Claim Decomposition
  and Program of Thought Verification
Pelican: Correcting Hallucination in Vision-LLMs via Claim Decomposition and Program of Thought Verification
Pritish Sahu
Karan Sikka
Ajay Divakaran
MLLMLRM
109
6
0
02 Jul 2024
WTU-EVAL: A Whether-or-Not Tool Usage Evaluation Benchmark for Large
  Language Models
WTU-EVAL: A Whether-or-Not Tool Usage Evaluation Benchmark for Large Language Models
Kangyun Ning
Yisong Su
Xueqiang Lv
Yuanzhe Zhang
Jian Liu
Kang Liu
Jinan Xu
ELMLLMAG
83
3
0
02 Jul 2024
AutoFlow: Automated Workflow Generation for Large Language Model Agents
AutoFlow: Automated Workflow Generation for Large Language Model Agents
Zelong Li
Shuyuan Xu
Kai Mei
Wenyue Hua
Balaji Rama
Om Raheja
Hao Wang
He Zhu
Yongfeng Zhang
AIFinAI4CELLMAG
100
19
0
01 Jul 2024
We-Math: Does Your Large Multimodal Model Achieve Human-like
  Mathematical Reasoning?
We-Math: Does Your Large Multimodal Model Achieve Human-like Mathematical Reasoning?
Runqi Qiao
Qiuna Tan
Guanting Dong
Minhui Wu
Chong Sun
...
Yida Xu
Muxi Diao
Zhimin Bao
Chen Li
Honggang Zhang
VLMLRM
113
56
0
01 Jul 2024
FRoG: Evaluating Fuzzy Reasoning of Generalized Quantifiers in Large
  Language Models
FRoG: Evaluating Fuzzy Reasoning of Generalized Quantifiers in Large Language Models
Yiyuan Li
Shichao Sun
Pengfei Liu
LRM
144
0
0
01 Jul 2024
Hybrid RAG-empowered Multi-modal LLM for Secure Healthcare Data
  Management: A Diffusion-based Contract Theory Approach
Hybrid RAG-empowered Multi-modal LLM for Secure Healthcare Data Management: A Diffusion-based Contract Theory Approach
Cheng Su
Jinbo Wen
Jiawen Kang
Yonghua Wang
Hudan Pan
M. S. Hossain
MedIm
45
0
0
01 Jul 2024
Step-Controlled DPO: Leveraging Stepwise Error for Enhanced Mathematical
  Reasoning
Step-Controlled DPO: Leveraging Stepwise Error for Enhanced Mathematical Reasoning
Zimu Lu
Aojun Zhou
Ke Wang
Houxing Ren
Weikang Shi
Junting Pan
Mingjie Zhan
Hongsheng Li
LRM
100
25
0
30 Jun 2024
Chain-of-Knowledge: Integrating Knowledge Reasoning into Large Language
  Models by Learning from Knowledge Graphs
Chain-of-Knowledge: Integrating Knowledge Reasoning into Large Language Models by Learning from Knowledge Graphs
Yifei Zhang
Xintao Wang
Jiaqing Liang
Sirui Xia
Lida Chen
Yanghua Xiao
LRM
142
2
0
30 Jun 2024
H-STAR: LLM-driven Hybrid SQL-Text Adaptive Reasoning on Tables
H-STAR: LLM-driven Hybrid SQL-Text Adaptive Reasoning on Tables
Nikhil Abhyankar
Vivek Gupta
Dan Roth
Chandan K. Reddy
LMTD
147
4
0
29 Jun 2024
Scaling Synthetic Data Creation with 1,000,000,000 Personas
Scaling Synthetic Data Creation with 1,000,000,000 Personas
Tao Ge
Xin Chan
Dian Yu
Haitao Mi
Dong Yu
Dong Yu
SyDa
238
150
0
28 Jun 2024
DEXTER: A Benchmark for open-domain Complex Question Answering using
  LLMs
DEXTER: A Benchmark for open-domain Complex Question Answering using LLMs
Venktesh V. Deepali Prabhu
Avishek Anand
RALMCoGe
68
3
0
24 Jun 2024
Automatically Generating UI Code from Screenshot: A Divide-and-Conquer-Based Approach
Automatically Generating UI Code from Screenshot: A Divide-and-Conquer-Based Approach
Yuxuan Wan
Chaozheng Wang
Yi Dong
Wenxuan Wang
Shuqing Li
Yintong Huo
Michael R. Lyu
3DV
172
14
0
24 Jun 2024
LOGIC-LM++: Multi-Step Refinement for Symbolic Formulations
LOGIC-LM++: Multi-Step Refinement for Symbolic Formulations
Shashank Kirtania
Priyanshu Gupta
Arjun Radhakirshna
LRM
94
7
0
22 Jun 2024
Whiteboard-of-Thought: Thinking Step-by-Step Across Modalities
Whiteboard-of-Thought: Thinking Step-by-Step Across Modalities
Sachit Menon
Richard Zemel
Carl Vondrick
LRM
97
5
0
20 Jun 2024
SEC-QA: A Systematic Evaluation Corpus for Financial QA
SEC-QA: A Systematic Evaluation Corpus for Financial QA
Viet Dac Lai
Michael Krumdick
Charles Lovering
Varshini Reddy
Craig W. Schmidt
Chris Tanner
96
4
0
20 Jun 2024
Previous
123456...111213
Next