Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2501.00562
Cited By
v1
v2 (latest)
An Overview and Discussion on Using Large Language Models for Implementation Generation of Solutions to Open-Ended Problems
31 December 2024
Hashmath Shaik
Alex Doboli
OffRL
ELM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"An Overview and Discussion on Using Large Language Models for Implementation Generation of Solutions to Open-Ended Problems"
50 / 105 papers shown
Title
Loose lips sink ships: Mitigating Length Bias in Reinforcement Learning from Human Feedback
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Wei Shen
Rui Zheng
Wenyu Zhan
Jun Zhao
Jiajun Sun
Tao Gui
Tao Gui
Xuanjing Huang
ALM
272
68
0
08 Oct 2023
Large Language Models as Analogical Reasoners
International Conference on Learning Representations (ICLR), 2023
Michihiro Yasunaga
Xinyun Chen
Yujia Li
Panupong Pasupat
J. Leskovec
Abigail Z. Jacobs
Ed H. Chi
Denny Zhou
ReLM
LRM
236
127
0
03 Oct 2023
Tool-Augmented Reward Modeling
International Conference on Learning Representations (ICLR), 2023
Lei Li
Yekun Chai
Shuohuan Wang
Yu Sun
Hao Tian
Ningyu Zhang
Hua Wu
OffRL
167
22
0
02 Oct 2023
Enhancing Zero-Shot Chain-of-Thought Reasoning in Large Language Models through Logic
International Conference on Language Resources and Evaluation (LREC), 2023
Xufeng Zhao
Mengdi Li
Wenhao Lu
C. Weber
Jae Hee Lee
Kun-Mo Chu
S. Wermter
LRM
AI4CE
ReLM
228
56
0
23 Sep 2023
Self-Refined Large Language Model as Automated Reward Function Designer for Deep Reinforcement Learning in Robotics
Yuheng Huang
Zhehua Zhou
Jiawei Liu
Chunrong Fang
Zhan Shu
Lei Ma
199
41
0
13 Sep 2023
PACE: Improving Prompt with Actor-Critic Editing for Large Language Model
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Yihong Dong
Kangcheng Luo
Xue Jiang
Zhi Jin
Ge Li
LRM
KELM
208
22
0
19 Aug 2023
Metacognitive Prompting Improves Understanding in Large Language Models
North American Chapter of the Association for Computational Linguistics (NAACL), 2023
Yuqing Wang
Yun Zhao
ReLM
LRM
216
48
0
10 Aug 2023
Augmenting Language Models with Long-Term Memory
Neural Information Processing Systems (NeurIPS), 2023
Weizhi Wang
Li Dong
Hao Cheng
Xiaodong Liu
Xifeng Yan
Jianfeng Gao
Furu Wei
KELM
RALM
185
131
0
12 Jun 2023
Direct Preference Optimization: Your Language Model is Secretly a Reward Model
Neural Information Processing Systems (NeurIPS), 2023
Rafael Rafailov
Archit Sharma
E. Mitchell
Stefano Ermon
Christopher D. Manning
Chelsea Finn
ALM
711
6,303
0
29 May 2023
HOP, UNION, GENERATE: Explainable Multi-hop Reasoning without Rationale Supervision
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Wenting Zhao
Justin T. Chiu
Claire Cardie
Alexander M. Rush
LRM
196
7
0
23 May 2023
Chain-of-Knowledge: Grounding Large Language Models via Dynamic Knowledge Adapting over Heterogeneous Sources
International Conference on Learning Representations (ICLR), 2023
Xingxuan Li
Ruochen Zhao
Yew Ken Chia
Bosheng Ding
Shafiq Joty
Soujanya Poria
Lidong Bing
HILM
BDL
LRM
360
138
0
22 May 2023
Empower Large Language Model to Perform Better on Industrial Domain-Specific Question Answering
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Fangkai Yang
Lu Wang
Zezhong Wang
Lu Wang
Jue Zhang
Mohit Garg
Qingwei Lin
Saravan Rajmohan
Dongmei Zhang
201
68
0
19 May 2023
Reasoning Implicit Sentiment with Chain-of-Thought Prompting
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Hao Fei
Bobo Li
Qian Liu
Lidong Bing
Fei Li
Tat-Seng Chua
ReLM
LRM
224
129
0
18 May 2023
Tree of Thoughts: Deliberate Problem Solving with Large Language Models
Neural Information Processing Systems (NeurIPS), 2023
Shunyu Yao
Dian Yu
Jeffrey Zhao
Izhak Shafran
Thomas Griffiths
Yuan Cao
Karthik Narasimhan
LM&Ro
LRM
AI4CE
399
2,934
0
17 May 2023
Structured Chain-of-Thought Prompting for Code Generation
ACM Transactions on Software Engineering and Methodology (TOSEM), 2023
Jia Li
Ge Li
Yongming Li
Zhi Jin
LRM
343
229
0
11 May 2023
Plan-and-Solve Prompting: Improving Zero-Shot Chain-of-Thought Reasoning by Large Language Models
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Lei Wang
Wanyu Xu
Yihuai Lan
Zhiqiang Hu
Yunshi Lan
Roy Ka-wei Lee
Ee-Peng Lim
ReLM
LRM
341
512
0
06 May 2023
Verify-and-Edit: A Knowledge-Enhanced Chain-of-Thought Framework
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Ruochen Zhao
Xingxuan Li
Shafiq Joty
Chengwei Qin
Lidong Bing
LRM
KELM
183
193
0
05 May 2023
Federated Prompting and Chain-of-Thought Reasoning for Improving LLMs Answering
Knowledge Science, Engineering and Management (KSEM), 2023
Xiangyang Liu
Tianqi Pang
Chenyou Fan
FedML
LRM
194
30
0
27 Apr 2023
Improving Large Language Models for Clinical Named Entity Recognition via Prompt Engineering
Yan Hu
Iqra Ameer
Jingcheng Du
Xueqing Peng
Vipina Kuttichi Keloth
...
Zehan Li
Xiaoqian Jiang
Yiming Li
Jianfu Li
Hua Xu
LM&MA
193
0
0
29 Mar 2023
MathPrompter: Mathematical Reasoning using Large Language Models
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Shima Imani
Liang Du
H. Shrivastava
KELM
ReLM
LRM
182
259
0
04 Mar 2023
Reward Design with Language Models
International Conference on Learning Representations (ICLR), 2023
Minae Kwon
Sang Michael Xie
Kalesha Bullard
Dorsa Sadigh
LM&Ro
314
276
0
27 Feb 2023
Active Prompting with Chain-of-Thought for Large Language Models
Annual Meeting of the Association for Computational Linguistics (ACL), 2023
Shizhe Diao
Pengcheng Wang
Yong Lin
Tong Zhang
ReLM
KELM
LLMAG
LRM
331
178
0
23 Feb 2023
Guiding Pretraining in Reinforcement Learning with Large Language Models
International Conference on Machine Learning (ICML), 2023
Yuqing Du
Olivia Watkins
Zihan Wang
Cédric Colas
Trevor Darrell
Pieter Abbeel
Abhishek Gupta
Jacob Andreas
LM&Ro
247
225
0
13 Feb 2023
Multimodal Chain-of-Thought Reasoning in Language Models
Zhuosheng Zhang
Aston Zhang
Mu Li
Hai Zhao
George Karypis
Alexander J. Smola
LRM
308
671
0
02 Feb 2023
Large Language Models are Versatile Decomposers: Decompose Evidence and Questions for Table-based Reasoning
Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR), 2023
Yunhu Ye
Binyuan Hui
Min Yang
Binhua Li
Fei Huang
Yongbin Li
LMTD
ReLM
LRM
255
211
0
31 Jan 2023
Program of Thoughts Prompting: Disentangling Computation from Reasoning for Numerical Reasoning Tasks
Wenhu Chen
Xueguang Ma
Xinyi Wang
William W. Cohen
ReLM
ReCod
LRM
933
1,044
0
22 Nov 2022
PAL: Program-aided Language Models
International Conference on Machine Learning (ICML), 2022
Luyu Gao
Aman Madaan
Shuyan Zhou
Uri Alon
Pengfei Liu
Yiming Yang
Jamie Callan
Graham Neubig
ReLM
LRM
410
594
0
18 Nov 2022
Scaling Laws for Reward Model Overoptimization
International Conference on Machine Learning (ICML), 2022
Leo Gao
John Schulman
Jacob Hilton
ALM
265
744
0
19 Oct 2022
Automatic Chain of Thought Prompting in Large Language Models
International Conference on Learning Representations (ICLR), 2022
Zhuosheng Zhang
Aston Zhang
Mu Li
Alexander J. Smola
ReLM
LRM
380
812
0
07 Oct 2022
Binding Language Models in Symbolic Languages
International Conference on Learning Representations (ICLR), 2022
Zhoujun Cheng
Tianbao Xie
Peng Shi
Chengzu Li
Rahul Nadkarni
...
Dragomir R. Radev
Mari Ostendorf
Luke Zettlemoyer
Noah A. Smith
Tao Yu
LMTD
370
262
0
06 Oct 2022
ReAct: Synergizing Reasoning and Acting in Language Models
International Conference on Learning Representations (ICLR), 2022
Shunyu Yao
Jeffrey Zhao
Dian Yu
Nan Du
Izhak Shafran
Karthik Narasimhan
Yuan Cao
LLMAG
ReLM
LRM
1.5K
4,795
0
06 Oct 2022
Decomposed Prompting: A Modular Approach for Solving Complex Tasks
International Conference on Learning Representations (ICLR), 2022
Tushar Khot
H. Trivedi
Matthew Finlayson
Yao Fu
Kyle Richardson
Peter Clark
Ashish Sabharwal
ReLM
LRM
405
572
0
05 Oct 2022
Complexity-Based Prompting for Multi-Step Reasoning
International Conference on Learning Representations (ICLR), 2022
Yao Fu
Hao-Chun Peng
Ashish Sabharwal
Peter Clark
Tushar Khot
ReLM
LRM
492
531
0
03 Oct 2022
Guarantees for Epsilon-Greedy Reinforcement Learning with Function Approximation
International Conference on Machine Learning (ICML), 2022
Christoph Dann
Yishay Mansour
M. Mohri
Ayush Sekhari
Karthik Sridharan
202
67
0
19 Jun 2022
Maieutic Prompting: Logically Consistent Reasoning with Recursive Explanations
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Jaehun Jung
Lianhui Qin
Sean Welleck
Faeze Brahman
Chandra Bhagavatula
Ronan Le Bras
Yejin Choi
ReLM
LRM
420
217
0
24 May 2022
Least-to-Most Prompting Enables Complex Reasoning in Large Language Models
International Conference on Learning Representations (ICLR), 2022
Denny Zhou
Nathanael Scharli
Le Hou
Jason W. Wei
Nathan Scales
...
Dale Schuurmans
Claire Cui
Olivier Bousquet
Quoc Le
Ed H. Chi
RALM
LRM
AI4CE
509
1,427
0
21 May 2022
Self-Consistency Improves Chain of Thought Reasoning in Language Models
International Conference on Learning Representations (ICLR), 2022
Xuezhi Wang
Jason W. Wei
Dale Schuurmans
Quoc Le
Ed H. Chi
Sharan Narang
Aakanksha Chowdhery
Denny Zhou
ReLM
BDL
LRM
AI4CE
1.5K
5,238
0
21 Mar 2022
Training language models to follow instructions with human feedback
Neural Information Processing Systems (NeurIPS), 2022
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
...
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
OSLM
ALM
1.9K
16,811
0
04 Mar 2022
Open-Ended Reinforcement Learning with Neural Reward Functions
Neural Information Processing Systems (NeurIPS), 2022
Robert Meier
Asier Mujika
242
7
0
16 Feb 2022
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
Neural Information Processing Systems (NeurIPS), 2022
Jason W. Wei
Xuezhi Wang
Dale Schuurmans
Maarten Bosma
Brian Ichter
F. Xia
Ed H. Chi
Quoc Le
Denny Zhou
LM&Ro
LRM
AI4CE
ReLM
2.0K
13,860
0
28 Jan 2022
Ethical and social risks of harm from Language Models
Laura Weidinger
John F. J. Mellor
Maribeth Rauh
Conor Griffin
J. Uesato
...
Lisa Anne Hendricks
William S. Isaac
Sean Legassick
G. Irving
Iason Gabriel
PILM
424
1,262
0
08 Dec 2021
Learning to summarize from human feedback
Neural Information Processing Systems (NeurIPS), 2020
Nisan Stiennon
Long Ouyang
Jeff Wu
Daniel M. Ziegler
Ryan J. Lowe
Chelsea Voss
Alec Radford
Dario Amodei
Paul Christiano
ALM
689
2,658
0
02 Sep 2020
Language Models are Few-Shot Learners
Neural Information Processing Systems (NeurIPS), 2020
Tom B. Brown
Benjamin Mann
Nick Ryder
Melanie Subbiah
Jared Kaplan
...
Christopher Berner
Sam McCandlish
Alec Radford
Ilya Sutskever
Dario Amodei
BDL
1.9K
50,904
0
28 May 2020
Shortcut Learning in Deep Neural Networks
Nature Machine Intelligence (NMI), 2020
Robert Geirhos
J. Jacobsen
Claudio Michaelis
R. Zemel
Wieland Brendel
Matthias Bethge
Felix Wichmann
770
2,371
0
16 Apr 2020
Enhanced POET: Open-Ended Reinforcement Learning through Unbounded Invention of Learning Challenges and their Solutions
International Conference on Machine Learning (ICML), 2020
Rui Wang
Joel Lehman
Aditya Rawal
Jiale Zhi
Yulun Li
Jeff Clune
Kenneth O. Stanley
270
140
0
19 Mar 2020
Fine-Tuning Language Models from Human Preferences
Daniel M. Ziegler
Nisan Stiennon
Jeff Wu
Tom B. Brown
Alec Radford
Dario Amodei
Paul Christiano
G. Irving
ALM
1.4K
2,130
0
18 Sep 2019
Defending Against Neural Fake News
Neural Information Processing Systems (NeurIPS), 2019
Rowan Zellers
Ari Holtzman
Hannah Rashkin
Yonatan Bisk
Ali Farhadi
Franziska Roesner
Yejin Choi
AAML
311
1,138
0
29 May 2019
Parameter-Efficient Transfer Learning for NLP
International Conference on Machine Learning (ICML), 2019
N. Houlsby
A. Giurgiu
Stanislaw Jastrzebski
Bruna Morrone
Quentin de Laroussilhe
Andrea Gesmundo
Mona Attariyan
Sylvain Gelly
553
5,463
0
02 Feb 2019
Exploration versus exploitation in reinforcement learning: a stochastic control approach
Haoran Wang
T. Zariphopoulou
X. Zhou
242
62
0
04 Dec 2018
Fast Lexically Constrained Decoding with Dynamic Beam Allocation for Neural Machine Translation
Matt Post
David Vilar
237
336
0
18 Apr 2018
Previous
1
2
3
Next