ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2501.00562
  4. Cited By
An Overview and Discussion on Using Large Language Models for Implementation Generation of Solutions to Open-Ended Problems
v1v2 (latest)

An Overview and Discussion on Using Large Language Models for Implementation Generation of Solutions to Open-Ended Problems

31 December 2024
Hashmath Shaik
Alex Doboli
    OffRLELM
ArXiv (abs)PDFHTML

Papers citing "An Overview and Discussion on Using Large Language Models for Implementation Generation of Solutions to Open-Ended Problems"

50 / 105 papers shown
Title
Loose lips sink ships: Mitigating Length Bias in Reinforcement Learning
  from Human Feedback
Loose lips sink ships: Mitigating Length Bias in Reinforcement Learning from Human FeedbackConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Wei Shen
Rui Zheng
Wenyu Zhan
Jun Zhao
Jiajun Sun
Tao Gui
Tao Gui
Xuanjing Huang
ALM
272
68
0
08 Oct 2023
Large Language Models as Analogical Reasoners
Large Language Models as Analogical ReasonersInternational Conference on Learning Representations (ICLR), 2023
Michihiro Yasunaga
Xinyun Chen
Yujia Li
Panupong Pasupat
J. Leskovec
Abigail Z. Jacobs
Ed H. Chi
Denny Zhou
ReLMLRM
236
127
0
03 Oct 2023
Tool-Augmented Reward Modeling
Tool-Augmented Reward ModelingInternational Conference on Learning Representations (ICLR), 2023
Lei Li
Yekun Chai
Shuohuan Wang
Yu Sun
Hao Tian
Ningyu Zhang
Hua Wu
OffRL
167
22
0
02 Oct 2023
Enhancing Zero-Shot Chain-of-Thought Reasoning in Large Language Models
  through Logic
Enhancing Zero-Shot Chain-of-Thought Reasoning in Large Language Models through LogicInternational Conference on Language Resources and Evaluation (LREC), 2023
Xufeng Zhao
Mengdi Li
Wenhao Lu
C. Weber
Jae Hee Lee
Kun-Mo Chu
S. Wermter
LRMAI4CEReLM
228
56
0
23 Sep 2023
Self-Refined Large Language Model as Automated Reward Function Designer
  for Deep Reinforcement Learning in Robotics
Self-Refined Large Language Model as Automated Reward Function Designer for Deep Reinforcement Learning in Robotics
Yuheng Huang
Zhehua Zhou
Jiawei Liu
Chunrong Fang
Zhan Shu
Lei Ma
199
41
0
13 Sep 2023
PACE: Improving Prompt with Actor-Critic Editing for Large Language
  Model
PACE: Improving Prompt with Actor-Critic Editing for Large Language ModelAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Yihong Dong
Kangcheng Luo
Xue Jiang
Zhi Jin
Ge Li
LRMKELM
208
22
0
19 Aug 2023
Metacognitive Prompting Improves Understanding in Large Language Models
Metacognitive Prompting Improves Understanding in Large Language ModelsNorth American Chapter of the Association for Computational Linguistics (NAACL), 2023
Yuqing Wang
Yun Zhao
ReLMLRM
216
48
0
10 Aug 2023
Augmenting Language Models with Long-Term Memory
Augmenting Language Models with Long-Term MemoryNeural Information Processing Systems (NeurIPS), 2023
Weizhi Wang
Li Dong
Hao Cheng
Xiaodong Liu
Xifeng Yan
Jianfeng Gao
Furu Wei
KELMRALM
185
131
0
12 Jun 2023
Direct Preference Optimization: Your Language Model is Secretly a Reward
  Model
Direct Preference Optimization: Your Language Model is Secretly a Reward ModelNeural Information Processing Systems (NeurIPS), 2023
Rafael Rafailov
Archit Sharma
E. Mitchell
Stefano Ermon
Christopher D. Manning
Chelsea Finn
ALM
711
6,303
0
29 May 2023
HOP, UNION, GENERATE: Explainable Multi-hop Reasoning without Rationale
  Supervision
HOP, UNION, GENERATE: Explainable Multi-hop Reasoning without Rationale SupervisionConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Wenting Zhao
Justin T. Chiu
Claire Cardie
Alexander M. Rush
LRM
196
7
0
23 May 2023
Chain-of-Knowledge: Grounding Large Language Models via Dynamic
  Knowledge Adapting over Heterogeneous Sources
Chain-of-Knowledge: Grounding Large Language Models via Dynamic Knowledge Adapting over Heterogeneous SourcesInternational Conference on Learning Representations (ICLR), 2023
Xingxuan Li
Ruochen Zhao
Yew Ken Chia
Bosheng Ding
Shafiq Joty
Soujanya Poria
Lidong Bing
HILMBDLLRM
360
138
0
22 May 2023
Empower Large Language Model to Perform Better on Industrial
  Domain-Specific Question Answering
Empower Large Language Model to Perform Better on Industrial Domain-Specific Question AnsweringConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Fangkai Yang
Lu Wang
Zezhong Wang
Lu Wang
Jue Zhang
Mohit Garg
Qingwei Lin
Saravan Rajmohan
Dongmei Zhang
201
68
0
19 May 2023
Reasoning Implicit Sentiment with Chain-of-Thought Prompting
Reasoning Implicit Sentiment with Chain-of-Thought PromptingAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Hao Fei
Bobo Li
Qian Liu
Lidong Bing
Fei Li
Tat-Seng Chua
ReLMLRM
224
129
0
18 May 2023
Tree of Thoughts: Deliberate Problem Solving with Large Language Models
Tree of Thoughts: Deliberate Problem Solving with Large Language ModelsNeural Information Processing Systems (NeurIPS), 2023
Shunyu Yao
Dian Yu
Jeffrey Zhao
Izhak Shafran
Thomas Griffiths
Yuan Cao
Karthik Narasimhan
LM&RoLRMAI4CE
399
2,934
0
17 May 2023
Structured Chain-of-Thought Prompting for Code Generation
Structured Chain-of-Thought Prompting for Code GenerationACM Transactions on Software Engineering and Methodology (TOSEM), 2023
Jia Li
Ge Li
Yongming Li
Zhi Jin
LRM
343
229
0
11 May 2023
Plan-and-Solve Prompting: Improving Zero-Shot Chain-of-Thought Reasoning
  by Large Language Models
Plan-and-Solve Prompting: Improving Zero-Shot Chain-of-Thought Reasoning by Large Language ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Lei Wang
Wanyu Xu
Yihuai Lan
Zhiqiang Hu
Yunshi Lan
Roy Ka-wei Lee
Ee-Peng Lim
ReLMLRM
341
512
0
06 May 2023
Verify-and-Edit: A Knowledge-Enhanced Chain-of-Thought Framework
Verify-and-Edit: A Knowledge-Enhanced Chain-of-Thought FrameworkAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Ruochen Zhao
Xingxuan Li
Shafiq Joty
Chengwei Qin
Lidong Bing
LRMKELM
183
193
0
05 May 2023
Federated Prompting and Chain-of-Thought Reasoning for Improving LLMs
  Answering
Federated Prompting and Chain-of-Thought Reasoning for Improving LLMs AnsweringKnowledge Science, Engineering and Management (KSEM), 2023
Xiangyang Liu
Tianqi Pang
Chenyou Fan
FedMLLRM
194
30
0
27 Apr 2023
Improving Large Language Models for Clinical Named Entity Recognition
  via Prompt Engineering
Improving Large Language Models for Clinical Named Entity Recognition via Prompt Engineering
Yan Hu
Iqra Ameer
Jingcheng Du
Xueqing Peng
Vipina Kuttichi Keloth
...
Zehan Li
Xiaoqian Jiang
Yiming Li
Jianfu Li
Hua Xu
LM&MA
193
0
0
29 Mar 2023
MathPrompter: Mathematical Reasoning using Large Language Models
MathPrompter: Mathematical Reasoning using Large Language ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Shima Imani
Liang Du
H. Shrivastava
KELMReLMLRM
182
259
0
04 Mar 2023
Reward Design with Language Models
Reward Design with Language ModelsInternational Conference on Learning Representations (ICLR), 2023
Minae Kwon
Sang Michael Xie
Kalesha Bullard
Dorsa Sadigh
LM&Ro
314
276
0
27 Feb 2023
Active Prompting with Chain-of-Thought for Large Language Models
Active Prompting with Chain-of-Thought for Large Language ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Shizhe Diao
Pengcheng Wang
Yong Lin
Tong Zhang
ReLMKELMLLMAGLRM
331
178
0
23 Feb 2023
Guiding Pretraining in Reinforcement Learning with Large Language Models
Guiding Pretraining in Reinforcement Learning with Large Language ModelsInternational Conference on Machine Learning (ICML), 2023
Yuqing Du
Olivia Watkins
Zihan Wang
Cédric Colas
Trevor Darrell
Pieter Abbeel
Abhishek Gupta
Jacob Andreas
LM&Ro
247
225
0
13 Feb 2023
Multimodal Chain-of-Thought Reasoning in Language Models
Multimodal Chain-of-Thought Reasoning in Language Models
Zhuosheng Zhang
Aston Zhang
Mu Li
Hai Zhao
George Karypis
Alexander J. Smola
LRM
308
671
0
02 Feb 2023
Large Language Models are Versatile Decomposers: Decompose Evidence and
  Questions for Table-based Reasoning
Large Language Models are Versatile Decomposers: Decompose Evidence and Questions for Table-based ReasoningAnnual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR), 2023
Yunhu Ye
Binyuan Hui
Min Yang
Binhua Li
Fei Huang
Yongbin Li
LMTDReLMLRM
255
211
0
31 Jan 2023
Program of Thoughts Prompting: Disentangling Computation from Reasoning
  for Numerical Reasoning Tasks
Program of Thoughts Prompting: Disentangling Computation from Reasoning for Numerical Reasoning Tasks
Wenhu Chen
Xueguang Ma
Xinyi Wang
William W. Cohen
ReLMReCodLRM
933
1,044
0
22 Nov 2022
PAL: Program-aided Language Models
PAL: Program-aided Language ModelsInternational Conference on Machine Learning (ICML), 2022
Luyu Gao
Aman Madaan
Shuyan Zhou
Uri Alon
Pengfei Liu
Yiming Yang
Jamie Callan
Graham Neubig
ReLMLRM
410
594
0
18 Nov 2022
Scaling Laws for Reward Model Overoptimization
Scaling Laws for Reward Model OveroptimizationInternational Conference on Machine Learning (ICML), 2022
Leo Gao
John Schulman
Jacob Hilton
ALM
265
744
0
19 Oct 2022
Automatic Chain of Thought Prompting in Large Language Models
Automatic Chain of Thought Prompting in Large Language ModelsInternational Conference on Learning Representations (ICLR), 2022
Zhuosheng Zhang
Aston Zhang
Mu Li
Alexander J. Smola
ReLMLRM
380
812
0
07 Oct 2022
Binding Language Models in Symbolic Languages
Binding Language Models in Symbolic LanguagesInternational Conference on Learning Representations (ICLR), 2022
Zhoujun Cheng
Tianbao Xie
Peng Shi
Chengzu Li
Rahul Nadkarni
...
Dragomir R. Radev
Mari Ostendorf
Luke Zettlemoyer
Noah A. Smith
Tao Yu
LMTD
370
262
0
06 Oct 2022
ReAct: Synergizing Reasoning and Acting in Language Models
ReAct: Synergizing Reasoning and Acting in Language ModelsInternational Conference on Learning Representations (ICLR), 2022
Shunyu Yao
Jeffrey Zhao
Dian Yu
Nan Du
Izhak Shafran
Karthik Narasimhan
Yuan Cao
LLMAGReLMLRM
1.5K
4,795
0
06 Oct 2022
Decomposed Prompting: A Modular Approach for Solving Complex Tasks
Decomposed Prompting: A Modular Approach for Solving Complex TasksInternational Conference on Learning Representations (ICLR), 2022
Tushar Khot
H. Trivedi
Matthew Finlayson
Yao Fu
Kyle Richardson
Peter Clark
Ashish Sabharwal
ReLMLRM
405
572
0
05 Oct 2022
Complexity-Based Prompting for Multi-Step Reasoning
Complexity-Based Prompting for Multi-Step ReasoningInternational Conference on Learning Representations (ICLR), 2022
Yao Fu
Hao-Chun Peng
Ashish Sabharwal
Peter Clark
Tushar Khot
ReLMLRM
492
531
0
03 Oct 2022
Guarantees for Epsilon-Greedy Reinforcement Learning with Function
  Approximation
Guarantees for Epsilon-Greedy Reinforcement Learning with Function ApproximationInternational Conference on Machine Learning (ICML), 2022
Christoph Dann
Yishay Mansour
M. Mohri
Ayush Sekhari
Karthik Sridharan
202
67
0
19 Jun 2022
Maieutic Prompting: Logically Consistent Reasoning with Recursive
  Explanations
Maieutic Prompting: Logically Consistent Reasoning with Recursive ExplanationsConference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Jaehun Jung
Lianhui Qin
Sean Welleck
Faeze Brahman
Chandra Bhagavatula
Ronan Le Bras
Yejin Choi
ReLMLRM
420
217
0
24 May 2022
Least-to-Most Prompting Enables Complex Reasoning in Large Language
  Models
Least-to-Most Prompting Enables Complex Reasoning in Large Language ModelsInternational Conference on Learning Representations (ICLR), 2022
Denny Zhou
Nathanael Scharli
Le Hou
Jason W. Wei
Nathan Scales
...
Dale Schuurmans
Claire Cui
Olivier Bousquet
Quoc Le
Ed H. Chi
RALMLRMAI4CE
509
1,427
0
21 May 2022
Self-Consistency Improves Chain of Thought Reasoning in Language Models
Self-Consistency Improves Chain of Thought Reasoning in Language ModelsInternational Conference on Learning Representations (ICLR), 2022
Xuezhi Wang
Jason W. Wei
Dale Schuurmans
Quoc Le
Ed H. Chi
Sharan Narang
Aakanksha Chowdhery
Denny Zhou
ReLMBDLLRMAI4CE
1.5K
5,238
0
21 Mar 2022
Training language models to follow instructions with human feedback
Training language models to follow instructions with human feedbackNeural Information Processing Systems (NeurIPS), 2022
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
...
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
OSLMALM
1.9K
16,811
0
04 Mar 2022
Open-Ended Reinforcement Learning with Neural Reward Functions
Open-Ended Reinforcement Learning with Neural Reward FunctionsNeural Information Processing Systems (NeurIPS), 2022
Robert Meier
Asier Mujika
242
7
0
16 Feb 2022
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
Chain-of-Thought Prompting Elicits Reasoning in Large Language ModelsNeural Information Processing Systems (NeurIPS), 2022
Jason W. Wei
Xuezhi Wang
Dale Schuurmans
Maarten Bosma
Brian Ichter
F. Xia
Ed H. Chi
Quoc Le
Denny Zhou
LM&RoLRMAI4CEReLM
2.0K
13,860
0
28 Jan 2022
Ethical and social risks of harm from Language Models
Ethical and social risks of harm from Language Models
Laura Weidinger
John F. J. Mellor
Maribeth Rauh
Conor Griffin
J. Uesato
...
Lisa Anne Hendricks
William S. Isaac
Sean Legassick
G. Irving
Iason Gabriel
PILM
424
1,262
0
08 Dec 2021
Learning to summarize from human feedback
Learning to summarize from human feedbackNeural Information Processing Systems (NeurIPS), 2020
Nisan Stiennon
Long Ouyang
Jeff Wu
Daniel M. Ziegler
Ryan J. Lowe
Chelsea Voss
Alec Radford
Dario Amodei
Paul Christiano
ALM
689
2,658
0
02 Sep 2020
Language Models are Few-Shot Learners
Language Models are Few-Shot LearnersNeural Information Processing Systems (NeurIPS), 2020
Tom B. Brown
Benjamin Mann
Nick Ryder
Melanie Subbiah
Jared Kaplan
...
Christopher Berner
Sam McCandlish
Alec Radford
Ilya Sutskever
Dario Amodei
BDL
1.9K
50,904
0
28 May 2020
Shortcut Learning in Deep Neural Networks
Shortcut Learning in Deep Neural NetworksNature Machine Intelligence (NMI), 2020
Robert Geirhos
J. Jacobsen
Claudio Michaelis
R. Zemel
Wieland Brendel
Matthias Bethge
Felix Wichmann
770
2,371
0
16 Apr 2020
Enhanced POET: Open-Ended Reinforcement Learning through Unbounded
  Invention of Learning Challenges and their Solutions
Enhanced POET: Open-Ended Reinforcement Learning through Unbounded Invention of Learning Challenges and their SolutionsInternational Conference on Machine Learning (ICML), 2020
Rui Wang
Joel Lehman
Aditya Rawal
Jiale Zhi
Yulun Li
Jeff Clune
Kenneth O. Stanley
270
140
0
19 Mar 2020
Fine-Tuning Language Models from Human Preferences
Fine-Tuning Language Models from Human Preferences
Daniel M. Ziegler
Nisan Stiennon
Jeff Wu
Tom B. Brown
Alec Radford
Dario Amodei
Paul Christiano
G. Irving
ALM
1.4K
2,130
0
18 Sep 2019
Defending Against Neural Fake News
Defending Against Neural Fake NewsNeural Information Processing Systems (NeurIPS), 2019
Rowan Zellers
Ari Holtzman
Hannah Rashkin
Yonatan Bisk
Ali Farhadi
Franziska Roesner
Yejin Choi
AAML
311
1,138
0
29 May 2019
Parameter-Efficient Transfer Learning for NLP
Parameter-Efficient Transfer Learning for NLPInternational Conference on Machine Learning (ICML), 2019
N. Houlsby
A. Giurgiu
Stanislaw Jastrzebski
Bruna Morrone
Quentin de Laroussilhe
Andrea Gesmundo
Mona Attariyan
Sylvain Gelly
553
5,463
0
02 Feb 2019
Exploration versus exploitation in reinforcement learning: a stochastic
  control approach
Exploration versus exploitation in reinforcement learning: a stochastic control approach
Haoran Wang
T. Zariphopoulou
X. Zhou
242
62
0
04 Dec 2018
Fast Lexically Constrained Decoding with Dynamic Beam Allocation for
  Neural Machine Translation
Fast Lexically Constrained Decoding with Dynamic Beam Allocation for Neural Machine Translation
Matt Post
David Vilar
237
336
0
18 Apr 2018
Previous
123
Next