v1v2 (latest)

An Overview and Discussion on Using Large Language Models for Implementation Generation of Solutions to Open-Ended Problems

31 December 2024

Papers citing "An Overview and Discussion on Using Large Language Models for Implementation Generation of Solutions to Open-Ended Problems"

50 / 105 papers shown

Title
Loose lips sink ships: Mitigating Length Bias in Reinforcement Learning from Human FeedbackConference on Empirical Methods in Natural Language Processing (EMNLP), 2023 Wei Shen Rui Zheng Wenyu Zhan Jun Zhao Jiajun Sun Tao Gui Tao Gui Xuanjing Huang ALM 272 68 0 08 Oct 2023
Large Language Models as Analogical ReasonersInternational Conference on Learning Representations (ICLR), 2023 Michihiro Yasunaga Xinyun Chen Yujia Li Panupong Pasupat J. Leskovec Abigail Z. Jacobs Ed H. Chi Denny Zhou ReLM LRM 236 127 0 03 Oct 2023
Tool-Augmented Reward ModelingInternational Conference on Learning Representations (ICLR), 2023 Lei Li Yekun Chai Shuohuan Wang Yu Sun Hao Tian Ningyu Zhang Hua Wu OffRL 167 22 0 02 Oct 2023
Enhancing Zero-Shot Chain-of-Thought Reasoning in Large Language Models through LogicInternational Conference on Language Resources and Evaluation (LREC), 2023 Xufeng Zhao Mengdi Li Wenhao Lu C. Weber Jae Hee Lee Kun-Mo Chu S. Wermter LRM AI4CE ReLM 228 56 0 23 Sep 2023
Self-Refined Large Language Model as Automated Reward Function Designer for Deep Reinforcement Learning in Robotics Yuheng Huang Zhehua Zhou Jiawei Liu Chunrong Fang Zhan Shu Lei Ma 199 41 0 13 Sep 2023
PACE: Improving Prompt with Actor-Critic Editing for Large Language ModelAnnual Meeting of the Association for Computational Linguistics (ACL), 2023 Yihong Dong Kangcheng Luo Xue Jiang Zhi Jin Ge Li LRM KELM 208 22 0 19 Aug 2023
Metacognitive Prompting Improves Understanding in Large Language ModelsNorth American Chapter of the Association for Computational Linguistics (NAACL), 2023 Yuqing Wang Yun Zhao ReLM LRM 216 48 0 10 Aug 2023
Augmenting Language Models with Long-Term MemoryNeural Information Processing Systems (NeurIPS), 2023 Weizhi Wang Li Dong Hao Cheng Xiaodong Liu Xifeng Yan Jianfeng Gao Furu Wei KELM RALM 185 131 0 12 Jun 2023
Direct Preference Optimization: Your Language Model is Secretly a Reward ModelNeural Information Processing Systems (NeurIPS), 2023 Rafael Rafailov Archit Sharma E. Mitchell Stefano Ermon Christopher D. Manning Chelsea Finn ALM 711 6,303 0 29 May 2023
HOP, UNION, GENERATE: Explainable Multi-hop Reasoning without Rationale SupervisionConference on Empirical Methods in Natural Language Processing (EMNLP), 2023 Wenting Zhao Justin T. Chiu Claire Cardie Alexander M. Rush LRM 196 7 0 23 May 2023
Chain-of-Knowledge: Grounding Large Language Models via Dynamic Knowledge Adapting over Heterogeneous SourcesInternational Conference on Learning Representations (ICLR), 2023 Xingxuan Li Ruochen Zhao Yew Ken Chia Bosheng Ding Shafiq Joty Soujanya Poria Lidong Bing HILM BDL LRM 360 138 0 22 May 2023
Empower Large Language Model to Perform Better on Industrial Domain-Specific Question AnsweringConference on Empirical Methods in Natural Language Processing (EMNLP), 2023 Fangkai Yang Lu Wang Zezhong Wang Lu Wang Jue Zhang Mohit Garg Qingwei Lin Saravan Rajmohan Dongmei Zhang 201 68 0 19 May 2023
Reasoning Implicit Sentiment with Chain-of-Thought PromptingAnnual Meeting of the Association for Computational Linguistics (ACL), 2023 Hao Fei Bobo Li Qian Liu Lidong Bing Fei Li Tat-Seng Chua ReLM LRM 224 129 0 18 May 2023
Tree of Thoughts: Deliberate Problem Solving with Large Language ModelsNeural Information Processing Systems (NeurIPS), 2023 Shunyu Yao Dian Yu Jeffrey Zhao Izhak Shafran Thomas Griffiths Yuan Cao Karthik Narasimhan LM&Ro LRM AI4CE 399 2,934 0 17 May 2023
Structured Chain-of-Thought Prompting for Code GenerationACM Transactions on Software Engineering and Methodology (TOSEM), 2023 Jia Li Ge Li Yongming Li Zhi Jin LRM 343 229 0 11 May 2023
Plan-and-Solve Prompting: Improving Zero-Shot Chain-of-Thought Reasoning by Large Language ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2023 Lei Wang Wanyu Xu Yihuai Lan Zhiqiang Hu Yunshi Lan Roy Ka-wei Lee Ee-Peng Lim ReLM LRM 341 512 0 06 May 2023
Verify-and-Edit: A Knowledge-Enhanced Chain-of-Thought FrameworkAnnual Meeting of the Association for Computational Linguistics (ACL), 2023 Ruochen Zhao Xingxuan Li Shafiq Joty Chengwei Qin Lidong Bing LRM KELM 183 193 0 05 May 2023
Federated Prompting and Chain-of-Thought Reasoning for Improving LLMs AnsweringKnowledge Science, Engineering and Management (KSEM), 2023 Xiangyang Liu Tianqi Pang Chenyou Fan FedML LRM 194 30 0 27 Apr 2023
Improving Large Language Models for Clinical Named Entity Recognition via Prompt Engineering Yan Hu Iqra Ameer Jingcheng Du Xueqing Peng Vipina Kuttichi Keloth ... Zehan Li Xiaoqian Jiang Yiming Li Jianfu Li Hua Xu LM&MA 193 0 0 29 Mar 2023
MathPrompter: Mathematical Reasoning using Large Language ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2023 Shima Imani Liang Du H. Shrivastava KELM ReLM LRM 182 259 0 04 Mar 2023
Reward Design with Language ModelsInternational Conference on Learning Representations (ICLR), 2023 Minae Kwon Sang Michael Xie Kalesha Bullard Dorsa Sadigh LM&Ro 314 276 0 27 Feb 2023
Active Prompting with Chain-of-Thought for Large Language ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2023 Shizhe Diao Pengcheng Wang Yong Lin Tong Zhang ReLM KELM LLMAG LRM 331 178 0 23 Feb 2023
Guiding Pretraining in Reinforcement Learning with Large Language ModelsInternational Conference on Machine Learning (ICML), 2023 Yuqing Du Olivia Watkins Zihan Wang Cédric Colas Trevor Darrell Pieter Abbeel Abhishek Gupta Jacob Andreas LM&Ro 247 225 0 13 Feb 2023
Multimodal Chain-of-Thought Reasoning in Language Models Zhuosheng Zhang Aston Zhang Mu Li Hai Zhao George Karypis Alexander J. Smola LRM 308 671 0 02 Feb 2023
Large Language Models are Versatile Decomposers: Decompose Evidence and Questions for Table-based ReasoningAnnual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR), 2023 Yunhu Ye Binyuan Hui Min Yang Binhua Li Fei Huang Yongbin Li LMTD ReLM LRM 255 211 0 31 Jan 2023
Program of Thoughts Prompting: Disentangling Computation from Reasoning for Numerical Reasoning Tasks Wenhu Chen Xueguang Ma Xinyi Wang William W. Cohen ReLM ReCod LRM 933 1,044 0 22 Nov 2022
PAL: Program-aided Language ModelsInternational Conference on Machine Learning (ICML), 2022 Luyu Gao Aman Madaan Shuyan Zhou Uri Alon Pengfei Liu Yiming Yang Jamie Callan Graham Neubig ReLM LRM 410 594 0 18 Nov 2022
Scaling Laws for Reward Model OveroptimizationInternational Conference on Machine Learning (ICML), 2022 Leo Gao John Schulman Jacob Hilton ALM 265 744 0 19 Oct 2022
Automatic Chain of Thought Prompting in Large Language ModelsInternational Conference on Learning Representations (ICLR), 2022 Zhuosheng Zhang Aston Zhang Mu Li Alexander J. Smola ReLM LRM 380 812 0 07 Oct 2022
Binding Language Models in Symbolic LanguagesInternational Conference on Learning Representations (ICLR), 2022 Zhoujun Cheng Tianbao Xie Peng Shi Chengzu Li Rahul Nadkarni ... Dragomir R. Radev Mari Ostendorf Luke Zettlemoyer Noah A. Smith Tao Yu LMTD 370 262 0 06 Oct 2022
ReAct: Synergizing Reasoning and Acting in Language ModelsInternational Conference on Learning Representations (ICLR), 2022 Shunyu Yao Jeffrey Zhao Dian Yu Nan Du Izhak Shafran Karthik Narasimhan Yuan Cao LLMAG ReLM LRM 1.5K 4,795 0 06 Oct 2022
Decomposed Prompting: A Modular Approach for Solving Complex TasksInternational Conference on Learning Representations (ICLR), 2022 Tushar Khot H. Trivedi Matthew Finlayson Yao Fu Kyle Richardson Peter Clark Ashish Sabharwal ReLM LRM 405 572 0 05 Oct 2022
Complexity-Based Prompting for Multi-Step ReasoningInternational Conference on Learning Representations (ICLR), 2022 Yao Fu Hao-Chun Peng Ashish Sabharwal Peter Clark Tushar Khot ReLM LRM 492 531 0 03 Oct 2022
Guarantees for Epsilon-Greedy Reinforcement Learning with Function ApproximationInternational Conference on Machine Learning (ICML), 2022 Christoph Dann Yishay Mansour M. Mohri Ayush Sekhari Karthik Sridharan 202 67 0 19 Jun 2022
Maieutic Prompting: Logically Consistent Reasoning with Recursive ExplanationsConference on Empirical Methods in Natural Language Processing (EMNLP), 2022 Jaehun Jung Lianhui Qin Sean Welleck Faeze Brahman Chandra Bhagavatula Ronan Le Bras Yejin Choi ReLM LRM 420 217 0 24 May 2022
Least-to-Most Prompting Enables Complex Reasoning in Large Language ModelsInternational Conference on Learning Representations (ICLR), 2022 Denny Zhou Nathanael Scharli Le Hou Jason W. Wei Nathan Scales ... Dale Schuurmans Claire Cui Olivier Bousquet Quoc Le Ed H. Chi RALM LRM AI4CE 509 1,427 0 21 May 2022
Self-Consistency Improves Chain of Thought Reasoning in Language ModelsInternational Conference on Learning Representations (ICLR), 2022 Xuezhi Wang Jason W. Wei Dale Schuurmans Quoc Le Ed H. Chi Sharan Narang Aakanksha Chowdhery Denny Zhou ReLM BDL LRM AI4CE 1.5K 5,238 0 21 Mar 2022
Training language models to follow instructions with human feedbackNeural Information Processing Systems (NeurIPS), 2022 Long Ouyang Jeff Wu Xu Jiang Diogo Almeida Carroll L. Wainwright ... Amanda Askell Peter Welinder Paul Christiano Jan Leike Ryan J. Lowe OSLM ALM 1.9K 16,811 0 04 Mar 2022
Open-Ended Reinforcement Learning with Neural Reward FunctionsNeural Information Processing Systems (NeurIPS), 2022 Robert Meier Asier Mujika 242 7 0 16 Feb 2022
Chain-of-Thought Prompting Elicits Reasoning in Large Language ModelsNeural Information Processing Systems (NeurIPS), 2022 Jason W. Wei Xuezhi Wang Dale Schuurmans Maarten Bosma Brian Ichter F. Xia Ed H. Chi Quoc Le Denny Zhou LM&Ro LRM AI4CE ReLM 2.0K 13,860 0 28 Jan 2022
Ethical and social risks of harm from Language Models Laura Weidinger John F. J. Mellor Maribeth Rauh Conor Griffin J. Uesato ... Lisa Anne Hendricks William S. Isaac Sean Legassick G. Irving Iason Gabriel PILM 424 1,262 0 08 Dec 2021
Learning to summarize from human feedbackNeural Information Processing Systems (NeurIPS), 2020 Nisan Stiennon Long Ouyang Jeff Wu Daniel M. Ziegler Ryan J. Lowe Chelsea Voss Alec Radford Dario Amodei Paul Christiano ALM 689 2,658 0 02 Sep 2020
Language Models are Few-Shot LearnersNeural Information Processing Systems (NeurIPS), 2020 Tom B. Brown Benjamin Mann Nick Ryder Melanie Subbiah Jared Kaplan ... Christopher Berner Sam McCandlish Alec Radford Ilya Sutskever Dario Amodei BDL 1.9K 50,904 0 28 May 2020
Shortcut Learning in Deep Neural NetworksNature Machine Intelligence (NMI), 2020 Robert Geirhos J. Jacobsen Claudio Michaelis R. Zemel Wieland Brendel Matthias Bethge Felix Wichmann 770 2,371 0 16 Apr 2020
Enhanced POET: Open-Ended Reinforcement Learning through Unbounded Invention of Learning Challenges and their SolutionsInternational Conference on Machine Learning (ICML), 2020 Rui Wang Joel Lehman Aditya Rawal Jiale Zhi Yulun Li Jeff Clune Kenneth O. Stanley 270 140 0 19 Mar 2020
Fine-Tuning Language Models from Human Preferences Daniel M. Ziegler Nisan Stiennon Jeff Wu Tom B. Brown Alec Radford Dario Amodei Paul Christiano G. Irving ALM 1.4K 2,130 0 18 Sep 2019
Defending Against Neural Fake NewsNeural Information Processing Systems (NeurIPS), 2019 Rowan Zellers Ari Holtzman Hannah Rashkin Yonatan Bisk Ali Farhadi Franziska Roesner Yejin Choi AAML 311 1,138 0 29 May 2019
Parameter-Efficient Transfer Learning for NLPInternational Conference on Machine Learning (ICML), 2019 N. Houlsby A. Giurgiu Stanislaw Jastrzebski Bruna Morrone Quentin de Laroussilhe Andrea Gesmundo Mona Attariyan Sylvain Gelly 553 5,463 0 02 Feb 2019
Exploration versus exploitation in reinforcement learning: a stochastic control approach Haoran Wang T. Zariphopoulou X. Zhou 242 62 0 04 Dec 2018
Fast Lexically Constrained Decoding with Dynamic Beam Allocation for Neural Machine Translation Matt Post David Vilar 237 336 0 18 Apr 2018