ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2112.00114
  4. Cited By
Show Your Work: Scratchpads for Intermediate Computation with Language
  Models

Show Your Work: Scratchpads for Intermediate Computation with Language Models

30 November 2021
Maxwell Nye
Anders Andreassen
Guy Gur-Ari
Henryk Michalewski
Jacob Austin
David Bieber
David Dohan
Aitor Lewkowycz
Maarten Bosma
D. Luan
Charles Sutton
Augustus Odena
    ReLM
    LRM
ArXivPDFHTML

Papers citing "Show Your Work: Scratchpads for Intermediate Computation with Language Models"

50 / 559 papers shown
Title
Inverse Scaling: When Bigger Isn't Better
Inverse Scaling: When Bigger Isn't Better
I. R. McKenzie
Alexander Lyzhov
Michael Pieler
Alicia Parrish
Aaron Mueller
...
Yuhui Zhang
Zhengping Zhou
Najoung Kim
Sam Bowman
Ethan Perez
41
128
0
15 Jun 2023
Opportunities for Large Language Models and Discourse in Engineering
  Design
Opportunities for Large Language Models and Discourse in Engineering Design
Jan Göpfert
J. Weinand
Patrick Kuckertz
D. Stolten
AI4CE
42
4
0
15 Jun 2023
Exploring the MIT Mathematics and EECS Curriculum Using Large Language Models
Sarah J. Zhang
Samuel H. Florin
Ariel N. Lee
Eamon Niknafs
Andrei Marginean
...
Madeleine Udell
Yoon Kim
Tonio Buonassisi
Armando Solar-Lezama
Iddo Drori
ELM
40
18
0
15 Jun 2023
FLamE: Few-shot Learning from Natural Language Explanations
FLamE: Few-shot Learning from Natural Language Explanations
Yangqiaoyu Zhou
Yiming Zhang
Chenhao Tan
LRM
FAtt
33
9
0
13 Jun 2023
BoardgameQA: A Dataset for Natural Language Reasoning with Contradictory
  Information
BoardgameQA: A Dataset for Natural Language Reasoning with Contradictory Information
Mehran Kazemi
Quan Yuan
Deepti Bhatia
Najoung Kim
Xin Xu
Vaiva Imbrasaite
Deepak Ramachandran
LRM
32
41
0
13 Jun 2023
Recursion of Thought: A Divide-and-Conquer Approach to Multi-Context
  Reasoning with Language Models
Recursion of Thought: A Divide-and-Conquer Approach to Multi-Context Reasoning with Language Models
Soochan Lee
Gunhee Kim
ReLM
LRM
33
26
0
12 Jun 2023
Triggering Multi-Hop Reasoning for Question Answering in Language Models
  using Soft Prompts and Random Walks
Triggering Multi-Hop Reasoning for Question Answering in Language Models using Soft Prompts and Random Walks
Kanishka Misra
Cicero Nogueira dos Santos
Siamak Shakeri
KELM
LRM
31
1
0
06 Jun 2023
Utilizing ChatGPT to Enhance Clinical Trial Enrollment
Utilizing ChatGPT to Enhance Clinical Trial Enrollment
Georgios Peikos
S. Symeonidis
Pranav Kasela
G. Pasi
LM&MA
32
12
0
03 Jun 2023
Exposing Attention Glitches with Flip-Flop Language Modeling
Exposing Attention Glitches with Flip-Flop Language Modeling
Bingbin Liu
Jordan T. Ash
Surbhi Goel
A. Krishnamurthy
Cyril Zhang
LRM
35
46
0
01 Jun 2023
An Invariant Learning Characterization of Controlled Text Generation
An Invariant Learning Characterization of Controlled Text Generation
Carolina Zheng
Claudia Shi
Keyon Vafa
Amir Feder
David M. Blei
OOD
38
8
0
31 May 2023
Decision-Oriented Dialogue for Human-AI Collaboration
Decision-Oriented Dialogue for Human-AI Collaboration
Jessy Lin
Nicholas Tomlin
Jacob Andreas
J. Eisner
LLMAG
35
27
0
31 May 2023
Let's Verify Step by Step
Let's Verify Step by Step
Hunter Lightman
V. Kosaraju
Yura Burda
Harrison Edwards
Bowen Baker
Teddy Lee
Jan Leike
John Schulman
Ilya Sutskever
K. Cobbe
ALM
OffRL
LRM
45
922
0
31 May 2023
Monotonic Location Attention for Length Generalization
Monotonic Location Attention for Length Generalization
Jishnu Ray Chowdhury
Cornelia Caragea
LLMAG
24
8
0
31 May 2023
The Impact of Positional Encoding on Length Generalization in
  Transformers
The Impact of Positional Encoding on Length Generalization in Transformers
Amirhossein Kazemnejad
Inkit Padhi
Karthikeyan N. Ramamurthy
Payel Das
Siva Reddy
47
178
0
31 May 2023
Grammar Prompting for Domain-Specific Language Generation with Large
  Language Models
Grammar Prompting for Domain-Specific Language Generation with Large Language Models
Bailin Wang
Zi Wang
Xuezhi Wang
Yuan Cao
Rif A. Saurous
Yoon Kim
ReLM
LRM
41
54
0
30 May 2023
Strategic Reasoning with Language Models
Strategic Reasoning with Language Models
Kanishk Gandhi
Dorsa Sadigh
Noah D. Goodman
LM&Ro
LRM
42
37
0
30 May 2023
Dissecting Chain-of-Thought: Compositionality through In-Context
  Filtering and Learning
Dissecting Chain-of-Thought: Compositionality through In-Context Filtering and Learning
Yingcong Li
Kartik K. Sreenivasan
Angeliki Giannou
Dimitris Papailiopoulos
Samet Oymak
LRM
21
16
0
30 May 2023
Faith and Fate: Limits of Transformers on Compositionality
Faith and Fate: Limits of Transformers on Compositionality
Nouha Dziri
Ximing Lu
Melanie Sclar
Xiang Lorraine Li
Liwei Jian
...
Sean Welleck
Xiang Ren
Allyson Ettinger
Zaïd Harchaoui
Yejin Choi
ReLM
LRM
34
336
0
29 May 2023
RAMP: Retrieval and Attribute-Marking Enhanced Prompting for
  Attribute-Controlled Translation
RAMP: Retrieval and Attribute-Marking Enhanced Prompting for Attribute-Controlled Translation
Gabriele Sarti
Phu Mon Htut
Xing Niu
B. Hsu
Anna Currey
Georgiana Dinu
Maria Nadejde
LRM
45
10
0
26 May 2023
Large Language Models as Tool Makers
Large Language Models as Tool Makers
Tianle Cai
Xuezhi Wang
Tengyu Ma
Xinyun Chen
Denny Zhou
LLMAG
37
190
0
26 May 2023
Passive learning of active causal strategies in agents and language
  models
Passive learning of active causal strategies in agents and language models
Andrew Kyle Lampinen
Stephanie C. Y. Chan
Ishita Dasgupta
A. Nam
Jane X. Wang
31
15
0
25 May 2023
Towards Revealing the Mystery behind Chain of Thought: A Theoretical
  Perspective
Towards Revealing the Mystery behind Chain of Thought: A Theoretical Perspective
Guhao Feng
Bohang Zhang
Yuntian Gu
Haotian Ye
Di He
Liwei Wang
LRM
47
224
0
24 May 2023
Testing the General Deductive Reasoning Capacity of Large Language
  Models Using OOD Examples
Testing the General Deductive Reasoning Capacity of Large Language Models Using OOD Examples
Abulhair Saparov
Richard Yuanzhe Pang
Vishakh Padmakumar
Nitish Joshi
Seyed Mehran Kazemi
Najoung Kim
He He
ELM
LRM
24
88
0
24 May 2023
Spoken Question Answering and Speech Continuation Using
  Spectrogram-Powered LLM
Spoken Question Answering and Speech Continuation Using Spectrogram-Powered LLM
Eliya Nachmani
Alon Levkovitch
Roy Hirsch
Julián Salazar
Chulayutsh Asawaroengchai
Soroosh Mariooryad
Ehud Rivlin
RJ Skerry-Ryan
Michelle Tadmor Ramanovich
AuLLM
36
34
0
24 May 2023
Reasoning with Language Model is Planning with World Model
Reasoning with Language Model is Planning with World Model
Shibo Hao
Yi Gu
Haodi Ma
Joshua Jiahua Hong
Zhen Wang
D. Wang
Zhiting Hu
ReLM
LRM
LLMAG
68
520
0
24 May 2023
GRACE: Discriminator-Guided Chain-of-Thought Reasoning
GRACE: Discriminator-Guided Chain-of-Thought Reasoning
Muhammad Khalifa
Lajanugen Logeswaran
Moontae Lee
Ho Hin Lee
Lu Wang
LRM
32
38
0
24 May 2023
Using Natural Language Explanations to Rescale Human Judgments
Using Natural Language Explanations to Rescale Human Judgments
Manya Wadhwa
Jifan Chen
Junyi Jessy Li
Greg Durrett
46
8
0
24 May 2023
Can Transformers Learn to Solve Problems Recursively?
Can Transformers Learn to Solve Problems Recursively?
Shizhuo Zhang
Curt Tigges
Stella Biderman
Maxim Raginsky
Talia Ringer
15
13
0
24 May 2023
Abductive Commonsense Reasoning Exploiting Mutually Exclusive
  Explanations
Abductive Commonsense Reasoning Exploiting Mutually Exclusive Explanations
Wenting Zhao
Justin T. Chiu
Claire Cardie
Alexander M. Rush
LRM
24
18
0
24 May 2023
Improving Factuality and Reasoning in Language Models through Multiagent
  Debate
Improving Factuality and Reasoning in Language Models through Multiagent Debate
Yilun Du
Shuang Li
Antonio Torralba
J. Tenenbaum
Igor Mordatch
LLMAG
LRM
79
618
0
23 May 2023
CREATOR: Tool Creation for Disentangling Abstract and Concrete Reasoning
  of Large Language Models
CREATOR: Tool Creation for Disentangling Abstract and Concrete Reasoning of Large Language Models
Cheng Qian
Chi Han
Yi R. Fung
Yujia Qin
Zhiyuan Liu
Heng Ji
LRM
20
30
0
23 May 2023
Question Answering as Programming for Solving Time-Sensitive Questions
Question Answering as Programming for Solving Time-Sensitive Questions
Xinyu Zhu
Cheng Yang
B. Chen
Siheng Li
Jian-Guang Lou
Yujiu Yang
KELM
38
11
0
23 May 2023
Goat: Fine-tuned LLaMA Outperforms GPT-4 on Arithmetic Tasks
Goat: Fine-tuned LLaMA Outperforms GPT-4 on Arithmetic Tasks
Tiedong Liu
K. H. Low
ALM
43
81
0
23 May 2023
How Language Model Hallucinations Can Snowball
How Language Model Hallucinations Can Snowball
Muru Zhang
Ofir Press
William Merrill
Alisa Liu
Noah A. Smith
HILM
LRM
88
257
0
22 May 2023
Small Language Models Improve Giants by Rewriting Their Outputs
Small Language Models Improve Giants by Rewriting Their Outputs
Giorgos Vernikos
Arthur Bravzinskas
Jakub Adamek
Jonathan Mallinson
Aliaksei Severyn
Eric Malmi
BDL
LRM
38
14
0
22 May 2023
Element-aware Summarization with Large Language Models: Expert-aligned
  Evaluation and Chain-of-Thought Method
Element-aware Summarization with Large Language Models: Expert-aligned Evaluation and Chain-of-Thought Method
Yiming Wang
ZhuoSheng Zhang
Rui Wang
46
79
0
22 May 2023
Prompting is not a substitute for probability measurements in large
  language models
Prompting is not a substitute for probability measurements in large language models
Jennifer Hu
R. Levy
45
38
0
22 May 2023
Has It All Been Solved? Open NLP Research Questions Not Solved by Large
  Language Models
Has It All Been Solved? Open NLP Research Questions Not Solved by Large Language Models
Oana Ignat
Zhijing Jin
Artem Abzaliev
Laura Biester
Santiago Castro
...
Verónica Pérez-Rosas
Siqi Shen
Zekun Wang
Winston Wu
Rada Mihalcea
LRM
41
6
0
21 May 2023
TheoremQA: A Theorem-driven Question Answering dataset
TheoremQA: A Theorem-driven Question Answering dataset
Wenhu Chen
Ming Yin
Max W.F. Ku
Pan Lu
Yixin Wan
Xueguang Ma
Jianyu Xu
Xinyi Wang
Tony Xia
AIMat
38
125
0
21 May 2023
TELeR: A General Taxonomy of LLM Prompts for Benchmarking Complex Tasks
TELeR: A General Taxonomy of LLM Prompts for Benchmarking Complex Tasks
Shubhra (Santu) Karmaker
Dongji Feng
30
51
0
19 May 2023
Instruction Tuned Models are Quick Learners
Instruction Tuned Models are Quick Learners
Himanshu Gupta
Saurabh Arjun Sawant
Swaroop Mishra
Mutsumi Nakamura
Arindam Mitra
Santosh Mashetty
Chitta Baral
26
26
0
17 May 2023
Reprompting: Automated Chain-of-Thought Prompt Inference Through Gibbs
  Sampling
Reprompting: Automated Chain-of-Thought Prompt Inference Through Gibbs Sampling
Weijia Xu
Andrzej Banburski-Fahey
Nebojsa Jojic
ReLM
LRM
28
32
0
17 May 2023
SatLM: Satisfiability-Aided Language Models Using Declarative Prompting
SatLM: Satisfiability-Aided Language Models Using Declarative Prompting
Xi Ye
Qiaochu Chen
Işıl Dillig
Greg Durrett
ReLM
ReCod
LRM
40
63
0
16 May 2023
Progressive Translation: Improving Domain Robustness of Neural Machine
  Translation with Intermediate Sequences
Progressive Translation: Improving Domain Robustness of Neural Machine Translation with Intermediate Sequences
Chaojun Wang
Yang Liu
Wai Lam
39
2
0
16 May 2023
TidyBot: Personalized Robot Assistance with Large Language Models
TidyBot: Personalized Robot Assistance with Large Language Models
Jimmy Wu
Rika Antonova
Adam Kan
Marion Lepert
Andy Zeng
Shuran Song
Jeannette Bohg
Szymon Rusinkiewicz
Thomas Funkhouser
LM&Ro
44
286
0
09 May 2023
Large Language Model Programs
Large Language Model Programs
Imanol Schlag
Sainbayar Sukhbaatar
Asli Celikyilmaz
Wen-tau Yih
Jason Weston
Jürgen Schmidhuber
Xian Li
LRM
44
15
0
09 May 2023
Code Execution with Pre-trained Language Models
Code Execution with Pre-trained Language Models
Chenxiao Liu
Shuai Lu
Weizhu Chen
Daxin Jiang
Alexey Svyatkovskiy
Shengyu Fu
Neel Sundaresan
Nan Duan
ELM
30
21
0
08 May 2023
Prompted LLMs as Chatbot Modules for Long Open-domain Conversation
Prompted LLMs as Chatbot Modules for Long Open-domain Conversation
Gibbeum Lee
Volker Hartmann
Jongho Park
Dimitris Papailiopoulos
Kangwook Lee
34
63
0
08 May 2023
Language Models Don't Always Say What They Think: Unfaithful
  Explanations in Chain-of-Thought Prompting
Language Models Don't Always Say What They Think: Unfaithful Explanations in Chain-of-Thought Prompting
Miles Turpin
Julian Michael
Ethan Perez
Sam Bowman
ReLM
LRM
38
390
0
07 May 2023
Transformer Working Memory Enables Regular Language Reasoning and
  Natural Language Length Extrapolation
Transformer Working Memory Enables Regular Language Reasoning and Natural Language Length Extrapolation
Ta-Chung Chi
Ting-Han Fan
Alexander I. Rudnicky
Peter J. Ramadge
LRM
14
13
0
05 May 2023
Previous
123...101112789
Next