ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2308.00946
  4. Cited By
Teaching Smaller Language Models To Generalise To Unseen Compositional
  Questions

Teaching Smaller Language Models To Generalise To Unseen Compositional Questions

2 August 2023
Tim Hartill
N. Tan
Michael Witbrock
Patricia J. Riddle
    ReLM
    KELM
    LRM
ArXivPDFHTML

Papers citing "Teaching Smaller Language Models To Generalise To Unseen Compositional Questions"

50 / 80 papers shown
Title
Galactica: A Large Language Model for Science
Galactica: A Large Language Model for Science
Ross Taylor
Marcin Kardas
Guillem Cucurull
Thomas Scialom
Anthony Hartshorn
Elvis Saravia
Andrew Poulton
Viktor Kerkez
Robert Stojnic
ELM
ReLM
98
766
0
16 Nov 2022
Large Language Models with Controllable Working Memory
Large Language Models with Controllable Working Memory
Daliang Li
A. S. Rawat
Manzil Zaheer
Xin Wang
Michal Lukasik
Andreas Veit
Felix X. Yu
Surinder Kumar
KELM
106
169
0
09 Nov 2022
Transcending Scaling Laws with 0.1% Extra Compute
Transcending Scaling Laws with 0.1% Extra Compute
Yi Tay
Jason W. Wei
Hyung Won Chung
Vinh Q. Tran
David R. So
...
Donald Metzler
Slav Petrov
N. Houlsby
Quoc V. Le
Mostafa Dehghani
LRM
82
70
0
20 Oct 2022
Zemi: Learning Zero-Shot Semi-Parametric Language Models from Multiple
  Tasks
Zemi: Learning Zero-Shot Semi-Parametric Language Models from Multiple Tasks
Zhenhailong Wang
Xiaoman Pan
Dian Yu
Dong Yu
Jianshu Chen
Heng Ji
VLM
88
10
0
01 Oct 2022
Beyond the Imitation Game: Quantifying and extrapolating the
  capabilities of language models
Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models
Aarohi Srivastava
Abhinav Rastogi
Abhishek Rao
Abu Awal Md Shoeb
Abubakar Abid
...
Zhuoye Zhao
Zijian Wang
Zijie J. Wang
Zirui Wang
Ziyi Wu
ELM
166
1,749
0
09 Jun 2022
Teaching Broad Reasoning Skills for Multi-Step QA by Generating Hard
  Contexts
Teaching Broad Reasoning Skills for Multi-Step QA by Generating Hard Contexts
H. Trivedi
Niranjan Balasubramanian
Tushar Khot
Ashish Sabharwal
ReLM
LRM
53
11
0
25 May 2022
Better Retrieval May Not Lead to Better Question Answering
Better Retrieval May Not Lead to Better Question Answering
Zhengzhong Liang
Tushar Khot
Steven Bethard
Mihai Surdeanu
Ashish Sabharwal
RALM
LRM
97
3
0
07 May 2022
OPT: Open Pre-trained Transformer Language Models
OPT: Open Pre-trained Transformer Language Models
Susan Zhang
Stephen Roller
Naman Goyal
Mikel Artetxe
Moya Chen
...
Daniel Simig
Punit Singh Koura
Anjali Sridhar
Tianlu Wang
Luke Zettlemoyer
VLM
OSLM
AI4CE
299
3,647
0
02 May 2022
PaLM: Scaling Language Modeling with Pathways
PaLM: Scaling Language Modeling with Pathways
Aakanksha Chowdhery
Sharan Narang
Jacob Devlin
Maarten Bosma
Gaurav Mishra
...
Kathy Meier-Hellstern
Douglas Eck
J. Dean
Slav Petrov
Noah Fiedel
PILM
LRM
446
6,222
0
05 Apr 2022
Self-Consistency Improves Chain of Thought Reasoning in Language Models
Self-Consistency Improves Chain of Thought Reasoning in Language Models
Xuezhi Wang
Jason W. Wei
Dale Schuurmans
Quoc Le
Ed H. Chi
Sharan Narang
Aakanksha Chowdhery
Denny Zhou
ReLM
BDL
LRM
AI4CE
507
3,618
0
21 Mar 2022
Training language models to follow instructions with human feedback
Training language models to follow instructions with human feedback
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
...
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
OSLM
ALM
811
12,893
0
04 Mar 2022
UnifiedQA-v2: Stronger Generalization via Broader Cross-Format Training
UnifiedQA-v2: Stronger Generalization via Broader Cross-Format Training
Daniel Khashabi
Yeganeh Kordi
Hannaneh Hajishirzi
65
66
0
23 Feb 2022
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
Jason W. Wei
Xuezhi Wang
Dale Schuurmans
Maarten Bosma
Brian Ichter
F. Xia
Ed H. Chi
Quoc Le
Denny Zhou
LM&Ro
LRM
AI4CE
ReLM
763
9,330
0
28 Jan 2022
Reasoning Like Program Executors
Reasoning Like Program Executors
Xinyu Pi
Qian Liu
Bei Chen
Morteza Ziyadi
Zeqi Lin
Qiang Fu
Yan Gao
Jian-Guang Lou
Weizhu Chen
ReLM
LRM
288
53
0
27 Jan 2022
CommonsenseQA 2.0: Exposing the Limits of AI through Gamification
CommonsenseQA 2.0: Exposing the Limits of AI through Gamification
Alon Talmor
Ori Yoran
Ronan Le Bras
Chandrasekhar Bhagavatula
Yoav Goldberg
Yejin Choi
Jonathan Berant
ELM
92
145
0
14 Jan 2022
Unsupervised Dense Information Retrieval with Contrastive Learning
Unsupervised Dense Information Retrieval with Contrastive Learning
Gautier Izacard
Mathilde Caron
Lucas Hosseini
Sebastian Riedel
Piotr Bojanowski
Armand Joulin
Edouard Grave
RALM
180
898
0
16 Dec 2021
Learning to Retrieve Passages without Supervision
Learning to Retrieve Passages without Supervision
Ori Ram
Gal Shachaf
Omer Levy
Jonathan Berant
Amir Globerson
RALM
37
61
0
14 Dec 2021
Human Parity on CommonsenseQA: Augmenting Self-Attention with External
  Attention
Human Parity on CommonsenseQA: Augmenting Self-Attention with External Attention
Yichong Xu
Chenguang Zhu
Shuohang Wang
Siqi Sun
Hao Cheng
Xiaodong Liu
Jianfeng Gao
Pengcheng He
Michael Zeng
Xuedong Huang
LRM
269
59
0
06 Dec 2021
Multitask Prompted Training Enables Zero-Shot Task Generalization
Multitask Prompted Training Enables Zero-Shot Task Generalization
Victor Sanh
Albert Webson
Colin Raffel
Stephen H. Bach
Lintang Sutawika
...
T. Bers
Stella Biderman
Leo Gao
Thomas Wolf
Alexander M. Rush
LRM
342
1,700
0
15 Oct 2021
Salient Phrase Aware Dense Retrieval: Can a Dense Retriever Imitate a
  Sparse One?
Salient Phrase Aware Dense Retrieval: Can a Dense Retriever Imitate a Sparse One?
Xilun Chen
Kushal Lakhotia
Barlas Oğuz
Anchit Gupta
Patrick Lewis
Stanislav Peshterliev
Yashar Mehdad
Sonal Gupta
Wen-tau Yih
89
68
0
13 Oct 2021
Entity-Based Knowledge Conflicts in Question Answering
Entity-Based Knowledge Conflicts in Question Answering
Shayne Longpre
Kartik Perisetla
Anthony Chen
Nikhil Ramesh
Chris DuBois
Sameer Singh
HILM
323
258
0
10 Sep 2021
CREAK: A Dataset for Commonsense Reasoning over Entity Knowledge
CREAK: A Dataset for Commonsense Reasoning over Entity Knowledge
Yasumasa Onoe
Michael J.Q. Zhang
Eunsol Choi
Greg Durrett
HILM
69
87
0
03 Sep 2021
Finetuned Language Models Are Zero-Shot Learners
Finetuned Language Models Are Zero-Shot Learners
Jason W. Wei
Maarten Bosma
Vincent Zhao
Kelvin Guu
Adams Wei Yu
Brian Lester
Nan Du
Andrew M. Dai
Quoc V. Le
ALM
UQCV
127
3,742
0
03 Sep 2021
The paradox of the compositionality of natural language: a neural
  machine translation case study
The paradox of the compositionality of natural language: a neural machine translation case study
Verna Dankers
Elia Bruni
Dieuwke Hupkes
CoGe
190
81
0
12 Aug 2021
MuSiQue: Multihop Questions via Single-hop Question Composition
MuSiQue: Multihop Questions via Single-hop Question Composition
H. Trivedi
Niranjan Balasubramanian
Tushar Khot
Ashish Sabharwal
LRM
79
267
0
02 Aug 2021
Turning Tables: Generating Examples from Semi-structured Tables for
  Endowing Language Models with Reasoning Skills
Turning Tables: Generating Examples from Semi-structured Tables for Endowing Language Models with Reasoning Skills
Ori Yoran
Alon Talmor
Jonathan Berant
ReLM
LRM
213
54
0
15 Jul 2021
TAT-QA: A Question Answering Benchmark on a Hybrid of Tabular and
  Textual Content in Finance
TAT-QA: A Question Answering Benchmark on a Hybrid of Tabular and Textual Content in Finance
Fengbin Zhu
Wenqiang Lei
Youcheng Huang
Chao Wang
Shuo Zhang
Jiancheng Lv
Fuli Feng
Tat-Seng Chua
AIMat
100
292
0
17 May 2021
A Dataset of Information-Seeking Questions and Answers Anchored in
  Research Papers
A Dataset of Information-Seeking Questions and Answers Anchored in Research Papers
Pradeep Dasigi
Kyle Lo
Iz Beltagy
Arman Cohan
Noah A. Smith
Matt Gardner
RALM
85
303
0
07 May 2021
BEIR: A Heterogenous Benchmark for Zero-shot Evaluation of Information
  Retrieval Models
BEIR: A Heterogenous Benchmark for Zero-shot Evaluation of Information Retrieval Models
Nandan Thakur
Nils Reimers
Andreas Rucklé
Abhishek Srivastava
Iryna Gurevych
VLM
415
1,031
0
17 Apr 2021
Think you have Solved Direct-Answer Question Answering? Try ARC-DA, the
  Direct-Answer AI2 Reasoning Challenge
Think you have Solved Direct-Answer Question Answering? Try ARC-DA, the Direct-Answer AI2 Reasoning Challenge
Sumithra Bhakthavatsalam
Daniel Khashabi
Tushar Khot
Bhavana Dalvi
Kyle Richardson
Ashish Sabharwal
Carissa Schoenick
Oyvind Tafjord
Peter Clark
RALM
AI4CE
49
65
0
05 Feb 2021
Did Aristotle Use a Laptop? A Question Answering Benchmark with Implicit
  Reasoning Strategies
Did Aristotle Use a Laptop? A Question Answering Benchmark with Implicit Reasoning Strategies
Mor Geva
Daniel Khashabi
Elad Segal
Tushar Khot
Dan Roth
Jonathan Berant
RALM
330
718
0
06 Jan 2021
Baleen: Robust Multi-Hop Reasoning at Scale via Condensed Retrieval
Baleen: Robust Multi-Hop Reasoning at Scale via Condensed Retrieval
Omar Khattab
Christopher Potts
Matei A. Zaharia
RALM
LRM
53
57
0
02 Jan 2021
IIRC: A Dataset of Incomplete Information Reading Comprehension
  Questions
IIRC: A Dataset of Incomplete Information Reading Comprehension Questions
James Ferguson
Matt Gardner
Hannaneh Hajishirzi
Tushar Khot
Pradeep Dasigi
RALM
30
54
0
13 Nov 2020
HoVer: A Dataset for Many-Hop Fact Extraction And Claim Verification
HoVer: A Dataset for Many-Hop Fact Extraction And Claim Verification
Yichen Jiang
Shikha Bordia
Zheng Zhong
Charles Dognin
M. Singh
Joey Tianyi Zhou
80
157
0
05 Nov 2020
Answering Open-Domain Questions of Varying Reasoning Steps from Text
Answering Open-Domain Questions of Varying Reasoning Steps from Text
Peng Qi
Haejun Lee
OghenetegiriTGSido
Christopher D. Manning
KELM
RALM
LRM
212
57
0
23 Oct 2020
Answering Complex Open-Domain Questions with Multi-Hop Dense Retrieval
Answering Complex Open-Domain Questions with Multi-Hop Dense Retrieval
Wenhan Xiong
Xiang Lorraine Li
Srini Iyer
Jingfei Du
Patrick Lewis
...
Yashar Mehdad
Wen-tau Yih
Sebastian Riedel
Douwe Kiela
Barlas Oğuz
58
192
0
27 Sep 2020
Question and Answer Test-Train Overlap in Open-Domain Question Answering
  Datasets
Question and Answer Test-Train Overlap in Open-Domain Question Answering Datasets
Patrick Lewis
Pontus Stenetorp
Sebastian Riedel
OOD
ELM
151
186
0
06 Aug 2020
Leveraging Passage Retrieval with Generative Models for Open Domain
  Question Answering
Leveraging Passage Retrieval with Generative Models for Open Domain Question Answering
Gautier Izacard
Edouard Grave
RALM
117
1,170
0
02 Jul 2020
Language Models are Few-Shot Learners
Language Models are Few-Shot Learners
Tom B. Brown
Benjamin Mann
Nick Ryder
Melanie Subbiah
Jared Kaplan
...
Christopher Berner
Sam McCandlish
Alec Radford
Ilya Sutskever
Dario Amodei
BDL
731
41,894
0
28 May 2020
UnifiedQA: Crossing Format Boundaries With a Single QA System
UnifiedQA: Crossing Format Boundaries With a Single QA System
Daniel Khashabi
Sewon Min
Tushar Khot
Ashish Sabharwal
Oyvind Tafjord
Peter Clark
Hannaneh Hajishirzi
122
738
0
02 May 2020
Dense Passage Retrieval for Open-Domain Question Answering
Dense Passage Retrieval for Open-Domain Question Answering
Vladimir Karpukhin
Barlas Oğuz
Sewon Min
Patrick Lewis
Ledell Yu Wu
Sergey Edunov
Danqi Chen
Wen-tau Yih
RALM
161
3,749
0
10 Apr 2020
Injecting Numerical Reasoning Skills into Language Models
Injecting Numerical Reasoning Skills into Language Models
Mor Geva
Ankit Gupta
Jonathan Berant
AIMat
LRM
62
226
0
09 Apr 2020
ReClor: A Reading Comprehension Dataset Requiring Logical Reasoning
ReClor: A Reading Comprehension Dataset Requiring Logical Reasoning
Weihao Yu
Zihang Jiang
Yanfei Dong
Jiashi Feng
LRM
112
251
0
11 Feb 2020
REALM: Retrieval-Augmented Language Model Pre-Training
REALM: Retrieval-Augmented Language Model Pre-Training
Kelvin Guu
Kenton Lee
Zora Tung
Panupong Pasupat
Ming-Wei Chang
RALM
113
2,093
0
10 Feb 2020
Beat the AI: Investigating Adversarial Human Annotation for Reading
  Comprehension
Beat the AI: Investigating Adversarial Human Annotation for Reading Comprehension
Max Bartolo
A. Roberts
Johannes Welbl
Sebastian Riedel
Pontus Stenetorp
AAML
94
175
0
02 Feb 2020
Break It Down: A Question Understanding Benchmark
Break It Down: A Question Understanding Benchmark
Tomer Wolfson
Mor Geva
Ankit Gupta
Matt Gardner
Yoav Goldberg
Daniel Deutch
Jonathan Berant
70
188
0
31 Jan 2020
PIQA: Reasoning about Physical Commonsense in Natural Language
PIQA: Reasoning about Physical Commonsense in Natural Language
Yonatan Bisk
Rowan Zellers
Ronan Le Bras
Jianfeng Gao
Yejin Choi
OOD
LRM
125
1,789
0
26 Nov 2019
BART: Denoising Sequence-to-Sequence Pre-training for Natural Language
  Generation, Translation, and Comprehension
BART: Denoising Sequence-to-Sequence Pre-training for Natural Language Generation, Translation, and Comprehension
M. Lewis
Yinhan Liu
Naman Goyal
Marjan Ghazvininejad
Abdel-rahman Mohamed
Omer Levy
Veselin Stoyanov
Luke Zettlemoyer
AIMat
VLM
241
10,815
0
29 Oct 2019
QASC: A Dataset for Question Answering via Sentence Composition
QASC: A Dataset for Question Answering via Sentence Composition
Tushar Khot
Peter Clark
Michal Guerquin
Peter Alexander Jansen
Ashish Sabharwal
CoGe
69
328
0
25 Oct 2019
Exploring the Limits of Transfer Learning with a Unified Text-to-Text
  Transformer
Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer
Colin Raffel
Noam M. Shazeer
Adam Roberts
Katherine Lee
Sharan Narang
Michael Matena
Yanqi Zhou
Wei Li
Peter J. Liu
AIMat
404
20,114
0
23 Oct 2019
12
Next