Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2012.08266
Cited By
*-CFQ: Analyzing the Scalability of Machine Learning on a Compositional Task
15 December 2020
Dmitry Tsarkov
Tibor Tihon
Nathan Scales
Nikola Momchev
Danila Sinopalnikov
Nathanael Scharli
Re-assign community
ArXiv
PDF
HTML
Papers citing
"*-CFQ: Analyzing the Scalability of Machine Learning on a Compositional Task"
31 / 31 papers shown
Title
Compositional Generalization in Semantic Parsing: Pre-training vs. Specialized Architectures
Daniel Furrer
Marc van Zee
Nathan Scales
Nathanael Scharli
CoGe
68
114
0
17 Jul 2020
Language Models are Few-Shot Learners
Tom B. Brown
Benjamin Mann
Nick Ryder
Melanie Subbiah
Jared Kaplan
...
Christopher Berner
Sam McCandlish
Alec Radford
Ilya Sutskever
Dario Amodei
BDL
602
41,736
0
28 May 2020
Don't Stop Pretraining: Adapt Language Models to Domains and Tasks
Suchin Gururangan
Ana Marasović
Swabha Swayamdipta
Kyle Lo
Iz Beltagy
Doug Downey
Noah A. Smith
VLM
AI4CE
CLL
134
2,414
0
23 Apr 2020
Building a Multi-domain Neural Machine Translation Model using Knowledge Distillation
Idriss Mghabbar
Pirashanth Ratnamogan
39
14
0
15 Apr 2020
Scaling Laws for Neural Language Models
Jared Kaplan
Sam McCandlish
T. Henighan
Tom B. Brown
B. Chess
R. Child
Scott Gray
Alec Radford
Jeff Wu
Dario Amodei
522
4,773
0
23 Jan 2020
Measuring Compositional Generalization: A Comprehensive Method on Realistic Data
Daniel Keysers
Nathanael Scharli
Nathan Scales
Hylke Buisman
Daniel Furrer
...
Tibor Tihon
Dmitry Tsarkov
Tianlin Li
Marc van Zee
Olivier Bousquet
CoGe
56
353
0
20 Dec 2019
Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer
Colin Raffel
Noam M. Shazeer
Adam Roberts
Katherine Lee
Sharan Narang
Michael Matena
Yanqi Zhou
Wei Li
Peter J. Liu
AIMat
367
20,053
0
23 Oct 2019
Compositional Generalization for Primitive Substitutions
Yuanpeng Li
Liang Zhao
Jianyu Wang
Joel Hestness
53
87
0
07 Oct 2019
Environmental drivers of systematicity and generalization in a situated agent
Felix Hill
Andrew Kyle Lampinen
R. Schneider
S. Clark
M. Botvinick
James L. McClelland
Adam Santoro
OOD
80
107
0
01 Oct 2019
A Constructive Prediction of the Generalization Error Across Scales
Jonathan S. Rosenfeld
Amir Rosenfeld
Yonatan Belinkov
Nir Shavit
91
211
0
27 Sep 2019
Learning a Multi-Domain Curriculum for Neural Machine Translation
Wei Wang
Ye Tian
Jiquan Ngiam
Yinfei Yang
Isaac Caswell
Zarana Parekh
68
39
0
28 Aug 2019
Compositionality decomposed: how do neural networks generalise?
Dieuwke Hupkes
Verna Dankers
Mathijs Mul
Elia Bruni
CoGe
118
332
0
22 Aug 2019
RoBERTa: A Robustly Optimized BERT Pretraining Approach
Yinhan Liu
Myle Ott
Naman Goyal
Jingfei Du
Mandar Joshi
Danqi Chen
Omer Levy
M. Lewis
Luke Zettlemoyer
Veselin Stoyanov
AIMat
514
24,351
0
26 Jul 2019
XLNet: Generalized Autoregressive Pretraining for Language Understanding
Zhilin Yang
Zihang Dai
Yiming Yang
J. Carbonell
Ruslan Salakhutdinov
Quoc V. Le
AI4CE
215
8,415
0
19 Jun 2019
MultiQA: An Empirical Investigation of Generalization and Transfer in Reading Comprehension
Alon Talmor
Jonathan Berant
71
173
0
31 May 2019
Curriculum Learning for Domain Adaptation in Neural Machine Translation
Xuan Zhang
Pamela Shapiro
Manish Kumar
Paul McNamee
Marine Carpuat
Kevin Duh
55
124
0
14 May 2019
Compositional generalization in a deep seq2seq model by separating syntax and semantics
Jacob Russin
Jason Jo
R. C. O'Reilly
Yoshua Bengio
60
103
0
22 Apr 2019
A Survey of Unsupervised Deep Domain Adaptation
Garrett Wilson
D. Cook
OOD
77
814
0
06 Dec 2018
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
VLM
SSL
SSeg
1.4K
94,511
0
11 Oct 2018
Universal Transformers
Mostafa Dehghani
Stephan Gouws
Oriol Vinyals
Jakob Uszkoreit
Lukasz Kaiser
80
752
0
10 Jul 2018
Relational inductive biases, deep learning, and graph networks
Peter W. Battaglia
Jessica B. Hamrick
V. Bapst
Alvaro Sanchez-Gonzalez
V. Zambaldi
...
Pushmeet Kohli
M. Botvinick
Oriol Vinyals
Yujia Li
Razvan Pascanu
AI4CE
NAI
584
3,112
0
04 Jun 2018
A Survey of Domain Adaptation for Neural Machine Translation
Chenhui Chu
Rui Wang
AI4CE
73
261
0
01 Jun 2018
Multi-Domain Neural Machine Translation
Sander Tars
Mark Fishel
AI4CE
41
51
0
06 May 2018
Tensor2Tensor for Neural Machine Translation
Ashish Vaswani
Samy Bengio
E. Brevdo
François Chollet
Aidan Gomez
...
Nal Kalchbrenner
Niki Parmar
Ryan Sepassi
Noam M. Shazeer
Jakob Uszkoreit
88
529
0
16 Mar 2018
Deep Learning Scaling is Predictable, Empirically
Joel Hestness
Sharan Narang
Newsha Ardalani
G. Diamos
Heewoo Jun
Hassan Kianinejad
Md. Mostofa Ali Patwary
Yang Yang
Yanqi Zhou
87
736
0
01 Dec 2017
Revisiting Unreasonable Effectiveness of Data in Deep Learning Era
Chen Sun
Abhinav Shrivastava
Saurabh Singh
Abhinav Gupta
VLM
159
2,393
0
10 Jul 2017
Attention Is All You Need
Ashish Vaswani
Noam M. Shazeer
Niki Parmar
Jakob Uszkoreit
Llion Jones
Aidan Gomez
Lukasz Kaiser
Illia Polosukhin
3DV
624
130,942
0
12 Jun 2017
Neural Semantic Parsing over Multiple Knowledge-bases
Jonathan Herzig
Jonathan Berant
47
57
0
06 Feb 2017
How much data is needed to train a medical image deep learning system to achieve necessary high accuracy?
Junghwan Cho
Kyewook Lee
Ellie Shin
G. Choy
Synho Do
65
335
0
19 Nov 2015
A Unified Perspective on Multi-Domain and Multi-Task Learning
Yongxin Yang
Timothy M. Hospedales
71
163
0
23 Dec 2014
Neural Machine Translation by Jointly Learning to Align and Translate
Dzmitry Bahdanau
Kyunghyun Cho
Yoshua Bengio
AIMat
501
27,263
0
01 Sep 2014
1