Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1602.02410
Cited By
Exploring the Limits of Language Modeling
7 February 2016
Rafal Jozefowicz
Oriol Vinyals
M. Schuster
Noam M. Shazeer
Yonghui Wu
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Exploring the Limits of Language Modeling"
50 / 167 papers shown
Title
Video Corpus Moment Retrieval with Contrastive Learning
Hao Zhang
Aixin Sun
Wei Jing
Guoshun Nan
Liangli Zhen
Qiufeng Wang
Rick Siow Mong Goh
44
81
0
13 May 2021
Towards A Multi-agent System for Online Hate Speech Detection
Gaurav Sahu
R. Cohen
Olga Vechtomova
16
9
0
03 May 2021
Local word statistics affect reading times independently of surprisal
Adam Goodkind
K. Bicknell
14
11
0
07 Mar 2021
End-to-end deep meta modelling to calibrate and optimize energy consumption and comfort
Max H. Cohen
Sylvain Le Corff
M. Charbit
Marius Preda
Gilles Noziere
AI4CE
18
11
0
01 Feb 2021
Domain-aware Neural Language Models for Speech Recognition
Linda Liu
Yile Gu
Aditya Gourav
Ankur Gandhe
Shashank Kalmane
Denis Filimonov
Ariya Rastrow
I. Bulyko
36
21
0
05 Jan 2021
Unsupervised Learning of Discourse Structures using a Tree Autoencoder
Patrick Huber
Giuseppe Carenini
32
4
0
17 Dec 2020
Accurate 3D Object Detection using Energy-Based Models
Fredrik K. Gustafsson
Martin Danelljan
Thomas B. Schon
3DPC
38
10
0
08 Dec 2020
CharacterBERT: Reconciling ELMo and BERT for Word-Level Open-Vocabulary Representations From Characters
Hicham El Boukkouri
Olivier Ferret
Thomas Lavergne
Hiroshi Noji
Pierre Zweigenbaum
Junichi Tsujii
77
156
0
20 Oct 2020
Vulgaris: Analysis of a Corpus for Middle-Age Varieties of Italian Language
Andrea Zugarini
Matteo Tiezzi
Marco Maggini
11
2
0
12 Oct 2020
Near-imperceptible Neural Linguistic Steganography via Self-Adjusting Arithmetic Coding
Jiaming Shen
Heng Ji
Jiawei Han
15
33
0
01 Oct 2020
Detecting Cross-Modal Inconsistency to Defend Against Neural Fake News
Reuben Tan
Bryan A. Plummer
Kate Saenko
AAML
26
72
0
16 Sep 2020
Improving Tail Performance of a Deliberation E2E ASR Model Using a Large Text Corpus
Cal Peyser
S. Mavandadi
Tara N. Sainath
J. Apfel
Ruoming Pang
Shankar Kumar
29
46
0
24 Aug 2020
Efficient Urdu Caption Generation using Attention based LSTM
Inaam Ilahi
Hafiz Muhammad Abdullah Zia
Ahtazaz Ehsan
Rauf Tabassam
Armaghan Ahmed
VLM
21
2
0
02 Aug 2020
Learning for Video Compression with Recurrent Auto-Encoder and Recurrent Probability Model
Ren Yang
Fabian Mentzer
Luc Van Gool
Radu Timofte
18
138
0
24 Jun 2020
AVLnet: Learning Audio-Visual Language Representations from Instructional Videos
Andrew Rouditchenko
Angie Boggust
David Harwath
Brian Chen
D. Joshi
...
Rogerio Feris
Brian Kingsbury
M. Picheny
Antonio Torralba
James R. Glass
SSL
22
141
0
16 Jun 2020
NAS-Bench-NLP: Neural Architecture Search Benchmark for Natural Language Processing
Nikita Klyuchnikov
I. Trofimov
Ekaterina Artemova
Mikhail Salnikov
M. Fedorov
Evgeny Burnaev
VLM
15
101
0
12 Jun 2020
Language Models are Few-Shot Learners
Tom B. Brown
Benjamin Mann
Nick Ryder
Melanie Subbiah
Jared Kaplan
...
Christopher Berner
Sam McCandlish
Alec Radford
Ilya Sutskever
Dario Amodei
BDL
77
40,200
0
28 May 2020
A Systematic Assessment of Syntactic Generalization in Neural Language Models
Jennifer Hu
Jon Gauthier
Peng Qian
Ethan Gotlieb Wilcox
R. Levy
ELM
35
212
0
07 May 2020
HERO: Hierarchical Encoder for Video+Language Omni-representation Pre-training
Linjie Li
Yen-Chun Chen
Yu Cheng
Zhe Gan
Licheng Yu
Jingjing Liu
MLLM
VLM
OffRL
AI4TS
46
493
0
01 May 2020
TextAttack: A Framework for Adversarial Attacks, Data Augmentation, and Adversarial Training in NLP
John X. Morris
Eli Lifland
Jin Yong Yoo
J. E. Grigsby
Di Jin
Yanjun Qi
SILM
27
69
0
29 Apr 2020
Sequence Model Design for Code Completion in the Modern IDE
Gareth Ari Aye
Gail E. Kaiser
20
30
0
10 Apr 2020
A Survey on Contextual Embeddings
Qi Liu
Matt J. Kusner
Phil Blunsom
225
146
0
16 Mar 2020
Visual Grounding in Video for Unsupervised Word Translation
Gunnar A. Sigurdsson
Jean-Baptiste Alayrac
Aida Nematzadeh
Lucas Smaira
Mateusz Malinowski
João Carreira
Phil Blunsom
Andrew Zisserman
VGen
16
49
0
11 Mar 2020
FixMatch: Simplifying Semi-Supervised Learning with Consistency and Confidence
Kihyuk Sohn
David Berthelot
Chun-Liang Li
Zizhao Zhang
Nicholas Carlini
E. D. Cubuk
Alexey Kurakin
Han Zhang
Colin Raffel
AAML
104
3,467
0
21 Jan 2020
Montage: A Neural Network Language Model-Guided JavaScript Engine Fuzzer
Suyoung Lee
HyungSeok Han
S. Cha
Sooel Son
17
85
0
13 Jan 2020
Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer
Colin Raffel
Noam M. Shazeer
Adam Roberts
Katherine Lee
Sharan Narang
Michael Matena
Yanqi Zhou
Wei Li
Peter J. Liu
AIMat
126
19,493
0
23 Oct 2019
Optimizing Speech Recognition For The Edge
Yuan Shangguan
Jian Li
Qiao Liang
R. Álvarez
Ian McGraw
28
64
0
26 Sep 2019
Learning Dense Representations for Entity Retrieval
D. Gillick
Sayali Kulkarni
L. Lansing
Alessandro Presta
Jason Baldridge
Eugene Ie
Diego Garcia-Olano
RALM
28
201
0
23 Sep 2019
Analysing Neural Language Models: Contextual Decomposition Reveals Default Reasoning in Number and Gender Assignment
Jaap Jumelet
Willem H. Zuidema
Dieuwke Hupkes
LRM
33
37
0
19 Sep 2019
PaLM: A Hybrid Parser and Language Model
Hao Peng
Roy Schwartz
Noah A. Smith
AIMat
23
15
0
04 Sep 2019
Optimizing Multi-GPU Parallelization Strategies for Deep Learning Training
Saptadeep Pal
Eiman Ebrahimi
A. Zulfiqar
Yaosheng Fu
Victor Zhang
Szymon Migacz
D. Nellans
Puneet Gupta
34
55
0
30 Jul 2019
Selection via Proxy: Efficient Data Selection for Deep Learning
Cody Coleman
Christopher Yeh
Stephen Mussmann
Baharan Mirzasoleiman
Peter Bailis
Percy Liang
J. Leskovec
Matei A. Zaharia
26
329
0
26 Jun 2019
Learning Video Representations using Contrastive Bidirectional Transformer
Chen Sun
Fabien Baradel
Kevin Patrick Murphy
Cordelia Schmid
SSL
ViT
27
133
0
13 Jun 2019
Likelihood Ratios for Out-of-Distribution Detection
Jie Jessie Ren
Peter J. Liu
Emily Fertig
Jasper Snoek
Ryan Poplin
M. DePristo
Joshua V. Dillon
Balaji Lakshminarayanan
OODD
50
716
0
07 Jun 2019
Defending Against Neural Fake News
Rowan Zellers
Ari Holtzman
Hannah Rashkin
Yonatan Bisk
Ali Farhadi
Franziska Roesner
Yejin Choi
AAML
55
999
0
29 May 2019
CHiVE: Varying Prosody in Speech Synthesis with a Linguistically Driven Dynamic Hierarchical Conditional Variational Network
V. Wan
Chun-an Chan
Tom Kenter
Jakub Vít
R. Clark
19
75
0
17 May 2019
Gmail Smart Compose: Real-Time Assisted Writing
Mengzhao Chen
Benjamin Lee
G. Bansal
Yuan Cao
Shuyuan Zhang
...
Yinan Wang
Andrew M. Dai
Z. Chen
Timothy Sohn
Yonghui Wu
16
203
0
17 May 2019
Generating Long Sequences with Sparse Transformers
R. Child
Scott Gray
Alec Radford
Ilya Sutskever
16
1,851
0
23 Apr 2019
Unsupervised Deep Structured Semantic Models for Commonsense Reasoning
Shuohang Wang
Sheng Zhang
Yelong Shen
Xiaodong Liu
Jingjing Liu
Jianfeng Gao
Jing Jiang
LRM
22
15
0
03 Apr 2019
Neural Language Models as Psycholinguistic Subjects: Representations of Syntactic State
Richard Futrell
Ethan Gotlieb Wilcox
Takashi Morita
Peng Qian
Miguel Ballesteros
R. Levy
MILM
42
191
0
08 Mar 2019
Structural Supervision Improves Learning of Non-Local Grammatical Dependencies
Ethan Gotlieb Wilcox
Peng Qian
Richard Futrell
Miguel Ballesteros
R. Levy
26
55
0
03 Mar 2019
Securing Voice-driven Interfaces against Fake (Cloned) Audio Attacks
Hafiz Malik
13
26
0
18 Feb 2019
Generating Natural Language Explanations for Visual Question Answering using Scene Graphs and Visual Attention
Shalini Ghosh
Giedrius Burachas
Arijit Ray
Avi Ziskind
19
65
0
15 Feb 2019
Cross-lingual Language Model Pretraining
Guillaume Lample
Alexis Conneau
25
2,710
0
22 Jan 2019
Transformer-XL: Attentive Language Models Beyond a Fixed-Length Context
Zihang Dai
Zhilin Yang
Yiming Yang
J. Carbonell
Quoc V. Le
Ruslan Salakhutdinov
VLM
38
3,674
0
09 Jan 2019
Choosing the Right Word: Using Bidirectional LSTM Tagger for Writing Support Systems
Victor Makarenkov
Lior Rokach
Bracha Shapira
18
35
0
08 Jan 2019
Judge the Judges: A Large-Scale Evaluation Study of Neural Language Models for Online Review Generation
Cristina Garbacea
Samuel Carton
Shiyan Yan
Qiaozhu Mei
ELM
25
29
0
02 Jan 2019
Learning Private Neural Language Modeling with Attentive Aggregation
Shaoxiong Ji
Shirui Pan
Guodong Long
Xue Li
Jing Jiang
Zi Huang
FedML
MoMe
16
136
0
17 Dec 2018
Inferring the size of the causal universe: features and fusion of causal attribution networks
Daniel Berenberg
James P. Bagrow
CML
6
0
0
14 Dec 2018
Von Mises-Fisher Loss for Training Sequence to Sequence Models with Continuous Outputs
Sachin Kumar
Yulia Tsvetkov
22
70
0
10 Dec 2018
Previous
1
2
3
4
Next