Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1907.11692
Cited By
RoBERTa: A Robustly Optimized BERT Pretraining Approach
26 July 2019
Yinhan Liu
Myle Ott
Naman Goyal
Jingfei Du
Mandar Joshi
Danqi Chen
Omer Levy
M. Lewis
Luke Zettlemoyer
Veselin Stoyanov
AIMat
Re-assign community
ArXiv
PDF
HTML
Papers citing
"RoBERTa: A Robustly Optimized BERT Pretraining Approach"
50 / 9,296 papers shown
Title
Are All Languages Created Equal in Multilingual BERT?
Shijie Wu
Mark Dredze
32
319
0
18 May 2020
Span-ConveRT: Few-shot Span Extraction for Dialog with Pretrained Conversational Representations
Sam Coope
Tyler Farghly
D. Gerz
Ivan Vulić
Matthew Henderson
40
62
0
18 May 2020
Audio ALBERT: A Lite BERT for Self-supervised Learning of Audio Representation
Po-Han Chi
Pei-Hung Chung
Tsung-Han Wu
Chun-Cheng Hsieh
Yen-Hao Chen
Shang-Wen Li
Hung-yi Lee
SSL
18
147
0
18 May 2020
Syntax-guided Controlled Generation of Paraphrases
Ashutosh Kumar
Kabir Ahuja
Raghuram Vadapalli
Partha P. Talukdar
41
93
0
18 May 2020
T-VSE: Transformer-Based Visual Semantic Embedding
M. Bastan
Arnau Ramisa
Mehmet Tek
ViT
24
7
0
17 May 2020
TaBERT: Pretraining for Joint Understanding of Textual and Tabular Data
Pengcheng Yin
Graham Neubig
Wen-tau Yih
Sebastian Riedel
RALM
LMTD
57
587
0
17 May 2020
ApplicaAI at SemEval-2020 Task 11: On RoBERTa-CRF, Span CLS and Whether Self-Training Helps Them
Dawid Jurkiewicz
Łukasz Borchmann
Izabela Kosmala
Filip Graliñski
22
40
0
16 May 2020
Movement Pruning: Adaptive Sparsity by Fine-Tuning
Victor Sanh
Thomas Wolf
Alexander M. Rush
37
472
0
15 May 2020
COVID-Twitter-BERT: A Natural Language Processing Model to Analyse COVID-19 Content on Twitter
Martin Müller
M. Salathé
P. Kummervold
VLM
MedIm
AI4MH
35
357
0
15 May 2020
Spelling Error Correction with Soft-Masked BERT
Shaohua Zhang
Haoran Huang
Jicong Liu
Hang Li
22
207
0
15 May 2020
Dense-Caption Matching and Frame-Selection Gating for Temporal Localization in VideoQA
Hyounghun Kim
Zineng Tang
Joey Tianyi Zhou
35
31
0
13 May 2020
INFOTABS: Inference on Tables as Semi-structured Data
Vivek Gupta
Maitrey Mehta
Pegah Nokhiz
Vivek Srikumar
LMTD
25
101
0
13 May 2020
That is a Known Lie: Detecting Previously Fact-Checked Claims
Shaden Shaar
Giovanni Da San Martino
Nikolay Babulkov
Preslav Nakov
HILM
59
155
0
12 May 2020
A Report on the 2020 Sarcasm Detection Shared Task
Debanjan Ghosh
Avijit Vajpayee
Smaranda Muresan
24
60
0
12 May 2020
WinoWhy: A Deep Diagnosis of Essential Commonsense Knowledge for Answering Winograd Schema Challenge
Hongming Zhang
Xinran Zhao
Yangqiu Song
35
55
0
12 May 2020
On the Robustness of Language Encoders against Grammatical Errors
Fan Yin
Quanyu Long
Tao Meng
Kai-Wei Chang
39
34
0
12 May 2020
SOLOIST: Building Task Bots at Scale with Transfer Learning and Machine Teaching
Baolin Peng
Chunyuan Li
Jinchao Li
Shahin Shayandeh
Lars Liden
Jianfeng Gao
41
125
0
11 May 2020
Commonsense Evidence Generation and Injection in Reading Comprehension
Ye Liu
Tao Yang
Zeyu You
Wei Fan
Philip S. Yu
38
14
0
11 May 2020
schuBERT: Optimizing Elements of BERT
A. Khetan
Zohar Karnin
36
30
0
09 May 2020
Cyberbullying Detection with Fairness Constraints
O. Gencoglu
29
48
0
09 May 2020
Temporal Common Sense Acquisition with Minimal Supervision
Ben Zhou
Qiang Ning
Daniel Khashabi
Dan Roth
21
92
0
08 May 2020
Evidence Inference 2.0: More Data, Better Models
Jay DeYoung
Eric P. Lehman
Benjamin E. Nye
Iain J. Marshall
Byron C. Wallace
62
68
0
08 May 2020
Beyond Accuracy: Behavioral Testing of NLP models with CheckList
Marco Tulio Ribeiro
Tongshuang Wu
Carlos Guestrin
Sameer Singh
ELM
52
1,085
0
08 May 2020
SentiBERT: A Transferable Transformer-Based Architecture for Compositional Sentiment Semantics
Da Yin
Tao Meng
Kai-Wei Chang
21
138
0
08 May 2020
Detecting East Asian Prejudice on Social Media
Bertie Vidgen
Austin Botelho
David A. Broniatowski
E. Guest
Matthew Hall
Helen Z. Margetts
Rebekah Tromble
Zeerak Talat
Scott A. Hale
19
97
0
08 May 2020
GOBO: Quantizing Attention-Based NLP Models for Low Latency and Energy Efficient Inference
Ali Hadi Zadeh
Isak Edo
Omar Mohamed Awad
Andreas Moshovos
MQ
35
185
0
08 May 2020
Blind Backdoors in Deep Learning Models
Eugene Bagdasaryan
Vitaly Shmatikov
AAML
FedML
SILM
51
298
0
08 May 2020
SUPERT: Towards New Frontiers in Unsupervised Evaluation Metrics for Multi-Document Summarization
Yang Gao
Wei Zhao
Steffen Eger
ELM
32
125
0
07 May 2020
Moving Down the Long Tail of Word Sense Disambiguation with Gloss-Informed Biencoders
Terra Blevins
Luke Zettlemoyer
40
164
0
06 May 2020
The Cascade Transformer: an Application for Efficient Answer Sentence Selection
Luca Soldaini
Alessandro Moschitti
37
44
0
05 May 2020
Multi-Stage Conversational Passage Retrieval: An Approach to Fusing Term Importance Estimation and Neural Query Rewriting
Sheng-Chieh Lin
Jheng-Hong Yang
Rodrigo Nogueira
Ming-Feng Tsai
Chuan-Ju Wang
Jimmy J. Lin
34
24
0
05 May 2020
To Test Machine Comprehension, Start by Defining Comprehension
Jesse Dunietz
Greg Burnham
Akash Bharadwaj
Owen Rambow
Jennifer Chu-Carroll
D. Ferrucci
FaML
56
65
0
04 May 2020
The Sensitivity of Language Models and Humans to Winograd Schema Perturbations
Mostafa Abdou
Vinit Ravishankar
Maria Barrett
Yonatan Belinkov
Desmond Elliott
Anders Søgaard
ReLM
LRM
62
34
0
04 May 2020
From SPMRL to NMRL: What Did We Learn (and Unlearn) in a Decade of Parsing Morphologically-Rich Languages (MRLs)?
Reut Tsarfaty
Dan Bareket
Stav Klein
Amit Seker
33
39
0
04 May 2020
Knowledge Graph-Augmented Abstractive Summarization with Semantic-Driven Cloze Reward
Luyang Huang
Lingfei Wu
Lu Wang
RALM
55
161
0
03 May 2020
How Can We Accelerate Progress Towards Human-like Linguistic Generalization?
Tal Linzen
220
191
0
03 May 2020
Understanding and Improving Information Transfer in Multi-Task Learning
Sen Wu
Hongyang R. Zhang
Christopher Ré
18
155
0
02 May 2020
Improving Truthfulness of Headline Generation
Kazuki Matsumaru
Sho Takase
Naoaki Okazaki
HILM
16
49
0
02 May 2020
IsoBN: Fine-Tuning BERT with Isotropic Batch Normalization
Wenxuan Zhou
Bill Yuchen Lin
Xiang Ren
44
25
0
02 May 2020
ForecastQA: A Question Answering Challenge for Event Forecasting with Temporal Text Data
Woojeong Jin
Rahul Khanna
Suji Kim
Dong-Ho Lee
Fred Morstatter
Aram Galstyan
Xiang Ren
AI4TS
19
37
0
02 May 2020
RICA: Evaluating Robust Inference Capabilities Based on Commonsense Axioms
Pei Zhou
Rahul Khanna
Seyeon Lee
Bill Yuchen Lin
Daniel E. Ho
Jay Pujara
Xiang Ren
ReLM
30
37
0
02 May 2020
ProtoQA: A Question Answering Dataset for Prototypical Common-Sense Reasoning
Michael Boratko
Xiang Lorraine Li
Rajarshi Das
Timothy J. O'Gorman
Daniel Le
Andrew McCallum
49
56
0
02 May 2020
BERT-kNN: Adding a kNN Search Component to Pretrained Language Models for Better QA
Nora Kassner
Hinrich Schütze
RALM
35
68
0
02 May 2020
UnifiedQA: Crossing Format Boundaries With a Single QA System
Daniel Khashabi
Sewon Min
Tushar Khot
Ashish Sabharwal
Oyvind Tafjord
Peter Clark
Hannaneh Hajishirzi
75
725
0
02 May 2020
DeFormer: Decomposing Pre-trained Transformers for Faster Question Answering
Qingqing Cao
H. Trivedi
A. Balasubramanian
Niranjan Balasubramanian
34
66
0
02 May 2020
Connecting the Dots: A Knowledgeable Path Generator for Commonsense Question Answering
Peifeng Wang
Nanyun Peng
Filip Ilievski
Pedro A. Szekely
Xiang Ren
19
91
0
02 May 2020
Scalable Multi-Hop Relational Reasoning for Knowledge-Aware Question Answering
Yanlin Feng
Xinyue Chen
Bill Yuchen Lin
Peifeng Wang
Jun Yan
Xiang Ren
LRM
KELM
24
239
0
01 May 2020
From Zero to Hero: On the Limitations of Zero-Shot Cross-Lingual Transfer with Multilingual Transformers
Anne Lauscher
Vinit Ravishankar
Ivan Vulić
Goran Glavaš
40
56
0
01 May 2020
Probing Contextual Language Models for Common Ground with Visual Representations
Gabriel Ilharco
Rowan Zellers
Ali Farhadi
Hannaneh Hajishirzi
30
14
0
01 May 2020
POINTER: Constrained Progressive Text Generation via Insertion-based Generative Pre-training
Yizhe Zhang
Guoyin Wang
Chunyuan Li
Zhe Gan
Chris Brockett
Bill Dolan
39
30
0
01 May 2020
Previous
1
2
3
...
180
181
182
...
184
185
186
Next