Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1906.08237
Cited By
v1
v2 (latest)
XLNet: Generalized Autoregressive Pretraining for Language Understanding
19 June 2019
Zhilin Yang
Zihang Dai
Yiming Yang
J. Carbonell
Ruslan Salakhutdinov
Quoc V. Le
AI4CE
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"XLNet: Generalized Autoregressive Pretraining for Language Understanding"
50 / 3,520 papers shown
Title
Large Scale Legal Text Classification Using Transformer Models
Zein Shaheen
G. Wohlgenannt
Erwin Filtz
AILaw
80
72
0
24 Oct 2020
ReadOnce Transformers: Reusable Representations of Text for Transformers
Shih-Ting Lin
Ashish Sabharwal
Tushar Khot
117
3
0
24 Oct 2020
Multilingual Speech Translation with Efficient Finetuning of Pretrained Models
Xian Li
Changhan Wang
Yun Tang
C. Tran
Yuqing Tang
J. Pino
Alexei Baevski
Alexis Conneau
Michael Auli
62
6
0
24 Oct 2020
Open-Domain Dialogue Generation Based on Pre-trained Language Models
Yan Zeng
J. Nie
31
3
0
24 Oct 2020
ANLIzing the Adversarial Natural Language Inference Dataset
Adina Williams
Tristan Thrush
Douwe Kiela
AAML
255
47
0
24 Oct 2020
Dynamic Contextualized Word Embeddings
Valentin Hofmann
J. Pierrehumbert
Hinrich Schütze
116
52
0
23 Oct 2020
Robust Document Representations using Latent Topics and Metadata
Natraj Raman
Armineh Nourbakhsh
Sameena Shah
Manuela Veloso
26
0
0
23 Oct 2020
Improving Robustness by Augmenting Training Sentences with Predicate-Argument Structures
N. Moosavi
M. Boer
Prasetya Ajie Utama
Iryna Gurevych
82
13
0
23 Oct 2020
TweetEval: Unified Benchmark and Comparative Evaluation for Tweet Classification
Francesco Barbieri
Jose Camacho-Collados
Leonardo Neves
Luis Espinosa-Anke
VLM
97
732
0
23 Oct 2020
Unsupervised Cross-lingual Adaptation for Sequence Tagging and Beyond
Xin Li
Lidong Bing
Wenxuan Zhang
Zheng Li
Wai Lam
125
25
0
23 Oct 2020
Pre-training Graph Transformer with Multimodal Side Information for Recommendation
Yong Liu
Susen Yang
Chenyi Lei
Guoxin Wang
Haihong Tang
Juyong Zhang
Aixin Sun
Chunyan Miao
29
4
0
23 Oct 2020
Generating Long Financial Report using Conditional Variational Autoencoders with Knowledge Distillation
Yunpeng Ren
Ziao Wang
Yiyuan Wang
Xiaofeng Zhang
60
9
0
23 Oct 2020
KINNEWS and KIRNEWS: Benchmarking Cross-Lingual Text Classification for Kinyarwanda and Kirundi
Andre Niyongabo Rubungo
Hong Qu
Julia Kreutzer
Li Huang
65
42
0
23 Oct 2020
ERNIE-Gram: Pre-Training with Explicitly N-Gram Masked Language Modeling for Natural Language Understanding
Dongling Xiao
Yukun Li
Han Zhang
Yu Sun
Hao Tian
Hua Wu
Haifeng Wang
29
39
0
23 Oct 2020
Language Models are Open Knowledge Graphs
Chenguang Wang
Xiao Liu
Basel Alomair
SSL
KELM
81
137
0
22 Oct 2020
Challenges in Information-Seeking QA: Unanswerable Questions and Paragraph Retrieval
Akari Asai
Eunsol Choi
RALM
115
54
0
22 Oct 2020
Rewriting Meaningful Sentences via Conditional BERT Sampling and an application on fooling text classifiers
Lei Xu
Ivan Ramirez
K. Veeramachaneni
AAML
32
2
0
22 Oct 2020
ConVEx: Data-Efficient and Few-Shot Slot Labeling
Matthew Henderson
Ivan Vulić
CLIP
VLM
87
38
0
22 Oct 2020
Knowledge Distillation for BERT Unsupervised Domain Adaptation
Minho Ryu
K. Lee
105
35
0
22 Oct 2020
Latte-Mix: Measuring Sentence Semantic Similarity with Latent Categorical Mixtures
Minghan Li
He Bai
Luchen Tan
Kun Xiong
Ming Li
Jimmy J. Lin
FedML
41
0
0
21 Oct 2020
Neural Networks for Entity Matching: A Survey
Nils Barlaug
J. Gulla
143
96
0
21 Oct 2020
Complaint Identification in Social Media with Transformer Networks
Mali Jin
Nikolaos Aletras
47
16
0
21 Oct 2020
Transition-based Parsing with Stack-Transformers
Ramón Fernández Astudillo
Miguel Ballesteros
Tahira Naseem
Austin Blodgett
Radu Florian
138
71
0
20 Oct 2020
An Empirical Investigation of Contextualized Number Prediction
Daniel M. Spokoyny
Taylor Berg-Kirkpatrick
AI4TS
83
38
0
20 Oct 2020
AutoMeTS: The Autocomplete for Medical Text Simplification
Hoang Van
David Kauchak
Gondy Leroy
79
31
0
20 Oct 2020
Better Highlighting: Creating Sub-Sentence Summary Highlights
Sangwoo Cho
Kaiqiang Song
Chen Li
Dong Yu
H. Foroosh
Fei Liu
87
12
0
20 Oct 2020
BERT2DNN: BERT Distillation with Massive Unlabeled Data for Online E-Commerce Search
Yunjiang Jiang
Yue Shang
Ziyang Liu
Hongwei Shen
Yun Xiao
Wei Xiong
Sulong Xu
Weipeng P. Yan
Di Jin
64
17
0
20 Oct 2020
Bi-directional Cognitive Thinking Network for Machine Reading Comprehension
Wei Peng
Yue Hu
Luxi Xing
Yuqiang Xie
Jing Yu
Yajing Sun
Xiangpeng Wei
64
7
0
20 Oct 2020
Local Knowledge Powered Conversational Agents
Sashank Santhanam
Ming-Yu Liu
Raul Puri
Mohammad Shoeybi
M. Patwary
Bryan Catanzaro
95
4
0
20 Oct 2020
Technical Question Answering across Tasks and Domains
Wenhao Yu
Lingfei Wu
Yu Deng
Qingkai Zeng
R. Mahindru
S. Guven
Meng Jiang
60
8
0
19 Oct 2020
Effects of Parameter Norm Growth During Transformer Training: Inductive Bias from Gradient Descent
William Merrill
Vivek Ramanujan
Yoav Goldberg
Roy Schwartz
Noah A. Smith
AI4CE
80
36
0
19 Oct 2020
An Empirical Study for Vietnamese Constituency Parsing with Pre-training
Tuan-Vi Tran
Xuan-Thien Pham
Duc-Vu Nguyen
Kiet Van Nguyen
Ngan Luu-Thuy Nguyen
46
4
0
19 Oct 2020
Cold-start Active Learning through Self-supervised Language Modeling
Michelle Yuan
Hsuan-Tien Lin
Jordan L. Boyd-Graber
209
185
0
19 Oct 2020
Heads-up! Unsupervised Constituency Parsing via Self-Attention Heads
Bowen Li
Taeuk Kim
Reinald Kim Amplayo
Frank Keller
SSL
101
17
0
19 Oct 2020
Towards Interpreting BERT for Reading Comprehension Based QA
Sahana Ramnath
Preksha Nema
Deep Sahni
Mitesh M. Khapra
94
30
0
18 Oct 2020
Federated Unsupervised Representation Learning
Fengda Zhang
Kun Kuang
Zhaoyang You
Tao Shen
Jun Xiao
Yin Zhang
Chao-Xiang Wu
Yueting Zhuang
Xiaolin Li
FedML
89
137
0
18 Oct 2020
Towards Data Distillation for End-to-end Spoken Conversational Question Answering
Chenyu You
Nuo Chen
Fenglin Liu
Dongchao Yang
Yuexian Zou
77
48
0
18 Oct 2020
Knowledge-Grounded Dialogue Generation with Pre-trained Language Models
Xueliang Zhao
Wei Wu
Can Xu
Chongyang Tao
Dongyan Zhao
Rui Yan
260
193
0
17 Oct 2020
Consistency and Coherency Enhanced Story Generation
Wei Wang
Piji Li
Haitao Zheng
71
11
0
17 Oct 2020
Cross-Lingual Relation Extraction with Transformers
Jian Ni
Taesun Moon
Parul Awasthy
Radu Florian
ViT
37
6
0
16 Oct 2020
Mischief: A Simple Black-Box Attack Against Transformer Architectures
Adrian de Wynter
AAML
74
1
0
16 Oct 2020
Delaying Interaction Layers in Transformer-based Encoders for Efficient Open Domain Question Answering
W. Siblini
Mohamed Challal
Charlotte Pasqual
59
3
0
16 Oct 2020
Automatic Feasibility Study via Data Quality Analysis for ML: A Case-Study on Label Noise
Cédric Renggli
Luka Rimanic
Luka Kolar
Wentao Wu
Ce Zhang
83
3
0
16 Oct 2020
WNUT-2020 Task 2: Identification of Informative COVID-19 English Tweets
Dat Quoc Nguyen
Thanh Tien Vu
A. Rahimi
M. Dao
L. T. Nguyen
Long Doan
65
74
0
16 Oct 2020
Coarse-to-Fine Pre-training for Named Entity Recognition
Mengge Xue
Yu Bowen
Zhenyu Zhang
Tingwen Liu
Yue Zhang
Bin Wang
63
53
0
16 Oct 2020
FPRaker: A Processing Element For Accelerating Neural Network Training
Omar Mohamed Awad
Mostafa Mahmoud
Isak Edo Vivancos
Ali Hadi Zadeh
Ciaran Bannon
Anand Jayarajan
Gennady Pekhimenko
Andreas Moshovos
89
15
0
15 Oct 2020
NUIG-Shubhanker@Dravidian-CodeMix-FIRE2020: Sentiment Analysis of Code-Mixed Dravidian text using XLNet
Shubhanker Banerjee
A. Jayapal
Sajeetha Thavareesan
32
16
0
15 Oct 2020
Improving Constituency Parsing with Span Attention
Yuanhe Tian
Yan Song
Fei Xia
Tong Zhang
78
45
0
15 Oct 2020
Natural Language Rationales with Full-Stack Visual Reasoning: From Pixels to Semantic Frames to Commonsense Graphs
Ana Marasović
Chandra Bhagavatula
J. S. Park
Ronan Le Bras
Noah A. Smith
Yejin Choi
ReLM
LRM
99
62
0
15 Oct 2020
Neural Deepfake Detection with Factual Structure of Text
Wanjun Zhong
Duyu Tang
Zenan Xu
Ruize Wang
Nan Duan
M. Zhou
Jiahai Wang
Jian Yin
52
66
0
15 Oct 2020
Previous
1
2
3
...
53
54
55
...
69
70
71
Next