Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1906.08237
Cited By
XLNet: Generalized Autoregressive Pretraining for Language Understanding
19 June 2019
Zhilin Yang
Zihang Dai
Yiming Yang
J. Carbonell
Ruslan Salakhutdinov
Quoc V. Le
AI4CE
Re-assign community
ArXiv
PDF
HTML
Papers citing
"XLNet: Generalized Autoregressive Pretraining for Language Understanding"
50 / 1,486 papers shown
Title
Multilingual Evidence Retrieval and Fact Verification to Combat Global Disinformation: The Power of Polyglotism
Denisa A.O. Roberts
48
3
0
16 Dec 2020
DialogXL: All-in-One XLNet for Multi-Party Conversation Emotion Recognition
Weizhou Shen
Junqing Chen
Xiaojun Quan
Zhixiang Xie
22
201
0
16 Dec 2020
*-CFQ: Analyzing the Scalability of Machine Learning on a Compositional Task
Dmitry Tsarkov
Tibor Tihon
Nathan Scales
Nikola Momchev
Danila Sinopalnikov
Nathanael Scharli
18
17
0
15 Dec 2020
Parameter-Efficient Transfer Learning with Diff Pruning
Demi Guo
Alexander M. Rush
Yoon Kim
13
386
0
14 Dec 2020
LRC-BERT: Latent-representation Contrastive Knowledge Distillation for Natural Language Understanding
Hao Fu
Shaojun Zhou
Qihong Yang
Junjie Tang
Guiquan Liu
Kaikui Liu
Xiaolong Li
54
58
0
14 Dec 2020
Yelp Review Rating Prediction: Machine Learning and Deep Learning Models
Zefang Liu
VLM
14
15
0
12 Dec 2020
Morphology Matters: A Multilingual Language Modeling Analysis
Hyunji Hayley Park
Katherine J. Zhang
Coleman Haley
K. Steimel
Han Liu
Lane Schwartz
53
47
0
11 Dec 2020
Reinforced Multi-Teacher Selection for Knowledge Distillation
Fei Yuan
Linjun Shou
J. Pei
Wutao Lin
Ming Gong
Yan Fu
Daxin Jiang
15
121
0
11 Dec 2020
Towards Neural Programming Interfaces
Zachary Brown
Nathaniel R. Robinson
David Wingate
Nancy Fulda
AI4CE
20
5
0
10 Dec 2020
Label Confusion Learning to Enhance Text Classification Models
Biyang Guo
Songqiao Han
Xiao Han
Hailiang Huang
Ting Lu
63
68
0
09 Dec 2020
Learning to Represent Programs with Heterogeneous Graphs
Kechi Zhang
Wenhan Wang
Huangzhao Zhang
Ge Li
Zhi Jin
GNN
21
63
0
08 Dec 2020
Cross-Layer Distillation with Semantic Calibration
Defang Chen
Jian-Ping Mei
Yuan Zhang
Can Wang
Yan Feng
Chun-Yen Chen
FedML
45
288
0
06 Dec 2020
Data Boost: Text Data Augmentation Through Reinforcement Learning Guided Conditional Generation
Ruibo Liu
Guangxuan Xu
Chenyan Jia
Weicheng Ma
Lili Wang
Soroush Vosoughi
23
107
0
05 Dec 2020
Progressively Stacking 2.0: A Multi-stage Layerwise Training Method for BERT Training Speedup
Cheng Yang
Shengnan Wang
Chao Yang
Yuechuan Li
Ru He
Jingqiao Zhang
32
25
0
27 Nov 2020
Improving Clinical Outcome Predictions Using Convolution over Medical Entities with Multimodal Learning
Batuhan Bardak
Mehmet Tan
31
35
0
24 Nov 2020
GLGE: A New General Language Generation Evaluation Benchmark
Dayiheng Liu
Yu Yan
Yeyun Gong
Weizhen Qi
Hang Zhang
...
Jiancheng Lv
Ruofei Zhang
Winnie Wu
Ming Zhou
Nan Duan
ELM
40
66
0
24 Nov 2020
ActBERT: Learning Global-Local Video-Text Representations
Linchao Zhu
Yi Yang
ViT
49
417
0
14 Nov 2020
Discriminative, Generative and Self-Supervised Approaches for Target-Agnostic Learning
Yuan Jin
Wray Buntine
F. Petitjean
Geoffrey I. Webb
SSL
25
1
0
12 Nov 2020
Bi-tuning of Pre-trained Representations
Jincheng Zhong
Ximei Wang
Zhi Kou
Jianmin Wang
Mingsheng Long
21
21
0
12 Nov 2020
Multi-document Summarization via Deep Learning Techniques: A Survey
Congbo Ma
W. Zhang
Mingyu Guo
Hu Wang
Quan Z. Sheng
13
126
0
10 Nov 2020
Towards Domain-Agnostic Contrastive Learning
Vikas Verma
Minh-Thang Luong
Kenji Kawaguchi
Hieu H. Pham
Quoc V. Le
SSL
15
116
0
09 Nov 2020
Positional Artefacts Propagate Through Masked Language Model Embeddings
Ziyang Luo
Artur Kulmizev
Xiaoxi Mao
29
41
0
09 Nov 2020
Know What You Don't Need: Single-Shot Meta-Pruning for Attention Heads
Zhengyan Zhang
Fanchao Qi
Zhiyuan Liu
Qun Liu
Maosong Sun
VLM
46
30
0
07 Nov 2020
Underspecification Presents Challenges for Credibility in Modern Machine Learning
Alexander DÁmour
Katherine A. Heller
D. Moldovan
Ben Adlam
B. Alipanahi
...
Kellie Webster
Steve Yadlowsky
T. Yun
Xiaohua Zhai
D. Sculley
OffRL
77
671
0
06 Nov 2020
EXAMS: A Multi-Subject High School Examinations Dataset for Cross-Lingual and Multilingual Question Answering
Momchil Hardalov
Todor Mihaylov
Dimitrina Zlatkova
Yoan Dinkov
Ivan Koychev
Preslav Nakov
AI4Ed
ELM
41
50
0
05 Nov 2020
Indic-Transformers: An Analysis of Transformer Language Models for Indian Languages
Kushal Kumar Jain
Adwait Deshpande
Kumar Shridhar
F. Laumann
Ayushman Dash
51
51
0
04 Nov 2020
Rearrangement: A Challenge for Embodied AI
Dhruv Batra
Angel X. Chang
Sonia Chernova
Andrew J. Davison
Jia Deng
...
Jitendra Malik
Igor Mordatch
Roozbeh Mottaghi
Manolis Savva
Hao Su
LM&Ro
38
217
0
03 Nov 2020
CharBERT: Character-aware Pre-trained Language Model
Wentao Ma
Yiming Cui
Chenglei Si
Ting Liu
Shijin Wang
Guoping Hu
31
104
0
03 Nov 2020
On the Sentence Embeddings from Pre-trained Language Models
Bohan Li
Hao Zhou
Junxian He
Mingxuan Wang
Yiming Yang
Lei Li
30
213
0
02 Nov 2020
MixKD: Towards Efficient Distillation of Large-scale Language Models
Kevin J Liang
Weituo Hao
Dinghan Shen
Yufan Zhou
Weizhu Chen
Changyou Chen
Lawrence Carin
19
73
0
01 Nov 2020
Bridging Text and Knowledge with Multi-Prototype Embedding for Few-Shot Relational Triple Extraction
Haiyang Yu
Ningyu Zhang
Shumin Deng
Hongbin Ye
Wei Zhang
Huajun Chen
8
49
0
30 Oct 2020
A Comprehensive Survey on Word Representation Models: From Classical to State-Of-The-Art Word Representation Language Models
Usman Naseem
Imran Razzak
S. Khan
M. Prasad
14
156
0
28 Oct 2020
WNUT-2020 Task 1 Overview: Extracting Entities and Relations from Wet Lab Protocols
Jeniya Tabassum
Sydney Lee
Wei Xu
Alan Ritter
18
18
0
27 Oct 2020
Strongly Incremental Constituency Parsing with Graph Neural Networks
Kaiyu Yang
Jia Deng
27
35
0
27 Oct 2020
Interpretation of NLP models through input marginalization
Siwon Kim
Jihun Yi
Eunji Kim
Sungroh Yoon
MILM
FAtt
30
58
0
27 Oct 2020
Large Scale Legal Text Classification Using Transformer Models
Zein Shaheen
G. Wohlgenannt
Erwin Filtz
AILaw
35
67
0
24 Oct 2020
Multilingual Speech Translation with Efficient Finetuning of Pretrained Models
Xian Li
Changhan Wang
Yun Tang
C. Tran
Yuqing Tang
J. Pino
Alexei Baevski
Alexis Conneau
Michael Auli
21
6
0
24 Oct 2020
ANLIzing the Adversarial Natural Language Inference Dataset
Adina Williams
Tristan Thrush
Douwe Kiela
AAML
183
46
0
24 Oct 2020
Dynamic Contextualized Word Embeddings
Valentin Hofmann
J. Pierrehumbert
Hinrich Schütze
41
51
0
23 Oct 2020
Improving Robustness by Augmenting Training Sentences with Predicate-Argument Structures
N. Moosavi
M. Boer
Prasetya Ajie Utama
Iryna Gurevych
27
13
0
23 Oct 2020
TweetEval: Unified Benchmark and Comparative Evaluation for Tweet Classification
Francesco Barbieri
Jose Camacho-Collados
Leonardo Neves
Luis Espinosa-Anke
VLM
21
700
0
23 Oct 2020
Unsupervised Cross-lingual Adaptation for Sequence Tagging and Beyond
Xin Li
Lidong Bing
Wenxuan Zhang
Zheng Li
Wai Lam
59
25
0
23 Oct 2020
KINNEWS and KIRNEWS: Benchmarking Cross-Lingual Text Classification for Kinyarwanda and Kirundi
Andre Niyongabo Rubungo
Hong Qu
Julia Kreutzer
Li Huang
29
38
0
23 Oct 2020
ERNIE-Gram: Pre-Training with Explicitly N-Gram Masked Language Modeling for Natural Language Understanding
Dongling Xiao
Yukun Li
Han Zhang
Yu Sun
Hao Tian
Hua Wu
Haifeng Wang
27
38
0
23 Oct 2020
Language Models are Open Knowledge Graphs
Chenguang Wang
Xiao Liu
D. Song
SSL
KELM
26
135
0
22 Oct 2020
Challenges in Information-Seeking QA: Unanswerable Questions and Paragraph Retrieval
Akari Asai
Eunsol Choi
RALM
47
51
0
22 Oct 2020
Knowledge Distillation for BERT Unsupervised Domain Adaptation
Minho Ryu
K. Lee
18
35
0
22 Oct 2020
Neural Networks for Entity Matching: A Survey
Nils Barlaug
J. Gulla
47
91
0
21 Oct 2020
Complaint Identification in Social Media with Transformer Networks
Mali Jin
Nikolaos Aletras
14
16
0
21 Oct 2020
Transition-based Parsing with Stack-Transformers
Ramón Fernández Astudillo
Miguel Ballesteros
Tahira Naseem
Austin Blodgett
Radu Florian
53
71
0
20 Oct 2020
Previous
1
2
3
...
21
22
23
...
28
29
30
Next