Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1906.08237
Cited By
v1
v2 (latest)
XLNet: Generalized Autoregressive Pretraining for Language Understanding
19 June 2019
Zhilin Yang
Zihang Dai
Yiming Yang
J. Carbonell
Ruslan Salakhutdinov
Quoc V. Le
AI4CE
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"XLNet: Generalized Autoregressive Pretraining for Language Understanding"
50 / 3,522 papers shown
Title
Head-driven Phrase Structure Parsing in O(
n
3
n^3
n
3
) Time Complexity
Zuchao Li
Junru Zhou
Hai Zhao
Kevin Parnow
51
0
0
20 May 2021
KLUE: Korean Language Understanding Evaluation
Sungjoon Park
Jihyung Moon
Sungdong Kim
Won Ik Cho
Jiyoon Han
...
Seonghyun Kim
Lucy Park
Alice Oh
Jung-Woo Ha
Kyunghyun Cho
ELM
VLM
123
198
0
20 May 2021
Retrieval-Augmented Transformer-XL for Close-Domain Dialog Generation
Giovanni Bonetta
R. Cancelliere
Ding Liu
Paul Vozila
RALM
28
17
0
19 May 2021
Explainable Tsetlin Machine framework for fake news detection with credibility score assessment
Bimal Bhattarai
Ole-Christoffer Granmo
Lei Jiao
61
37
0
19 May 2021
CoTexT: Multi-task Learning with Code-Text Transformer
Long Phan
H. Tran
Daniel Le
Hieu Duy Nguyen
J. Anibal
Alec Peltekian
Yanfang Ye
92
136
0
18 May 2021
Parallel Attention Network with Sequence Matching for Video Grounding
Hao Zhang
Aixin Sun
Wei Jing
Liangli Zhen
Qiufeng Wang
Rick Siow Mong Goh
109
41
0
18 May 2021
Link Prediction on N-ary Relational Facts: A Graph-based Approach
Quan Wang
Haifeng Wang
Yajuan Lyu
Yong Zhu
95
49
0
18 May 2021
Divide and Contrast: Self-supervised Learning from Uncurated Data
Yonglong Tian
Olivier J. Hénaff
Aaron van den Oord
SSL
138
101
0
17 May 2021
Pay Attention to MLPs
Hanxiao Liu
Zihang Dai
David R. So
Quoc V. Le
AI4CE
173
672
0
17 May 2021
SeaD: End-to-end Text-to-SQL Generation with Schema-aware Denoising
K. Xuan
Yongbo Wang
Yongliang Wang
Zujie Wen
Yang Dong
VLM
76
54
0
17 May 2021
How is BERT surprised? Layerwise detection of linguistic anomalies
Bai Li
Zining Zhu
Guillaume Thomas
Yang Xu
Frank Rudzicz
76
31
0
16 May 2021
BERT Busters: Outlier Dimensions that Disrupt Transformers
Olga Kovaleva
Saurabh Kulshreshtha
Anna Rogers
Anna Rumshisky
126
93
0
14 May 2021
Designing Multimodal Datasets for NLP Challenges
James Pustejovsky
E. Holderness
Jingxuan Tu
Parker Glenn
Kyeongmin Rim
Kelley Lynch
R. Brutti
59
5
0
12 May 2021
Addressing "Documentation Debt" in Machine Learning Research: A Retrospective Datasheet for BookCorpus
Jack Bandy
Nicholas Vincent
69
57
0
11 May 2021
Assessing the Syntactic Capabilities of Transformer-based Multilingual Language Models
Laura Pérez-Mayos
Alba Táboas García
Simon Mille
Leo Wanner
ELM
LRM
51
8
0
10 May 2021
R2D2: Relational Text Decoding with Transformers
Aryan Arbabi
Mingqiu Wang
Laurent El Shafey
Nan Du
Izhak Shafran
39
1
0
10 May 2021
REPT: Bridging Language Models and Machine Reading Comprehension via Retrieval-Based Pre-training
Fangkai Jiao
Yangyang Guo
Yilin Niu
Feng Ji
Feng-Lin Li
Liqiang Nie
LRM
69
12
0
10 May 2021
DocSCAN: Unsupervised Text Classification via Learning from Neighbors
Dominik Stammbach
Elliott Ash
78
9
0
09 May 2021
Dispatcher: A Message-Passing Approach To Language Modelling
A. Cetoli
84
0
0
09 May 2021
Enhancing Transformers with Gradient Boosted Decision Trees for NLI Fine-Tuning
Benjamin Minixhofer
Milan Gritta
Ignacio Iacobacci
AI4CE
28
5
0
08 May 2021
Logic-Driven Context Extension and Data Augmentation for Logical Reasoning of Text
Siyuan Wang
Wanjun Zhong
Duyu Tang
Zhongyu Wei
Zhihao Fan
Daxin Jiang
Ming Zhou
Nan Duan
NAI
137
73
0
08 May 2021
Empirical Evaluation of Pre-trained Transformers for Human-Level NLP: The Role of Sample Size and Dimensionality
Adithya Ganesan
Matthew Matero
Aravind Reddy Ravula
Huy-Hien Vu
H. Andrew Schwartz
90
35
0
07 May 2021
Are Pre-trained Convolutions Better than Pre-trained Transformers?
Yi Tay
Mostafa Dehghani
J. Gupta
Dara Bahri
V. Aribandi
Zhen Qin
Donald Metzler
AI4CE
84
49
0
07 May 2021
Graph-based Multilingual Product Retrieval in E-commerce Search
Hanqing Lu
You-Heng Hu
Tong Zhao
Tony Wu
Yiwei Song
Bing Yin
115
25
0
06 May 2021
GraphFormers: GNN-nested Transformers for Representation Learning on Textual Graph
Junhan Yang
Zheng Liu
Shitao Xiao
Chaozhuo Li
Defu Lian
Sanjay Agrawal
Amit Singh
Guangzhong Sun
Xing Xie
AI4CE
91
160
0
06 May 2021
Assessing Dialogue Systems with Distribution Distances
Jiannan Xiang
Yahui Liu
Deng Cai
Huayang Li
Defu Lian
Lemao Liu
99
18
0
06 May 2021
Towards General Natural Language Understanding with Probabilistic Worldbuilding
Abulhair Saparov
Tom Michael Mitchell
101
6
0
06 May 2021
Security Vulnerability Detection Using Deep Learning Natural Language Processing
Noah Ziems
Shaoen Wu
93
58
0
06 May 2021
Beyond Self-attention: External Attention using Two Linear Layers for Visual Tasks
Meng-Hao Guo
Zheng-Ning Liu
Tai-Jiang Mu
Shimin Hu
77
504
0
05 May 2021
PreSizE: Predicting Size in E-Commerce using Transformers
Yotam Eshel
Or Levi
Haggai Roitman
A. Nus
AI4TS
34
8
0
04 May 2021
ZEN 2.0: Continue Training and Adaption for N-gram Enhanced Text Encoders
Yan Song
Tong Zhang
Yonggang Wang
Kai-Fu Lee
97
45
0
04 May 2021
Textual Analysis of Communications in COVID-19 Infected Community on Social Media
Yuhan Liu
Yuhan Gao
Zhifan Nan
Long Chen
25
0
0
03 May 2021
Goldilocks: Just-Right Tuning of BERT for Technology-Assisted Review
Eugene Yang
Sean MacAvaney
D. Lewis
O. Frieder
126
29
0
03 May 2021
Billion-scale Pre-trained E-commerce Product Knowledge Graph Model
Wen Zhang
Chi-Man Wong
Ganqiang Ye
Bo Wen
Wei Zhang
Huajun Chen
63
24
0
02 May 2021
MathBERT: A Pre-Trained Model for Mathematical Formula Understanding
Shuai Peng
Ke Yuan
Liangcai Gao
Zhi Tang
AIMat
100
109
0
02 May 2021
When to Foldém: How to answer Unanswerable questions
Marshall Ho
Zhipeng Zhou
J. He
55
2
0
01 May 2021
Adversarial Example Detection for DNN Models: A Review and Experimental Comparison
Ahmed Aldahdooh
W. Hamidouche
Sid Ahmed Fezza
Olivier Déforges
AAML
239
128
0
01 May 2021
SVT-Net: Super Light-Weight Sparse Voxel Transformer for Large Scale Place Recognition
Zhaoxin Fan
Zhenbo Song
Hongyan Liu
Zhiwu Lu
Jun He
Xiaoyong Du
3DPC
ViT
160
77
0
01 May 2021
Improving Response Quality with Backward Reasoning in Open-domain Dialogue Systems
Ziming Li
Julia Kiseleva
Maarten de Rijke
78
12
0
30 Apr 2021
Mitigating Political Bias in Language Models Through Reinforced Calibration
Ruibo Liu
Chenyan Jia
Jason W. Wei
Guangxuan Xu
Lili Wang
Soroush Vosoughi
75
99
0
30 Apr 2021
Using Transformers to Provide Teachers with Personalized Feedback on their Classroom Discourse: The TalkMoves Application
Abhijit Suresh
Jennifer Jacobs
Vivian Lai
Chenhao Tan
Wayne H. Ward
James H. Martin
T. Sumner
56
30
0
29 Apr 2021
MAGMA: An Optimization Framework for Mapping Multiple DNNs on Multiple Accelerator Cores
Sheng-Chun Kao
T. Krishna
123
52
0
28 Apr 2021
Inpainting Transformer for Anomaly Detection
Jonathan Pirnay
K. Chai
ViT
211
169
0
28 Apr 2021
PanGu-
α
α
α
: Large-scale Autoregressive Pretrained Chinese Language Models with Auto-parallel Computation
Wei Zeng
Xiaozhe Ren
Teng Su
Hui Wang
Yi-Lun Liao
...
Gaojun Fan
Yaowei Wang
Xuefeng Jin
Qun Liu
Yonghong Tian
ALM
MoE
AI4CE
80
215
0
26 Apr 2021
Diverse Image Inpainting with Bidirectional and Autoregressive Transformers
Yingchen Yu
Fangneng Zhan
Rongliang Wu
Jianxiong Pan
Kaiwen Cui
Shijian Lu
Feiying Ma
Xuansong Xie
Chunyan Miao
ViT
124
152
0
26 Apr 2021
A Comprehensive Attempt to Research Statement Generation
Wenhao Wu
Sujian Li
39
0
0
25 Apr 2021
baller2vec++: A Look-Ahead Multi-Entity Transformer For Modeling Coordinated Agents
Michael A. Alcorn
A. Nguyen
68
15
0
24 Apr 2021
Extract then Distill: Efficient and Effective Task-Agnostic BERT Distillation
Cheng Chen
Yichun Yin
Lifeng Shang
Zhi Wang
Xin Jiang
Xiao Chen
Qun Liu
FedML
81
7
0
24 Apr 2021
Literature review on vulnerability detection using NLP technology
Jiajie Wu
150
15
0
23 Apr 2021
Framing Unpacked: A Semi-Supervised Interpretable Multi-View Model of Media Frames
Shima Khanehzar
Trevor Cohn
Gosia Mikołajczak
A. Turpin
Lea Frermann
65
11
0
22 Apr 2021
Previous
1
2
3
...
44
45
46
...
69
70
71
Next