Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1906.08237
Cited By
v1
v2 (latest)
XLNet: Generalized Autoregressive Pretraining for Language Understanding
19 June 2019
Zhilin Yang
Zihang Dai
Yiming Yang
J. Carbonell
Ruslan Salakhutdinov
Quoc V. Le
AI4CE
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"XLNet: Generalized Autoregressive Pretraining for Language Understanding"
50 / 3,518 papers shown
Title
DialogID: A Dialogic Instruction Dataset for Improving Teaching Effectiveness in Online Environments
Jiahao Chen
Shuyan Huang
Zitao Liu
Weiqing Luo
35
2
0
24 Jun 2022
On the Importance and Applicability of Pre-Training for Federated Learning
Hong-You Chen
Cheng-Hao Tu
Zi-hua Li
Hang Shen
Wei-Lun Chao
FedML
119
83
0
23 Jun 2022
LargeKernel3D: Scaling up Kernels in 3D Sparse CNNs
Yukang Chen
Jianhui Liu
Xinming Zhang
Xiaojuan Qi
Jiaya Jia
124
90
0
21 Jun 2022
An Automatic and Efficient BERT Pruning for Edge AI Systems
Shaoyi Huang
Ning Liu
Yueying Liang
Hongwu Peng
Hongjia Li
Dongkuan Xu
Mimi Xie
Caiwen Ding
124
22
0
21 Jun 2022
Winning the CVPR'2022 AQTC Challenge: A Two-stage Function-centric Approach
Shiwei Wu
Weidong He
Tong Xu
Hao Wang
Enhong Chen
EgoV
64
3
0
20 Jun 2022
VReBERT: A Simple and Flexible Transformer for Visual Relationship Detection
Yunbo Cui
M. Farazi
ViT
93
1
0
18 Jun 2022
Query-Efficient and Scalable Black-Box Adversarial Attacks on Discrete Sequential Data via Bayesian Optimization
Deokjae Lee
Seungyong Moon
Junhyeok Lee
Hyun Oh Song
AAML
68
39
0
17 Jun 2022
Predicting Hate Intensity of Twitter Conversation Threads
Qing Meng
Tharun Suresh
Roy Ka-wei Lee
Tanmoy Chakraborty
124
20
0
16 Jun 2022
TOKEN is a MASK: Few-shot Named Entity Recognition with Pre-trained Language Models
A. Davody
David Ifeoluwa Adelani
Thomas Kleinbauer
Dietrich Klakow
75
4
0
15 Jun 2022
SP-ViT: Learning 2D Spatial Priors for Vision Transformers
Yuxuan Zhou
Wangmeng Xiang
Chong Li
Biao Wang
Xihan Wei
Lei Zhang
Margret Keuper
Xia Hua
ViT
71
15
0
15 Jun 2022
The SIGMORPHON 2022 Shared Task on Morpheme Segmentation
Khuyagbaatar Batsuren
Gábor Bella
Aryaman Arora
Viktor Martinović
Kyle Gorman
...
Magda vSevvcíková
Katevrina Pelegrinová
Fausto Giunchiglia
Ryan Cotterell
Ekaterina Vylomova
62
40
0
15 Jun 2022
KE-QI: A Knowledge Enhanced Article Quality Identification Dataset
Chunhui Ai
Derui Wang
Xuemi Yan
Yang Xu
Wenrui Xie
Ziqiang Cao
55
0
0
15 Jun 2022
LIFT: Language-Interfaced Fine-Tuning for Non-Language Machine Learning Tasks
Tuan Dinh
Yuchen Zeng
Ruisu Zhang
Ziqian Lin
Michael Gira
Shashank Rajput
Jy-yong Sohn
Dimitris Papailiopoulos
Kangwook Lee
LMTD
176
139
0
14 Jun 2022
Multimodal Learning with Transformers: A Survey
Peng Xu
Xiatian Zhu
David Clifton
ViT
238
578
0
13 Jun 2022
MetaTPTrans: A Meta Learning Approach for Multilingual Code Representation Learning
Weiguo Pian
Hanyu Peng
Xunzhu Tang
Tiezhu Sun
Haoye Tian
Andrew Habib
Jacques Klein
Tegawende F. Bissyande
56
12
0
13 Jun 2022
Virtual embeddings and self-consistency for self-supervised learning
T. Bdair
Hossam Abdelhamid
Nassir Navab
Shadi Albarqouni
SSL
43
0
0
13 Jun 2022
Improving Pre-trained Language Model Fine-tuning with Noise Stability Regularization
Hang Hua
Xingjian Li
Dejing Dou
Chengzhong Xu
Jiebo Luo
94
15
0
12 Jun 2022
Bridging the Gap Between Training and Inference of Bayesian Controllable Language Models
Han Liu
Bingning Wang
Ting Yao
Haijin Liang
Jianjin Xu
Xiaolin Hu
BDL
67
1
0
11 Jun 2022
StructCoder: Structure-Aware Transformer for Code Generation
Sindhu Tipirneni
Ming Zhu
Chandan K. Reddy
109
60
0
10 Jun 2022
VN-Transformer: Rotation-Equivariant Attention for Vector Neurons
Serge Assaad
Carlton Downey
Rami Al-Rfou
Nigamaa Nayakanti
Benjamin Sapp
71
20
0
08 Jun 2022
STable: Table Generation Framework for Encoder-Decoder Models
Michal Pietruszka
M. Turski
Łukasz Borchmann
Tomasz Dwojak
Gabriela Pałka
Karolina Szyndler
Dawid Jurkiewicz
Lukasz Garncarek
LMTD
87
18
0
08 Jun 2022
Modularized Transfer Learning with Multiple Knowledge Graphs for Zero-shot Commonsense Reasoning
Yu Jin Kim
Beong-woo Kwak
Youngwook Kim
Reinald Kim Amplayo
Seung-won Hwang
Jinyoung Yeo
LRM
64
14
0
08 Jun 2022
Can CNNs Be More Robust Than Transformers?
Zeyu Wang
Yutong Bai
Yuyin Zhou
Cihang Xie
UQCV
OOD
115
46
0
07 Jun 2022
Always Keep your Target in Mind: Studying Semantics and Improving Performance of Neural Lexical Substitution
N. Arefyev
Boris Sheludko
Alexander Podolskiy
Alexander Panchenko
KELM
114
31
0
07 Jun 2022
Assessing Project-Level Fine-Tuning of ML4SE Models
Egor Bogomolov
Sergey Zhuravlev
Egor Spirin
T. Bryksin
44
7
0
07 Jun 2022
Fooling Explanations in Text Classifiers
Adam Ivankay
Ivan Girardi
Chiara Marchiori
P. Frossard
AAML
85
20
0
07 Jun 2022
Signal Propagation in Transformers: Theoretical Perspectives and the Role of Rank Collapse
Lorenzo Noci
Sotiris Anagnostidis
Luca Biggio
Antonio Orvieto
Sidak Pal Singh
Aurelien Lucchi
108
75
0
07 Jun 2022
An Empirical Study of IoT Security Aspects at Sentence-Level in Developer Textual Discussions
Nibir Mandal
Gias Uddin
36
10
0
07 Jun 2022
Recent Advances for Quantum Neural Networks in Generative Learning
Jinkai Tian
Xiaoyun Sun
Yuxuan Du
Shanshan Zhao
Qing Liu
...
Xingyao Wu
Min-hsiu Hsieh
Tongliang Liu
Wen-Bin Yang
Dacheng Tao
AI4CE
98
85
0
07 Jun 2022
Differentially Private Model Compression
Fatemehsadat Mireshghallah
A. Backurs
Huseyin A. Inan
Lukas Wutschitz
Janardhan Kulkarni
SyDa
55
14
0
03 Jun 2022
Task-Adaptive Pre-Training for Boosting Learning With Noisy Labels: A Study on Text Classification for African Languages
D. Zhu
Michael A. Hedderich
Fangzhou Zhai
David Ifeoluwa Adelani
Dietrich Klakow
NoLa
51
1
0
03 Jun 2022
Towards Improving the Generation Quality of Autoregressive Slot VAEs
Patrick Emami
Pan He
Sanjay Ranka
Anand Rangarajan
OCL
75
1
0
03 Jun 2022
A comparative study between vision transformers and CNNs in digital pathology
Luca Deininger
Bernhard Stimpel
Anil Yüce
Samaneh Abbasi-Sureshjani
Simon Schönenberger
P. Ocampo
Konstanty Korski
F. Gaire
ViT
MedIm
47
30
0
01 Jun 2022
Transformer with Fourier Integral Attentions
T. Nguyen
Minh Pham
Tam Nguyen
Khai Nguyen
Stanley J. Osher
Nhat Ho
68
4
0
01 Jun 2022
On the Usefulness of Embeddings, Clusters and Strings for Text Generator Evaluation
Tiago Pimentel
Clara Meister
Ryan Cotterell
128
7
0
31 May 2022
Multilingual Transformers for Product Matching -- Experiments and a New Benchmark in Polish
Michal Mo.zd.zonek
Anna Wróblewska
Sergiy Tkachuk
S. Lukasik
68
0
0
31 May 2022
Label-Enhanced Graph Neural Network for Semi-supervised Node Classification
Le Yu
Leilei Sun
Bowen Du
T. Zhu
Weifeng Lv
80
16
0
31 May 2022
E2S2: Encoding-Enhanced Sequence-to-Sequence Pretraining for Language Understanding and Generation
Qihuang Zhong
Liang Ding
Juhua Liu
Bo Du
Dacheng Tao
114
27
0
30 May 2022
SFE-AI at SemEval-2022 Task 11: Low-Resource Named Entity Recognition using Large Pre-trained Language Models
Changyu Hou
Jun Wang
Yixuan Qiao
Pengxiang Jiang
Peng Gao
...
Qizhi Lin
Xiaopeng Wang
Xiandi Jiang
Benqi Wang
Qifeng Xiao
24
2
0
29 May 2022
Gating Dropout: Communication-efficient Regularization for Sparsely Activated Transformers
R. Liu
Young Jin Kim
Alexandre Muzio
Hany Awadalla
MoE
81
22
0
28 May 2022
AANG: Automating Auxiliary Learning
Lucio Dery
Paul Michel
M. Khodak
Graham Neubig
Ameet Talwalkar
114
9
0
27 May 2022
A Survey on Long-Tailed Visual Recognition
Lu Yang
He Jiang
Q. Song
Jun Guo
93
135
0
27 May 2022
Training and Inference on Any-Order Autoregressive Models the Right Way
Andy Shih
Dorsa Sadigh
Stefano Ermon
BDL
TPM
OOD
CML
98
28
0
26 May 2022
AdaptFormer: Adapting Vision Transformers for Scalable Visual Recognition
Shoufa Chen
Chongjian Ge
Zhan Tong
Jiangliu Wang
Yibing Song
Jue Wang
Ping Luo
249
703
0
26 May 2022
Learning to Reconstruct Missing Data from Spatiotemporal Graphs with Sparse Observations
Ivan Marisca
Andrea Cini
Cesare Alippi
AI4TS
90
68
0
26 May 2022
The Document Vectors Using Cosine Similarity Revisited
Bingyu Zhang
N. Arefyev
53
10
0
26 May 2022
Transcormer: Transformer for Sentence Scoring with Sliding Language Modeling
Kaitao Song
Yichong Leng
Xu Tan
Yicheng Zou
Tao Qin
Dongsheng Li
107
11
0
25 May 2022
Inception Transformer
Chenyang Si
Weihao Yu
Pan Zhou
Yichen Zhou
Xinchao Wang
Shuicheng Yan
ViT
120
200
0
25 May 2022
LOPS: Learning Order Inspired Pseudo-Label Selection for Weakly Supervised Text Classification
Dheeraj Mekala
Chengyu Dong
Jingbo Shang
61
20
0
25 May 2022
Conditional set generation using Seq2seq models
Aman Madaan
Dheeraj Rajagopal
Niket Tandon
Yiming Yang
Antoine Bosselut
73
10
0
25 May 2022
Previous
1
2
3
...
27
28
29
...
69
70
71
Next