Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1906.08237
Cited By
XLNet: Generalized Autoregressive Pretraining for Language Understanding
19 June 2019
Zhilin Yang
Zihang Dai
Yiming Yang
J. Carbonell
Ruslan Salakhutdinov
Quoc V. Le
AI4CE
Re-assign community
ArXiv
PDF
HTML
Papers citing
"XLNet: Generalized Autoregressive Pretraining for Language Understanding"
50 / 1,487 papers shown
Title
Backdoor Attacks on Pre-trained Models by Layerwise Weight Poisoning
Linyang Li
Demin Song
Xiaonan Li
Jiehang Zeng
Ruotian Ma
Xipeng Qiu
33
135
0
31 Aug 2021
Automated Mining of Leaderboards for Empirical AI Research
Salomon Kabongo KABENAMUALU
Jennifer D'Souza
Sören Auer
46
29
0
31 Aug 2021
Improving Multimodal fusion via Mutual Dependency Maximisation
Pierre Colombo
E. Chapuis
Matthieu Labeau
Chloé Clavel
15
30
0
31 Aug 2021
How Does Adversarial Fine-Tuning Benefit BERT?
J. Ebrahimi
Hao Yang
Wei Zhang
AAML
28
4
0
31 Aug 2021
N24News: A New Dataset for Multimodal News Classification
Zhen Wang
Xu Shan
Xiangxie Zhang
Jie Yang
VLM
31
33
0
30 Aug 2021
Are Training Resources Insufficient? Predict First Then Explain!
Myeongjun Jang
Thomas Lukasiewicz
LRM
26
7
0
29 Aug 2021
NoiER: An Approach for Training more Reliable Fine-TunedDownstream Task Models
Myeongjun Jang
Thomas Lukasiewicz
24
4
0
29 Aug 2021
Evaluating the Robustness of Neural Language Models to Input Perturbations
M. Moradi
Matthias Samwald
AAML
55
96
0
27 Aug 2021
EmoBERTa: Speaker-Aware Emotion Recognition in Conversation with RoBERTa
Taewoon Kim
Piek Vossen
38
98
0
26 Aug 2021
Auxiliary Task Update Decomposition: The Good, The Bad and The Neutral
Lucio Dery
Yann N. Dauphin
David Grangier
MoMe
26
29
0
25 Aug 2021
SimVLM: Simple Visual Language Model Pretraining with Weak Supervision
Zirui Wang
Jiahui Yu
Adams Wei Yu
Zihang Dai
Yulia Tsvetkov
Yuan Cao
VLM
MLLM
51
782
0
24 Aug 2021
sigmoidF1: A Smooth F1 Score Surrogate Loss for Multilabel Classification
Gabriel Bénédict
Vincent Koops
Daan Odijk
Maarten de Rijke
40
30
0
24 Aug 2021
Explaining Bayesian Neural Networks
Kirill Bykov
Marina M.-C. Höhne
Adelaida Creosteanu
Klaus-Robert Muller
Frederick Klauschen
Shinichi Nakajima
Marius Kloft
BDL
AAML
36
25
0
23 Aug 2021
Regularizing Transformers With Deep Probabilistic Layers
Aurora Cobo Aguilera
Pablo Martínez Olmos
Antonio Artés-Rodríguez
Fernando Pérez-Cruz
41
7
0
23 Aug 2021
MM-ViT: Multi-Modal Video Transformer for Compressed Video Action Recognition
Jiawei Chen
C. Ho
ViT
26
77
0
20 Aug 2021
Knowledge Perceived Multi-modal Pretraining in E-commerce
Yushan Zhu
Huaixiao Tou
Wen Zhang
Ganqiang Ye
Hui Chen
Ningyu Zhang
Huajun Chen
33
32
0
20 Aug 2021
SMedBERT: A Knowledge-Enhanced Pre-trained Language Model with Structured Semantics for Medical Text Mining
Taolin Zhang
Zerui Cai
Chengyu Wang
Minghui Qiu
Bite Yang
Xiaofeng He
AI4MH
28
52
0
20 Aug 2021
MigrationsKB: A Knowledge Base of Public Attitudes towards Migrations and their Driving Factors
Yiyi Chen
Harald Sack
Mehwish Alam
24
3
0
17 Aug 2021
ROSITA: Enhancing Vision-and-Language Semantic Alignments via Cross- and Intra-modal Knowledge Integration
Yuhao Cui
Zhou Yu
Chunqi Wang
Zhongzhou Zhao
Ji Zhang
Meng Wang
Jun-chen Yu
VLM
27
53
0
16 Aug 2021
MUSIQ: Multi-scale Image Quality Transformer
Junjie Ke
Qifei Wang
Yilin Wang
P. Milanfar
Feng Yang
177
632
0
12 Aug 2021
Unsupervised Corpus Aware Language Model Pre-training for Dense Passage Retrieval
Luyu Gao
Jamie Callan
RALM
175
330
0
12 Aug 2021
Differentiable Subset Pruning of Transformer Heads
Jiaoda Li
Ryan Cotterell
Mrinmaya Sachan
45
54
0
10 Aug 2021
Making Transformers Solve Compositional Tasks
Santiago Ontañón
Joshua Ainslie
Vaclav Cvicek
Zachary Kenneth Fisher
44
70
0
09 Aug 2021
Unifying Heterogeneous Electronic Health Records Systems via Text-Based Code Embedding
Kyunghoon Hur
Jiyoung Lee
Jungwoo Oh
Wesley Price
Young-Hak Kim
Edward Choi
43
17
0
08 Aug 2021
Language Model Evaluation in Open-ended Text Generation
An Nguyen
46
3
0
08 Aug 2021
Exploiting BERT For Multimodal Target Sentiment Classification Through Input Space Translation
Zaid Khan
Y. Fu
43
132
0
03 Aug 2021
Musical Speech: A Transformer-based Composition Tool
Jason dÉon
Sri Harsha Dumpala
Chandramouli Shama Sastry
Daniel Oore
Sageev Oore
18
1
0
02 Aug 2021
Polarity in the Classroom: A Case Study Leveraging Peer Sentiment Toward Scalable Assessment
Zachariah J. Beasley
L. Piegl
Paul Rosen
14
3
0
02 Aug 2021
LICHEE: Improving Language Model Pre-training with Multi-grained Tokenization
Weidong Guo
Mingjun Zhao
Lusheng Zhang
Di Niu
Jinwen Luo
Zhenhua Liu
Zhenyang Li
J. Tang
32
8
0
02 Aug 2021
BadEncoder: Backdoor Attacks to Pre-trained Encoders in Self-Supervised Learning
Jinyuan Jia
Yupei Liu
Neil Zhenqiang Gong
SILM
SSL
47
152
0
01 Aug 2021
Enhancing Social Relation Inference with Concise Interaction Graph and Discriminative Scene Representation
Xiaotian Yu
Hanling Yi
Yi Yu
Ling Xing
Shiliang Zhang
Xiaoyu Wang
GNN
34
0
0
30 Jul 2021
Self-Supervised Transformer for Sparse and Irregularly Sampled Multivariate Clinical Time-Series
Sindhu Tipirneni
Chandan K. Reddy
AI4TS
15
105
0
29 Jul 2021
Rethinking and Improving Relative Position Encoding for Vision Transformer
Kan Wu
Houwen Peng
Minghao Chen
Jianlong Fu
Hongyang Chao
ViT
53
330
0
29 Jul 2021
Pre-train, Prompt, and Predict: A Systematic Survey of Prompting Methods in Natural Language Processing
Pengfei Liu
Weizhe Yuan
Jinlan Fu
Zhengbao Jiang
Hiroaki Hayashi
Graham Neubig
VLM
SyDa
99
3,858
0
28 Jul 2021
Sentiment Analysis of the COVID-related r/Depression Posts
Zihan Chen
Marina Sokolova
27
4
0
28 Jul 2021
Predicting the Future from First Person (Egocentric) Vision: A Survey
Ivan Rodin
Antonino Furnari
Dimitrios Mavroeidis
G. Farinella
EgoV
34
42
0
28 Jul 2021
Cross-lingual Transferring of Pre-trained Contextualized Language Models
Zuchao Li
Kevin Parnow
Hai Zhao
Zhuosheng Zhang
Rui Wang
Masao Utiyama
Eiichiro Sumita
13
8
0
27 Jul 2021
Dual Slot Selector via Local Reliability Verification for Dialogue State Tracking
Jinyu Guo
Kai Shuang
Jijie Li
Zihan Wang
23
18
0
27 Jul 2021
Learning Span-Level Interactions for Aspect Sentiment Triplet Extraction
Lu Xu
Yew Ken Chia
Lidong Bing
39
180
0
26 Jul 2021
CycleMLP: A MLP-like Architecture for Dense Prediction
Shoufa Chen
Enze Xie
Chongjian Ge
Runjian Chen
Ding Liang
Ping Luo
33
231
0
21 Jul 2021
CausalBERT: Injecting Causal Knowledge Into Pre-trained Models with Minimal Supervision
Zhongyang Li
Xiao Ding
Kuo Liao
Bing Qin
Ting Liu
CML
29
17
0
21 Jul 2021
RAMS-Trans: Recurrent Attention Multi-scale Transformer forFine-grained Image Recognition
Yunqing Hu
Xuan Jin
Yin Zhang
Ha Hong
Jingfeng Zhang
Yuan He
Hui Xue
ViT
33
97
0
17 Jul 2021
FewCLUE: A Chinese Few-shot Learning Evaluation Benchmark
Liang Xu
Xiaojing Lu
Chenyang Yuan
Xuanwei Zhang
Huilin Xu
...
Guoao Wei
X. Pan
Xin Tian
Libo Qin
Hai Hu
ELM
24
57
0
15 Jul 2021
AutoBERT-Zero: Evolving BERT Backbone from Scratch
Jiahui Gao
Hang Xu
Han Shi
Xiaozhe Ren
Philip L. H. Yu
Xiaodan Liang
Xin Jiang
Zhenguo Li
21
37
0
15 Jul 2021
The Benchmark Lottery
Mostafa Dehghani
Yi Tay
A. Gritsenko
Zhe Zhao
N. Houlsby
Fernando Diaz
Donald Metzler
Oriol Vinyals
44
89
0
14 Jul 2021
BERT Fine-Tuning for Sentiment Analysis on Indonesian Mobile Apps Reviews
Kuncahyo Setyo Nugroho
Anantha Yullian Sukmadewa
DW HaftittahWuswilahaken
F. A. Bachtiar
N. Yudistira
18
43
0
14 Jul 2021
Visual Parser: Representing Part-whole Hierarchies with Transformers
Shuyang Sun
Xiaoyu Yue
S. Bai
Philip Torr
50
27
0
13 Jul 2021
Combiner: Full Attention Transformer with Sparse Computation Cost
Hongyu Ren
H. Dai
Zihang Dai
Mengjiao Yang
J. Leskovec
Dale Schuurmans
Bo Dai
87
77
0
12 Jul 2021
CoBERL: Contrastive BERT for Reinforcement Learning
Andrea Banino
Adria Puidomenech Badia
Jacob Walker
Tim Scholtes
Jovana Mitrović
Charles Blundell
OffRL
32
36
0
12 Jul 2021
Noise Stability Regularization for Improving BERT Fine-tuning
Hang Hua
Xingjian Li
Dejing Dou
Chengzhong Xu
Jiebo Luo
19
44
0
10 Jul 2021
Previous
1
2
3
...
16
17
18
...
28
29
30
Next