ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1906.08237
  4. Cited By
XLNet: Generalized Autoregressive Pretraining for Language Understanding
v1v2 (latest)

XLNet: Generalized Autoregressive Pretraining for Language Understanding

19 June 2019
Zhilin Yang
Zihang Dai
Yiming Yang
J. Carbonell
Ruslan Salakhutdinov
Quoc V. Le
    AI4CE
ArXiv (abs)PDFHTML

Papers citing "XLNet: Generalized Autoregressive Pretraining for Language Understanding"

50 / 3,523 papers shown
Title
BadEncoder: Backdoor Attacks to Pre-trained Encoders in Self-Supervised
  Learning
BadEncoder: Backdoor Attacks to Pre-trained Encoders in Self-Supervised Learning
Jinyuan Jia
Yupei Liu
Neil Zhenqiang Gong
SILMSSL
127
163
0
01 Aug 2021
The History of Speech Recognition to the Year 2030
The History of Speech Recognition to the Year 2030
Awni Y. Hannun
AI4TS
121
21
0
30 Jul 2021
Perceiver IO: A General Architecture for Structured Inputs & Outputs
Perceiver IO: A General Architecture for Structured Inputs & Outputs
Andrew Jaegle
Sebastian Borgeaud
Jean-Baptiste Alayrac
Carl Doersch
Catalin Ionescu
...
Olivier J. Hénaff
M. Botvinick
Andrew Zisserman
Oriol Vinyals
João Carreira
MLLMVLMGNN
176
585
0
30 Jul 2021
Enhancing Social Relation Inference with Concise Interaction Graph and
  Discriminative Scene Representation
Enhancing Social Relation Inference with Concise Interaction Graph and Discriminative Scene Representation
Xiaotian Yu
Hanling Yi
Yi Yu
Ling Xing
Shiliang Zhang
Xiaoyu Wang
GNN
110
0
0
30 Jul 2021
Self-Supervised Transformer for Sparse and Irregularly Sampled
  Multivariate Clinical Time-Series
Self-Supervised Transformer for Sparse and Irregularly Sampled Multivariate Clinical Time-Series
Sindhu Tipirneni
Chandan K. Reddy
AI4TS
72
111
0
29 Jul 2021
Rethinking and Improving Relative Position Encoding for Vision
  Transformer
Rethinking and Improving Relative Position Encoding for Vision Transformer
Kan Wu
Houwen Peng
Minghao Chen
Jianlong Fu
Hongyang Chao
ViT
124
340
0
29 Jul 2021
Adapting GPT, GPT-2 and BERT Language Models for Speech Recognition
Adapting GPT, GPT-2 and BERT Language Models for Speech Recognition
Xianrui Zheng
Chao Zhang
P. Woodland
34
49
0
29 Jul 2021
AutoTinyBERT: Automatic Hyper-parameter Optimization for Efficient
  Pre-trained Language Models
AutoTinyBERT: Automatic Hyper-parameter Optimization for Efficient Pre-trained Language Models
Yichun Yin
Cheng Chen
Lifeng Shang
Xin Jiang
Xiao Chen
Qun Liu
VLM
69
50
0
29 Jul 2021
Pre-train, Prompt, and Predict: A Systematic Survey of Prompting Methods
  in Natural Language Processing
Pre-train, Prompt, and Predict: A Systematic Survey of Prompting Methods in Natural Language Processing
Pengfei Liu
Weizhe Yuan
Jinlan Fu
Zhengbao Jiang
Hiroaki Hayashi
Graham Neubig
VLMSyDa
437
4,053
0
28 Jul 2021
Sentiment Analysis of the COVID-related r/Depression Posts
Sentiment Analysis of the COVID-related r/Depression Posts
Zihan Chen
Marina Sokolova
136
4
0
28 Jul 2021
Predicting the Future from First Person (Egocentric) Vision: A Survey
Predicting the Future from First Person (Egocentric) Vision: A Survey
Ivan Rodin
Antonino Furnari
Dimitrios Mavroeidis
G. Farinella
EgoV
104
44
0
28 Jul 2021
Cross-lingual Transferring of Pre-trained Contextualized Language Models
Cross-lingual Transferring of Pre-trained Contextualized Language Models
Zuchao Li
Kevin Parnow
Hai Zhao
Zhuosheng Zhang
Rui Wang
Masao Utiyama
Eiichiro Sumita
55
8
0
27 Jul 2021
PiSLTRc: Position-informed Sign Language Transformer with Content-aware
  Convolution
PiSLTRc: Position-informed Sign Language Transformer with Content-aware Convolution
Pan Xie
Mengyi Zhao
Xiaohui Hu
ViTSLR
99
35
0
27 Jul 2021
Dual Slot Selector via Local Reliability Verification for Dialogue State
  Tracking
Dual Slot Selector via Local Reliability Verification for Dialogue State Tracking
Jinyu Guo
Kai Shuang
Jijie Li
Zihan Wang
75
18
0
27 Jul 2021
Meta-Learning Adversarial Domain Adaptation Network for Few-Shot Text
  Classification
Meta-Learning Adversarial Domain Adaptation Network for Few-Shot Text Classification
Chengcheng Han
Zeqiu Fan
Dongxiang Zhang
Minghui Qiu
Ming Gao
Aoying Zhou
VLM
56
64
0
26 Jul 2021
Learning Span-Level Interactions for Aspect Sentiment Triplet Extraction
Learning Span-Level Interactions for Aspect Sentiment Triplet Extraction
Lu Xu
Yew Ken Chia
Lidong Bing
163
188
0
26 Jul 2021
Fine-Grained Emotion Prediction by Modeling Emotion Definitions
Fine-Grained Emotion Prediction by Modeling Emotion Definitions
Gargi Singh
Dhanajit Brahma
Piyush Rai
Ashutosh Modi
65
10
0
26 Jul 2021
FNetAR: Mixing Tokens with Autoregressive Fourier Transforms
FNetAR: Mixing Tokens with Autoregressive Fourier Transforms
Tim Lou
M. Park
M. Ramezanali
Vincent Tang
AI4TS
18
3
0
22 Jul 2021
CycleMLP: A MLP-like Architecture for Dense Prediction
CycleMLP: A MLP-like Architecture for Dense Prediction
Shoufa Chen
Enze Xie
Chongjian Ge
Runjian Chen
Ding Liang
Ping Luo
151
235
0
21 Jul 2021
Generative Models for Security: Attacks, Defenses, and Opportunities
Generative Models for Security: Attacks, Defenses, and Opportunities
L. A. Bauer
Vincent Bindschaedler
114
4
0
21 Jul 2021
CausalBERT: Injecting Causal Knowledge Into Pre-trained Models with
  Minimal Supervision
CausalBERT: Injecting Causal Knowledge Into Pre-trained Models with Minimal Supervision
Zhongyang Li
Xiao Ding
Kuo Liao
Bing Qin
Ting Liu
CML
107
19
0
21 Jul 2021
Group Contrastive Self-Supervised Learning on Graphs
Group Contrastive Self-Supervised Learning on Graphs
Xinyi Xu
Cheng Deng
Yaochen Xie
Shuiwang Ji
SSL
53
19
0
20 Jul 2021
Cross-Lingual BERT Contextual Embedding Space Mapping with Isotropic and
  Isometric Conditions
Cross-Lingual BERT Contextual Embedding Space Mapping with Isotropic and Isometric Conditions
Haoran Xu
Philipp Koehn
79
9
0
19 Jul 2021
Clinical Relation Extraction Using Transformer-based Models
Clinical Relation Extraction Using Transformer-based Models
Xi Yang
Zehao Yu
Yi Guo
Jiang Bian
Yonghui Wu
LM&MAMedIm
65
20
0
19 Jul 2021
Stock Movement Prediction with Financial News using Contextualized
  Embedding from BERT
Stock Movement Prediction with Financial News using Contextualized Embedding from BERT
Qinkai Chen
AIFin
52
19
0
19 Jul 2021
On the Copying Behaviors of Pre-Training for Neural Machine Translation
On the Copying Behaviors of Pre-Training for Neural Machine Translation
Xuebo Liu
Longyue Wang
Derek F. Wong
Liang Ding
Lidia S. Chao
Shuming Shi
Zhaopeng Tu
81
25
0
17 Jul 2021
RAMS-Trans: Recurrent Attention Multi-scale Transformer forFine-grained
  Image Recognition
RAMS-Trans: Recurrent Attention Multi-scale Transformer forFine-grained Image Recognition
Yunqing Hu
Xuan Jin
Yin Zhang
Ha Hong
Jingfeng Zhang
Yuan He
Hui Xue
ViT
92
105
0
17 Jul 2021
From block-Toeplitz matrices to differential equations on graphs:
  towards a general theory for scalable masked Transformers
From block-Toeplitz matrices to differential equations on graphs: towards a general theory for scalable masked Transformers
K. Choromanski
Han Lin
Haoxian Chen
Tianyi Zhang
Arijit Sehanobish
Valerii Likhosherstov
Jack Parker-Holder
Tamás Sarlós
Adrian Weller
Thomas Weingarten
160
34
0
16 Jul 2021
FewCLUE: A Chinese Few-shot Learning Evaluation Benchmark
FewCLUE: A Chinese Few-shot Learning Evaluation Benchmark
Liang Xu
Xiaojing Lu
Chenyang Yuan
Xuanwei Zhang
Huilin Xu
...
Guoao Wei
X. Pan
Xin Tian
Libo Qin
Hai Hu
ELM
99
57
0
15 Jul 2021
AutoBERT-Zero: Evolving BERT Backbone from Scratch
AutoBERT-Zero: Evolving BERT Backbone from Scratch
Jiahui Gao
Hang Xu
Han Shi
Xiaozhe Ren
Philip L. H. Yu
Xiaodan Liang
Xin Jiang
Zhenguo Li
85
37
0
15 Jul 2021
Solving ESL Sentence Completion Questions via Pre-trained Neural
  Language Models
Solving ESL Sentence Completion Questions via Pre-trained Neural Language Models
Qiongqiong Liu
Tianqiao Liu
Jiafu Zhao
Qiang Fang
Wenbiao Ding
Zhongqin Wu
Xiwei Xu
Jiliang Tang
Zitao Liu
AI4Ed
22
2
0
15 Jul 2021
Multi-Task Learning based Online Dialogic Instruction Detection with
  Pre-trained Language Models
Multi-Task Learning based Online Dialogic Instruction Detection with Pre-trained Language Models
Y. Hao
Hang Li
Wenbiao Ding
Zhongqin Wu
Jiliang Tang
R. Luckin
Zitao Liu
49
2
0
15 Jul 2021
The Benchmark Lottery
The Benchmark Lottery
Mostafa Dehghani
Yi Tay
A. Gritsenko
Zhe Zhao
N. Houlsby
Fernando Diaz
Donald Metzler
Oriol Vinyals
124
92
0
14 Jul 2021
BERT Fine-Tuning for Sentiment Analysis on Indonesian Mobile Apps
  Reviews
BERT Fine-Tuning for Sentiment Analysis on Indonesian Mobile Apps Reviews
Kuncahyo Setyo Nugroho
Anantha Yullian Sukmadewa
DW HaftittahWuswilahaken
F. A. Bachtiar
N. Yudistira
35
45
0
14 Jul 2021
Large-Scale News Classification using BERT Language Model: Spark NLP
  Approach
Large-Scale News Classification using BERT Language Model: Spark NLP Approach
Kuncahyo Setyo Nugroho
Anantha Yullian Sukmadewa
N. Yudistira
75
27
0
14 Jul 2021
Detection of Abnormal Behavior with Self-Supervised Gaze Estimation
Detection of Abnormal Behavior with Self-Supervised Gaze Estimation
Suneung Kim
Seong-Whan Lee
23
2
0
14 Jul 2021
Visual Parser: Representing Part-whole Hierarchies with Transformers
Visual Parser: Representing Part-whole Hierarchies with Transformers
Shuyang Sun
Xiaoyu Yue
S. Bai
Philip Torr
128
27
0
13 Jul 2021
Combiner: Full Attention Transformer with Sparse Computation Cost
Combiner: Full Attention Transformer with Sparse Computation Cost
Hongyu Ren
H. Dai
Zihang Dai
Mengjiao Yang
J. Leskovec
Dale Schuurmans
Bo Dai
171
80
0
12 Jul 2021
Revisiting Uncertainty-based Query Strategies for Active Learning with
  Transformers
Revisiting Uncertainty-based Query Strategies for Active Learning with Transformers
Christopher Schröder
A. Niekler
Martin Potthast
102
81
0
12 Jul 2021
CoBERL: Contrastive BERT for Reinforcement Learning
CoBERL: Contrastive BERT for Reinforcement Learning
Andrea Banino
Adria Puidomenech Badia
Jacob Walker
Tim Scholtes
Jovana Mitrović
Charles Blundell
OffRL
88
36
0
12 Jul 2021
The Brownian motion in the transformer model
The Brownian motion in the transformer model
Yingshi Chen
118
1
0
12 Jul 2021
BERT-like Pre-training for Symbolic Piano Music Classification Tasks
BERT-like Pre-training for Symbolic Piano Music Classification Tasks
Yi-Hui Chou
I-Chun Chen
Chin-Jui Chang
Joann Ching
Yi-Hsuan Yang
102
25
0
12 Jul 2021
LexSubCon: Integrating Knowledge from Lexical Resources into Contextual
  Embeddings for Lexical Substitution
LexSubCon: Integrating Knowledge from Lexical Resources into Contextual Embeddings for Lexical Substitution
George Michalopoulos
I. McKillop
Alexander Wong
Helen H. Chen
124
19
0
11 Jul 2021
Transformers with multi-modal features and post-fusion context for
  e-commerce session-based recommendation
Transformers with multi-modal features and post-fusion context for e-commerce session-based recommendation
Gabriel de Souza P. Moreira
Sara Rabhi
Ronay Ak
Md Yasin Kabir
Even Oldridge
64
28
0
11 Jul 2021
Improving Low-resource Reading Comprehension via Cross-lingual
  Transposition Rethinking
Improving Low-resource Reading Comprehension via Cross-lingual Transposition Rethinking
Gaochen Wu
Bin Xu
Yuxin Qin
Fei Kong
Bangchang Liu
Hongwen Zhao
Dejie Chang
134
3
0
11 Jul 2021
Noise Stability Regularization for Improving BERT Fine-tuning
Noise Stability Regularization for Improving BERT Fine-tuning
Hang Hua
Xingjian Li
Dejing Dou
Chengzhong Xu
Jiebo Luo
79
45
0
10 Jul 2021
An Initial Investigation of Non-Native Spoken Question-Answering
An Initial Investigation of Non-Native Spoken Question-Answering
V. Raina
Mark Gales
66
1
0
09 Jul 2021
Joint Models for Answer Verification in Question Answering Systems
Joint Models for Answer Verification in Question Answering Systems
Zeyu Zhang
Thuy Vu
Alessandro Moschitti
53
24
0
09 Jul 2021
Deep Learning for Embodied Vision Navigation: A Survey
Deep Learning for Embodied Vision Navigation: A Survey
Fengda Zhu
Yi Zhu
Vincent CS Lee
Xiaodan Liang
Xiaojun Chang
EgoVLM&Ro
101
0
0
07 Jul 2021
VidLanKD: Improving Language Understanding via Video-Distilled Knowledge
  Transfer
VidLanKD: Improving Language Understanding via Video-Distilled Knowledge Transfer
Zineng Tang
Jaemin Cho
Hao Tan
Joey Tianyi Zhou
VLM
61
29
0
06 Jul 2021
Previous
123...404142...697071
Next