ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1906.08237
  4. Cited By
XLNet: Generalized Autoregressive Pretraining for Language Understanding
v1v2 (latest)

XLNet: Generalized Autoregressive Pretraining for Language Understanding

19 June 2019
Zhilin Yang
Zihang Dai
Yiming Yang
J. Carbonell
Ruslan Salakhutdinov
Quoc V. Le
    AI4CE
ArXiv (abs)PDFHTML

Papers citing "XLNet: Generalized Autoregressive Pretraining for Language Understanding"

50 / 3,521 papers shown
Title
Exploiting Method Names to Improve Code Summarization: A Deliberation
  Multi-Task Learning Approach
Exploiting Method Names to Improve Code Summarization: A Deliberation Multi-Task Learning Approach
Rui Xie
Wei Ye
Jinan Sun
Shikun Zhang
65
28
0
21 Mar 2021
ROSITA: Refined BERT cOmpreSsion with InTegrAted techniques
ROSITA: Refined BERT cOmpreSsion with InTegrAted techniques
Yuanxin Liu
Zheng Lin
Fengcheng Yuan
VLMMQ
80
20
0
21 Mar 2021
Language-Agnostic Representation Learning of Source Code from Structure
  and Context
Language-Agnostic Representation Learning of Source Code from Structure and Context
Daniel Zügner
Tobias Kirschstein
Michele Catasta
J. Leskovec
Stephan Günnemann
78
121
0
21 Mar 2021
Self-Supervised Test-Time Learning for Reading Comprehension
Self-Supervised Test-Time Learning for Reading Comprehension
Pratyay Banerjee
Tejas Gokhale
Chitta Baral
SSL
59
29
0
20 Mar 2021
Attribute Alignment: Controlling Text Generation from Pre-trained
  Language Models
Attribute Alignment: Controlling Text Generation from Pre-trained Language Models
Dian Yu
Zhou Yu
Kenji Sagae
82
40
0
20 Mar 2021
Play the Shannon Game With Language Models: A Human-Free Approach to
  Summary Evaluation
Play the Shannon Game With Language Models: A Human-Free Approach to Summary Evaluation
Nicholas Egan
Oleg V. Vasilyev
John Bohannon
HILM
44
20
0
19 Mar 2021
Extractive Summarization of Call Transcripts
Extractive Summarization of Call Transcripts
Pratik K. Biswas
Aleksandr Iakubovich
52
10
0
19 Mar 2021
GPT Understands, Too
GPT Understands, Too
Xiao Liu
Yanan Zheng
Zhengxiao Du
Ming Ding
Yujie Qian
Zhilin Yang
Jie Tang
VLM
187
1,188
0
18 Mar 2021
GLM: General Language Model Pretraining with Autoregressive Blank
  Infilling
GLM: General Language Model Pretraining with Autoregressive Blank Infilling
Zhengxiao Du
Yujie Qian
Xiao Liu
Ming Ding
J. Qiu
Zhilin Yang
Jie Tang
BDLAI4CE
162
1,568
0
18 Mar 2021
Model Extraction and Adversarial Transferability, Your BERT is
  Vulnerable!
Model Extraction and Adversarial Transferability, Your BERT is Vulnerable!
Xuanli He
Lingjuan Lyu
Xingliang Yuan
Lichao Sun
MIACVSILM
95
96
0
18 Mar 2021
Towards Few-Shot Fact-Checking via Perplexity
Towards Few-Shot Fact-Checking via Perplexity
Nayeon Lee
Yejin Bang
Andrea Madotto
Madian Khabsa
Pascale Fung
AAML
50
93
0
17 Mar 2021
Graph Convolutional Network for Swahili News Classification
Graph Convolutional Network for Swahili News Classification
Alexandros Kastanos
Tyler Martin
GNN
71
3
0
16 Mar 2021
LightningDOT: Pre-training Visual-Semantic Embeddings for Real-Time
  Image-Text Retrieval
LightningDOT: Pre-training Visual-Semantic Embeddings for Real-Time Image-Text Retrieval
Siqi Sun
Yen-Chun Chen
Linjie Li
Shuohang Wang
Yuwei Fang
Jingjing Liu
VLM
89
84
0
16 Mar 2021
TransFG: A Transformer Architecture for Fine-grained Recognition
TransFG: A Transformer Architecture for Fine-grained Recognition
Ju He
Jieneng Chen
Shuai Liu
Adam Kortylewski
Cheng Yang
Yutong Bai
Changhu Wang
ViT
131
397
0
14 Mar 2021
Learning a Word-Level Language Model with Sentence-Level Noise
  Contrastive Estimation for Contextual Sentence Probability Estimation
Learning a Word-Level Language Model with Sentence-Level Noise Contrastive Estimation for Contextual Sentence Probability Estimation
Heewoong Park
Sukhyun Cho
Jonghun Park
32
0
0
14 Mar 2021
Comparing the Performance of NLP Toolkits and Evaluation measures in
  Legal Tech
Comparing the Performance of NLP Toolkits and Evaluation measures in Legal Tech
Muhammad Zohaib Khan
ELMAILaw
37
3
0
12 Mar 2021
TAG: Gradient Attack on Transformer-based Language Models
TAG: Gradient Attack on Transformer-based Language Models
Jieren Deng
Yijue Wang
Ji Li
Chao Shang
Hang Liu
Sanguthevar Rajasekaran
Caiwen Ding
FedMLPILM
93
79
0
11 Mar 2021
Involution: Inverting the Inherence of Convolution for Visual
  Recognition
Involution: Inverting the Inherence of Convolution for Visual Recognition
Duo Li
Jie Hu
Changhu Wang
Xiangtai Li
Qi She
Lei Zhu
Tong Zhang
Qifeng Chen
BDL
86
306
0
10 Mar 2021
Variable-rate discrete representation learning
Variable-rate discrete representation learning
Sander Dieleman
C. Nash
Jesse Engel
Karen Simonyan
BDLDRL
82
24
0
10 Mar 2021
Team Phoenix at WASSA 2021: Emotion Analysis on News Stories with
  Pre-Trained Language Models
Team Phoenix at WASSA 2021: Emotion Analysis on News Stories with Pre-Trained Language Models
Yash Butala
Kanishk Singh
Adarsh Kumar
Shrey Shrivastava
60
10
0
10 Mar 2021
When is it permissible for artificial intelligence to lie? A trust-based
  approach
When is it permissible for artificial intelligence to lie? A trust-based approach
Tae Wan Kim
Tong Lu
Lu
Kyusong Lee
Zhaoqi Cheng
Yanhan Tang
J. N. Hooker
56
4
0
09 Mar 2021
Self-supervised Regularization for Text Classification
Self-supervised Regularization for Text Classification
Meng Zhou
Zechen Li
P. Xie
60
16
0
09 Mar 2021
Large Pre-trained Language Models Contain Human-like Biases of What is
  Right and Wrong to Do
Large Pre-trained Language Models Contain Human-like Biases of What is Right and Wrong to Do
P. Schramowski
Cigdem Turan
Nico Andersen
Constantin Rothkopf
Kristian Kersting
120
298
0
08 Mar 2021
MTLHealth: A Deep Learning System for Detecting Disturbing Content in
  Student Essays
MTLHealth: A Deep Learning System for Detecting Disturbing Content in Student Essays
Joseph Valencia
Erin Yao
37
0
0
07 Mar 2021
Perspectives and Prospects on Transformer Architecture for Cross-Modal
  Tasks with Language and Vision
Perspectives and Prospects on Transformer Architecture for Cross-Modal Tasks with Language and Vision
Andrew Shin
Masato Ishii
T. Narihira
142
39
0
06 Mar 2021
Overcoming Poor Word Embeddings with Word Definitions
Overcoming Poor Word Embeddings with Word Definitions
Christopher Malon
36
3
0
05 Mar 2021
MalBERT: Using Transformers for Cybersecurity and Malicious Software
  Detection
MalBERT: Using Transformers for Cybersecurity and Malicious Software Detection
Abir Rahali
M. Akhloufi
94
30
0
05 Mar 2021
IOT: Instance-wise Layer Reordering for Transformer Structures
IOT: Instance-wise Layer Reordering for Transformer Structures
Jinhua Zhu
Lijun Wu
Yingce Xia
Shufang Xie
Tao Qin
Wen-gang Zhou
Houqiang Li
Tie-Yan Liu
87
7
0
05 Mar 2021
Attention is Not All You Need: Pure Attention Loses Rank Doubly
  Exponentially with Depth
Attention is Not All You Need: Pure Attention Loses Rank Doubly Exponentially with Depth
Yihe Dong
Jean-Baptiste Cordonnier
Andreas Loukas
177
388
0
05 Mar 2021
Natural Language Understanding for Argumentative Dialogue Systems in the
  Opinion Building Domain
Natural Language Understanding for Argumentative Dialogue Systems in the Opinion Building Domain
W. A. Abro
Annalena Aicher
Niklas Rach
Stefan Ultes
Wolfgang Minker
Guilin Qi
78
34
0
03 Mar 2021
Stay on Topic, Please: Aligning User Comments to the Content of a News
  Article
Stay on Topic, Please: Aligning User Comments to the Content of a News Article
Jumanah Alshehri
Marija Stanojevic
Eduard Constantin Dragut
Z. Obradovic
23
7
0
03 Mar 2021
OAG-BERT: Towards A Unified Backbone Language Model For Academic
  Knowledge Services
OAG-BERT: Towards A Unified Backbone Language Model For Academic Knowledge Services
Xiao Liu
Da Yin
Jingnan Zheng
Xingjian Zhang
Peng Zhang
Hongxia Yang
Yuxiao Dong
Jie Tang
VLM
115
32
0
03 Mar 2021
M6: A Chinese Multimodal Pretrainer
M6: A Chinese Multimodal Pretrainer
Junyang Lin
Rui Men
An Yang
Chan Zhou
Ming Ding
...
Yong Li
Wei Lin
Jingren Zhou
J. Tang
Hongxia Yang
VLMMoE
159
134
0
01 Mar 2021
Unbiased Sentence Encoder For Large-Scale Multi-lingual Search Engines
Unbiased Sentence Encoder For Large-Scale Multi-lingual Search Engines
Mahdi Hajiaghayi
Monir Hajiaghayi
Mark R. Bolin
39
0
0
01 Mar 2021
NLP-CUET@DravidianLangTech-EACL2021: Investigating Visual and Textual
  Features to Identify Trolls from Multimodal Social Media Memes
NLP-CUET@DravidianLangTech-EACL2021: Investigating Visual and Textual Features to Identify Trolls from Multimodal Social Media Memes
E. Hossain
Omar Sharif
M. M. Hoque
18
13
0
28 Feb 2021
NLP-CUET@LT-EDI-EACL2021: Multilingual Code-Mixed Hope Speech Detection
  using Cross-lingual Representation Learner
NLP-CUET@LT-EDI-EACL2021: Multilingual Code-Mixed Hope Speech Detection using Cross-lingual Representation Learner
E. Hossain
Omar Sharif
M. M. Hoque
65
25
0
28 Feb 2021
LRG at TREC 2020: Document Ranking with XLNet-Based Models
LRG at TREC 2020: Document Ranking with XLNet-Based Models
Abheesht Sharma
Harshit Pandey
20
2
0
28 Feb 2021
COVID-19 Tweets Analysis through Transformer Language Models
COVID-19 Tweets Analysis through Transformer Language Models
Abdul Hameed Azeemi
Adeel A Waheed
41
3
0
27 Feb 2021
Graph Self-Supervised Learning: A Survey
Graph Self-Supervised Learning: A Survey
Yixin Liu
Ming Jin
Shirui Pan
Chuan Zhou
Yu Zheng
Xiwei Xu
Philip S. Yu
SSL
126
574
0
27 Feb 2021
PharmKE: Knowledge Extraction Platform for Pharmaceutical Texts using
  Transfer Learning
PharmKE: Knowledge Extraction Platform for Pharmaceutical Texts using Transfer Learning
Nasi Jofche
Kostadin Mishev
Riste Stojanov
Milos Jovanovik
D. Trajanov
56
18
0
25 Feb 2021
Automated essay scoring using efficient transformer-based language
  models
Automated essay scoring using efficient transformer-based language models
C. Ormerod
Akanksha Malhotra
Amir Jafari
48
31
0
25 Feb 2021
ZJUKLAB at SemEval-2021 Task 4: Negative Augmentation with Language
  Model for Reading Comprehension of Abstract Meaning
ZJUKLAB at SemEval-2021 Task 4: Negative Augmentation with Language Model for Reading Comprehension of Abstract Meaning
Xin Xie
Xiangnan Chen
Xiang Chen
Yong Wang
Ningyu Zhang
Shumin Deng
Huajun Chen
63
2
0
25 Feb 2021
SocialNLP EmotionGIF 2020 Challenge Overview: Predicting Reaction GIF
  Categories on Social Media
SocialNLP EmotionGIF 2020 Challenge Overview: Predicting Reaction GIF Categories on Social Media
Boaz Shmueli
Lun-Wei Ku
Soumya Ray
61
3
0
24 Feb 2021
Do Transformer Modifications Transfer Across Implementations and
  Applications?
Do Transformer Modifications Transfer Across Implementations and Applications?
Sharan Narang
Hyung Won Chung
Yi Tay
W. Fedus
Thibault Févry
...
Wei Li
Nan Ding
Jake Marcus
Adam Roberts
Colin Raffel
100
128
0
23 Feb 2021
Neural ranking models for document retrieval
Neural ranking models for document retrieval
M. Trabelsi
Zhiyu Zoey Chen
Brian D. Davison
J. Heflin
FedML
85
29
0
23 Feb 2021
Automated Quality Assessment of Cognitive Behavioral Therapy Sessions
  Through Highly Contextualized Language Representations
Automated Quality Assessment of Cognitive Behavioral Therapy Sessions Through Highly Contextualized Language Representations
Nikolaos Flemotomos
Víctor R. Martínez
Zhuohao Chen
Torrey A. Creed
David C. Atkins
Shrikanth Narayanan
64
31
0
23 Feb 2021
Robust and Transferable Anomaly Detection in Log Data using Pre-Trained
  Language Models
Robust and Transferable Anomaly Detection in Log Data using Pre-Trained Language Models
Harold Ott
Jasmin Bogatinovski
Alexander Acker
S. Nedelkoski
O. Kao
23
31
0
23 Feb 2021
LogME: Practical Assessment of Pre-trained Models for Transfer Learning
LogME: Practical Assessment of Pre-trained Models for Transfer Learning
Kaichao You
Yong Liu
Jianmin Wang
Mingsheng Long
99
190
0
22 Feb 2021
RUBERT: A Bilingual Roman Urdu BERT Using Cross Lingual Transfer
  Learning
RUBERT: A Bilingual Roman Urdu BERT Using Cross Lingual Transfer Learning
Usama Khalid
M. O. Beg
Muhammad Umair Arshad
59
11
0
22 Feb 2021
Bilingual Language Modeling, A transfer learning technique for Roman
  Urdu
Bilingual Language Modeling, A transfer learning technique for Roman Urdu
Usama Khalid
M. O. Beg
Muhammad Umair Arshad
41
3
0
22 Feb 2021
Previous
123...474849...697071
Next