ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1906.08237
  4. Cited By
XLNet: Generalized Autoregressive Pretraining for Language Understanding
v1v2 (latest)

XLNet: Generalized Autoregressive Pretraining for Language Understanding

19 June 2019
Zhilin Yang
Zihang Dai
Yiming Yang
J. Carbonell
Ruslan Salakhutdinov
Quoc V. Le
    AI4CE
ArXiv (abs)PDFHTML

Papers citing "XLNet: Generalized Autoregressive Pretraining for Language Understanding"

50 / 3,519 papers shown
Title
Input Perturbation Reduces Exposure Bias in Diffusion Models
Input Perturbation Reduces Exposure Bias in Diffusion Models
Mang Ning
E. Sangineto
Angelo Porrello
Simone Calderara
Rita Cucchiara
DiffM
96
67
0
27 Jan 2023
Towards Personalized Review Summarization by Modeling Historical Reviews
  from Customer and Product Separately
Towards Personalized Review Summarization by Modeling Historical Reviews from Customer and Product Separately
Xin Cheng
Shen Gao
Yuchi Zhang
Yongliang Wang
Preslav Nakov
Mingzhe Li
Dongyan Zhao
Rui Yan
89
10
0
27 Jan 2023
Robust Transformer with Locality Inductive Bias and Feature
  Normalization
Robust Transformer with Locality Inductive Bias and Feature Normalization
Omid Nejati Manzari
Hossein Kashiani
Hojat Asgarian Dehkordi
S. B. Shokouhi
ViT
77
15
0
27 Jan 2023
Open Problems in Applied Deep Learning
Open Problems in Applied Deep Learning
M. Raissi
AI4CE
115
2
0
26 Jan 2023
Characterizing the Entities in Harmful Memes: Who is the Hero, the
  Villain, the Victim?
Characterizing the Entities in Harmful Memes: Who is the Hero, the Villain, the Victim?
Shivam Sharma
Atharva Kulkarni
Tharun Suresh
Himanshi Mathur
Preslav Nakov
Md. Shad Akhtar
Tanmoy Chakraborty
105
17
0
26 Jan 2023
A benchmark for toxic comment classification on Civil Comments dataset
A benchmark for toxic comment classification on Civil Comments dataset
Corentin Duchene
Henri Jamet
Pierre Guillaume
Reda Dehak
74
9
0
26 Jan 2023
Out of Distribution Performance of State of Art Vision Model
Out of Distribution Performance of State of Art Vision Model
Salman Rahman
W. Lee
115
3
0
25 Jan 2023
ViDeBERTa: A powerful pre-trained language model for Vietnamese
ViDeBERTa: A powerful pre-trained language model for Vietnamese
Cong Dao Tran
Nhut Huy Pham
Anh-Viêt Nguyên
Truong-Son Hy
Tu Vu
67
17
0
25 Jan 2023
BDMMT: Backdoor Sample Detection for Language Models through Model
  Mutation Testing
BDMMT: Backdoor Sample Detection for Language Models through Model Mutation Testing
Jiali Wei
Ming Fan
Wenjing Jiao
Wuxia Jin
Ting Liu
AAML
99
15
0
25 Jan 2023
Selective Explanations: Leveraging Human Input to Align Explainable AI
Selective Explanations: Leveraging Human Input to Align Explainable AI
Vivian Lai
Yiming Zhang
Chacha Chen
Q. V. Liao
Chenhao Tan
102
45
0
23 Jan 2023
SPEC5G: A Dataset for 5G Cellular Network Protocol Analysis
SPEC5G: A Dataset for 5G Cellular Network Protocol Analysis
Imtiaz Karim
Kazi Samin Mubasshir
Mirza Masfiqur Rahman
Elisa Bertino
55
25
0
22 Jan 2023
Blacks is to Anger as Whites is to Joy? Understanding Latent Affective
  Bias in Large Pre-trained Neural Language Models
Blacks is to Anger as Whites is to Joy? Understanding Latent Affective Bias in Large Pre-trained Neural Language Models
Anoop Kadan
P Deepak
Sahely Bhadra
Manjary P.Gangan
L. LajishV.
39
2
0
21 Jan 2023
REDAffectiveLM: Leveraging Affect Enriched Embedding and
  Transformer-based Neural Language Model for Readers' Emotion Detection
REDAffectiveLM: Leveraging Affect Enriched Embedding and Transformer-based Neural Language Model for Readers' Emotion Detection
Anoop Kadan
Deepak P
Manjary P.Gangan
Savitha Sam Abraham
L. LajishV.
117
1
0
21 Jan 2023
Ankh: Optimized Protein Language Model Unlocks General-Purpose Modelling
Ankh: Optimized Protein Language Model Unlocks General-Purpose Modelling
Ahmed Elnaggar
Hazem Essam
Wafaa Salah-Eldin
Walid Moustafa
Mohamed Elkerdawy
Charlotte Rochereau
B. Rost
238
103
0
16 Jan 2023
Hawk: An Industrial-strength Multi-label Document Classifier
Hawk: An Industrial-strength Multi-label Document Classifier
Arshad Javeed
79
1
0
15 Jan 2023
Everyone's Voice Matters: Quantifying Annotation Disagreement Using
  Demographic Information
Everyone's Voice Matters: Quantifying Annotation Disagreement Using Demographic Information
Ruyuan Wan
Jaehyung Kim
Dongyeop Kang
60
38
0
12 Jan 2023
Topics in Contextualised Attention Embeddings
Topics in Contextualised Attention Embeddings
Mozhgan Talebpour
A. G. S. D. Herrera
Shoaib Jameel
69
2
0
11 Jan 2023
CHRONOS: Time-Aware Zero-Shot Identification of Libraries from
  Vulnerability Reports
CHRONOS: Time-Aware Zero-Shot Identification of Libraries from Vulnerability Reports
Yu-zeng Lyu
Thanh Le-Cong
Hong Jin Kang
Ratnadira Widyasari
Zhipeng Zhao
X. Le
Ming Li
David Lo
90
18
0
10 Jan 2023
Understanding the Complexity and Its Impact on Testing in ML-Enabled
  Systems
Understanding the Complexity and Its Impact on Testing in ML-Enabled Systems
Junming Cao
Bihuan Chen
Longjie Hu
Jie Ying Gao
Kaifeng Huang
Xin Peng
69
3
0
10 Jan 2023
Universal Multimodal Representation for Language Understanding
Universal Multimodal Representation for Language Understanding
Zhuosheng Zhang
Kehai Chen
Rui Wang
Masao Utiyama
Eiichiro Sumita
Z. Li
Hai Zhao
SSL
109
22
0
09 Jan 2023
Mind Reasoning Manners: Enhancing Type Perception for Generalized
  Zero-shot Logical Reasoning over Text
Mind Reasoning Manners: Enhancing Type Perception for Generalized Zero-shot Logical Reasoning over Text
Fangzhi Xu
Jun Liu
Qika Lin
Tianzhe Zhao
Jian Zhang
Lingling Zhang
ReLMLRM
69
4
0
08 Jan 2023
Traditional Readability Formulas Compared for English
Traditional Readability Formulas Compared for English
Bruce W. Lee
J. Lee
AIMat
84
6
0
08 Jan 2023
Does compressing activations help model parallel training?
Does compressing activations help model parallel training?
S. Bian
Dacheng Li
Hongyi Wang
Eric P. Xing
Shivaram Venkataraman
74
9
0
06 Jan 2023
Causal Categorization of Mental Health Posts using Transformers
Causal Categorization of Mental Health Posts using Transformers
Simranjeet Kaur
Ritika Bhardwaj
Aastha Jain
Muskan Garg
Chandni Saxena
AI4MH
85
1
0
06 Jan 2023
Parameter-Efficient Fine-Tuning Design Spaces
Parameter-Efficient Fine-Tuning Design Spaces
Jiaao Chen
Aston Zhang
Xingjian Shi
Mu Li
Alexander J. Smola
Diyi Yang
124
67
0
04 Jan 2023
MessageNet: Message Classification using Natural Language Processing and
  Meta-data
MessageNet: Message Classification using Natural Language Processing and Meta-data
Adar Kahana
Oren Elisha
26
0
0
04 Jan 2023
Multi-Aspect Explainable Inductive Relation Prediction by Sentence
  Transformer
Multi-Aspect Explainable Inductive Relation Prediction by Sentence Transformer
Zhixiang Su
Di Wang
Steven C. H. Hoi
Li-zhen Cui
114
8
0
04 Jan 2023
PIE-QG: Paraphrased Information Extraction for Unsupervised Question
  Generation from Small Corpora
PIE-QG: Paraphrased Information Extraction for Unsupervised Question Generation from Small Corpora
D. Nagumothu
B. Ofoghi
G. Huang
Peter W. Eklund
RALM
62
5
0
03 Jan 2023
Semi-Structured Object Sequence Encoders
Semi-Structured Object Sequence Encoders
V. Rudramurthy
Riyaz Ahmad Bhat
Chulaka Gunasekara
Siva Sankalp Patel
H. Wan
Tejas I. Dhamecha
Danish Contractor
Marina Danilevsky
124
0
0
03 Jan 2023
Leveraging Semantic Representations Combined with Contextual Word
  Representations for Recognizing Textual Entailment in Vietnamese
Leveraging Semantic Representations Combined with Contextual Word Representations for Recognizing Textual Entailment in Vietnamese
Quoc-Loc Duong
Duc-Vu Nguyen
Ngan Luu-Thuy Nguyen
68
1
0
01 Jan 2023
Relevance Classification of Flood-related Twitter Posts via Multiple
  Transformers
Relevance Classification of Flood-related Twitter Posts via Multiple Transformers
Wisal Mukhtiar
Waliiya Rizwan
A. Habib
Y. Afridi
Laiq Hasan
Kashif Ahmad
38
3
0
01 Jan 2023
Efficient Movie Scene Detection using State-Space Transformers
Efficient Movie Scene Detection using State-Space Transformers
Md. Mohaiminul Islam
Mahmudul Hasan
Kishan Athrey
Tony Braskich
Gedas Bertasius
ViT
68
45
0
29 Dec 2022
On Transforming Reinforcement Learning by Transformer: The Development
  Trajectory
On Transforming Reinforcement Learning by Transformer: The Development Trajectory
Shengchao Hu
Li Shen
Ya Zhang
Yixin Chen
Dacheng Tao
OffRL
148
30
0
29 Dec 2022
Cramming: Training a Language Model on a Single GPU in One Day
Cramming: Training a Language Model on a Single GPU in One Day
Jonas Geiping
Tom Goldstein
MoE
117
91
0
28 Dec 2022
Relational Local Explanations
Relational Local Explanations
V. Borisov
Gjergji Kasneci
FAtt
70
0
0
23 Dec 2022
Data-Centric Artificial Intelligence
Data-Centric Artificial Intelligence
Johannes Jakubik
Michael Vossing
Niklas Kühl
J. Walk
G. Satzger
GNN
67
51
0
22 Dec 2022
Automatic Emotion Modelling in Written Stories
Automatic Emotion Modelling in Written Stories
Lukas Christ
Shahin Amiriparian
M. Milling
Ilhan Aslan
Björn W. Schuller
60
2
0
21 Dec 2022
Text classification in shipping industry using unsupervised models and
  Transformer based supervised models
Text classification in shipping industry using unsupervised models and Transformer based supervised models
Yingyi Xie
Dongping Song
112
1
0
21 Dec 2022
Analyzing Semantic Faithfulness of Language Models via Input
  Intervention on Question Answering
Analyzing Semantic Faithfulness of Language Models via Input Intervention on Question Answering
Akshay Chaturvedi
Swarnadeep Bhar
Soumadeep Saha
Utpal Garain
Nicholas Asher
61
5
0
21 Dec 2022
A Length-Extrapolatable Transformer
A Length-Extrapolatable Transformer
Yutao Sun
Li Dong
Barun Patra
Shuming Ma
Shaohan Huang
Alon Benhaim
Vishrav Chaudhary
Xia Song
Furu Wei
115
124
0
20 Dec 2022
Is GPT-3 a Good Data Annotator?
Is GPT-3 a Good Data Annotator?
Bosheng Ding
Chengwei Qin
Linlin Liu
Yew Ken Chia
Shafiq Joty
Boyang Albert Li
Lidong Bing
95
250
0
20 Dec 2022
CoCo: Coherence-Enhanced Machine-Generated Text Detection Under Data
  Limitation With Contrastive Learning
CoCo: Coherence-Enhanced Machine-Generated Text Detection Under Data Limitation With Contrastive Learning
Xiaoming Liu
Zhaohan Zhang
Yichen Wang
Hang Pu
Y. Lan
Chao Shen
95
41
0
20 Dec 2022
GanLM: Encoder-Decoder Pre-training with an Auxiliary Discriminator
GanLM: Encoder-Decoder Pre-training with an Auxiliary Discriminator
Jian Yang
Shuming Ma
Li Dong
Shaohan Huang
Haoyang Huang
Yuwei Yin
Dongdong Zhang
Liqun Yang
Furu Wei
Zhoujun Li
SyDaAI4CE
76
25
0
20 Dec 2022
DIONYSUS: A Pre-trained Model for Low-Resource Dialogue Summarization
DIONYSUS: A Pre-trained Model for Low-Resource Dialogue Summarization
Yu Li
Baolin Peng
Pengcheng He
Michel Galley
Zhou Yu
Jianfeng Gao
83
8
0
20 Dec 2022
Memory-efficient NLLB-200: Language-specific Expert Pruning of a
  Massively Multilingual Machine Translation Model
Memory-efficient NLLB-200: Language-specific Expert Pruning of a Massively Multilingual Machine Translation Model
Yeskendir Koishekenov
Alexandre Berard
Vassilina Nikoulina
MoE
84
31
0
19 Dec 2022
Mu$^{2}$SLAM: Multitask, Multilingual Speech and Language Models
Mu2^{2}2SLAM: Multitask, Multilingual Speech and Language Models
Yong Cheng
Yu Zhang
Melvin Johnson
Wolfgang Macherey
Ankur Bapna
66
8
0
19 Dec 2022
Improving the Generalizability of Text-Based Emotion Detection by
  Leveraging Transformers with Psycholinguistic Features
Improving the Generalizability of Text-Based Emotion Detection by Leveraging Transformers with Psycholinguistic Features
S. Zanwar
Daniel Wiechmann
Yu Qiao
E. Kerz
61
3
0
19 Dec 2022
Enriching Relation Extraction with OpenIE
Enriching Relation Extraction with OpenIE
Alessandro Temperoni
M. Biryukov
Martin Theobald
51
1
0
19 Dec 2022
HyPe: Better Pre-trained Language Model Fine-tuning with Hidden
  Representation Perturbation
HyPe: Better Pre-trained Language Model Fine-tuning with Hidden Representation Perturbation
Hongyi Yuan
Zheng Yuan
Chuanqi Tan
Fei Huang
Songfang Huang
91
15
0
17 Dec 2022
Injecting Domain Knowledge in Language Models for Task-Oriented Dialogue
  Systems
Injecting Domain Knowledge in Language Models for Task-Oriented Dialogue Systems
Denis Emelin
Daniele Bonadiman
Sawsan Alqahtani
Yi Zhang
Saab Mansour
82
17
0
15 Dec 2022
Previous
123...202122...697071
Next