ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1906.08237
  4. Cited By
XLNet: Generalized Autoregressive Pretraining for Language Understanding
v1v2 (latest)

XLNet: Generalized Autoregressive Pretraining for Language Understanding

19 June 2019
Zhilin Yang
Zihang Dai
Yiming Yang
J. Carbonell
Ruslan Salakhutdinov
Quoc V. Le
    AI4CE
ArXiv (abs)PDFHTML

Papers citing "XLNet: Generalized Autoregressive Pretraining for Language Understanding"

50 / 3,518 papers shown
Title
CNN-Trans-Enc: A CNN-Enhanced Transformer-Encoder On Top Of Static BERT
  representations for Document Classification
CNN-Trans-Enc: A CNN-Enhanced Transformer-Encoder On Top Of Static BERT representations for Document Classification
Charaf Eddine Benarab
Shenglin Gui
41
6
0
13 Sep 2022
Pre-trained Language Models for the Legal Domain: A Case Study on Indian
  Law
Pre-trained Language Models for the Legal Domain: A Case Study on Indian Law
Shounak Paul
A. Mandal
Pawan Goyal
Saptarshi Ghosh
AILawVLMELM
77
48
0
13 Sep 2022
Socially Enhanced Situation Awareness from Microblogs using Artificial
  Intelligence: A Survey
Socially Enhanced Situation Awareness from Microblogs using Artificial Intelligence: A Survey
Rabindra Lamsal
Aaron Harwood
M. Read
101
21
0
13 Sep 2022
An Embedding-Based Grocery Search Model at Instacart
An Embedding-Based Grocery Search Model at Instacart
Yuqing Xie
Taesik Na
X. Xiao
Saurav Manchanda
Young Rao
Zhihong Xu
Guanghua Shu
Esther Vasiete
Tejaswi Tenneti
Haixun Wang
DMLRALM
58
6
0
12 Sep 2022
Semantic-Preserving Adversarial Code Comprehension
Semantic-Preserving Adversarial Code Comprehension
Yiyang Li
Hongqiu Wu
Hai Zhao
AAML
85
7
0
12 Sep 2022
Leveraging Language Foundation Models for Human Mobility Forecasting
Leveraging Language Foundation Models for Human Mobility Forecasting
Hao Xue
Bhanu Prakash Voutharoja
Flora D. Salim
AI4TS
179
75
0
11 Sep 2022
Pre-Training a Graph Recurrent Network for Language Representation
Pre-Training a Graph Recurrent Network for Language Representation
Yile Wang
Linyi Yang
Zhiyang Teng
M. Zhou
Yue Zhang
GNN
81
1
0
08 Sep 2022
SSL-WM: A Black-Box Watermarking Approach for Encoders Pre-trained by
  Self-supervised Learning
SSL-WM: A Black-Box Watermarking Approach for Encoders Pre-trained by Self-supervised Learning
Peizhuo Lv
Pan Li
Shenchen Zhu
Shengzhi Zhang
Kai Chen
...
Fan Xiang
Yuling Cai
Hualong Ma
Yingjun Zhang
Guozhu Meng
AAML
86
7
0
08 Sep 2022
Blessing of Class Diversity in Pre-training
Blessing of Class Diversity in Pre-training
Yulai Zhao
Jianshu Chen
S. Du
AI4CE
77
3
0
07 Sep 2022
AutoPruner: Transformer-Based Call Graph Pruning
AutoPruner: Transformer-Based Call Graph Pruning
Thanh Le-Cong
Hong Jin Kang
Truong-Giang Nguyen
S. A. Haryono
David Lo
X. Le
H. Thang
66
20
0
07 Sep 2022
Mimose: An Input-Aware Checkpointing Planner for Efficient Training on
  GPU
Mimose: An Input-Aware Checkpointing Planner for Efficient Training on GPU
Jian-He Liao
Mingzhen Li
Qingxiao Sun
Jiwei Hao
F. Yu
...
Ye Tao
Zicheng Zhang
Hailong Yang
Zhongzhi Luan
D. Qian
76
4
0
06 Sep 2022
EnergonAI: An Inference System for 10-100 Billion Parameter Transformer
  Models
EnergonAI: An Inference System for 10-100 Billion Parameter Transformer Models
Jiangsu Du
Ziming Liu
Jiarui Fang
Shenggui Li
Yongbin Li
Yutong Lu
Yang You
MoE
52
4
0
06 Sep 2022
A Survey on Measuring and Mitigating Reasoning Shortcuts in Machine
  Reading Comprehension
A Survey on Measuring and Mitigating Reasoning Shortcuts in Machine Reading Comprehension
Xanh Ho
Johannes Mario Meissner
Saku Sugawara
Akiko Aizawa
OffRL
92
4
0
05 Sep 2022
Attack Tactic Identification by Transfer Learning of Language Model
Attack Tactic Identification by Transfer Learning of Language Model
Lily Lin
Shun-Wen Hsiao
67
2
0
01 Sep 2022
Unified Knowledge Prompt Pre-training for Customer Service Dialogues
Unified Knowledge Prompt Pre-training for Customer Service Dialogues
Keqing He
Jingang Wang
Chaobo Sun
Wei Wu
72
4
0
31 Aug 2022
Transformers with Learnable Activation Functions
Transformers with Learnable Activation Functions
Haishuo Fang
Ji-Ung Lee
N. Moosavi
Iryna Gurevych
AI4CE
46
8
0
30 Aug 2022
Survey: Exploiting Data Redundancy for Optimization of Deep Learning
Survey: Exploiting Data Redundancy for Optimization of Deep Learning
Jou-An Chen
Wei Niu
Bin Ren
Yanzhi Wang
Xipeng Shen
60
24
0
29 Aug 2022
A Compact Pretraining Approach for Neural Language Models
A Compact Pretraining Approach for Neural Language Models
Shahriar Golchin
Mihai Surdeanu
N. Tavabi
A. Kiapour
VLM
35
1
0
25 Aug 2022
Of Human Criteria and Automatic Metrics: A Benchmark of the Evaluation
  of Story Generation
Of Human Criteria and Automatic Metrics: A Benchmark of the Evaluation of Story Generation
Cyril Chhun
Pierre Colombo
Chloé Clavel
Fabian M. Suchanek
191
55
0
24 Aug 2022
Visual Subtitle Feature Enhanced Video Outline Generation
Visual Subtitle Feature Enhanced Video Outline Generation
Qi Lv
Ziqiang Cao
Wenrui Xie
Derui Wang
Jingwen Wang
...
Yuan-Fang Li
Min Cao
Wenjie Li
Sujian Li
Guohong Fu
VGen
103
0
0
24 Aug 2022
CLOWER: A Pre-trained Language Model with Contrastive Learning over Word
  and Character Representations
CLOWER: A Pre-trained Language Model with Contrastive Learning over Word and Character Representations
Borun Chen
Hongyin Tang
Jiahao Bu
Kai Zhang
Jingang Wang
Qifan Wang
Haitao Zheng
Wei Wu
Liqian Yu
VLM
54
1
0
23 Aug 2022
Learning Dynamic Contextualised Word Embeddings via Template-based
  Temporal Adaptation
Learning Dynamic Contextualised Word Embeddings via Template-based Temporal Adaptation
Xiaohang Tang
Yi Zhou
Danushka Bollegala
95
6
0
23 Aug 2022
A Novel Multi-Task Learning Approach for Context-Sensitive Compound Type
  Identification in Sanskrit
A Novel Multi-Task Learning Approach for Context-Sensitive Compound Type Identification in Sanskrit
Jivnesh Sandhan
Ashish Gupta
Hrishikesh Terdalkar
Tushar Sandhan
S. Samanta
Laxmidhar Behera
Pawan Goyal
77
4
0
22 Aug 2022
PANDA: Prompt Transfer Meets Knowledge Distillation for Efficient Model
  Adaptation
PANDA: Prompt Transfer Meets Knowledge Distillation for Efficient Model Adaptation
Qihuang Zhong
Liang Ding
Juhua Liu
Bo Du
Dacheng Tao
VLMCLL
94
44
0
22 Aug 2022
Heterogeneous Graph Masked Autoencoders
Heterogeneous Graph Masked Autoencoders
Yijun Tian
Kaiwen Dong
Chunhui Zhang
Chuxu Zhang
Nitesh Chawla
129
82
0
21 Aug 2022
DiscrimLoss: A Universal Loss for Hard Samples and Incorrect Samples
  Discrimination
DiscrimLoss: A Universal Loss for Hard Samples and Incorrect Samples Discrimination
Tingting Wu
Xiao Ding
Hao Zhang
Jin-Fang Gao
Li Du
Bing Qin
Ting Liu
78
9
0
21 Aug 2022
gBuilder: A Scalable Knowledge Graph Construction System for
  Unstructured Corpus
gBuilder: A Scalable Knowledge Graph Construction System for Unstructured Corpus
Yanzeng Li
Lei Zou
62
5
0
20 Aug 2022
Lost in Context? On the Sense-wise Variance of Contextualized Word
  Embeddings
Lost in Context? On the Sense-wise Variance of Contextualized Word Embeddings
Yile Wang
Yue Zhang
59
5
0
20 Aug 2022
Pretrained Language Encoders are Natural Tagging Frameworks for Aspect
  Sentiment Triplet Extraction
Pretrained Language Encoders are Natural Tagging Frameworks for Aspect Sentiment Triplet Extraction
Yanjie Gou
Yinjie Lei
Lingqiao Liu
Yong Dai
Chun-Yen Shen
Yongqi Tong
ViT
58
0
0
20 Aug 2022
Graph-Augmented Cyclic Learning Framework for Similarity Estimation of
  Medical Clinical Notes
Graph-Augmented Cyclic Learning Framework for Similarity Estimation of Medical Clinical Notes
Can Zheng
Yanshan Wang
X. Jia
62
0
0
19 Aug 2022
Federated Select: A Primitive for Communication- and Memory-Efficient
  Federated Learning
Federated Select: A Primitive for Communication- and Memory-Efficient Federated Learning
Zachary B. Charles
Kallista A. Bonawitz
Stanislav Chiknavaryan
H. B. McMahan
Blaise Agüera y Arcas
FedML
60
13
0
19 Aug 2022
Coarse-to-Fine: Hierarchical Multi-task Learning for Natural Language
  Understanding
Coarse-to-Fine: Hierarchical Multi-task Learning for Natural Language Understanding
Zhaoye Fei
Yu Tian
Yongkang Wu
Xinyu Zhang
Yutao Zhu
...
Dejiang Kong
Ruofei Lai
Bo Zhao
Zhicheng Dou
Xipeng Qiu
287
1
0
19 Aug 2022
A Kind Introduction to Lexical and Grammatical Aspect, with a Survey of
  Computational Approaches
A Kind Introduction to Lexical and Grammatical Aspect, with a Survey of Computational Approaches
Annemarie Friedrich
Nianwen Xue
Alexis Palmer
89
3
0
18 Aug 2022
Brand Celebrity Matching Model Based on Natural Language Processing
Brand Celebrity Matching Model Based on Natural Language Processing
Han Yang
Kejian Yang
Erhan Zhang
81
1
0
18 Aug 2022
Exploring and Exploiting Multi-Granularity Representations for Machine
  Reading Comprehension
Exploring and Exploiting Multi-Granularity Representations for Machine Reading Comprehension
Nuo Chen
Chenyu You
82
0
0
18 Aug 2022
Boosting Distributed Training Performance of the Unpadded BERT Model
Boosting Distributed Training Performance of the Unpadded BERT Model
Jinle Zeng
Min Li
Zhihua Wu
Jiaqi Liu
Yuang Liu
Dianhai Yu
Yanjun Ma
67
11
0
17 Aug 2022
BERT(s) to Detect Multiword Expressions
BERT(s) to Detect Multiword Expressions
Damith Premasiri
Tharindu Ranasinghe
55
6
0
16 Aug 2022
Domain-Specific Text Generation for Machine Translation
Domain-Specific Text Generation for Machine Translation
Yasmin Moslem
Rejwanul Haque
John D. Kelleher
Andy Way
66
18
0
11 Aug 2022
A Comprehensive Survey of Natural Language Generation Advances from the
  Perspective of Digital Deception
A Comprehensive Survey of Natural Language Generation Advances from the Perspective of Digital Deception
Keenan I. Jones
Enes ALTUNCU
V. N. Franqueira
Yi-Chia Wang
Shujun Li
DeLMO
80
3
0
11 Aug 2022
Can Brain Signals Reveal Inner Alignment with Human Languages?
Can Brain Signals Reveal Inner Alignment with Human Languages?
William Jongwon Han
Jielin Qiu
Jiacheng Zhu
Mengdi Xu
Douglas Weber
Yue Liu
Ding Zhao
119
13
0
10 Aug 2022
Weak Supervision in Analysis of News: Application to Economic Policy Uncertainty
Paul Trust
A. Zahran
R. Minghim
28
0
0
10 Aug 2022
Exploring Hate Speech Detection with HateXplain and BERT
Exploring Hate Speech Detection with HateXplain and BERT
Arvind Subramaniam
A. Mehra
Sayani Kundu
47
3
0
09 Aug 2022
The Analysis of Synonymy and Antonymy in Discourse Relations: An
  interpretable Modeling Approach
The Analysis of Synonymy and Antonymy in Discourse Relations: An interpretable Modeling Approach
Assela Reig-Alamillo
David Torres-Moreno
Eliseo Morales-González
Mauricio Toledo-Acosta
Antoine Taroni
Jorge Hermosillo Valadez
19
4
0
09 Aug 2022
Global Pointer: Novel Efficient Span-based Approach for Named Entity
  Recognition
Global Pointer: Novel Efficient Span-based Approach for Named Entity Recognition
Jianlin Su
Ahmed Murtadha
Shengfeng Pan
Jing Hou
Jun Sun
Wanwei Huang
Bo Wen
Yunfeng Liu
74
80
0
05 Aug 2022
PointConvFormer: Revenge of the Point-based Convolution
PointConvFormer: Revenge of the Point-based Convolution
Wenxuan Wu
Li Fuxin
Qi Shan
3DPC
84
32
0
04 Aug 2022
Prompt Tuning for Generative Multimodal Pretrained Models
Prompt Tuning for Generative Multimodal Pretrained Models
Han Yang
Junyang Lin
An Yang
Peng Wang
Chang Zhou
Hongxia Yang
VLMLRMVPVLM
86
31
0
04 Aug 2022
SpanDrop: Simple and Effective Counterfactual Learning for Long
  Sequences
SpanDrop: Simple and Effective Counterfactual Learning for Long Sequences
Peng Qi
Guangtao Wang
Jing Huang
48
0
0
03 Aug 2022
Masked Vision and Language Modeling for Multi-modal Representation
  Learning
Masked Vision and Language Modeling for Multi-modal Representation Learning
Gukyeong Kwon
Zhaowei Cai
Avinash Ravichandran
Erhan Bas
Rahul Bhotika
Stefano Soatto
92
68
0
03 Aug 2022
To Answer or Not to Answer? Improving Machine Reading Comprehension
  Model with Span-based Contrastive Learning
To Answer or Not to Answer? Improving Machine Reading Comprehension Model with Span-based Contrastive Learning
Yunjie Ji
Liangyu Chen
Chenxiao Dou
Baochang Ma
Xiangang Li
82
5
0
02 Aug 2022
giMLPs: Gate with Inhibition Mechanism in MLPs
Cheng Kang
Jindich Prokop
Lei Tong
Huiyu Zhou
Yong Hu
Daneil Novak
33
0
0
01 Aug 2022
Previous
123...252627...697071
Next