ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1906.08237
  4. Cited By
XLNet: Generalized Autoregressive Pretraining for Language Understanding
v1v2 (latest)

XLNet: Generalized Autoregressive Pretraining for Language Understanding

19 June 2019
Zhilin Yang
Zihang Dai
Yiming Yang
J. Carbonell
Ruslan Salakhutdinov
Quoc V. Le
    AI4CE
ArXiv (abs)PDFHTML

Papers citing "XLNet: Generalized Autoregressive Pretraining for Language Understanding"

50 / 3,519 papers shown
Title
Boosting Retailer Revenue by Generated Optimized Combined Multiple
  Digital Marketing Campaigns
Boosting Retailer Revenue by Generated Optimized Combined Multiple Digital Marketing Campaigns
Yafei Xu
Tian Xie
Yu Zhang
25
1
0
09 Sep 2020
Probabilistic Predictions of People Perusing: Evaluating Metrics of
  Language Model Performance for Psycholinguistic Modeling
Probabilistic Predictions of People Perusing: Evaluating Metrics of Language Model Performance for Psycholinguistic Modeling
Sophie Hao
S. Mendelsohn
Rachel Sterneck
Randi Martinez
Robert Frank
41
48
0
08 Sep 2020
LynyrdSkynyrd at WNUT-2020 Task 2: Semi-Supervised Learning for
  Identification of Informative COVID-19 English Tweets
LynyrdSkynyrd at WNUT-2020 Task 2: Semi-Supervised Learning for Identification of Informative COVID-19 English Tweets
Abhilasha Sancheti
Kushal Chawla
Gaurav Verma
16
3
0
08 Sep 2020
Robust Conversational AI with Grounded Text Generation
Robust Conversational AI with Grounded Text Generation
Jianfeng Gao
Baolin Peng
Chunyuan Li
Jinchao Li
Shahin Shayandeh
Lars Liden
H. Shum
76
21
0
07 Sep 2020
Scaling up Differentially Private Deep Learning with Fast Per-Example
  Gradient Clipping
Scaling up Differentially Private Deep Learning with Fast Per-Example Gradient Clipping
Jaewoo Lee
Daniel Kifer
88
57
0
07 Sep 2020
Adversarial Watermarking Transformer: Towards Tracing Text Provenance
  with Data Hiding
Adversarial Watermarking Transformer: Towards Tracing Text Provenance with Data Hiding
Sahar Abdelnabi
Mario Fritz
WaLM
90
152
0
07 Sep 2020
UIT-HSE at WNUT-2020 Task 2: Exploiting CT-BERT for Identifying COVID-19
  Information on the Twitter Social Network
UIT-HSE at WNUT-2020 Task 2: Exploiting CT-BERT for Identifying COVID-19 Information on the Twitter Social Network
Khiem Vinh Tran
Hao Phu Phan
Kiet Van Nguyen
Ngan Luu-Thuy Nguyen
68
8
0
07 Sep 2020
E-BERT: A Phrase and Product Knowledge Enhanced Language Model for
  E-commerce
E-BERT: A Phrase and Product Knowledge Enhanced Language Model for E-commerce
Denghui Zhang
Zixuan Yuan
Yanchi Liu
Fuzhen Zhuang
Haifeng Chen
Hui Xiong
84
34
0
07 Sep 2020
BANANA at WNUT-2020 Task 2: Identifying COVID-19 Information on Twitter
  by Combining Deep Learning and Transfer Learning Models
BANANA at WNUT-2020 Task 2: Identifying COVID-19 Information on Twitter by Combining Deep Learning and Transfer Learning Models
Tin Van Huynh
Luan Thanh Nguyen
Son T. Luu
29
3
0
06 Sep 2020
QiaoNing at SemEval-2020 Task 4: Commonsense Validation and Explanation
  system based on ensemble of language model
QiaoNing at SemEval-2020 Task 4: Commonsense Validation and Explanation system based on ensemble of language model
Pai Liu
LRM
66
6
0
06 Sep 2020
MIDAS at SemEval-2020 Task 10: Emphasis Selection using Label
  Distribution Learning and Contextual Embeddings
MIDAS at SemEval-2020 Task 10: Emphasis Selection using Label Distribution Learning and Contextual Embeddings
Sarthak Anand
Pradyumna Gupta
Hemant Yadav
Debanjan Mahata
Rakesh Gosangi
Haimin Zhang
R. Shah
30
3
0
06 Sep 2020
Recent Trends in the Use of Deep Learning Models for Grammar Error
  Handling
Recent Trends in the Use of Deep Learning Models for Grammar Error Handling
Mina Naghshnejad
Tarun Joshi
V. Nair
VLM
43
6
0
04 Sep 2020
A Comparison of Pre-trained Vision-and-Language Models for Multimodal
  Representation Learning across Medical Images and Reports
A Comparison of Pre-trained Vision-and-Language Models for Multimodal Representation Learning across Medical Images and Reports
Yikuan Li
Hanyin Wang
Yuan Luo
70
67
0
03 Sep 2020
ASTRAL: Adversarial Trained LSTM-CNN for Named Entity Recognition
ASTRAL: Adversarial Trained LSTM-CNN for Named Entity Recognition
Jiuniu Wang
Wenjia Xu
Xingyu Fu
Guangluan Xu
Yirong Wu
50
58
0
02 Sep 2020
On SkipGram Word Embedding Models with Negative Sampling: Unified
  Framework and Impact of Noise Distributions
On SkipGram Word Embedding Models with Negative Sampling: Unified Framework and Impact of Noise Distributions
Ziqiao Wang
Yongyi Mao
Hongyu Guo
Richong Zhang
SyDa
32
1
0
02 Sep 2020
Active Contrastive Learning of Audio-Visual Video Representations
Active Contrastive Learning of Audio-Visual Video Representations
Shuang Ma
Zhaoyang Zeng
Daniel J. McDuff
Yale Song
VLMSSL
60
8
0
31 Aug 2020
Langevin Cooling for Domain Translation
Langevin Cooling for Domain Translation
Vignesh Srinivasan
Klaus-Robert Muller
Wojciech Samek
Shinichi Nakajima
74
1
0
31 Aug 2020
Zero-Resource Knowledge-Grounded Dialogue Generation
Zero-Resource Knowledge-Grounded Dialogue Generation
Linxiao Li
Can Xu
Wei Wu
Yufan Zhao
Xueliang Zhao
Chongyang Tao
99
72
0
29 Aug 2020
Rethinking the Objectives of Extractive Question Answering
Rethinking the Objectives of Extractive Question Answering
Martin Fajcik
Josef Jon
Pavel Smrz
99
12
0
28 Aug 2020
A Dataset and Baselines for Visual Question Answering on Art
A Dataset and Baselines for Visual Question Answering on Art
Noa Garcia
Chentao Ye
Zihua Liu
Qingtao Hu
Mayu Otani
Chenhui Chu
Yuta Nakashima
Teruko Mitamura
CoGe
57
56
0
28 Aug 2020
Language Models as Emotional Classifiers for Textual Conversations
Language Models as Emotional Classifiers for Textual Conversations
Connor T. Heaton
David M. Schwartz
41
6
0
27 Aug 2020
A Fast and Robust BERT-based Dialogue State Tracker for Schema-Guided
  Dialogue Dataset
A Fast and Robust BERT-based Dialogue State Tracker for Schema-Guided Dialogue Dataset
Vahid Noroozi
Yang Zhang
Evelina Bakhturina
Tomasz Kornuta
47
16
0
27 Aug 2020
Entity and Evidence Guided Relation Extraction for DocRED
Entity and Evidence Guided Relation Extraction for DocRED
Kevin Huang
Guangtao Wang
Tengyu Ma
Jing Huang
71
9
0
27 Aug 2020
GREEK-BERT: The Greeks visiting Sesame Street
GREEK-BERT: The Greeks visiting Sesame Street
John Koutsikakis
Ilias Chalkidis
Prodromos Malakasiotis
Ion Androutsopoulos
80
92
0
27 Aug 2020
A Survey of Evaluation Metrics Used for NLG Systems
A Survey of Evaluation Metrics Used for NLG Systems
Ananya B. Sai
Akash Kumar Mohankumar
Mitesh M. Khapra
ELM
99
237
0
27 Aug 2020
Improvement of a dedicated model for open domain persona-aware dialogue
  generation
Improvement of a dedicated model for open domain persona-aware dialogue generation
Qiang Han
49
0
0
27 Aug 2020
Relation/Entity-Centric Reading Comprehension
Relation/Entity-Centric Reading Comprehension
Takeshi Onishi
28
0
0
27 Aug 2020
AMBERT: A Pre-trained Language Model with Multi-Grained Tokenization
AMBERT: A Pre-trained Language Model with Multi-Grained Tokenization
Xinsong Zhang
Pengshuai Li
Hang Li
95
52
0
27 Aug 2020
A Multitask Deep Learning Approach for User Depression Detection on Sina
  Weibo
A Multitask Deep Learning Approach for User Depression Detection on Sina Weibo
Yiding Wang
Zhenyi Wang
Chenghao Li
Yilin Zhang
Haizhou Wang
40
20
0
26 Aug 2020
What is being transferred in transfer learning?
What is being transferred in transfer learning?
Behnam Neyshabur
Hanie Sedghi
Chiyuan Zhang
152
530
0
26 Aug 2020
Inno at SemEval-2020 Task 11: Leveraging Pure Transformer for
  Multi-Class Propaganda Detection
Inno at SemEval-2020 Task 11: Leveraging Pure Transformer for Multi-Class Propaganda Detection
D. Grigorev
V. Ivanov
21
2
0
26 Aug 2020
Conceptualized Representation Learning for Chinese Biomedical Text
  Mining
Conceptualized Representation Learning for Chinese Biomedical Text Mining
Ningyu Zhang
Qianghuai Jia
Kangping Yin
Liang Dong
Feng Gao
Nengwei Hua
OOD
82
68
0
25 Aug 2020
End to End Dialogue Transformer
End to End Dialogue Transformer
Ondrej Mekota
Memduh Gökirmak
Petr Laitoch
30
1
0
24 Aug 2020
Matching Guided Distillation
Matching Guided Distillation
Kaiyu Yue
Jiangfan Deng
Feng Zhou
53
50
0
23 Aug 2020
Do Syntax Trees Help Pre-trained Transformers Extract Information?
Do Syntax Trees Help Pre-trained Transformers Extract Information?
Devendra Singh Sachan
Yuhao Zhang
Peng Qi
William L. Hamilton
50
79
0
20 Aug 2020
Transformer based Multilingual document Embedding model
Transformer based Multilingual document Embedding model
Wei Li
Brian Mak
121
6
0
19 Aug 2020
MEANTIME: Mixture of Attention Mechanisms with Multi-temporal Embeddings
  for Sequential Recommendation
MEANTIME: Mixture of Attention Mechanisms with Multi-temporal Embeddings for Sequential Recommendation
S. Cho
Eunhyeok Park
S. Yoo
AI4TS
36
70
0
19 Aug 2020
A Survey of Active Learning for Text Classification using Deep Neural
  Networks
A Survey of Active Learning for Text Classification using Deep Neural Networks
Christopher Schröder
A. Niekler
83
101
0
17 Aug 2020
Adaptable Multi-Domain Language Model for Transformer ASR
Adaptable Multi-Domain Language Model for Transformer ASR
Taewoo Lee
Min-Joong Lee
Tae Gyoon Kang
Seokyeong Jung
Minseok Kwon
...
Ho-Gyeong Kim
Jiseung Jeong
Jihyun Lee
Hosik Lee
Y. S. Choi
56
18
0
14 Aug 2020
Prosody Learning Mechanism for Speech Synthesis System Without Text
  Length Limit
Prosody Learning Mechanism for Speech Synthesis System Without Text Length Limit
Zhen Zeng
Jianzong Wang
Ning Cheng
Jing Xiao
61
8
0
13 Aug 2020
Variance-reduced Language Pretraining via a Mask Proposal Network
Variance-reduced Language Pretraining via a Mask Proposal Network
Liang Chen
SSL
58
8
0
12 Aug 2020
Compression of Deep Learning Models for Text: A Survey
Compression of Deep Learning Models for Text: A Survey
Manish Gupta
Puneet Agrawal
VLMMedImAI4CE
79
119
0
12 Aug 2020
Evaluating the Impact of Knowledge Graph Context on Entity
  Disambiguation Models
Evaluating the Impact of Knowledge Graph Context on Entity Disambiguation Models
I. Mulang'
Kuldeep Singh
Chaitali Prabhu
Abhishek Nadgeri
Johannes Hoffart
Jens Lehmann
105
56
0
12 Aug 2020
Hybrid Ranking Network for Text-to-SQL
Hybrid Ranking Network for Text-to-SQL
Qin Lyu
K. Chakrabarti
Shobhit Hathi
Souvik Kundu
Jianwen Zhang
Zheng Chen
AIMat
68
85
0
11 Aug 2020
Distilling the Knowledge of BERT for Sequence-to-Sequence ASR
Distilling the Knowledge of BERT for Sequence-to-Sequence ASR
Hayato Futami
Hirofumi Inaguma
Sei Ueno
Masato Mimura
S. Sakai
Tatsuya Kawahara
78
53
0
09 Aug 2020
SemEval-2020 Task 8: Memotion Analysis -- The Visuo-Lingual Metaphor!
SemEval-2020 Task 8: Memotion Analysis -- The Visuo-Lingual Metaphor!
Chhavi Sharma
Deepesh Bhageria
W. Scott
Srinivas Pykl
A. Das
Tanmoy Chakraborty
Viswanath Pulabaigari
Björn Gambäck
92
180
0
09 Aug 2020
SemEval-2020 Task 10: Emphasis Selection for Written Text in Visual
  Media
SemEval-2020 Task 10: Emphasis Selection for Written Text in Visual Media
Amirreza Shirani
Franck Dernoncourt
Nedim Lipka
P. Asente
J. Echevarria
Thamar Solorio
51
21
0
07 Aug 2020
ConvBERT: Improving BERT with Span-based Dynamic Convolution
ConvBERT: Improving BERT with Span-based Dynamic Convolution
Zihang Jiang
Weihao Yu
Daquan Zhou
Yunpeng Chen
Jiashi Feng
Shuicheng Yan
135
163
0
06 Aug 2020
Effective Transfer Learning for Identifying Similar Questions: Matching
  User Questions to COVID-19 FAQs
Effective Transfer Learning for Identifying Similar Questions: Matching User Questions to COVID-19 FAQs
Clara H. McCreery
Namit Katariya
A. Kannan
Manish Chablani
X. Amatriain
MedImOOD
57
76
0
04 Aug 2020
NLPDove at SemEval-2020 Task 12: Improving Offensive Language Detection
  with Cross-lingual Transfer
NLPDove at SemEval-2020 Task 12: Improving Offensive Language Detection with Cross-lingual Transfer
Hwijeen Ahn
Jimin Sun
Chan Young Park
Jungyun Seo
72
26
0
04 Aug 2020
Previous
123...575859...697071
Next