ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1906.08237
  4. Cited By
XLNet: Generalized Autoregressive Pretraining for Language Understanding
v1v2 (latest)

XLNet: Generalized Autoregressive Pretraining for Language Understanding

19 June 2019
Zhilin Yang
Zihang Dai
Yiming Yang
J. Carbonell
Ruslan Salakhutdinov
Quoc V. Le
    AI4CE
ArXiv (abs)PDFHTML

Papers citing "XLNet: Generalized Autoregressive Pretraining for Language Understanding"

50 / 3,520 papers shown
Title
Unsupervised Pre-training for Biomedical Question Answering
Unsupervised Pre-training for Biomedical Question Answering
Vaishnavi Kommaraju
K. Gunasekaran
Kun Li
Trapit Bansal
Andrew McCallum
Ivana Williams
Ana-Maria Istrate
SSLMedIm
50
18
0
27 Sep 2020
Learning to Plan and Realize Separately for Open-Ended Dialogue Systems
Learning to Plan and Realize Separately for Open-Ended Dialogue Systems
Sashank Santhanam
Zhuo Cheng
Brodie Mather
Bonnie J. Dorr
Archna Bhatia
Bryanna Hebenstreit
Alan Zemel
Adam Dalton
T. Strzalkowski
Samira Shaikh
58
6
0
26 Sep 2020
BET: A Backtranslation Approach for Easy Data Augmentation in
  Transformer-based Paraphrase Identification Context
BET: A Backtranslation Approach for Easy Data Augmentation in Transformer-based Paraphrase Identification Context
Jean-Philippe Corbeil
Hadi Abdi Ghadivel
40
28
0
25 Sep 2020
HetSeq: Distributed GPU Training on Heterogeneous Infrastructure
HetSeq: Distributed GPU Training on Heterogeneous Infrastructure
Yifan Ding
Nicholas Botzer
Tim Weninger
VLMMoE
37
7
0
25 Sep 2020
RecoBERT: A Catalog Language Model for Text-Based Recommendations
RecoBERT: A Catalog Language Model for Text-Based Recommendations
Itzik Malkiel
Oren Barkan
Avi Caciularu
Noam Razin
Ori Katz
Noam Koenigstein
103
13
0
25 Sep 2020
Weird AI Yankovic: Generating Parody Lyrics
Weird AI Yankovic: Generating Parody Lyrics
Mark O. Riedl
31
3
0
25 Sep 2020
No Answer is Better Than Wrong Answer: A Reflection Model for Document
  Level Machine Reading Comprehension
No Answer is Better Than Wrong Answer: A Reflection Model for Document Level Machine Reading Comprehension
Xuguang Wang
Linjun Shou
Ming Gong
Nan Duan
Daxin Jiang
58
12
0
25 Sep 2020
MinTL: Minimalist Transfer Learning for Task-Oriented Dialogue Systems
MinTL: Minimalist Transfer Learning for Task-Oriented Dialogue Systems
Zhaojiang Lin
Andrea Madotto
Genta Indra Winata
Pascale Fung
81
173
0
25 Sep 2020
Feature Adaptation of Pre-Trained Language Models across Languages and
  Domains with Robust Self-Training
Feature Adaptation of Pre-Trained Language Models across Languages and Domains with Robust Self-Training
Hai Ye
Qingyu Tan
Ruidan He
Juntao Li
Hwee Tou Ng
Lidong Bing
VLM
80
7
0
24 Sep 2020
AnchiBERT: A Pre-Trained Model for Ancient ChineseLanguage Understanding
  and Generation
AnchiBERT: A Pre-Trained Model for Ancient ChineseLanguage Understanding and Generation
Huishuang Tian
Kexin Yang
Dayiheng Liu
Jiancheng Lv
67
31
0
24 Sep 2020
Hierarchical Pre-training for Sequence Labelling in Spoken Dialog
Hierarchical Pre-training for Sequence Labelling in Spoken Dialog
E. Chapuis
Pierre Colombo
Matteo Manica
Matthieu Labeau
Chloé Clavel
170
59
0
23 Sep 2020
Global-to-Local Neural Networks for Document-Level Relation Extraction
Global-to-Local Neural Networks for Document-Level Relation Extraction
D. Wang
Wei Hu
E. Cao
Weijian Sun
NAI
90
122
0
22 Sep 2020
Public Health Informatics: Proposing Causal Sequence of Death Using
  Neural Machine Translation
Public Health Informatics: Proposing Causal Sequence of Death Using Neural Machine Translation
Yuanda Zhu
Ying Sha
Hang Wu
Mai Li
R. Hoffman
May D. Wang
HAI
26
0
0
22 Sep 2020
Preserving Integrity in Online Social Networks
Preserving Integrity in Online Social Networks
A. Halevy
Cristian Canton Ferrer
Hao Ma
Umut Ozertem
Patrick Pantel
Marzieh Saeidi
Fabrizio Silvestri
Ves Stoyanov
75
59
0
22 Sep 2020
"When they say weed causes depression, but it's your fav
  antidepressant": Knowledge-aware Attention Framework for Relationship
  Extraction
"When they say weed causes depression, but it's your fav antidepressant": Knowledge-aware Attention Framework for Relationship Extraction
S. Yadav
Usha Lokala
Raminta Daniulaityte
K. Thirunarayan
Francois R. Lamy
A. Sheth
36
17
0
21 Sep 2020
Learning to Attack: Towards Textual Adversarial Attacking in Real-world
  Situations
Learning to Attack: Towards Textual Adversarial Attacking in Real-world Situations
Yuan Zang
Bairu Hou
Fanchao Qi
Zhiyuan Liu
Xiaojun Meng
Maosong Sun
60
11
0
19 Sep 2020
CLEVR Parser: A Graph Parser Library for Geometric Learning on Language
  Grounded Image Scenes
CLEVR Parser: A Graph Parser Library for Geometric Learning on Language Grounded Image Scenes
Raeid Saqur
Ameet Deshpande
GNNNAI
22
0
0
19 Sep 2020
Weight Distillation: Transferring the Knowledge in Neural Network
  Parameters
Weight Distillation: Transferring the Knowledge in Neural Network Parameters
Ye Lin
Yanyang Li
Ziyang Wang
Bei Li
Quan Du
Tong Xiao
Jingbo Zhu
62
24
0
19 Sep 2020
Long-Short Term Masking Transformer: A Simple but Effective Baseline for
  Document-level Neural Machine Translation
Long-Short Term Masking Transformer: A Simple but Effective Baseline for Document-level Neural Machine Translation
Pei Zhang
Boxing Chen
Niyu Ge
Kai Fan
89
37
0
19 Sep 2020
The birth of Romanian BERT
The birth of Romanian BERT
Stefan Daniel Dumitrescu
Andrei-Marius Avram
S. Pyysalo
VLM
63
78
0
18 Sep 2020
Hierarchical GPT with Congruent Transformers for Multi-Sentence Language
  Models
Hierarchical GPT with Congruent Transformers for Multi-Sentence Language Models
Jihyeon Roh
Huiseong Gim
Soo-Young Lee
43
1
0
18 Sep 2020
Self-Supervised Meta-Learning for Few-Shot Natural Language
  Classification Tasks
Self-Supervised Meta-Learning for Few-Shot Natural Language Classification Tasks
Trapit Bansal
Rishikesh Jha
Tsendsuren Munkhdalai
Andrew McCallum
SSLVLM
108
88
0
17 Sep 2020
GraphCodeBERT: Pre-training Code Representations with Data Flow
GraphCodeBERT: Pre-training Code Representations with Data Flow
Daya Guo
Shuo Ren
Shuai Lu
Zhangyin Feng
Duyu Tang
...
Dawn Drain
Neel Sundaresan
Jian Yin
Daxin Jiang
M. Zhou
194
1,161
0
17 Sep 2020
Code-switching pre-training for neural machine translation
Code-switching pre-training for neural machine translation
Zhen Yang
Bojie Hu
Ambyera Han
Shen Huang
Qi Ju
102
74
0
17 Sep 2020
Efficient Transformer-based Large Scale Language Representations using
  Hardware-friendly Block Structured Pruning
Efficient Transformer-based Large Scale Language Representations using Hardware-friendly Block Structured Pruning
Bingbing Li
Zhenglun Kong
Tianyun Zhang
Ji Li
Zechao Li
Hang Liu
Caiwen Ding
VLM
192
65
0
17 Sep 2020
Reasoning about Goals, Steps, and Temporal Ordering with WikiHow
Reasoning about Goals, Steps, and Temporal Ordering with WikiHow
Li Zhang
Qing Lyu
Chris Callison-Burch
ReLMLRM
84
89
0
16 Sep 2020
Measuring Information Transfer in Neural Networks
Measuring Information Transfer in Neural Networks
Xiao Zhang
Xingjian Li
Dejing Dou
Ji Wu
71
3
0
16 Sep 2020
Question Directed Graph Attention Network for Numerical Reasoning over
  Text
Question Directed Graph Attention Network for Numerical Reasoning over Text
Kunlong Chen
Weidi Xu
Xingyi Cheng
Zou Xiaochuan
Yuyu Zhang
Le Song
Taifeng Wang
Yuan Qi
Wei Chu
AIMatOOD
85
67
0
16 Sep 2020
Retrofitting Structure-aware Transformer Language Model for End Tasks
Retrofitting Structure-aware Transformer Language Model for End Tasks
Hao Fei
Yafeng Ren
Donghong Ji
46
45
0
16 Sep 2020
Multi-span Style Extraction for Generative Reading Comprehension
Multi-span Style Extraction for Generative Reading Comprehension
Junjie Yang
Zhuosheng Zhang
Hai Zhao
SyDa
53
14
0
15 Sep 2020
Augmented Natural Language for Generative Sequence Labeling
Augmented Natural Language for Generative Sequence Labeling
Ben Athiwaratkun
Cicero Nogueira dos Santos
Jason Krone
Bing Xiang
VLM
75
64
0
15 Sep 2020
BERT-QE: Contextualized Query Expansion for Document Re-ranking
BERT-QE: Contextualized Query Expansion for Document Re-ranking
Zhi Zheng
Kai Hui
Xianpei Han
Xianpei Han
Le Sun
Andrew Yates
73
97
0
15 Sep 2020
It's Not Just Size That Matters: Small Language Models Are Also Few-Shot
  Learners
It's Not Just Size That Matters: Small Language Models Are Also Few-Shot Learners
Timo Schick
Hinrich Schütze
203
979
0
15 Sep 2020
MLMLM: Link Prediction with Mean Likelihood Masked Language Model
MLMLM: Link Prediction with Mean Likelihood Masked Language Model
Louis Clouâtre
P. Trempe
Payel Das
Sarath Chandar
114
44
0
15 Sep 2020
Current Limitations of Language Models: What You Need is Retrieval
Current Limitations of Language Models: What You Need is Retrieval
Aran Komatsuzaki
LRM
39
3
0
15 Sep 2020
Real-Time Execution of Large-scale Language Models on Mobile
Real-Time Execution of Large-scale Language Models on Mobile
Wei Niu
Zhenglun Kong
Geng Yuan
Weiwen Jiang
Jiexiong Guan
Caiwen Ding
Pu Zhao
Sijia Liu
Bin Ren
Yanzhi Wang
MQ
62
7
0
15 Sep 2020
Not-NUTs at W-NUT 2020 Task 2: A BERT-based System in Identifying
  Informative COVID-19 English Tweets
Not-NUTs at W-NUT 2020 Task 2: A BERT-based System in Identifying Informative COVID-19 English Tweets
T. Hoang
Phuong Thu Vu
16
0
0
14 Sep 2020
Learning an Effective Context-Response Matching Model with
  Self-Supervised Tasks for Retrieval-based Dialogues
Learning an Effective Context-Response Matching Model with Self-Supervised Tasks for Retrieval-based Dialogues
Ruijian Xu
Chongyang Tao
Daxin Jiang
Xueliang Zhao
Dongyan Zhao
Rui Yan
77
73
0
14 Sep 2020
On Robustness and Bias Analysis of BERT-based Relation Extraction
On Robustness and Bias Analysis of BERT-based Relation Extraction
Luoqiu Li
Xiang Chen
Hongbin Ye
Zhen Bi
Shumin Deng
Ningyu Zhang
Huajun Chen
81
18
0
14 Sep 2020
Fine-tuning Pre-trained Contextual Embeddings for Citation Content
  Analysis in Scholarly Publication
Fine-tuning Pre-trained Contextual Embeddings for Citation Content Analysis in Scholarly Publication
Haihua Chen
Huyen Nguyen
23
0
0
12 Sep 2020
Syntax Role for Neural Semantic Role Labeling
Syntax Role for Neural Semantic Role Labeling
Z. Li
Hai Zhao
Shexia He
Jiaxun Cai
NAI
67
19
0
12 Sep 2020
Compressed Deep Networks: Goodbye SVD, Hello Robust Low-Rank
  Approximation
Compressed Deep Networks: Goodbye SVD, Hello Robust Low-Rank Approximation
M. Tukan
Alaa Maalouf
Matan Weksler
Dan Feldman
84
9
0
11 Sep 2020
Generating Accurate Assert Statements for Unit Test Cases using
  Pretrained Transformers
Generating Accurate Assert Statements for Unit Test Cases using Pretrained Transformers
Michele Tufano
Dawn Drain
Alexey Svyatkovskiy
Neel Sundaresan
ViT
79
91
0
11 Sep 2020
UPB at SemEval-2020 Task 6: Pretrained Language Models for Definition
  Extraction
UPB at SemEval-2020 Task 6: Pretrained Language Models for Definition Extraction
Andrei-Marius Avram
Dumitru-Clementin Cercel
Costin-Gabriel Chiru
49
7
0
11 Sep 2020
A Comparison of LSTM and BERT for Small Corpus
A Comparison of LSTM and BERT for Small Corpus
Aysu Ezen-Can
74
113
0
11 Sep 2020
Rank over Class: The Untapped Potential of Ranking in Natural Language
  Processing
Rank over Class: The Untapped Potential of Ranking in Natural Language Processing
Amir Atapour-Abarghouei
Stephen Bonner
A. Mcgough
57
4
0
10 Sep 2020
Patient Cohort Retrieval using Transformer Language Models
Patient Cohort Retrieval using Transformer Language Models
Sarvesh Soni
Kirk Roberts
46
9
0
10 Sep 2020
Investigating Gender Bias in BERT
Investigating Gender Bias in BERT
Rishabh Bhardwaj
Navonil Majumder
Soujanya Poria
83
108
0
10 Sep 2020
Dialogue-adaptive Language Model Pre-training From Quality Estimation
Dialogue-adaptive Language Model Pre-training From Quality Estimation
Junlong Li
Zhuosheng Zhang
Hai Zhao
OffRL
68
12
0
10 Sep 2020
Learning Universal Representations from Word to Sentence
Learning Universal Representations from Word to Sentence
Yian Li
Hai Zhao
SSL
36
2
0
10 Sep 2020
Previous
123...565758...697071
Next