ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1906.08237
  4. Cited By
XLNet: Generalized Autoregressive Pretraining for Language Understanding
v1v2 (latest)

XLNet: Generalized Autoregressive Pretraining for Language Understanding

19 June 2019
Zhilin Yang
Zihang Dai
Yiming Yang
J. Carbonell
Ruslan Salakhutdinov
Quoc V. Le
    AI4CE
ArXiv (abs)PDFHTML

Papers citing "XLNet: Generalized Autoregressive Pretraining for Language Understanding"

50 / 3,518 papers shown
Title
Exploring Benefits of Transfer Learning in Neural Machine Translation
Exploring Benefits of Transfer Learning in Neural Machine Translation
Tom Kocmi
60
17
0
06 Jan 2020
oLMpics -- On what Language Model Pre-training Captures
oLMpics -- On what Language Model Pre-training Captures
Alon Talmor
Yanai Elazar
Yoav Goldberg
Jonathan Berant
LRM
105
304
0
31 Dec 2019
Clinical XLNet: Modeling Sequential Clinical Notes and Predicting
  Prolonged Mechanical Ventilation
Clinical XLNet: Modeling Sequential Clinical Notes and Predicting Prolonged Mechanical Ventilation
Kexin Huang
Abhishek Singh
Sitong Chen
E. Moseley
Chih-ying Deng
Naomi George
C. Lindvall
119
59
0
27 Dec 2019
Is Attention All What You Need? -- An Empirical Investigation on
  Convolution-Based Active Memory and Self-Attention
Is Attention All What You Need? -- An Empirical Investigation on Convolution-Based Active Memory and Self-Attention
Thomas D. Dowdell
Hongyu Zhang
36
4
0
27 Dec 2019
Multi-Graph Transformer for Free-Hand Sketch Recognition
Multi-Graph Transformer for Free-Hand Sketch Recognition
Peng Xu
Chaitanya K. Joshi
Xavier Bresson
ViT
115
87
0
24 Dec 2019
A Multimodal Target-Source Classifier with Attention Branches to
  Understand Ambiguous Instructions for Fetching Daily Objects
A Multimodal Target-Source Classifier with Attention Branches to Understand Ambiguous Instructions for Fetching Daily Objects
A. Magassouba
K. Sugiura
Hisashi Kawai
81
9
0
23 Dec 2019
Learning and Evaluating Contextual Embedding of Source Code
Learning and Evaluating Contextual Embedding of Source Code
Aditya Kanade
Petros Maniatis
Gogul Balakrishnan
Kensen Shi
ELM
87
77
0
21 Dec 2019
Are Transformers universal approximators of sequence-to-sequence
  functions?
Are Transformers universal approximators of sequence-to-sequence functions?
Chulhee Yun
Srinadh Bhojanapalli
A. S. Rawat
Sashank J. Reddi
Sanjiv Kumar
140
358
0
20 Dec 2019
End-to-end Named Entity Recognition and Relation Extraction using
  Pre-trained Language Models
End-to-end Named Entity Recognition and Relation Extraction using Pre-trained Language Models
John Giorgi
Xindi Wang
Nicola Sahar
W. Shin
Gary D. Bader
Bo Wang
76
38
0
20 Dec 2019
Pretrained Encyclopedia: Weakly Supervised Knowledge-Pretrained Language
  Model
Pretrained Encyclopedia: Weakly Supervised Knowledge-Pretrained Language Model
Wenhan Xiong
Jingfei Du
William Yang Wang
Veselin Stoyanov
SSLKELM
105
201
0
20 Dec 2019
Asymmetrical Hierarchical Networks with Attentive Interactions for
  Interpretable Review-Based Recommendation
Asymmetrical Hierarchical Networks with Attentive Interactions for Interpretable Review-Based Recommendation
Xin Dong
Jingchao Ni
Wei Cheng
Zhengzhang Chen
Bo Zong
Dongjin Song
Yanchi Liu
Haifeng Chen
Gerard de Melo
144
55
0
18 Dec 2019
PEGASUS: Pre-training with Extracted Gap-sentences for Abstractive
  Summarization
PEGASUS: Pre-training with Extracted Gap-sentences for Abstractive Summarization
Jingqing Zhang
Yao-Min Zhao
Mohammad Saleh
Peter J. Liu
RALM3DGS
313
2,057
0
18 Dec 2019
Curriculum Learning Strategies for IR: An Empirical Study on
  Conversation Response Ranking
Curriculum Learning Strategies for IR: An Empirical Study on Conversation Response Ranking
Gustavo Penha
C. Hauff
77
24
0
18 Dec 2019
Chinese Named Entity Recognition Augmented with Lexicon Memory
Chinese Named Entity Recognition Augmented with Lexicon Memory
Yi Zhou
Xiaoqing Zheng
Xuanjing Huang
29
5
0
17 Dec 2019
Cross-Lingual Ability of Multilingual BERT: An Empirical Study
Cross-Lingual Ability of Multilingual BERT: An Empirical Study
Karthikeyan K
Zihan Wang
Stephen D. Mayhew
Dan Roth
LRM
98
340
0
17 Dec 2019
WaLDORf: Wasteless Language-model Distillation On Reading-comprehension
WaLDORf: Wasteless Language-model Distillation On Reading-comprehension
J. Tian
A. Kreuzer
Pai-Hung Chen
Hans-Martin Will
VLM
60
3
0
13 Dec 2019
Spatial-Temporal Self-Attention Network for Flow Prediction
Spatial-Temporal Self-Attention Network for Flow Prediction
Haoxing Lin
Weijia Jia
Yiping Sun
Yongjian You
3DPCAI4TS
62
8
0
13 Dec 2019
Extending Machine Language Models toward Human-Level Language
  Understanding
Extending Machine Language Models toward Human-Level Language Understanding
James L. McClelland
Felix Hill
Maja R. Rudolph
Jason Baldridge
Hinrich Schütze
LRM
78
35
0
12 Dec 2019
FlauBERT: Unsupervised Language Model Pre-training for French
FlauBERT: Unsupervised Language Model Pre-training for French
Hang Le
Loïc Vial
Jibril Frej
Vincent Segonne
Maximin Coavoux
Benjamin Lecouteux
A. Allauzen
Benoît Crabbé
Laurent Besacier
D. Schwab
AI4CE
111
401
0
11 Dec 2019
Automatic Spanish Translation of the SQuAD Dataset for Multilingual
  Question Answering
Automatic Spanish Translation of the SQuAD Dataset for Multilingual Question Answering
C. Carrino
Marta R. Costa-jussá
José A. R. Fonollosa
69
89
0
11 Dec 2019
Zero-shot Text Classification With Generative Language Models
Zero-shot Text Classification With Generative Language Models
Raul Puri
Bryan Catanzaro
VLM
81
106
0
10 Dec 2019
Learning Norms from Stories: A Prior for Value Aligned Agents
Learning Norms from Stories: A Prior for Value Aligned Agents
Spencer Frazier
Md Sultan al Nahian
Mark O. Riedl
Brent Harrison
73
39
0
07 Dec 2019
Personalized Patent Claim Generation and Measurement
Personalized Patent Claim Generation and Measurement
Jieh-Sheng Lee
25
4
0
07 Dec 2019
Large-scale Pretraining for Visual Dialog: A Simple State-of-the-Art
  Baseline
Large-scale Pretraining for Visual Dialog: A Simple State-of-the-Art Baseline
Vishvak Murahari
Dhruv Batra
Devi Parikh
Abhishek Das
VLM
111
117
0
05 Dec 2019
12-in-1: Multi-Task Vision and Language Representation Learning
12-in-1: Multi-Task Vision and Language Representation Learning
Jiasen Lu
Vedanuj Goswami
Marcus Rohrbach
Devi Parikh
Stefan Lee
VLMObjD
131
481
0
05 Dec 2019
Natural Alpha Embeddings
Natural Alpha Embeddings
Riccardo Volpi
Luigi Malagò
53
5
0
04 Dec 2019
An Exploration of Data Augmentation and Sampling Techniques for
  Domain-Agnostic Question Answering
An Exploration of Data Augmentation and Sampling Techniques for Domain-Agnostic Question Answering
Shayne Longpre
Yi Lu
Zhucheng Tu
Christopher DuBois
71
70
0
04 Dec 2019
Acquiring Knowledge from Pre-trained Model to Neural Machine Translation
Acquiring Knowledge from Pre-trained Model to Neural Machine Translation
Rongxiang Weng
Heng Yu
Shujian Huang
Shanbo Cheng
Weihua Luo
93
67
0
04 Dec 2019
Deep Contextualized Acoustic Representations For Semi-Supervised Speech
  Recognition
Deep Contextualized Acoustic Representations For Semi-Supervised Speech Recognition
Shaoshi Ling
Yuzong Liu
Julian Salazar
Katrin Kirchhoff
SSL
86
139
0
03 Dec 2019
BERT for Large-scale Video Segment Classification with Test-time
  Augmentation
BERT for Large-scale Video Segment Classification with Test-time Augmentation
Tianqi Liu
Qizhan Shao
57
4
0
02 Dec 2019
Leveraging Contextual Embeddings for Detecting Diachronic Semantic Shift
Leveraging Contextual Embeddings for Detecting Diachronic Semantic Shift
Matej Martinc
Petra Kralj Novak
Senja Pollak
80
72
0
02 Dec 2019
EDA: Enriching Emotional Dialogue Acts using an Ensemble of Neural
  Annotators
EDA: Enriching Emotional Dialogue Acts using an Ensemble of Neural Annotators
Chandrakant Bothe
C. Weber
S. Magg
S. Wermter
57
10
0
02 Dec 2019
Bimodal Speech Emotion Recognition Using Pre-Trained Language Models
Bimodal Speech Emotion Recognition Using Pre-Trained Language Models
Verena Heusser
Niklas Freymuth
Stefan Constantin
A. Waibel
92
26
0
29 Nov 2019
Inducing Relational Knowledge from BERT
Inducing Relational Knowledge from BERT
Zied Bouraoui
Jose Camacho-Collados
Steven Schockaert
96
167
0
28 Nov 2019
Multi-label Classification for Automatic Tag Prediction in the Context
  of Programming Challenges
Multi-label Classification for Automatic Tag Prediction in the Context of Programming Challenges
Bianca Iancu
Gabriele Mazzola
Kyriakos Psarakis
Panagiotis Soilis
21
5
0
27 Nov 2019
Taking a Stance on Fake News: Towards Automatic Disinformation
  Assessment via Deep Bidirectional Transformer Language Models for Stance
  Detection
Taking a Stance on Fake News: Towards Automatic Disinformation Assessment via Deep Bidirectional Transformer Language Models for Stance Detection
Chris Dulhanty
Jason L. Deglint
Ibrahim Ben Daya
A. Wong
49
22
0
27 Nov 2019
Evaluating Commonsense in Pre-trained Language Models
Evaluating Commonsense in Pre-trained Language Models
Xuhui Zhou
Yue Zhang
Leyang Cui
Dandan Huang
AI4MHLRM
88
185
0
27 Nov 2019
Word-Class Embeddings for Multiclass Text Classification
Word-Class Embeddings for Multiclass Text Classification
Alejandro Moreo
Andrea Esuli
Fabrizio Sebastiani
33
36
0
26 Nov 2019
Pre-Training of Deep Bidirectional Protein Sequence Representations with
  Structural Information
Pre-Training of Deep Bidirectional Protein Sequence Representations with Structural Information
Seonwoo Min
Seunghyun Park
Siwon Kim
Hyun-Soo Choi
Byunghan Lee
Sungroh Yoon
SSL
73
63
0
25 Nov 2019
FairyTED: A Fair Rating Predictor for TED Talk Data
FairyTED: A Fair Rating Predictor for TED Talk Data
Rupam Acharyya
Shouman Das
Ankani Chattoraj
Md. Iftekhar Tanveer
41
12
0
25 Nov 2019
Unsupervised Domain Adaptation of Language Models for Reading
  Comprehension
Unsupervised Domain Adaptation of Language Models for Reading Comprehension
Kosuke Nishida
Kyosuke Nishida
Itsumi Saito
Hisako Asano
J. Tomita
102
26
0
25 Nov 2019
End-to-End Trainable Non-Collaborative Dialog System
End-to-End Trainable Non-Collaborative Dialog System
Yu Li
Kun Qian
Weiyan Shi
Zhou Yu
87
46
0
25 Nov 2019
Invenio: Discovering Hidden Relationships Between Tasks/Domains Using
  Structured Meta Learning
Invenio: Discovering Hidden Relationships Between Tasks/Domains Using Structured Meta Learning
Sameeksha Katoch
Kowshik Thopalli
Jayaraman J. Thiagarajan
Pavan Turaga
A. Spanias
39
4
0
24 Nov 2019
A Transformer-based approach to Irony and Sarcasm detection
A Transformer-based approach to Irony and Sarcasm detection
Rolandos Alexandros Potamias
Georgios Siolas
A. Stafylopatis
47
213
0
23 Nov 2019
Factorized Multimodal Transformer for Multimodal Sequential Learning
Factorized Multimodal Transformer for Multimodal Sequential Learning
Amir Zadeh
Chengfeng Mao
Kelly Shi
Yiwei Zhang
Paul Pu Liang
Soujanya Poria
Louis-Philippe Morency
69
45
0
22 Nov 2019
Learning Multi-level Dependencies for Robust Word Recognition
Learning Multi-level Dependencies for Robust Word Recognition
Z. Wang
Hui Liu
Jiliang Tang
Songfan Yang
Gale Yan Huang
Zitao Liu
67
8
0
22 Nov 2019
Outside the Box: Abstraction-Based Monitoring of Neural Networks
Outside the Box: Abstraction-Based Monitoring of Neural Networks
T. Henzinger
Anna Lukina
Christian Schilling
AAML
93
59
0
20 Nov 2019
Global Greedy Dependency Parsing
Global Greedy Dependency Parsing
Z. Li
Zhao Hai
Kevin Parnow
113
31
0
20 Nov 2019
Towards non-toxic landscapes: Automatic toxic comment detection using
  DNN
Towards non-toxic landscapes: Automatic toxic comment detection using DNN
Ashwin Geet D'Sa
Irina Illina
Dominique Fohr
56
22
0
19 Nov 2019
DARB: A Density-Aware Regular-Block Pruning for Deep Neural Networks
DARB: A Density-Aware Regular-Block Pruning for Deep Neural Networks
Ao Ren
Tao Zhang
Yuhao Wang
Sheng Lin
Peiyan Dong
Yen-kuang Chen
Yuan Xie
Yanzhi Wang
78
11
0
19 Nov 2019
Previous
123...666768697071
Next