ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1906.08237
  4. Cited By
XLNet: Generalized Autoregressive Pretraining for Language Understanding
v1v2 (latest)

XLNet: Generalized Autoregressive Pretraining for Language Understanding

19 June 2019
Zhilin Yang
Zihang Dai
Yiming Yang
J. Carbonell
Ruslan Salakhutdinov
Quoc V. Le
    AI4CE
ArXiv (abs)PDFHTML

Papers citing "XLNet: Generalized Autoregressive Pretraining for Language Understanding"

50 / 3,520 papers shown
Title
Large Scale Legal Text Classification Using Transformer Models
Large Scale Legal Text Classification Using Transformer Models
Zein Shaheen
G. Wohlgenannt
Erwin Filtz
AILaw
80
72
0
24 Oct 2020
ReadOnce Transformers: Reusable Representations of Text for Transformers
ReadOnce Transformers: Reusable Representations of Text for Transformers
Shih-Ting Lin
Ashish Sabharwal
Tushar Khot
117
3
0
24 Oct 2020
Multilingual Speech Translation with Efficient Finetuning of Pretrained
  Models
Multilingual Speech Translation with Efficient Finetuning of Pretrained Models
Xian Li
Changhan Wang
Yun Tang
C. Tran
Yuqing Tang
J. Pino
Alexei Baevski
Alexis Conneau
Michael Auli
62
6
0
24 Oct 2020
Open-Domain Dialogue Generation Based on Pre-trained Language Models
Open-Domain Dialogue Generation Based on Pre-trained Language Models
Yan Zeng
J. Nie
31
3
0
24 Oct 2020
ANLIzing the Adversarial Natural Language Inference Dataset
ANLIzing the Adversarial Natural Language Inference Dataset
Adina Williams
Tristan Thrush
Douwe Kiela
AAML
255
47
0
24 Oct 2020
Dynamic Contextualized Word Embeddings
Dynamic Contextualized Word Embeddings
Valentin Hofmann
J. Pierrehumbert
Hinrich Schütze
116
52
0
23 Oct 2020
Robust Document Representations using Latent Topics and Metadata
Robust Document Representations using Latent Topics and Metadata
Natraj Raman
Armineh Nourbakhsh
Sameena Shah
Manuela Veloso
26
0
0
23 Oct 2020
Improving Robustness by Augmenting Training Sentences with
  Predicate-Argument Structures
Improving Robustness by Augmenting Training Sentences with Predicate-Argument Structures
N. Moosavi
M. Boer
Prasetya Ajie Utama
Iryna Gurevych
82
13
0
23 Oct 2020
TweetEval: Unified Benchmark and Comparative Evaluation for Tweet
  Classification
TweetEval: Unified Benchmark and Comparative Evaluation for Tweet Classification
Francesco Barbieri
Jose Camacho-Collados
Leonardo Neves
Luis Espinosa-Anke
VLM
97
732
0
23 Oct 2020
Unsupervised Cross-lingual Adaptation for Sequence Tagging and Beyond
Unsupervised Cross-lingual Adaptation for Sequence Tagging and Beyond
Xin Li
Lidong Bing
Wenxuan Zhang
Zheng Li
Wai Lam
125
25
0
23 Oct 2020
Pre-training Graph Transformer with Multimodal Side Information for
  Recommendation
Pre-training Graph Transformer with Multimodal Side Information for Recommendation
Yong Liu
Susen Yang
Chenyi Lei
Guoxin Wang
Haihong Tang
Juyong Zhang
Aixin Sun
Chunyan Miao
29
4
0
23 Oct 2020
Generating Long Financial Report using Conditional Variational
  Autoencoders with Knowledge Distillation
Generating Long Financial Report using Conditional Variational Autoencoders with Knowledge Distillation
Yunpeng Ren
Ziao Wang
Yiyuan Wang
Xiaofeng Zhang
60
9
0
23 Oct 2020
KINNEWS and KIRNEWS: Benchmarking Cross-Lingual Text Classification for
  Kinyarwanda and Kirundi
KINNEWS and KIRNEWS: Benchmarking Cross-Lingual Text Classification for Kinyarwanda and Kirundi
Andre Niyongabo Rubungo
Hong Qu
Julia Kreutzer
Li Huang
65
42
0
23 Oct 2020
ERNIE-Gram: Pre-Training with Explicitly N-Gram Masked Language Modeling
  for Natural Language Understanding
ERNIE-Gram: Pre-Training with Explicitly N-Gram Masked Language Modeling for Natural Language Understanding
Dongling Xiao
Yukun Li
Han Zhang
Yu Sun
Hao Tian
Hua Wu
Haifeng Wang
29
39
0
23 Oct 2020
Language Models are Open Knowledge Graphs
Language Models are Open Knowledge Graphs
Chenguang Wang
Xiao Liu
Basel Alomair
SSLKELM
81
137
0
22 Oct 2020
Challenges in Information-Seeking QA: Unanswerable Questions and
  Paragraph Retrieval
Challenges in Information-Seeking QA: Unanswerable Questions and Paragraph Retrieval
Akari Asai
Eunsol Choi
RALM
115
54
0
22 Oct 2020
Rewriting Meaningful Sentences via Conditional BERT Sampling and an application on fooling text classifiers
Lei Xu
Ivan Ramirez
K. Veeramachaneni
AAML
32
2
0
22 Oct 2020
ConVEx: Data-Efficient and Few-Shot Slot Labeling
ConVEx: Data-Efficient and Few-Shot Slot Labeling
Matthew Henderson
Ivan Vulić
CLIPVLM
87
38
0
22 Oct 2020
Knowledge Distillation for BERT Unsupervised Domain Adaptation
Knowledge Distillation for BERT Unsupervised Domain Adaptation
Minho Ryu
K. Lee
105
35
0
22 Oct 2020
Latte-Mix: Measuring Sentence Semantic Similarity with Latent
  Categorical Mixtures
Latte-Mix: Measuring Sentence Semantic Similarity with Latent Categorical Mixtures
Minghan Li
He Bai
Luchen Tan
Kun Xiong
Ming Li
Jimmy J. Lin
FedML
41
0
0
21 Oct 2020
Neural Networks for Entity Matching: A Survey
Neural Networks for Entity Matching: A Survey
Nils Barlaug
J. Gulla
143
96
0
21 Oct 2020
Complaint Identification in Social Media with Transformer Networks
Complaint Identification in Social Media with Transformer Networks
Mali Jin
Nikolaos Aletras
47
16
0
21 Oct 2020
Transition-based Parsing with Stack-Transformers
Transition-based Parsing with Stack-Transformers
Ramón Fernández Astudillo
Miguel Ballesteros
Tahira Naseem
Austin Blodgett
Radu Florian
138
71
0
20 Oct 2020
An Empirical Investigation of Contextualized Number Prediction
An Empirical Investigation of Contextualized Number Prediction
Daniel M. Spokoyny
Taylor Berg-Kirkpatrick
AI4TS
83
38
0
20 Oct 2020
AutoMeTS: The Autocomplete for Medical Text Simplification
AutoMeTS: The Autocomplete for Medical Text Simplification
Hoang Van
David Kauchak
Gondy Leroy
79
31
0
20 Oct 2020
Better Highlighting: Creating Sub-Sentence Summary Highlights
Better Highlighting: Creating Sub-Sentence Summary Highlights
Sangwoo Cho
Kaiqiang Song
Chen Li
Dong Yu
H. Foroosh
Fei Liu
87
12
0
20 Oct 2020
BERT2DNN: BERT Distillation with Massive Unlabeled Data for Online
  E-Commerce Search
BERT2DNN: BERT Distillation with Massive Unlabeled Data for Online E-Commerce Search
Yunjiang Jiang
Yue Shang
Ziyang Liu
Hongwei Shen
Yun Xiao
Wei Xiong
Sulong Xu
Weipeng P. Yan
Di Jin
64
17
0
20 Oct 2020
Bi-directional Cognitive Thinking Network for Machine Reading
  Comprehension
Bi-directional Cognitive Thinking Network for Machine Reading Comprehension
Wei Peng
Yue Hu
Luxi Xing
Yuqiang Xie
Jing Yu
Yajing Sun
Xiangpeng Wei
64
7
0
20 Oct 2020
Local Knowledge Powered Conversational Agents
Local Knowledge Powered Conversational Agents
Sashank Santhanam
Ming-Yu Liu
Raul Puri
Mohammad Shoeybi
M. Patwary
Bryan Catanzaro
95
4
0
20 Oct 2020
Technical Question Answering across Tasks and Domains
Technical Question Answering across Tasks and Domains
Wenhao Yu
Lingfei Wu
Yu Deng
Qingkai Zeng
R. Mahindru
S. Guven
Meng Jiang
60
8
0
19 Oct 2020
Effects of Parameter Norm Growth During Transformer Training: Inductive
  Bias from Gradient Descent
Effects of Parameter Norm Growth During Transformer Training: Inductive Bias from Gradient Descent
William Merrill
Vivek Ramanujan
Yoav Goldberg
Roy Schwartz
Noah A. Smith
AI4CE
80
36
0
19 Oct 2020
An Empirical Study for Vietnamese Constituency Parsing with Pre-training
An Empirical Study for Vietnamese Constituency Parsing with Pre-training
Tuan-Vi Tran
Xuan-Thien Pham
Duc-Vu Nguyen
Kiet Van Nguyen
Ngan Luu-Thuy Nguyen
46
4
0
19 Oct 2020
Cold-start Active Learning through Self-supervised Language Modeling
Cold-start Active Learning through Self-supervised Language Modeling
Michelle Yuan
Hsuan-Tien Lin
Jordan L. Boyd-Graber
209
185
0
19 Oct 2020
Heads-up! Unsupervised Constituency Parsing via Self-Attention Heads
Heads-up! Unsupervised Constituency Parsing via Self-Attention Heads
Bowen Li
Taeuk Kim
Reinald Kim Amplayo
Frank Keller
SSL
101
17
0
19 Oct 2020
Towards Interpreting BERT for Reading Comprehension Based QA
Towards Interpreting BERT for Reading Comprehension Based QA
Sahana Ramnath
Preksha Nema
Deep Sahni
Mitesh M. Khapra
94
30
0
18 Oct 2020
Federated Unsupervised Representation Learning
Federated Unsupervised Representation Learning
Fengda Zhang
Kun Kuang
Zhaoyang You
Tao Shen
Jun Xiao
Yin Zhang
Chao-Xiang Wu
Yueting Zhuang
Xiaolin Li
FedML
89
137
0
18 Oct 2020
Towards Data Distillation for End-to-end Spoken Conversational Question
  Answering
Towards Data Distillation for End-to-end Spoken Conversational Question Answering
Chenyu You
Nuo Chen
Fenglin Liu
Dongchao Yang
Yuexian Zou
77
48
0
18 Oct 2020
Knowledge-Grounded Dialogue Generation with Pre-trained Language Models
Knowledge-Grounded Dialogue Generation with Pre-trained Language Models
Xueliang Zhao
Wei Wu
Can Xu
Chongyang Tao
Dongyan Zhao
Rui Yan
260
193
0
17 Oct 2020
Consistency and Coherency Enhanced Story Generation
Consistency and Coherency Enhanced Story Generation
Wei Wang
Piji Li
Haitao Zheng
71
11
0
17 Oct 2020
Cross-Lingual Relation Extraction with Transformers
Cross-Lingual Relation Extraction with Transformers
Jian Ni
Taesun Moon
Parul Awasthy
Radu Florian
ViT
37
6
0
16 Oct 2020
Mischief: A Simple Black-Box Attack Against Transformer Architectures
Mischief: A Simple Black-Box Attack Against Transformer Architectures
Adrian de Wynter
AAML
74
1
0
16 Oct 2020
Delaying Interaction Layers in Transformer-based Encoders for Efficient
  Open Domain Question Answering
Delaying Interaction Layers in Transformer-based Encoders for Efficient Open Domain Question Answering
W. Siblini
Mohamed Challal
Charlotte Pasqual
59
3
0
16 Oct 2020
Automatic Feasibility Study via Data Quality Analysis for ML: A
  Case-Study on Label Noise
Automatic Feasibility Study via Data Quality Analysis for ML: A Case-Study on Label Noise
Cédric Renggli
Luka Rimanic
Luka Kolar
Wentao Wu
Ce Zhang
83
3
0
16 Oct 2020
WNUT-2020 Task 2: Identification of Informative COVID-19 English Tweets
WNUT-2020 Task 2: Identification of Informative COVID-19 English Tweets
Dat Quoc Nguyen
Thanh Tien Vu
A. Rahimi
M. Dao
L. T. Nguyen
Long Doan
65
74
0
16 Oct 2020
Coarse-to-Fine Pre-training for Named Entity Recognition
Coarse-to-Fine Pre-training for Named Entity Recognition
Mengge Xue
Yu Bowen
Zhenyu Zhang
Tingwen Liu
Yue Zhang
Bin Wang
63
53
0
16 Oct 2020
FPRaker: A Processing Element For Accelerating Neural Network Training
FPRaker: A Processing Element For Accelerating Neural Network Training
Omar Mohamed Awad
Mostafa Mahmoud
Isak Edo Vivancos
Ali Hadi Zadeh
Ciaran Bannon
Anand Jayarajan
Gennady Pekhimenko
Andreas Moshovos
89
15
0
15 Oct 2020
NUIG-Shubhanker@Dravidian-CodeMix-FIRE2020: Sentiment Analysis of
  Code-Mixed Dravidian text using XLNet
NUIG-Shubhanker@Dravidian-CodeMix-FIRE2020: Sentiment Analysis of Code-Mixed Dravidian text using XLNet
Shubhanker Banerjee
A. Jayapal
Sajeetha Thavareesan
32
16
0
15 Oct 2020
Improving Constituency Parsing with Span Attention
Improving Constituency Parsing with Span Attention
Yuanhe Tian
Yan Song
Fei Xia
Tong Zhang
78
45
0
15 Oct 2020
Natural Language Rationales with Full-Stack Visual Reasoning: From
  Pixels to Semantic Frames to Commonsense Graphs
Natural Language Rationales with Full-Stack Visual Reasoning: From Pixels to Semantic Frames to Commonsense Graphs
Ana Marasović
Chandra Bhagavatula
J. S. Park
Ronan Le Bras
Noah A. Smith
Yejin Choi
ReLMLRM
99
62
0
15 Oct 2020
Neural Deepfake Detection with Factual Structure of Text
Neural Deepfake Detection with Factual Structure of Text
Wanjun Zhong
Duyu Tang
Zenan Xu
Ruize Wang
Nan Duan
M. Zhou
Jiahai Wang
Jian Yin
52
66
0
15 Oct 2020
Previous
123...535455...697071
Next