ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1906.08237
  4. Cited By
XLNet: Generalized Autoregressive Pretraining for Language Understanding
v1v2 (latest)

XLNet: Generalized Autoregressive Pretraining for Language Understanding

19 June 2019
Zhilin Yang
Zihang Dai
Yiming Yang
J. Carbonell
Ruslan Salakhutdinov
Quoc V. Le
    AI4CE
ArXiv (abs)PDFHTML

Papers citing "XLNet: Generalized Autoregressive Pretraining for Language Understanding"

50 / 3,518 papers shown
Title
AVA: an Automatic eValuation Approach to Question Answering Systems
AVA: an Automatic eValuation Approach to Question Answering Systems
Thuy Vu
Alessandro Moschitti
49
13
0
02 May 2020
Gender Bias in Multilingual Embeddings and Cross-Lingual Transfer
Gender Bias in Multilingual Embeddings and Cross-Lingual Transfer
Jieyu Zhao
Subhabrata Mukherjee
Saghar Hosseini
Kai-Wei Chang
Ahmed Hassan Awadallah
96
91
0
02 May 2020
DeFormer: Decomposing Pre-trained Transformers for Faster Question
  Answering
DeFormer: Decomposing Pre-trained Transformers for Faster Question Answering
Qingqing Cao
H. Trivedi
A. Balasubramanian
Niranjan Balasubramanian
86
68
0
02 May 2020
DagoBERT: Generating Derivational Morphology with a Pretrained Language
  Model
DagoBERT: Generating Derivational Morphology with a Pretrained Language Model
Valentin Hofmann
J. Pierrehumbert
Hinrich Schütze
100
2
0
02 May 2020
On Faithfulness and Factuality in Abstractive Summarization
On Faithfulness and Factuality in Abstractive Summarization
Joshua Maynez
Shashi Narayan
Bernd Bohnet
Ryan T. McDonald
HILM
98
1,048
0
02 May 2020
KLEJ: Comprehensive Benchmark for Polish Language Understanding
KLEJ: Comprehensive Benchmark for Polish Language Understanding
Piotr Rybak
Robert Mroczkowski
Janusz Tracz
Ireneusz Gawlik
ELM
73
84
0
01 May 2020
POINTER: Constrained Progressive Text Generation via Insertion-based
  Generative Pre-training
POINTER: Constrained Progressive Text Generation via Insertion-based Generative Pre-training
Yizhe Zhang
Guoyin Wang
Chunyuan Li
Zhe Gan
Chris Brockett
Bill Dolan
84
30
0
01 May 2020
Self-supervised Knowledge Triplet Learning for Zero-shot Question
  Answering
Self-supervised Knowledge Triplet Learning for Zero-shot Question Answering
Pratyay Banerjee
Chitta Baral
90
65
0
01 May 2020
HERO: Hierarchical Encoder for Video+Language Omni-representation
  Pre-training
HERO: Hierarchical Encoder for Video+Language Omni-representation Pre-training
Linjie Li
Yen-Chun Chen
Yu Cheng
Zhe Gan
Licheng Yu
Jingjing Liu
MLLMVLMOffRLAI4TS
133
507
0
01 May 2020
Why and when should you pool? Analyzing Pooling in Recurrent
  Architectures
Why and when should you pool? Analyzing Pooling in Recurrent Architectures
Pratyush Maini
Keshav Kolluru
Danish Pruthi
Mausam
66
6
0
01 May 2020
Hide-and-Seek: A Template for Explainable AI
Hide-and-Seek: A Template for Explainable AI
Thanos Tagaris
A. Stafylopatis
26
6
0
30 Apr 2020
Template Guided Text Generation for Task-Oriented Dialogue
Template Guided Text Generation for Task-Oriented Dialogue
Mihir Kale
Abhinav Rastogi
59
12
0
30 Apr 2020
Word Rotator's Distance
Word Rotator's Distance
Sho Yokoi
Ryo Takahashi
Reina Akama
Jun Suzuki
Kentaro Inui
OT
75
59
0
30 Apr 2020
Segatron: Segment-Aware Transformer for Language Modeling and
  Understanding
Segatron: Segment-Aware Transformer for Language Modeling and Understanding
Richard He Bai
Peng Shi
Jimmy J. Lin
Yuqing Xie
Luchen Tan
Kun Xiong
Wen Gao
Ming Li
38
8
0
30 Apr 2020
Recipes for Adapting Pre-trained Monolingual and Multilingual Models to
  Machine Translation
Recipes for Adapting Pre-trained Monolingual and Multilingual Models to Machine Translation
Asa Cooper Stickland
Xian Li
Marjan Ghazvininejad
101
46
0
30 Apr 2020
Analyzing the Surprising Variability in Word Embedding Stability Across
  Languages
Analyzing the Surprising Variability in Word Embedding Stability Across Languages
Laura Burdick
Jonathan K. Kummerfeld
Rada Mihalcea
44
9
0
30 Apr 2020
Enriched Pre-trained Transformers for Joint Slot Filling and Intent
  Detection
Enriched Pre-trained Transformers for Joint Slot Filling and Intent Detection
Momchil Hardalov
Ivan Koychev
Preslav Nakov
VLM
47
17
0
30 Apr 2020
Character-Level Translation with Self-attention
Character-Level Translation with Self-attention
Yingqiang Gao
Nikola I. Nikolov
Yuhuang Hu
Richard H. R. Hahnloser
46
27
0
30 Apr 2020
Look at the First Sentence: Position Bias in Question Answering
Look at the First Sentence: Position Bias in Question Answering
Miyoung Ko
Jinhyuk Lee
Hyunjae Kim
Gangwoo Kim
Jaewoo Kang
FaMLOOD
80
100
0
30 Apr 2020
memeBot: Towards Automatic Image Meme Generation
memeBot: Towards Automatic Image Meme Generation
Aadhavan Sadasivam
K. Gunasekar
H. Davulcu
Yezhou Yang
29
10
0
30 Apr 2020
RikiNet: Reading Wikipedia Pages for Natural Question Answering
RikiNet: Reading Wikipedia Pages for Natural Question Answering
Dayiheng Liu
Yeyun Gong
Jie Fu
Yu Yan
Jiusheng Chen
Daxin Jiang
Jiancheng Lv
Nan Duan
RALM
94
55
0
30 Apr 2020
TAVAT: Token-Aware Virtual Adversarial Training for Language
  Understanding
TAVAT: Token-Aware Virtual Adversarial Training for Language Understanding
Linyang Li
Xipeng Qiu
78
17
0
30 Apr 2020
An Empirical Study of Pre-trained Transformers for Arabic Information
  Extraction
An Empirical Study of Pre-trained Transformers for Arabic Information Extraction
Wuwei Lan
Yang Chen
Wei Xu
Alan Ritter
39
4
0
30 Apr 2020
"The Boating Store Had Its Best Sail Ever": Pronunciation-attentive
  Contextualized Pun Recognition
"The Boating Store Had Its Best Sail Ever": Pronunciation-attentive Contextualized Pun Recognition
Yichao Zhou
Jyun-Yu Jiang
Jieyu Zhao
Kai-Wei Chang
Wei Wang
35
13
0
29 Apr 2020
The Effect of Natural Distribution Shift on Question Answering Models
The Effect of Natural Distribution Shift on Question Answering Models
John Miller
K. Krauth
Benjamin Recht
Ludwig Schmidt
OOD
105
145
0
29 Apr 2020
General Purpose Text Embeddings from Pre-trained Language Models for
  Scalable Inference
General Purpose Text Embeddings from Pre-trained Language Models for Scalable Inference
Jingfei Du
Myle Ott
Haoran Li
Xing Zhou
Veselin Stoyanov
AI4CE
66
10
0
29 Apr 2020
Exploiting Structured Knowledge in Text via Graph-Guided Representation
  Learning
Exploiting Structured Knowledge in Text via Graph-Guided Representation Learning
Tao Shen
Yi Mao
Pengcheng He
Guodong Long
Adam Trischler
Weizhu Chen
84
63
0
29 Apr 2020
Zero-Shot Learning and its Applications from Autonomous Vehicles to
  COVID-19 Diagnosis: A Review
Zero-Shot Learning and its Applications from Autonomous Vehicles to COVID-19 Diagnosis: A Review
Mahdi Rezaei
Mahsa Shahidi
113
55
0
29 Apr 2020
Adversarial Subword Regularization for Robust Neural Machine Translation
Adversarial Subword Regularization for Robust Neural Machine Translation
Jungsoo Park
Mujeen Sung
Jinhyuk Lee
Jaewoo Kang
64
8
0
29 Apr 2020
Enhancing Answer Boundary Detection for Multilingual Machine Reading
  Comprehension
Enhancing Answer Boundary Detection for Multilingual Machine Reading Comprehension
Fei Yuan
Linjun Shou
X. Bai
Ming Gong
Yaobo Liang
Nan Duan
Yan Fu
Daxin Jiang
88
23
0
29 Apr 2020
Benchmarking Robustness of Machine Reading Comprehension Models
Benchmarking Robustness of Machine Reading Comprehension Models
Chenglei Si
Ziqing Yang
Yiming Cui
Wentao Ma
Ting Liu
Shijin Wang
ELMAAML
112
42
0
29 Apr 2020
Knowledgeable Dialogue Reading Comprehension on Key Turns
Knowledgeable Dialogue Reading Comprehension on Key Turns
Junlong Li
Zhuosheng Zhang
Hai Zhao
71
1
0
29 Apr 2020
BURT: BERT-inspired Universal Representation from Twin Structure
BURT: BERT-inspired Universal Representation from Twin Structure
Yian Li
Hai Zhao
40
0
0
29 Apr 2020
Span-based Localizing Network for Natural Language Video Localization
Span-based Localizing Network for Natural Language Video Localization
Hao Zhang
Aixin Sun
Wei Jing
Qiufeng Wang
110
316
0
29 Apr 2020
Revisiting Pre-Trained Models for Chinese Natural Language Processing
Revisiting Pre-Trained Models for Chinese Natural Language Processing
Yiming Cui
Wanxiang Che
Ting Liu
Bing Qin
Shijin Wang
Guoping Hu
104
705
0
29 Apr 2020
Exploring Self-attention for Image Recognition
Exploring Self-attention for Image Recognition
Hengshuang Zhao
Jiaya Jia
V. Koltun
SSL
100
790
0
28 Apr 2020
The Curse of Performance Instability in Analysis Datasets: Consequences,
  Source, and Suggestions
The Curse of Performance Instability in Analysis Datasets: Consequences, Source, and Suggestions
Xiang Zhou
Yixin Nie
Hao Tan
Joey Tianyi Zhou
111
41
0
28 Apr 2020
Conversational Word Embedding for Retrieval-Based Dialog System
Conversational Word Embedding for Retrieval-Based Dialog System
Wentao Ma
Yiming Cui
Ting Liu
Dong Wang
Shijin Wang
Guoping Hu
45
5
0
28 Apr 2020
UXLA: A Robust Unsupervised Data Augmentation Framework for
  Zero-Resource Cross-Lingual NLP
UXLA: A Robust Unsupervised Data Augmentation Framework for Zero-Resource Cross-Lingual NLP
M Saiful Bari
Tasnim Mohiuddin
Shafiq Joty
85
24
0
28 Apr 2020
SCELMo: Source Code Embeddings from Language Models
SCELMo: Source Code Embeddings from Language Models
Rafael-Michael Karampatsis
Charles Sutton
67
53
0
28 Apr 2020
The Impact of the Mini-batch Size on the Variance of Gradients in
  Stochastic Gradient Descent
The Impact of the Mini-batch Size on the Variance of Gradients in Stochastic Gradient Descent
Xin-Yao Qian
Diego Klabjan
ODL
72
36
0
27 Apr 2020
DeeBERT: Dynamic Early Exiting for Accelerating BERT Inference
DeeBERT: Dynamic Early Exiting for Accelerating BERT Inference
Ji Xin
Raphael Tang
Jaejun Lee
Yaoliang Yu
Jimmy J. Lin
65
377
0
27 Apr 2020
ColBERT: Using BERT Sentence Embedding in Parallel Neural Networks for
  Computational Humor
ColBERT: Using BERT Sentence Embedding in Parallel Neural Networks for Computational Humor
Issa Annamoradnejad
Gohar Zoghi
83
26
0
27 Apr 2020
The Gutenberg Dialogue Dataset
The Gutenberg Dialogue Dataset
Richard Csaky
Gábor Recski
84
14
0
27 Apr 2020
Recall and Learn: Fine-tuning Deep Pretrained Language Models with Less
  Forgetting
Recall and Learn: Fine-tuning Deep Pretrained Language Models with Less Forgetting
Sanyuan Chen
Yutai Hou
Yiming Cui
Wanxiang Che
Ting Liu
Xiangzhan Yu
KELMCLL
140
226
0
27 Apr 2020
Masking as an Efficient Alternative to Finetuning for Pretrained
  Language Models
Masking as an Efficient Alternative to Finetuning for Pretrained Language Models
Mengjie Zhao
Tao R. Lin
Fei Mi
Martin Jaggi
Hinrich Schütze
77
121
0
26 Apr 2020
Beyond 512 Tokens: Siamese Multi-depth Transformer-based Hierarchical
  Encoder for Long-Form Document Matching
Beyond 512 Tokens: Siamese Multi-depth Transformer-based Hierarchical Encoder for Long-Form Document Matching
Liu Yang
Mingyang Zhang
Cheng Li
Michael Bendersky
Marc Najork
96
89
0
26 Apr 2020
MixText: Linguistically-Informed Interpolation of Hidden Space for
  Semi-Supervised Text Classification
MixText: Linguistically-Informed Interpolation of Hidden Space for Semi-Supervised Text Classification
Jiaao Chen
Zichao Yang
Diyi Yang
VLM
103
365
0
25 Apr 2020
Quantifying the Contextualization of Word Representations with Semantic
  Class Probing
Quantifying the Contextualization of Word Representations with Semantic Class Probing
Mengjie Zhao
Philipp Dufter
Yadollah Yaghoobzadeh
Hinrich Schütze
83
27
0
25 Apr 2020
How Does NLP Benefit Legal System: A Summary of Legal Artificial
  Intelligence
How Does NLP Benefit Legal System: A Summary of Legal Artificial Intelligence
Haoxiang Zhong
Chaojun Xiao
Cunchao Tu
Tianyang Zhang
Zhiyuan Liu
Maosong Sun
AILaw
140
307
0
25 Apr 2020
Previous
123...626364...697071
Next