ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1906.08237
  4. Cited By
XLNet: Generalized Autoregressive Pretraining for Language Understanding
v1v2 (latest)

XLNet: Generalized Autoregressive Pretraining for Language Understanding

19 June 2019
Zhilin Yang
Zihang Dai
Yiming Yang
J. Carbonell
Ruslan Salakhutdinov
Quoc V. Le
    AI4CE
ArXiv (abs)PDFHTML

Papers citing "XLNet: Generalized Autoregressive Pretraining for Language Understanding"

50 / 3,522 papers shown
Title
Exploring Discourse Structures for Argument Impact Classification
Exploring Discourse Structures for Argument Impact Classification
Xin Liu
Jiefu Ou
Yangqiu Song
Xin Jiang
48
12
0
02 Jun 2021
A Multi-Level Attention Model for Evidence-Based Fact Checking
A Multi-Level Attention Model for Evidence-Based Fact Checking
Canasai Kruengkrai
Junichi Yamagishi
Xin Wang
GNN
52
26
0
02 Jun 2021
Unsupervised Out-of-Domain Detection via Pre-trained Transformers
Unsupervised Out-of-Domain Detection via Pre-trained Transformers
Keyang Xu
Zhaolin Ren
Shikun Zhang
Yihao Feng
Caiming Xiong
ViT
81
41
0
02 Jun 2021
Conversational Question Answering: A Survey
Conversational Question Answering: A Survey
Munazza Zaib
Wei Emma Zhang
Quan Z. Sheng
A. Mahmood
Yang Zhang
89
91
0
02 Jun 2021
Dialogue-oriented Pre-training
Dialogue-oriented Pre-training
Yi Xu
Hai Zhao
80
14
0
01 Jun 2021
Distribution Matching for Rationalization
Distribution Matching for Rationalization
Yongfeng Huang
Yujun Chen
Yulun Du
Zhilin Yang
OOD
67
18
0
01 Jun 2021
Volta at SemEval-2021 Task 9: Statement Verification and Evidence
  Finding with Tables using TAPAS and Transfer Learning
Volta at SemEval-2021 Task 9: Statement Verification and Evidence Finding with Tables using TAPAS and Transfer Learning
Devansh Gautam
Kshitij Gupta
Manish Shrivastava
LMTD
49
6
0
01 Jun 2021
Adversarial VQA: A New Benchmark for Evaluating the Robustness of VQA
  Models
Adversarial VQA: A New Benchmark for Evaluating the Robustness of VQA Models
Linjie Li
Jie Lei
Zhe Gan
Jingjing Liu
AAMLVLM
116
75
0
01 Jun 2021
HiddenCut: Simple Data Augmentation for Natural Language Understanding
  with Better Generalization
HiddenCut: Simple Data Augmentation for Natural Language Understanding with Better Generalization
Jiaao Chen
Dinghan Shen
Weizhu Chen
Diyi Yang
BDL
77
48
0
31 May 2021
Corpus-Based Paraphrase Detection Experiments and Review
Corpus-Based Paraphrase Detection Experiments and Review
T. Vrbanec
A. Meštrović
129
31
0
31 May 2021
Training ELECTRA Augmented with Multi-word Selection
Training ELECTRA Augmented with Multi-word Selection
Jiaming Shen
Jialu Liu
Tianqi Liu
Cong Yu
Jiawei Han
79
9
0
31 May 2021
How transfer learning impacts linguistic knowledge in deep NLP models?
How transfer learning impacts linguistic knowledge in deep NLP models?
Nadir Durrani
Hassan Sajjad
Fahim Dalvi
45
51
0
31 May 2021
M6-T: Exploring Sparse Expert Models and Beyond
M6-T: Exploring Sparse Expert Models and Beyond
An Yang
Junyang Lin
Rui Men
Chang Zhou
Le Jiang
...
Dingyang Zhang
Wei Lin
Lin Qu
Jingren Zhou
Hongxia Yang
MoE
124
24
0
31 May 2021
SemEval-2021 Task 4: Reading Comprehension of Abstract Meaning
SemEval-2021 Task 4: Reading Comprehension of Abstract Meaning
Boyuan Zheng
Xiaoyu Yang
Yu-Ping Ruan
Zhen-Hua Ling
Quan Liu
Si Wei
Xiao-Dan Zhu
ELM
46
13
0
31 May 2021
Cascaded Head-colliding Attention
Cascaded Head-colliding Attention
Lin Zheng
Zhiyong Wu
Lingpeng Kong
55
2
0
31 May 2021
On the Interplay Between Fine-tuning and Composition in Transformers
On the Interplay Between Fine-tuning and Composition in Transformers
Lang-Chi Yu
Allyson Ettinger
77
14
0
31 May 2021
A Compression-Compilation Framework for On-mobile Real-time BERT
  Applications
A Compression-Compilation Framework for On-mobile Real-time BERT Applications
Wei Niu
Zhenglun Kong
Geng Yuan
Weiwen Jiang
Jiexiong Guan
Caiwen Ding
Pu Zhao
Sijia Liu
Bin Ren
Yanzhi Wang
MQ
37
4
0
30 May 2021
StyTr$^2$: Image Style Transfer with Transformers
StyTr2^22: Image Style Transfer with Transformers
Yingying Deng
Fan Tang
Weiming Dong
Chongyang Ma
Xingjia Pan
Lei Wang
Changsheng Xu
ViT
123
269
0
30 May 2021
Pre-training Universal Language Representation
Pre-training Universal Language Representation
Yian Li
Hai Zhao
SSL
62
8
0
30 May 2021
NAS-BERT: Task-Agnostic and Adaptive-Size BERT Compression with Neural
  Architecture Search
NAS-BERT: Task-Agnostic and Adaptive-Size BERT Compression with Neural Architecture Search
Jin Xu
Xu Tan
Renqian Luo
Kaitao Song
Jian Li
Tao Qin
Tie-Yan Liu
MQ
62
79
0
30 May 2021
Grammatical Error Correction as GAN-like Sequence Labeling
Grammatical Error Correction as GAN-like Sequence Labeling
Kevin Parnow
Zuchao Li
Hai Zhao
121
12
0
29 May 2021
Controllable Abstractive Dialogue Summarization with Sketch Supervision
Controllable Abstractive Dialogue Summarization with Sketch Supervision
Chien-Sheng Wu
Linqing Liu
Wenhao Liu
Pontus Stenetorp
Caiming Xiong
83
52
0
28 May 2021
Learning to Extend Program Graphs to Work-in-Progress Code
Learning to Extend Program Graphs to Work-in-Progress Code
Xuechen Li
Chris J. Maddison
Daniel Tarlow
50
2
0
28 May 2021
Cisco at SemEval-2021 Task 5: What's Toxic?: Leveraging Transformers for
  Multiple Toxic Span Extraction from Online Comments
Cisco at SemEval-2021 Task 5: What's Toxic?: Leveraging Transformers for Multiple Toxic Span Extraction from Online Comments
Sreyan Ghosh
Sonal Kumar
53
8
0
28 May 2021
Knowledge Inheritance for Pre-trained Language Models
Knowledge Inheritance for Pre-trained Language Models
Yujia Qin
Yankai Lin
Jing Yi
Jiajie Zhang
Xu Han
...
Yusheng Su
Zhiyuan Liu
Peng Li
Maosong Sun
Jie Zhou
VLM
85
50
0
28 May 2021
Domain-Adaptive Pretraining Methods for Dialogue Understanding
Domain-Adaptive Pretraining Methods for Dialogue Understanding
Han Wu
Kun Xu
Linfeng Song
Lifeng Jin
Haisong Zhang
Linqi Song
AI4CE
62
18
0
28 May 2021
ILDC for CJPE: Indian Legal Documents Corpus for Court Judgment
  Prediction and Explanation
ILDC for CJPE: Indian Legal Documents Corpus for Court Judgment Prediction and Explanation
Vijit Malik
Rishabh Sanjay
S. Nigam
Kripabandhu Ghosh
S. Guha
Arnab Bhattacharya
Ashutosh Modi
ELMAILaw
117
149
0
28 May 2021
Leveraging Linguistic Coordination in Reranking N-Best Candidates For
  End-to-End Response Selection Using BERT
Leveraging Linguistic Coordination in Reranking N-Best Candidates For End-to-End Response Selection Using BERT
Mingzhi Yu
Diane Litman
31
2
0
27 May 2021
Inspecting the concept knowledge graph encoded by modern language models
Inspecting the concept knowledge graph encoded by modern language models
Carlos Aspillaga
Marcelo Mendoza
Alvaro Soto
72
13
0
27 May 2021
Verb Sense Clustering using Contextualized Word Representations for
  Semantic Frame Induction
Verb Sense Clustering using Contextualized Word Representations for Semantic Frame Induction
Kosuke Yamada
Ryohei Sasano
Koichi Takeda
39
7
0
27 May 2021
RAW-C: Relatedness of Ambiguous Words--in Context (A New Lexical
  Resource for English)
RAW-C: Relatedness of Ambiguous Words--in Context (A New Lexical Resource for English)
Sean Trott
Benjamin Bergen
135
20
0
27 May 2021
Path-based knowledge reasoning with textual semantic information for
  medical knowledge graph completion
Path-based knowledge reasoning with textual semantic information for medical knowledge graph completion
Yinyu Lan
Shizhu He
Xiangrong Zeng
Shengping Liu
Kang Liu
Jun Zhao
57
27
0
27 May 2021
Improve Query Focused Abstractive Summarization by Incorporating Answer
  Relevance
Improve Query Focused Abstractive Summarization by Incorporating Answer Relevance
Jane Polak Scowcroft
Tiezheng Yu
Pascale Fung
78
28
0
27 May 2021
Directed Acyclic Graph Network for Conversational Emotion Recognition
Directed Acyclic Graph Network for Conversational Emotion Recognition
Weizhou Shen
Siyue Wu
Yunyi Yang
Xiaojun Quan
124
245
0
27 May 2021
A Full-Stack Search Technique for Domain Optimized Deep Learning
  Accelerators
A Full-Stack Search Technique for Domain Optimized Deep Learning Accelerators
Dan Zhang
Safeen Huda
Ebrahim M. Songhori
Kartik Prabhu
Quoc V. Le
Anna Goldie
Azalia Mirhoseini
94
53
0
26 May 2021
Deception detection in text and its relation to the cultural dimension
  of individualism/collectivism
Deception detection in text and its relation to the cultural dimension of individualism/collectivism
Katerina Papantoniou
P. Papadakos
Theodore Patkos
G. Flouris
Ion Androutsopoulos
Dimitris Plexousakis
94
7
0
26 May 2021
TreeBERT: A Tree-Based Pre-Trained Model for Programming Language
TreeBERT: A Tree-Based Pre-Trained Model for Programming Language
Xue Jiang
Zhuoran Zheng
Chen Lyu
Liang Li
Lei Lyu
85
91
0
26 May 2021
LMMS Reloaded: Transformer-based Sense Embeddings for Disambiguation and
  Beyond
LMMS Reloaded: Transformer-based Sense Embeddings for Disambiguation and Beyond
Daniel Loureiro
A. Jorge
Jose Camacho-Collados
92
26
0
26 May 2021
Read, Listen, and See: Leveraging Multimodal Information Helps Chinese
  Spell Checking
Read, Listen, and See: Leveraging Multimodal Information Helps Chinese Spell Checking
Heng-Da Xu
Zhongli Li
Qingyu Zhou
Chao Li
Zizhen Wang
Yunbo Cao
Heyan Huang
Xian-Ling Mao
98
97
0
26 May 2021
Database Workload Characterization with Query Plan Encoders
Database Workload Characterization with Query Plan Encoders
Debjyoti Paul
Jie Cao
Feifei Li
Vivek Srikumar
34
18
0
26 May 2021
Context-Sensitive Visualization of Deep Learning Natural Language
  Processing Models
Context-Sensitive Visualization of Deep Learning Natural Language Processing Models
A. Dunn
Diana Inkpen
Razvan Andonie
44
8
0
25 May 2021
Focus Attention: Promoting Faithfulness and Diversity in Summarization
Focus Attention: Promoting Faithfulness and Diversity in Summarization
Rahul Aralikatte
Shashi Narayan
Joshua Maynez
S. Rothe
Ryan T. McDonald
114
46
0
25 May 2021
Estimating Redundancy in Clinical Text
Estimating Redundancy in Clinical Text
Thomas Searle
Zina M. Ibrahim
J. Teo
Richard J. B. Dobson
69
21
0
25 May 2021
TR-BERT: Dynamic Token Reduction for Accelerating BERT Inference
TR-BERT: Dynamic Token Reduction for Accelerating BERT Inference
Deming Ye
Yankai Lin
Yufei Huang
Maosong Sun
MQ
84
65
0
25 May 2021
HIN-RNN: A Graph Representation Learning Neural Network for Fraudster
  Group Detection With No Handcrafted Features
HIN-RNN: A Graph Representation Learning Neural Network for Fraudster Group Detection With No Handcrafted Features
Saeedreza Shehnepoor
R. Togneri
Wei Liu
Bennamoun
40
19
0
25 May 2021
Heterogeneous Graph Representation Learning with Relation Awareness
Heterogeneous Graph Representation Learning with Relation Awareness
Le Yu
Leilei Sun
Bowen Du
Chuanren Liu
Weifeng Lv
Hui Xiong
76
55
0
24 May 2021
Structural Pre-training for Dialogue Comprehension
Structural Pre-training for Dialogue Comprehension
Zhuosheng Zhang
Hai Zhao
94
31
0
23 May 2021
Killing One Bird with Two Stones: Model Extraction and Attribute
  Inference Attacks against BERT-based APIs
Killing One Bird with Two Stones: Model Extraction and Attribute Inference Attacks against BERT-based APIs
Chen Chen
Xuanli He
Lingjuan Lyu
Fangzhao Wu
SILMMIACV
102
8
0
23 May 2021
DepressionNet: A Novel Summarization Boosted Deep Framework for
  Depression Detection on Social Media
DepressionNet: A Novel Summarization Boosted Deep Framework for Depression Detection on Social Media
Hamad Zogan
Imran Razzak
Shoaib Jameel
Guandong Xu
83
60
0
23 May 2021
RST Parsing from Scratch
RST Parsing from Scratch
Thanh-Tung Nguyen
Xuan-Phi Nguyen
Shafiq Joty
Xiaoli Li
66
24
0
23 May 2021
Previous
123...434445...697071
Next