ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1906.08237
  4. Cited By
XLNet: Generalized Autoregressive Pretraining for Language Understanding
v1v2 (latest)

XLNet: Generalized Autoregressive Pretraining for Language Understanding

19 June 2019
Zhilin Yang
Zihang Dai
Yiming Yang
J. Carbonell
Ruslan Salakhutdinov
Quoc V. Le
    AI4CE
ArXiv (abs)PDFHTML

Papers citing "XLNet: Generalized Autoregressive Pretraining for Language Understanding"

50 / 3,522 papers shown
Title
Head-driven Phrase Structure Parsing in O($n^3$) Time Complexity
Head-driven Phrase Structure Parsing in O(n3n^3n3) Time Complexity
Zuchao Li
Junru Zhou
Hai Zhao
Kevin Parnow
51
0
0
20 May 2021
KLUE: Korean Language Understanding Evaluation
KLUE: Korean Language Understanding Evaluation
Sungjoon Park
Jihyung Moon
Sungdong Kim
Won Ik Cho
Jiyoon Han
...
Seonghyun Kim
Lucy Park
Alice Oh
Jung-Woo Ha
Kyunghyun Cho
ELMVLM
123
198
0
20 May 2021
Retrieval-Augmented Transformer-XL for Close-Domain Dialog Generation
Retrieval-Augmented Transformer-XL for Close-Domain Dialog Generation
Giovanni Bonetta
R. Cancelliere
Ding Liu
Paul Vozila
RALM
28
17
0
19 May 2021
Explainable Tsetlin Machine framework for fake news detection with
  credibility score assessment
Explainable Tsetlin Machine framework for fake news detection with credibility score assessment
Bimal Bhattarai
Ole-Christoffer Granmo
Lei Jiao
61
37
0
19 May 2021
CoTexT: Multi-task Learning with Code-Text Transformer
CoTexT: Multi-task Learning with Code-Text Transformer
Long Phan
H. Tran
Daniel Le
Hieu Duy Nguyen
J. Anibal
Alec Peltekian
Yanfang Ye
92
136
0
18 May 2021
Parallel Attention Network with Sequence Matching for Video Grounding
Parallel Attention Network with Sequence Matching for Video Grounding
Hao Zhang
Aixin Sun
Wei Jing
Liangli Zhen
Qiufeng Wang
Rick Siow Mong Goh
109
41
0
18 May 2021
Link Prediction on N-ary Relational Facts: A Graph-based Approach
Link Prediction on N-ary Relational Facts: A Graph-based Approach
Quan Wang
Haifeng Wang
Yajuan Lyu
Yong Zhu
95
49
0
18 May 2021
Divide and Contrast: Self-supervised Learning from Uncurated Data
Divide and Contrast: Self-supervised Learning from Uncurated Data
Yonglong Tian
Olivier J. Hénaff
Aaron van den Oord
SSL
138
101
0
17 May 2021
Pay Attention to MLPs
Pay Attention to MLPs
Hanxiao Liu
Zihang Dai
David R. So
Quoc V. Le
AI4CE
173
672
0
17 May 2021
SeaD: End-to-end Text-to-SQL Generation with Schema-aware Denoising
SeaD: End-to-end Text-to-SQL Generation with Schema-aware Denoising
K. Xuan
Yongbo Wang
Yongliang Wang
Zujie Wen
Yang Dong
VLM
76
54
0
17 May 2021
How is BERT surprised? Layerwise detection of linguistic anomalies
How is BERT surprised? Layerwise detection of linguistic anomalies
Bai Li
Zining Zhu
Guillaume Thomas
Yang Xu
Frank Rudzicz
76
31
0
16 May 2021
BERT Busters: Outlier Dimensions that Disrupt Transformers
BERT Busters: Outlier Dimensions that Disrupt Transformers
Olga Kovaleva
Saurabh Kulshreshtha
Anna Rogers
Anna Rumshisky
126
93
0
14 May 2021
Designing Multimodal Datasets for NLP Challenges
Designing Multimodal Datasets for NLP Challenges
James Pustejovsky
E. Holderness
Jingxuan Tu
Parker Glenn
Kyeongmin Rim
Kelley Lynch
R. Brutti
59
5
0
12 May 2021
Addressing "Documentation Debt" in Machine Learning Research: A
  Retrospective Datasheet for BookCorpus
Addressing "Documentation Debt" in Machine Learning Research: A Retrospective Datasheet for BookCorpus
Jack Bandy
Nicholas Vincent
69
57
0
11 May 2021
Assessing the Syntactic Capabilities of Transformer-based Multilingual
  Language Models
Assessing the Syntactic Capabilities of Transformer-based Multilingual Language Models
Laura Pérez-Mayos
Alba Táboas García
Simon Mille
Leo Wanner
ELMLRM
51
8
0
10 May 2021
R2D2: Relational Text Decoding with Transformers
R2D2: Relational Text Decoding with Transformers
Aryan Arbabi
Mingqiu Wang
Laurent El Shafey
Nan Du
Izhak Shafran
39
1
0
10 May 2021
REPT: Bridging Language Models and Machine Reading Comprehension via
  Retrieval-Based Pre-training
REPT: Bridging Language Models and Machine Reading Comprehension via Retrieval-Based Pre-training
Fangkai Jiao
Yangyang Guo
Yilin Niu
Feng Ji
Feng-Lin Li
Liqiang Nie
LRM
69
12
0
10 May 2021
DocSCAN: Unsupervised Text Classification via Learning from Neighbors
DocSCAN: Unsupervised Text Classification via Learning from Neighbors
Dominik Stammbach
Elliott Ash
78
9
0
09 May 2021
Dispatcher: A Message-Passing Approach To Language Modelling
Dispatcher: A Message-Passing Approach To Language Modelling
A. Cetoli
84
0
0
09 May 2021
Enhancing Transformers with Gradient Boosted Decision Trees for NLI
  Fine-Tuning
Enhancing Transformers with Gradient Boosted Decision Trees for NLI Fine-Tuning
Benjamin Minixhofer
Milan Gritta
Ignacio Iacobacci
AI4CE
28
5
0
08 May 2021
Logic-Driven Context Extension and Data Augmentation for Logical
  Reasoning of Text
Logic-Driven Context Extension and Data Augmentation for Logical Reasoning of Text
Siyuan Wang
Wanjun Zhong
Duyu Tang
Zhongyu Wei
Zhihao Fan
Daxin Jiang
Ming Zhou
Nan Duan
NAI
137
73
0
08 May 2021
Empirical Evaluation of Pre-trained Transformers for Human-Level NLP:
  The Role of Sample Size and Dimensionality
Empirical Evaluation of Pre-trained Transformers for Human-Level NLP: The Role of Sample Size and Dimensionality
Adithya Ganesan
Matthew Matero
Aravind Reddy Ravula
Huy-Hien Vu
H. Andrew Schwartz
90
35
0
07 May 2021
Are Pre-trained Convolutions Better than Pre-trained Transformers?
Are Pre-trained Convolutions Better than Pre-trained Transformers?
Yi Tay
Mostafa Dehghani
J. Gupta
Dara Bahri
V. Aribandi
Zhen Qin
Donald Metzler
AI4CE
84
49
0
07 May 2021
Graph-based Multilingual Product Retrieval in E-commerce Search
Graph-based Multilingual Product Retrieval in E-commerce Search
Hanqing Lu
You-Heng Hu
Tong Zhao
Tony Wu
Yiwei Song
Bing Yin
115
25
0
06 May 2021
GraphFormers: GNN-nested Transformers for Representation Learning on
  Textual Graph
GraphFormers: GNN-nested Transformers for Representation Learning on Textual Graph
Junhan Yang
Zheng Liu
Shitao Xiao
Chaozhuo Li
Defu Lian
Sanjay Agrawal
Amit Singh
Guangzhong Sun
Xing Xie
AI4CE
91
160
0
06 May 2021
Assessing Dialogue Systems with Distribution Distances
Assessing Dialogue Systems with Distribution Distances
Jiannan Xiang
Yahui Liu
Deng Cai
Huayang Li
Defu Lian
Lemao Liu
99
18
0
06 May 2021
Towards General Natural Language Understanding with Probabilistic
  Worldbuilding
Towards General Natural Language Understanding with Probabilistic Worldbuilding
Abulhair Saparov
Tom Michael Mitchell
101
6
0
06 May 2021
Security Vulnerability Detection Using Deep Learning Natural Language
  Processing
Security Vulnerability Detection Using Deep Learning Natural Language Processing
Noah Ziems
Shaoen Wu
93
58
0
06 May 2021
Beyond Self-attention: External Attention using Two Linear Layers for
  Visual Tasks
Beyond Self-attention: External Attention using Two Linear Layers for Visual Tasks
Meng-Hao Guo
Zheng-Ning Liu
Tai-Jiang Mu
Shimin Hu
77
504
0
05 May 2021
PreSizE: Predicting Size in E-Commerce using Transformers
PreSizE: Predicting Size in E-Commerce using Transformers
Yotam Eshel
Or Levi
Haggai Roitman
A. Nus
AI4TS
34
8
0
04 May 2021
ZEN 2.0: Continue Training and Adaption for N-gram Enhanced Text
  Encoders
ZEN 2.0: Continue Training and Adaption for N-gram Enhanced Text Encoders
Yan Song
Tong Zhang
Yonggang Wang
Kai-Fu Lee
97
45
0
04 May 2021
Textual Analysis of Communications in COVID-19 Infected Community on
  Social Media
Textual Analysis of Communications in COVID-19 Infected Community on Social Media
Yuhan Liu
Yuhan Gao
Zhifan Nan
Long Chen
25
0
0
03 May 2021
Goldilocks: Just-Right Tuning of BERT for Technology-Assisted Review
Goldilocks: Just-Right Tuning of BERT for Technology-Assisted Review
Eugene Yang
Sean MacAvaney
D. Lewis
O. Frieder
126
29
0
03 May 2021
Billion-scale Pre-trained E-commerce Product Knowledge Graph Model
Billion-scale Pre-trained E-commerce Product Knowledge Graph Model
Wen Zhang
Chi-Man Wong
Ganqiang Ye
Bo Wen
Wei Zhang
Huajun Chen
63
24
0
02 May 2021
MathBERT: A Pre-Trained Model for Mathematical Formula Understanding
MathBERT: A Pre-Trained Model for Mathematical Formula Understanding
Shuai Peng
Ke Yuan
Liangcai Gao
Zhi Tang
AIMat
100
109
0
02 May 2021
When to Foldém: How to answer Unanswerable questions
When to Foldém: How to answer Unanswerable questions
Marshall Ho
Zhipeng Zhou
J. He
55
2
0
01 May 2021
Adversarial Example Detection for DNN Models: A Review and Experimental
  Comparison
Adversarial Example Detection for DNN Models: A Review and Experimental Comparison
Ahmed Aldahdooh
W. Hamidouche
Sid Ahmed Fezza
Olivier Déforges
AAML
239
128
0
01 May 2021
SVT-Net: Super Light-Weight Sparse Voxel Transformer for Large Scale
  Place Recognition
SVT-Net: Super Light-Weight Sparse Voxel Transformer for Large Scale Place Recognition
Zhaoxin Fan
Zhenbo Song
Hongyan Liu
Zhiwu Lu
Jun He
Xiaoyong Du
3DPCViT
160
77
0
01 May 2021
Improving Response Quality with Backward Reasoning in Open-domain
  Dialogue Systems
Improving Response Quality with Backward Reasoning in Open-domain Dialogue Systems
Ziming Li
Julia Kiseleva
Maarten de Rijke
78
12
0
30 Apr 2021
Mitigating Political Bias in Language Models Through Reinforced
  Calibration
Mitigating Political Bias in Language Models Through Reinforced Calibration
Ruibo Liu
Chenyan Jia
Jason W. Wei
Guangxuan Xu
Lili Wang
Soroush Vosoughi
75
99
0
30 Apr 2021
Using Transformers to Provide Teachers with Personalized Feedback on
  their Classroom Discourse: The TalkMoves Application
Using Transformers to Provide Teachers with Personalized Feedback on their Classroom Discourse: The TalkMoves Application
Abhijit Suresh
Jennifer Jacobs
Vivian Lai
Chenhao Tan
Wayne H. Ward
James H. Martin
T. Sumner
56
30
0
29 Apr 2021
MAGMA: An Optimization Framework for Mapping Multiple DNNs on Multiple
  Accelerator Cores
MAGMA: An Optimization Framework for Mapping Multiple DNNs on Multiple Accelerator Cores
Sheng-Chun Kao
T. Krishna
123
52
0
28 Apr 2021
Inpainting Transformer for Anomaly Detection
Inpainting Transformer for Anomaly Detection
Jonathan Pirnay
K. Chai
ViT
211
169
0
28 Apr 2021
PanGu-$α$: Large-scale Autoregressive Pretrained Chinese Language
  Models with Auto-parallel Computation
PanGu-ααα: Large-scale Autoregressive Pretrained Chinese Language Models with Auto-parallel Computation
Wei Zeng
Xiaozhe Ren
Teng Su
Hui Wang
Yi-Lun Liao
...
Gaojun Fan
Yaowei Wang
Xuefeng Jin
Qun Liu
Yonghong Tian
ALMMoEAI4CE
80
215
0
26 Apr 2021
Diverse Image Inpainting with Bidirectional and Autoregressive
  Transformers
Diverse Image Inpainting with Bidirectional and Autoregressive Transformers
Yingchen Yu
Fangneng Zhan
Rongliang Wu
Jianxiong Pan
Kaiwen Cui
Shijian Lu
Feiying Ma
Xuansong Xie
Chunyan Miao
ViT
124
152
0
26 Apr 2021
A Comprehensive Attempt to Research Statement Generation
A Comprehensive Attempt to Research Statement Generation
Wenhao Wu
Sujian Li
39
0
0
25 Apr 2021
baller2vec++: A Look-Ahead Multi-Entity Transformer For Modeling
  Coordinated Agents
baller2vec++: A Look-Ahead Multi-Entity Transformer For Modeling Coordinated Agents
Michael A. Alcorn
A. Nguyen
68
15
0
24 Apr 2021
Extract then Distill: Efficient and Effective Task-Agnostic BERT
  Distillation
Extract then Distill: Efficient and Effective Task-Agnostic BERT Distillation
Cheng Chen
Yichun Yin
Lifeng Shang
Zhi Wang
Xin Jiang
Xiao Chen
Qun Liu
FedML
81
7
0
24 Apr 2021
Literature review on vulnerability detection using NLP technology
Literature review on vulnerability detection using NLP technology
Jiajie Wu
150
15
0
23 Apr 2021
Framing Unpacked: A Semi-Supervised Interpretable Multi-View Model of
  Media Frames
Framing Unpacked: A Semi-Supervised Interpretable Multi-View Model of Media Frames
Shima Khanehzar
Trevor Cohn
Gosia Mikołajczak
A. Turpin
Lea Frermann
65
11
0
22 Apr 2021
Previous
123...444546...697071
Next