ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1906.08237
  4. Cited By
XLNet: Generalized Autoregressive Pretraining for Language Understanding
v1v2 (latest)

XLNet: Generalized Autoregressive Pretraining for Language Understanding

19 June 2019
Zhilin Yang
Zihang Dai
Yiming Yang
J. Carbonell
Ruslan Salakhutdinov
Quoc V. Le
    AI4CE
ArXiv (abs)PDFHTML

Papers citing "XLNet: Generalized Autoregressive Pretraining for Language Understanding"

50 / 3,520 papers shown
Title
UMass PCL at SemEval-2022 Task 4: Pre-trained Language Model Ensembles
  for Detecting Patronizing and Condescending Language
UMass PCL at SemEval-2022 Task 4: Pre-trained Language Model Ensembles for Detecting Patronizing and Condescending Language
David Koleczek
Alexander Scarlatos
Siddha Makarand Karkare
Preshma Linet Pereira
56
0
0
18 Apr 2022
Nested Named Entity Recognition as Holistic Structure Parsing
Nested Named Entity Recognition as Holistic Structure Parsing
Yifei Yang
Z. Li
Hai Zhao
62
0
0
17 Apr 2022
What If: Generating Code to Answer Simulation Questions
What If: Generating Code to Answer Simulation Questions
G. Peretz
Kira Radinsky
77
3
0
16 Apr 2022
Towards Lightweight Transformer via Group-wise Transformation for
  Vision-and-Language Tasks
Towards Lightweight Transformer via Group-wise Transformation for Vision-and-Language Tasks
Gen Luo
Yiyi Zhou
Xiaoshuai Sun
Yan Wang
Liujuan Cao
Yongjian Wu
Feiyue Huang
Rongrong Ji
ViT
64
47
0
16 Apr 2022
mGPT: Few-Shot Learners Go Multilingual
mGPT: Few-Shot Learners Go Multilingual
Oleh Shliazhko
Alena Fenogenova
Maria Tikhonova
Vladislav Mikhailov
Anastasia Kozlova
Tatiana Shavrina
137
155
0
15 Apr 2022
Label Semantic Aware Pre-training for Few-shot Text Classification
Label Semantic Aware Pre-training for Few-shot Text Classification
Aaron Mueller
Jason Krone
Salvatore Romeo
Saab Mansour
Elman Mansimov
Yi Zhang
Dan Roth
VLM
67
38
0
14 Apr 2022
SNP2Vec: Scalable Self-Supervised Pre-Training for Genome-Wide
  Association Study
SNP2Vec: Scalable Self-Supervised Pre-Training for Genome-Wide Association Study
Samuel Cahyawijaya
Tiezheng Yu
Zihan Liu
Tiffany Mak
Xiaopu Zhou
N. Ip
Pascale Fung
57
8
0
14 Apr 2022
METRO: Efficient Denoising Pretraining of Large Scale Autoencoding
  Language Models with Model Generated Signals
METRO: Efficient Denoising Pretraining of Large Scale Autoencoding Language Models with Model Generated Signals
Payal Bajaj
Chenyan Xiong
Guolin Ke
Xiaodong Liu
Di He
Saurabh Tiwary
Tie-Yan Liu
Paul N. Bennett
Xia Song
Jianfeng Gao
118
32
0
13 Apr 2022
A Novel Approach to Train Diverse Types of Language Models for Health
  Mention Classification of Tweets
A Novel Approach to Train Diverse Types of Language Models for Health Mention Classification of Tweets
Pervaiz Iqbal Khan
Imran Razzak
Andreas Dengel
Sheraz Ahmed
MedIm
48
5
0
13 Apr 2022
Curriculum: A Broad-Coverage Benchmark for Linguistic Phenomena in
  Natural Language Understanding
Curriculum: A Broad-Coverage Benchmark for Linguistic Phenomena in Natural Language Understanding
Zeming Chen
Qiyue Gao
ELM
67
4
0
13 Apr 2022
TangoBERT: Reducing Inference Cost by using Cascaded Architecture
TangoBERT: Reducing Inference Cost by using Cascaded Architecture
Jonathan Mamou
Oren Pereg
Moshe Wasserblat
Roy Schwartz
44
12
0
13 Apr 2022
Probing for Constituency Structure in Neural Language Models
Probing for Constituency Structure in Neural Language Models
David Arps
Younes Samih
Laura Kallmeyer
Hassan Sajjad
57
14
0
13 Apr 2022
Impossible Triangle: What's Next for Pre-trained Language Models?
Impossible Triangle: What's Next for Pre-trained Language Models?
Chenguang Zhu
Michael Zeng
78
1
0
13 Apr 2022
A Call for Clarity in Beam Search: How It Works and When It Stops
A Call for Clarity in Beam Search: How It Works and When It Stops
Jungo Kasai
Keisuke Sakaguchi
Ronan Le Bras
Dragomir R. Radev
Yejin Choi
Noah A. Smith
104
9
0
11 Apr 2022
A Comparative Study of Pre-trained Encoders for Low-Resource Named
  Entity Recognition
A Comparative Study of Pre-trained Encoders for Low-Resource Named Entity Recognition
Yuxuan Chen
Jonas Mikkelsen
Arne Binder
Christoph Alt
Leonhard Hennig
72
2
0
11 Apr 2022
Accurate Portraits of Scientific Resources and Knowledge Service
  Components
Accurate Portraits of Scientific Resources and Knowledge Service Components
Yue Wang
Zhe Xue
Ang Li
36
0
0
11 Apr 2022
A Survey on Legal Judgment Prediction: Datasets, Metrics, Models and
  Challenges
A Survey on Legal Judgment Prediction: Datasets, Metrics, Models and Challenges
Junyun Cui
Xiaoyu Shen
Feiping Nie
Ziyi Wang
Jinglong Wang
Yulong Chen
AILawELM
69
73
0
11 Apr 2022
Towards Understanding Large-Scale Discourse Structures in Pre-Trained
  and Fine-Tuned Language Models
Towards Understanding Large-Scale Discourse Structures in Pre-Trained and Fine-Tuned Language Models
Patrick Huber
Giuseppe Carenini
71
11
0
08 Apr 2022
Contextual Representation Learning beyond Masked Language Modeling
Contextual Representation Learning beyond Masked Language Modeling
Zhiyi Fu
Wangchunshu Zhou
Jingjing Xu
Hao Zhou
Lei Li
78
26
0
08 Apr 2022
Improving Tokenisation by Alternative Treatment of Spaces
Improving Tokenisation by Alternative Treatment of Spaces
Edward Gow-Smith
Harish Tayyar Madabushi
Carolina Scarton
Aline Villavicencio
89
21
0
08 Apr 2022
Transfer Attacks Revisited: A Large-Scale Empirical Study in Real
  Computer Vision Settings
Transfer Attacks Revisited: A Large-Scale Empirical Study in Real Computer Vision Settings
Yuhao Mao
Chong Fu
Sai-gang Wang
S. Ji
Xuhong Zhang
Zhenguang Liu
Junfeng Zhou
A. Liu
R. Beyah
Ting Wang
AAML
105
19
0
07 Apr 2022
Using Decision Tree as Local Interpretable Model in Autoencoder-based
  LIME
Using Decision Tree as Local Interpretable Model in Autoencoder-based LIME
Niloofar Ranjbar
Reza Safabakhsh
FAtt
40
5
0
07 Apr 2022
Pretraining Text Encoders with Adversarial Mixture of Training Signal
  Generators
Pretraining Text Encoders with Adversarial Mixture of Training Signal Generators
Yu Meng
Chenyan Xiong
Payal Bajaj
Saurabh Tiwary
Paul N. Bennett
Jiawei Han
Xia Song
MoE
78
16
0
07 Apr 2022
Domain Specific Fine-tuning of Denoising Sequence-to-Sequence Models for
  Natural Language Summarization
Domain Specific Fine-tuning of Denoising Sequence-to-Sequence Models for Natural Language Summarization
Brydon Parker
A. Sokolov
Mahtab Ahmed
Matt Kalebic
S. Koçak
Ofer Shai
92
1
0
06 Apr 2022
Inducing Positive Perspectives with Text Reframing
Inducing Positive Perspectives with Text Reframing
Caleb Ziems
Minzhi Li
Anthony Zhang
Diyi Yang
DiffM
92
37
0
06 Apr 2022
On the Transferability of Pre-trained Language Models for Low-Resource
  Programming Languages
On the Transferability of Pre-trained Language Models for Low-Resource Programming Languages
Fuxiang Chen
F. Fard
David Lo
T. Bryksin
81
49
0
05 Apr 2022
MaxViT: Multi-Axis Vision Transformer
MaxViT: Multi-Axis Vision Transformer
Zhengzhong Tu
Hossein Talebi
Han Zhang
Feng Yang
P. Milanfar
A. Bovik
Yinxiao Li
ViT
163
676
0
04 Apr 2022
Long Movie Clip Classification with State-Space Video Models
Long Movie Clip Classification with State-Space Video Models
Md. Mohaiminul Islam
Gedas Bertasius
VLM
136
105
0
04 Apr 2022
Graph Enhanced BERT for Query Understanding
Graph Enhanced BERT for Query Understanding
Juanhui Li
Yao Ma
Weizhen Zeng
Suqi Cheng
Jiliang Tang
Shuaiqiang Wang
D. Yin
47
9
0
03 Apr 2022
Exploiting Local and Global Features in Transformer-based Extreme
  Multi-label Text Classification
Exploiting Local and Global Features in Transformer-based Extreme Multi-label Text Classification
Ruohong Zhang
Yau-Shian Wang
Yiming Yang
Tom Vu
Li Lei
57
2
0
02 Apr 2022
Efficient comparison of sentence embeddings
Efficient comparison of sentence embeddings
Spyros Zoupanos
Stratis Kolovos
Athanasios Kanavos
Orestis Papadimitriou
M. Maragoudakis
35
12
0
02 Apr 2022
CharacterBERT and Self-Teaching for Improving the Robustness of Dense
  Retrievers on Queries with Typos
CharacterBERT and Self-Teaching for Improving the Robustness of Dense Retrievers on Queries with Typos
Shengyao Zhuang
Guido Zuccon
OOD
92
30
0
01 Apr 2022
Uncertainty Determines the Adequacy of the Mode and the Tractability of
  Decoding in Sequence-to-Sequence Models
Uncertainty Determines the Adequacy of the Mode and the Tractability of Decoding in Sequence-to-Sequence Models
Felix Stahlberg
Ilia Kulikov
Shankar Kumar
UQLM
136
10
0
01 Apr 2022
COOL, a Context Outlooker, and its Application to Question Answering and
  other Natural Language Processing Tasks
COOL, a Context Outlooker, and its Application to Question Answering and other Natural Language Processing Tasks
Fangyi Zhu
See-Kiong Ng
S. Bressan
LRM
58
1
0
01 Apr 2022
Making Pre-trained Language Models End-to-end Few-shot Learners with
  Contrastive Prompt Tuning
Making Pre-trained Language Models End-to-end Few-shot Learners with Contrastive Prompt Tuning
Ziyun Xu
Chengyu Wang
Minghui Qiu
Fuli Luo
Runxin Xu
Songfang Huang
Jun Huang
VLM
103
34
0
01 Apr 2022
Scaling Language Model Size in Cross-Device Federated Learning
Scaling Language Model Size in Cross-Device Federated Learning
Jae Hun Ro
Theresa Breiner
Lara McConnaughey
Mingqing Chen
A. Suresh
Shankar Kumar
Rajiv Mathews
FedML
61
26
0
31 Mar 2022
Auto-MLM: Improved Contrastive Learning for Self-supervised
  Multi-lingual Knowledge Retrieval
Auto-MLM: Improved Contrastive Learning for Self-supervised Multi-lingual Knowledge Retrieval
Wenshen Xu
M. Maimaiti
Yuanhang Zheng
Xin Tang
Ji Zhang
RALMSSL
47
2
0
30 Mar 2022
Transfer Learning Framework for Low-Resource Text-to-Speech using a
  Large-Scale Unlabeled Speech Corpus
Transfer Learning Framework for Low-Resource Text-to-Speech using a Large-Scale Unlabeled Speech Corpus
Minchan Kim
Myeonghun Jeong
Byoung Jin Choi
Sunghwan Ahn
Joun Yeop Lee
N. Kim
106
26
0
29 Mar 2022
A Fast Post-Training Pruning Framework for Transformers
A Fast Post-Training Pruning Framework for Transformers
Woosuk Kwon
Sehoon Kim
Michael W. Mahoney
Joseph Hassoun
Kurt Keutzer
A. Gholami
113
157
0
29 Mar 2022
MFA-Conformer: Multi-scale Feature Aggregation Conformer for Automatic
  Speaker Verification
MFA-Conformer: Multi-scale Feature Aggregation Conformer for Automatic Speaker Verification
Yang Zhang
Zhiqiang Lv
Haibin Wu
Shanshan Zhang
Pengfei Hu
Zhiyong Wu
Hung-yi Lee
Helen Meng
ViT
100
137
0
29 Mar 2022
Parameter-efficient Model Adaptation for Vision Transformers
Parameter-efficient Model Adaptation for Vision Transformers
Xuehai He
Chunyuan Li
Pengchuan Zhang
Jianwei Yang
Xinze Wang
77
90
0
29 Mar 2022
ANNA: Enhanced Language Representation for Question Answering
ANNA: Enhanced Language Representation for Question Answering
Changwook Jun
Hansol Jang
Myoseop Sim
Hyun Kim
Jooyoung Choi
Kyungkoo Min
Kyunghoon Bae
73
8
0
28 Mar 2022
Reinforcement Guided Multi-Task Learning Framework for Low-Resource
  Stereotype Detection
Reinforcement Guided Multi-Task Learning Framework for Low-Resource Stereotype Detection
Rajkumar Pujari
Erik Oveson
Priyanka Kulkarni
E. Nouri
105
10
0
27 Mar 2022
Beyond Masking: Demystifying Token-Based Pre-Training for Vision
  Transformers
Beyond Masking: Demystifying Token-Based Pre-Training for Vision Transformers
Yunjie Tian
Lingxi Xie
Jiemin Fang
Mengnan Shi
Junran Peng
Xiaopeng Zhang
Jianbin Jiao
Qi Tian
QiXiang Ye
81
20
0
27 Mar 2022
Lite Unified Modeling for Discriminative Reading Comprehension
Lite Unified Modeling for Discriminative Reading Comprehension
Yilin Zhao
Hai Zhao
Libin Shen
Yinggong Zhao
86
2
0
26 Mar 2022
Exploring Self-Attention for Visual Intersection Classification
Exploring Self-Attention for Visual Intersection Classification
Haruki Nakata
Kanji Tanaka
Koji Takeda
23
0
0
26 Mar 2022
On the Intrinsic and Extrinsic Fairness Evaluation Metrics for
  Contextualized Language Representations
On the Intrinsic and Extrinsic Fairness Evaluation Metrics for Contextualized Language Representations
Yang Trista Cao
Yada Pruksachatkun
Kai-Wei Chang
Rahul Gupta
Varun Kumar
Jwala Dhamala
Aram Galstyan
74
99
0
25 Mar 2022
Reinforcement Learning with Action-Free Pre-Training from Videos
Reinforcement Learning with Action-Free Pre-Training from Videos
Younggyo Seo
Kimin Lee
Stephen James
Pieter Abbeel
SSLOnRL
107
123
0
25 Mar 2022
A Comparative Evaluation Of Transformer Models For De-Identification Of
  Clinical Text Data
A Comparative Evaluation Of Transformer Models For De-Identification Of Clinical Text Data
C. Meaney
Wali Hakimpour
S. Kalia
R. Moineddin
39
7
0
25 Mar 2022
Email Summarization to Assist Users in Phishing Identification
Email Summarization to Assist Users in Phishing Identification
Amir Kashapov
Tingmin Wu
A. Abuadbba
Carsten Rudolph
21
16
0
24 Mar 2022
Previous
123...303132...697071
Next