Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1906.08237
Cited By
v1
v2 (latest)
XLNet: Generalized Autoregressive Pretraining for Language Understanding
19 June 2019
Zhilin Yang
Zihang Dai
Yiming Yang
J. Carbonell
Ruslan Salakhutdinov
Quoc V. Le
AI4CE
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"XLNet: Generalized Autoregressive Pretraining for Language Understanding"
50 / 3,520 papers shown
Title
UMass PCL at SemEval-2022 Task 4: Pre-trained Language Model Ensembles for Detecting Patronizing and Condescending Language
David Koleczek
Alexander Scarlatos
Siddha Makarand Karkare
Preshma Linet Pereira
56
0
0
18 Apr 2022
Nested Named Entity Recognition as Holistic Structure Parsing
Yifei Yang
Z. Li
Hai Zhao
62
0
0
17 Apr 2022
What If: Generating Code to Answer Simulation Questions
G. Peretz
Kira Radinsky
77
3
0
16 Apr 2022
Towards Lightweight Transformer via Group-wise Transformation for Vision-and-Language Tasks
Gen Luo
Yiyi Zhou
Xiaoshuai Sun
Yan Wang
Liujuan Cao
Yongjian Wu
Feiyue Huang
Rongrong Ji
ViT
64
47
0
16 Apr 2022
mGPT: Few-Shot Learners Go Multilingual
Oleh Shliazhko
Alena Fenogenova
Maria Tikhonova
Vladislav Mikhailov
Anastasia Kozlova
Tatiana Shavrina
137
155
0
15 Apr 2022
Label Semantic Aware Pre-training for Few-shot Text Classification
Aaron Mueller
Jason Krone
Salvatore Romeo
Saab Mansour
Elman Mansimov
Yi Zhang
Dan Roth
VLM
67
38
0
14 Apr 2022
SNP2Vec: Scalable Self-Supervised Pre-Training for Genome-Wide Association Study
Samuel Cahyawijaya
Tiezheng Yu
Zihan Liu
Tiffany Mak
Xiaopu Zhou
N. Ip
Pascale Fung
57
8
0
14 Apr 2022
METRO: Efficient Denoising Pretraining of Large Scale Autoencoding Language Models with Model Generated Signals
Payal Bajaj
Chenyan Xiong
Guolin Ke
Xiaodong Liu
Di He
Saurabh Tiwary
Tie-Yan Liu
Paul N. Bennett
Xia Song
Jianfeng Gao
118
32
0
13 Apr 2022
A Novel Approach to Train Diverse Types of Language Models for Health Mention Classification of Tweets
Pervaiz Iqbal Khan
Imran Razzak
Andreas Dengel
Sheraz Ahmed
MedIm
48
5
0
13 Apr 2022
Curriculum: A Broad-Coverage Benchmark for Linguistic Phenomena in Natural Language Understanding
Zeming Chen
Qiyue Gao
ELM
67
4
0
13 Apr 2022
TangoBERT: Reducing Inference Cost by using Cascaded Architecture
Jonathan Mamou
Oren Pereg
Moshe Wasserblat
Roy Schwartz
44
12
0
13 Apr 2022
Probing for Constituency Structure in Neural Language Models
David Arps
Younes Samih
Laura Kallmeyer
Hassan Sajjad
57
14
0
13 Apr 2022
Impossible Triangle: What's Next for Pre-trained Language Models?
Chenguang Zhu
Michael Zeng
78
1
0
13 Apr 2022
A Call for Clarity in Beam Search: How It Works and When It Stops
Jungo Kasai
Keisuke Sakaguchi
Ronan Le Bras
Dragomir R. Radev
Yejin Choi
Noah A. Smith
104
9
0
11 Apr 2022
A Comparative Study of Pre-trained Encoders for Low-Resource Named Entity Recognition
Yuxuan Chen
Jonas Mikkelsen
Arne Binder
Christoph Alt
Leonhard Hennig
72
2
0
11 Apr 2022
Accurate Portraits of Scientific Resources and Knowledge Service Components
Yue Wang
Zhe Xue
Ang Li
36
0
0
11 Apr 2022
A Survey on Legal Judgment Prediction: Datasets, Metrics, Models and Challenges
Junyun Cui
Xiaoyu Shen
Feiping Nie
Ziyi Wang
Jinglong Wang
Yulong Chen
AILaw
ELM
69
73
0
11 Apr 2022
Towards Understanding Large-Scale Discourse Structures in Pre-Trained and Fine-Tuned Language Models
Patrick Huber
Giuseppe Carenini
71
11
0
08 Apr 2022
Contextual Representation Learning beyond Masked Language Modeling
Zhiyi Fu
Wangchunshu Zhou
Jingjing Xu
Hao Zhou
Lei Li
78
26
0
08 Apr 2022
Improving Tokenisation by Alternative Treatment of Spaces
Edward Gow-Smith
Harish Tayyar Madabushi
Carolina Scarton
Aline Villavicencio
89
21
0
08 Apr 2022
Transfer Attacks Revisited: A Large-Scale Empirical Study in Real Computer Vision Settings
Yuhao Mao
Chong Fu
Sai-gang Wang
S. Ji
Xuhong Zhang
Zhenguang Liu
Junfeng Zhou
A. Liu
R. Beyah
Ting Wang
AAML
105
19
0
07 Apr 2022
Using Decision Tree as Local Interpretable Model in Autoencoder-based LIME
Niloofar Ranjbar
Reza Safabakhsh
FAtt
40
5
0
07 Apr 2022
Pretraining Text Encoders with Adversarial Mixture of Training Signal Generators
Yu Meng
Chenyan Xiong
Payal Bajaj
Saurabh Tiwary
Paul N. Bennett
Jiawei Han
Xia Song
MoE
78
16
0
07 Apr 2022
Domain Specific Fine-tuning of Denoising Sequence-to-Sequence Models for Natural Language Summarization
Brydon Parker
A. Sokolov
Mahtab Ahmed
Matt Kalebic
S. Koçak
Ofer Shai
92
1
0
06 Apr 2022
Inducing Positive Perspectives with Text Reframing
Caleb Ziems
Minzhi Li
Anthony Zhang
Diyi Yang
DiffM
92
37
0
06 Apr 2022
On the Transferability of Pre-trained Language Models for Low-Resource Programming Languages
Fuxiang Chen
F. Fard
David Lo
T. Bryksin
81
49
0
05 Apr 2022
MaxViT: Multi-Axis Vision Transformer
Zhengzhong Tu
Hossein Talebi
Han Zhang
Feng Yang
P. Milanfar
A. Bovik
Yinxiao Li
ViT
163
676
0
04 Apr 2022
Long Movie Clip Classification with State-Space Video Models
Md. Mohaiminul Islam
Gedas Bertasius
VLM
136
105
0
04 Apr 2022
Graph Enhanced BERT for Query Understanding
Juanhui Li
Yao Ma
Weizhen Zeng
Suqi Cheng
Jiliang Tang
Shuaiqiang Wang
D. Yin
47
9
0
03 Apr 2022
Exploiting Local and Global Features in Transformer-based Extreme Multi-label Text Classification
Ruohong Zhang
Yau-Shian Wang
Yiming Yang
Tom Vu
Li Lei
57
2
0
02 Apr 2022
Efficient comparison of sentence embeddings
Spyros Zoupanos
Stratis Kolovos
Athanasios Kanavos
Orestis Papadimitriou
M. Maragoudakis
35
12
0
02 Apr 2022
CharacterBERT and Self-Teaching for Improving the Robustness of Dense Retrievers on Queries with Typos
Shengyao Zhuang
Guido Zuccon
OOD
92
30
0
01 Apr 2022
Uncertainty Determines the Adequacy of the Mode and the Tractability of Decoding in Sequence-to-Sequence Models
Felix Stahlberg
Ilia Kulikov
Shankar Kumar
UQLM
136
10
0
01 Apr 2022
COOL, a Context Outlooker, and its Application to Question Answering and other Natural Language Processing Tasks
Fangyi Zhu
See-Kiong Ng
S. Bressan
LRM
58
1
0
01 Apr 2022
Making Pre-trained Language Models End-to-end Few-shot Learners with Contrastive Prompt Tuning
Ziyun Xu
Chengyu Wang
Minghui Qiu
Fuli Luo
Runxin Xu
Songfang Huang
Jun Huang
VLM
103
34
0
01 Apr 2022
Scaling Language Model Size in Cross-Device Federated Learning
Jae Hun Ro
Theresa Breiner
Lara McConnaughey
Mingqing Chen
A. Suresh
Shankar Kumar
Rajiv Mathews
FedML
61
26
0
31 Mar 2022
Auto-MLM: Improved Contrastive Learning for Self-supervised Multi-lingual Knowledge Retrieval
Wenshen Xu
M. Maimaiti
Yuanhang Zheng
Xin Tang
Ji Zhang
RALM
SSL
47
2
0
30 Mar 2022
Transfer Learning Framework for Low-Resource Text-to-Speech using a Large-Scale Unlabeled Speech Corpus
Minchan Kim
Myeonghun Jeong
Byoung Jin Choi
Sunghwan Ahn
Joun Yeop Lee
N. Kim
106
26
0
29 Mar 2022
A Fast Post-Training Pruning Framework for Transformers
Woosuk Kwon
Sehoon Kim
Michael W. Mahoney
Joseph Hassoun
Kurt Keutzer
A. Gholami
113
157
0
29 Mar 2022
MFA-Conformer: Multi-scale Feature Aggregation Conformer for Automatic Speaker Verification
Yang Zhang
Zhiqiang Lv
Haibin Wu
Shanshan Zhang
Pengfei Hu
Zhiyong Wu
Hung-yi Lee
Helen Meng
ViT
100
137
0
29 Mar 2022
Parameter-efficient Model Adaptation for Vision Transformers
Xuehai He
Chunyuan Li
Pengchuan Zhang
Jianwei Yang
Xinze Wang
77
90
0
29 Mar 2022
ANNA: Enhanced Language Representation for Question Answering
Changwook Jun
Hansol Jang
Myoseop Sim
Hyun Kim
Jooyoung Choi
Kyungkoo Min
Kyunghoon Bae
73
8
0
28 Mar 2022
Reinforcement Guided Multi-Task Learning Framework for Low-Resource Stereotype Detection
Rajkumar Pujari
Erik Oveson
Priyanka Kulkarni
E. Nouri
105
10
0
27 Mar 2022
Beyond Masking: Demystifying Token-Based Pre-Training for Vision Transformers
Yunjie Tian
Lingxi Xie
Jiemin Fang
Mengnan Shi
Junran Peng
Xiaopeng Zhang
Jianbin Jiao
Qi Tian
QiXiang Ye
81
20
0
27 Mar 2022
Lite Unified Modeling for Discriminative Reading Comprehension
Yilin Zhao
Hai Zhao
Libin Shen
Yinggong Zhao
86
2
0
26 Mar 2022
Exploring Self-Attention for Visual Intersection Classification
Haruki Nakata
Kanji Tanaka
Koji Takeda
23
0
0
26 Mar 2022
On the Intrinsic and Extrinsic Fairness Evaluation Metrics for Contextualized Language Representations
Yang Trista Cao
Yada Pruksachatkun
Kai-Wei Chang
Rahul Gupta
Varun Kumar
Jwala Dhamala
Aram Galstyan
74
99
0
25 Mar 2022
Reinforcement Learning with Action-Free Pre-Training from Videos
Younggyo Seo
Kimin Lee
Stephen James
Pieter Abbeel
SSL
OnRL
107
123
0
25 Mar 2022
A Comparative Evaluation Of Transformer Models For De-Identification Of Clinical Text Data
C. Meaney
Wali Hakimpour
S. Kalia
R. Moineddin
39
7
0
25 Mar 2022
Email Summarization to Assist Users in Phishing Identification
Amir Kashapov
Tingmin Wu
A. Abuadbba
Carsten Rudolph
21
16
0
24 Mar 2022
Previous
1
2
3
...
30
31
32
...
69
70
71
Next