Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1906.08237
Cited By
v1
v2 (latest)
XLNet: Generalized Autoregressive Pretraining for Language Understanding
19 June 2019
Zhilin Yang
Zihang Dai
Yiming Yang
J. Carbonell
Ruslan Salakhutdinov
Quoc V. Le
AI4CE
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"XLNet: Generalized Autoregressive Pretraining for Language Understanding"
50 / 3,518 papers shown
Title
UniMASK: Unified Inference in Sequential Decision Problems
Micah Carroll
Orr Paradise
Jessy Lin
Raluca Georgescu
Mingfei Sun
...
Stephanie Milani
Katja Hofmann
Matthew J. Hausknecht
Anca Dragan
Sam Devlin
OffRL
109
22
0
20 Nov 2022
Combining State-of-the-Art Models with Maximal Marginal Relevance for Few-Shot and Zero-Shot Multi-Document Summarization
David Adams
Gandharv Suri
Yllias Chali
VLM
57
3
0
19 Nov 2022
A survey on knowledge-enhanced multimodal learning
Maria Lymperaiou
Giorgos Stamou
161
15
0
19 Nov 2022
A Transformer Framework for Data Fusion and Multi-Task Learning in Smart Cities
Alexander C. DeRieux
Walid Saad
W. Zuo
R. Budiarto
M. D. Koerniawan
D. Novitasari
29
1
0
18 Nov 2022
Where did you tweet from? Inferring the origin locations of tweets based on contextual information
Rabindra Lamsal
Aaron Harwood
M. Read
62
11
0
18 Nov 2022
ReLER@ZJU Submission to the Ego4D Moment Queries Challenge 2022
Jiayi Shao
Xiaohan Wang
Yi Yang
41
1
0
17 Nov 2022
Feature-augmented Machine Reading Comprehension with Auxiliary Tasks
Yifeng Xie
65
0
0
17 Nov 2022
Deep Emotion Recognition in Textual Conversations: A Survey
Patrícia Pereira
Helena Moniz
Joao Paulo Carvalho
97
18
0
16 Nov 2022
Fast and Accurate FSA System Using ELBERT: An Efficient and Lightweight BERT
Siyuan Lu
Chenchen Zhou
Keli Xie
Jun Lin
Zhongfeng Wang
49
1
0
16 Nov 2022
Towards Robust Low-Resource Fine-Tuning with Multi-View Compressed Representations
Linlin Liu
Xingxuan Li
Megh Thakkar
Xin Li
Shafiq Joty
Luo Si
Lidong Bing
86
2
0
16 Nov 2022
GAMMT: Generative Ambiguity Modeling Using Multiple Transformers
Xingcheng Xu
81
0
0
16 Nov 2022
Empowering Language Models with Knowledge Graph Reasoning for Question Answering
Ziniu Hu
Yichong Xu
Wenhao Yu
Shuohang Wang
Ziyi Yang
Chenguang Zhu
Kai-Wei Chang
Yizhou Sun
KELM
RALM
LRM
102
26
0
15 Nov 2022
GLUE-X: Evaluating Natural Language Understanding Models from an Out-of-distribution Generalization Perspective
Linyi Yang
Shuibai Zhang
Libo Qin
Yafu Li
Yidong Wang
Hanmeng Liu
Jindong Wang
Xingxu Xie
Yue Zhang
ELM
188
82
0
15 Nov 2022
An Automatic ICD Coding Network Using Partition-Based Label Attention
Daeseong Kim
Haanju Yoo
Sewon Kim
91
5
0
15 Nov 2022
Language models are good pathologists: using attention-based sequence reduction and text-pretrained transformers for efficient WSI classification
Juan Pisula
Katarzyna Bozek
VLM
MedIm
83
3
0
14 Nov 2022
Finding Skill Neurons in Pre-trained Transformer-based Language Models
Xiaozhi Wang
Kaiyue Wen
Zhengyan Zhang
Lei Hou
Zhiyuan Liu
Juanzi Li
MILM
MoE
88
52
0
14 Nov 2022
Dark patterns in e-commerce: a dataset and its baseline evaluations
Yukiharu Yada
J. Feng
Tsuneo Matsumoto
Naotake Fukushima
Fuyuko Kido
Hayato Yamana
98
14
0
12 Nov 2022
Assistive Completion of Agrammatic Aphasic Sentences: A Transfer Learning Approach using Neurolinguistics-based Synthetic Dataset
Rohit Misra
S. Mishra
Tapan K. Gandhi
93
2
0
10 Nov 2022
Can Transformers Reason in Fragments of Natural Language?
Viktor Schlegel
Kamen V. Pavlov
Ian Pratt-Hartmann
LRM
ReLM
77
7
0
10 Nov 2022
FormLM: Recommending Creation Ideas for Online Forms by Modelling Semantic and Structural Information
Yijia Shao
Mengyu Zhou
Yifan Zhong
Tao Wu
Hongwei Han
Shi Han
Gideon Huang
Dongmei Zhang
3DV
66
2
0
10 Nov 2022
ADEPT: A DEbiasing PrompT Framework
Ke Yang
Charles Yu
Yi R. Fung
Manling Li
Heng Ji
124
27
0
10 Nov 2022
BLOOM: A 176B-Parameter Open-Access Multilingual Language Model
BigScience Workshop
:
Teven Le Scao
Angela Fan
Christopher Akiki
...
Zhongli Xie
Zifan Ye
M. Bras
Younes Belkada
Thomas Wolf
VLM
480
2,400
0
09 Nov 2022
Exploiting Transformer-based Multitask Learning for the Detection of Media Bias in News Articles
Timo Spinde
Jan-David Krieger
Terry Ruas
Jelena Mitrović
Franz Götz-Hahn
Akiko Aizawa
Bela Gipp
90
28
0
07 Nov 2022
Textual Manifold-based Defense Against Natural Language Adversarial Examples
D. M. Nguyen
Anh Tuan Luu
AAML
84
17
0
05 Nov 2022
KGLM: Integrating Knowledge Graph Structure in Language Models for Link Prediction
Jason Youn
I. Tagkopoulos
KELM
69
23
0
04 Nov 2022
A Comparison of SVM against Pre-trained Language Models (PLMs) for Text Classification Tasks
Yasmen Wahba
N. Madhavji
John Steinbacher
VLM
31
22
0
04 Nov 2022
BERT-Deep CNN: State-of-the-Art for Sentiment Analysis of COVID-19 Tweets
Javad Hassannataj Joloudari
Sadiq Hussain
M. Nematollahi
Rouhollah Bagheri
Fatemeh Fazl
R. Alizadehsani
Reza Lashgari
Ashis Talukder
55
47
0
04 Nov 2022
Miko Team: Deep Learning Approach for Legal Question Answering in ALQAC 2022
Hieu Nguyen Van
Dat Nguyen
Phuong Minh Nguyen
Minh Le Nguyen
AILaw
132
7
0
04 Nov 2022
Transformers on Multilingual Clause-Level Morphology
Emre Can Acikgoz
T. Chubakov
Muge Kural
Gozde Gul cSahin
Deniz Yuret
60
4
0
03 Nov 2022
Fine-Tuning Pre-Trained Language Models Effectively by Optimizing Subnetworks Adaptively
Haojie Zhang
Ge Li
Jia Li
Zhongjin Zhang
Yuqi Zhu
Zhi Jin
AI4CE
57
28
0
03 Nov 2022
MPCFormer: fast, performant and private Transformer inference with MPC
Dacheng Li
Rulin Shao
Hongyi Wang
Han Guo
Eric P. Xing
Haotong Zhang
92
87
0
02 Nov 2022
Generative Adversarial Training Can Improve Neural Language Models
Sajad Movahedi
A. Shakery
GAN
AI4CE
77
2
0
02 Nov 2022
Multi-level Distillation of Semantic Knowledge for Pre-training Multilingual Language Model
Mingqi Li
Fei Ding
Dan Zhang
Long Cheng
Hongxin Hu
Feng Luo
82
7
0
02 Nov 2022
Processing Long Legal Documents with Pre-trained Transformers: Modding LegalBERT and Longformer
Dimitris Mamakas
Petros Tsotsi
Ion Androutsopoulos
Ilias Chalkidis
VLM
AILaw
65
29
0
02 Nov 2022
VarMAE: Pre-training of Variational Masked Autoencoder for Domain-adaptive Language Understanding
Dou Hu
Xiaolong Hou
Xiyang Du
Mengyuan Zhou
Lian-Xin Jiang
Yang Mo
Xiaofeng Shi
97
13
0
01 Nov 2022
Order-sensitive Neural Constituency Parsing
Zhicheng Wang
Tianyuan Shi
Liyin Xiao
Cong Liu
53
0
0
01 Nov 2022
Leveraging Pre-trained Models for Failure Analysis Triplets Generation
Kenneth Ezukwoke
Anis Hoayek
M. Batton-Hubert
Xavier Boucher
Pascal Gounet
Jerome Adrian
64
1
0
31 Oct 2022
Multimodal Information Bottleneck: Learning Minimal Sufficient Unimodal and Multimodal Representations
Sijie Mai
Ying Zeng
Haifeng Hu
129
71
0
31 Oct 2022
Improving Temporal Generalization of Pre-trained Language Models with Lexical Semantic Change
Zhao-yu Su
Zecheng Tang
Xinyan Guan
Juntao Li
Lijun Wu
Hao Fei
CLL
AI4CE
90
23
0
31 Oct 2022
GPS: Genetic Prompt Search for Efficient Few-shot Learning
Hanwei Xu
Yujun Chen
Yulun Du
Nan Shao
Yanggang Wang
Haiyu Li
Zhilin Yang
VLM
63
31
0
31 Oct 2022
Character-level White-Box Adversarial Attacks against Transformers via Attachable Subwords Substitution
Aiwei Liu
Honghai Yu
Xuming Hu
Shuang Li
Li Lin
Fukun Ma
Yawen Yang
Lijie Wen
86
35
0
31 Oct 2022
Parameter-Efficient Tuning Makes a Good Classification Head
Zhuoyi Yang
Ming Ding
Yanhui Guo
Qingsong Lv
Jie Tang
VLM
108
14
0
30 Oct 2022
token2vec: A Joint Self-Supervised Pre-training Framework Using Unpaired Speech and Text
Xianghu Yue
Junyi Ao
Xiaoxue Gao
Haizhou Li
SSL
60
8
0
30 Oct 2022
Empirical Evaluation of Post-Training Quantization Methods for Language Tasks
Ting Hu
Christoph Meinel
Haojin Yang
MQ
96
3
0
29 Oct 2022
CascadeXML: Rethinking Transformers for End-to-end Multi-resolution Training in Extreme Multi-label Classification
Siddhant Kharbanda
Atmadeep Banerjee
Erik Schultheis
Rohit Babbar
99
14
0
29 Oct 2022
DiMBERT: Learning Vision-Language Grounded Representations with Disentangled Multimodal-Attention
Fenglin Liu
Xian Wu
Shen Ge
Xuancheng Ren
Wei Fan
Xu Sun
Yuexian Zou
VLM
108
13
0
28 Oct 2022
Probing for targeted syntactic knowledge through grammatical error detection
Christopher Davis
Christopher Bryant
Andrew Caines
Marek Rei
P. Buttery
45
4
0
28 Oct 2022
On the Use of Modality-Specific Large-Scale Pre-Trained Encoders for Multimodal Sentiment Analysis
Atsushi Ando
Ryo Masumura
Akihiko Takashima
Satoshi Suzuki
Naoki Makishima
Keita Suzuki
Takafumi Moriya
Takanori Ashihara
Hiroshi Sato
96
9
0
28 Oct 2022
COST-EFF: Collaborative Optimization of Spatial and Temporal Efficiency with Slenderized Multi-exit Language Models
Bowen Shen
Zheng Lin
Yuanxin Liu
Zhengxiao Liu
Lei Wang
Weiping Wang
VLM
77
5
0
27 Oct 2022
Dial2vec: Self-Guided Contrastive Learning of Unsupervised Dialogue Embeddings
Che Liu
Rui Wang
Junfeng Jiang
Yongbin Li
Fei Huang
SSL
113
9
0
27 Oct 2022
Previous
1
2
3
...
22
23
24
...
69
70
71
Next