Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1906.08237
Cited By
XLNet: Generalized Autoregressive Pretraining for Language Understanding
19 June 2019
Zhilin Yang
Zihang Dai
Yiming Yang
J. Carbonell
Ruslan Salakhutdinov
Quoc V. Le
AI4CE
Re-assign community
ArXiv
PDF
HTML
Papers citing
"XLNet: Generalized Autoregressive Pretraining for Language Understanding"
50 / 1,336 papers shown
Title
QBERT: Generalist Model for Processing Questions
Zhaozhen Xu
N. Cristianini
22
1
0
05 Dec 2022
MiLMo:Minority Multilingual Pre-trained Language Model
Sisi Liu
Hanru Shi
Xinhe Yu
Wugedele Bao
Yuan Sun
Xiaobing Zhao
31
0
0
04 Dec 2022
Exploring Stochastic Autoregressive Image Modeling for Visual Representation
Yu-Hang Qi
Fan Yang
Yousong Zhu
Yufei Liu
Liwei Wu
Rui Zhao
Wei Li
DiffM
27
13
0
03 Dec 2022
Event knowledge in large language models: the gap between the impossible and the unlikely
Carina Kauf
Anna A. Ivanova
Giulia Rambelli
Emmanuele Chersoni
Jingyuan Selena She
Zawad Chowdhury
Evelina Fedorenko
Alessandro Lenci
37
67
0
02 Dec 2022
SOLD: Sinhala Offensive Language Dataset
Tharindu Ranasinghe
Isuri Anuradha
Damith Premasiri
Kanishka Silva
Hansi Hettiarachchi
Lasitha Uyangodage
Marcos Zampieri
41
8
0
01 Dec 2022
UniT3D: A Unified Transformer for 3D Dense Captioning and Visual Grounding
Dave Zhenyu Chen
Ronghang Hu
Xinlei Chen
Matthias Nießner
Angel X. Chang
29
52
0
01 Dec 2022
Language Model Pre-training on True Negatives
ZhuoSheng Zhang
Hai Zhao
Masao Utiyama
Eiichiro Sumita
34
2
0
01 Dec 2022
Reliable Joint Segmentation of Retinal Edema Lesions in OCT Images
Meng Wang
Kai-An Yu
Chun-Mei Feng
K. Zou
Yanyu Xu
Qingquan Meng
Rick Siow Mong Goh
Yong Liu
Huazhu Fu
MedIm
30
3
0
01 Dec 2022
BudgetLongformer: Can we Cheaply Pretrain a SotA Legal Language Model From Scratch?
Joel Niklaus
Daniele Giofré
33
11
0
30 Nov 2022
Model Extraction Attack against Self-supervised Speech Models
Tsung-Yuan Hsu
Chen An Li
Tung-Yu Wu
Hung-yi Lee
27
1
0
29 Nov 2022
Survey on Self-Supervised Multimodal Representation Learning and Foundation Models
Sushil Thapa
AI4TS
SSL
20
1
0
29 Nov 2022
Predicting Digital Asset Prices using Natural Language Processing: a survey
Trang Tran
8
1
0
28 Nov 2022
Understanding BLOOM: An empirical study on diverse NLP tasks
Parag Dakle
Sai Krishna Rallabandi
Preethi Raghavan
AI4CE
39
3
0
27 Nov 2022
Deep representation learning: Fundamentals, Perspectives, Applications, and Open Challenges
K. T. Baghaei
Amirreza Payandeh
Pooya Fayyazsanavi
Shahram Rahimi
Zhiqian Chen
Somayeh Bakhtiari Ramezani
FaML
AI4TS
38
6
0
27 Nov 2022
Asymmetric Cross-Scale Alignment for Text-Based Person Search
Zhong Ji
Junhua Hu
Deyin Liu
Yuan Wu
Ye Zhao
31
42
0
26 Nov 2022
Using Selective Masking as a Bridge between Pre-training and Fine-tuning
Tanish Lad
Himanshu Maheshwari
Shreyas Kottukkal
R. Mamidi
24
3
0
24 Nov 2022
Indian Commercial Truck License Plate Detection and Recognition for Weighbridge Automation
Siddharth Agrawal
Keyur D. Joshi
35
4
0
23 Nov 2022
Unified Multimodal Model with Unlikelihood Training for Visual Dialog
Zihao Wang
Junli Wang
Changjun Jiang
MLLM
29
10
0
23 Nov 2022
Predicting the Type and Target of Offensive Social Media Posts in Marathi
Marcos Zampieri
Tharindu Ranasinghe
Mrinal Chaudhari
Saurabh Gaikwad
P. Krishna
Mayuresh Nene
Shrunali Paygude
32
24
0
22 Nov 2022
Evaluating the Knowledge Dependency of Questions
Hyeongdon Moon
Yoonseok Yang
Jamin Shin
Hangyeol Yu
Seunghyun Lee
Myeongho Jeong
Juneyoung Park
Minsam Kim
Seungtaek Choi
AI4Ed
31
10
0
21 Nov 2022
Perceiver-VL: Efficient Vision-and-Language Modeling with Iterative Latent Attention
Zineng Tang
Jaemin Cho
Jie Lei
Joey Tianyi Zhou
VLM
24
9
0
21 Nov 2022
AF Adapter: Continual Pretraining for Building Chinese Biomedical Language Model
Yongyu Yan
Kui Xue
Xiaoming Shi
Qi Ye
Jingping Liu
Tong Ruan
CLL
47
1
0
21 Nov 2022
UniMASK: Unified Inference in Sequential Decision Problems
Micah Carroll
Orr Paradise
Jessy Lin
Raluca Georgescu
Mingfei Sun
...
Stephanie Milani
Katja Hofmann
Matthew J. Hausknecht
Anca Dragan
Sam Devlin
OffRL
26
21
0
20 Nov 2022
Combining State-of-the-Art Models with Maximal Marginal Relevance for Few-Shot and Zero-Shot Multi-Document Summarization
David Adams
Gandharv Suri
Yllias Chali
VLM
32
3
0
19 Nov 2022
A Transformer Framework for Data Fusion and Multi-Task Learning in Smart Cities
Alexander C. DeRieux
Walid Saad
W. Zuo
R. Budiarto
M. D. Koerniawan
D. Novitasari
20
1
0
18 Nov 2022
Feature-augmented Machine Reading Comprehension with Auxiliary Tasks
Yifeng Xie
26
0
0
17 Nov 2022
Deep Emotion Recognition in Textual Conversations: A Survey
Patrícia Pereira
Helena Moniz
Joao Paulo Carvalho
37
15
0
16 Nov 2022
Fast and Accurate FSA System Using ELBERT: An Efficient and Lightweight BERT
Siyuan Lu
Chenchen Zhou
Keli Xie
Jun Lin
Zhongfeng Wang
27
1
0
16 Nov 2022
Towards Robust Low-Resource Fine-Tuning with Multi-View Compressed Representations
Linlin Liu
Xingxuan Li
Megh Thakkar
Xin Li
Chenyu You
Luo Si
Lidong Bing
27
2
0
16 Nov 2022
GAMMT: Generative Ambiguity Modeling Using Multiple Transformers
Xingcheng Xu
30
0
0
16 Nov 2022
GLUE-X: Evaluating Natural Language Understanding Models from an Out-of-distribution Generalization Perspective
Linyi Yang
Shuibai Zhang
Libo Qin
Yafu Li
Yidong Wang
Hanmeng Liu
Jindong Wang
Xingxu Xie
Yue Zhang
ELM
46
79
0
15 Nov 2022
Language models are good pathologists: using attention-based sequence reduction and text-pretrained transformers for efficient WSI classification
Juan Pisula
Katarzyna Bozek
VLM
MedIm
36
3
0
14 Nov 2022
Finding Skill Neurons in Pre-trained Transformer-based Language Models
Xiaozhi Wang
Kaiyue Wen
Zhengyan Zhang
Lei Hou
Zhiyuan Liu
Juanzi Li
MILM
MoE
27
50
0
14 Nov 2022
Dark patterns in e-commerce: a dataset and its baseline evaluations
Yukiharu Yada
J. Feng
Tsuneo Matsumoto
Naotake Fukushima
Fuyuko Kido
Hayato Yamana
30
14
0
12 Nov 2022
Assistive Completion of Agrammatic Aphasic Sentences: A Transfer Learning Approach using Neurolinguistics-based Synthetic Dataset
Rohit Misra
S. Mishra
Tapan K. Gandhi
19
2
0
10 Nov 2022
Can Transformers Reason in Fragments of Natural Language?
Viktor Schlegel
Kamen V. Pavlov
Ian Pratt-Hartmann
LRM
ReLM
35
7
0
10 Nov 2022
BLOOM: A 176B-Parameter Open-Access Multilingual Language Model
BigScience Workshop
:
Teven Le Scao
Angela Fan
Christopher Akiki
...
Zhongli Xie
Zifan Ye
M. Bras
Younes Belkada
Thomas Wolf
VLM
118
2,315
0
09 Nov 2022
Exploiting Transformer-based Multitask Learning for the Detection of Media Bias in News Articles
Timo Spinde
Jan-David Krieger
Terry Ruas
Jelena Mitrović
Franz Götz-Hahn
Akiko Aizawa
Bela Gipp
35
27
0
07 Nov 2022
Textual Manifold-based Defense Against Natural Language Adversarial Examples
D. M. Nguyen
Anh Tuan Luu
AAML
27
17
0
05 Nov 2022
KGLM: Integrating Knowledge Graph Structure in Language Models for Link Prediction
Jason Youn
I. Tagkopoulos
KELM
22
20
0
04 Nov 2022
BERT-Deep CNN: State-of-the-Art for Sentiment Analysis of COVID-19 Tweets
Javad Hassannataj Joloudari
Sadiq Hussain
M. Nematollahi
Rouhollah Bagheri
Fatemeh Fazl
R. Alizadehsani
Reza Lashgari
Ashis Talukder
18
38
0
04 Nov 2022
MPCFormer: fast, performant and private Transformer inference with MPC
Dacheng Li
Rulin Shao
Hongyi Wang
Han Guo
Eric P. Xing
Haotong Zhang
18
79
0
02 Nov 2022
Generative Adversarial Training Can Improve Neural Language Models
Sajad Movahedi
A. Shakery
GAN
AI4CE
34
2
0
02 Nov 2022
Multi-level Distillation of Semantic Knowledge for Pre-training Multilingual Language Model
Mingqi Li
Fei Ding
Dan Zhang
Long Cheng
Hongxin Hu
Feng Luo
41
6
0
02 Nov 2022
Order-sensitive Neural Constituency Parsing
Zhicheng Wang
Tianyuan Shi
Liyin Xiao
Cong Liu
30
0
0
01 Nov 2022
Multimodal Information Bottleneck: Learning Minimal Sufficient Unimodal and Multimodal Representations
Sijie Mai
Ying Zeng
Haifeng Hu
40
67
0
31 Oct 2022
Improving Temporal Generalization of Pre-trained Language Models with Lexical Semantic Change
Zhao-yu Su
Zecheng Tang
Xinyan Guan
Juntao Li
Lijun Wu
Hao Fei
CLL
AI4CE
32
22
0
31 Oct 2022
GPS: Genetic Prompt Search for Efficient Few-shot Learning
Hanwei Xu
Yujun Chen
Yulun Du
Nan Shao
Yanggang Wang
Haiyu Li
Zhilin Yang
VLM
14
28
0
31 Oct 2022
Character-level White-Box Adversarial Attacks against Transformers via Attachable Subwords Substitution
Aiwei Liu
Honghai Yu
Xuming Hu
Shuang Li
Li Lin
Fukun Ma
Yawen Yang
Lijie Wen
36
33
0
31 Oct 2022
Parameter-Efficient Tuning Makes a Good Classification Head
Zhuoyi Yang
Ming Ding
Yanhui Guo
Qingsong Lv
Jie Tang
VLM
58
14
0
30 Oct 2022
Previous
1
2
3
...
6
7
8
...
25
26
27
Next