Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1906.08237
Cited By
v1
v2 (latest)
XLNet: Generalized Autoregressive Pretraining for Language Understanding
19 June 2019
Zhilin Yang
Zihang Dai
Yiming Yang
J. Carbonell
Ruslan Salakhutdinov
Quoc V. Le
AI4CE
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"XLNet: Generalized Autoregressive Pretraining for Language Understanding"
50 / 3,518 papers shown
Title
Dictionary-Assisted Supervised Contrastive Learning
Patrick Y. Wu
Richard Bonneau
Joshua A. Tucker
Jonathan Nagler
CLIP
66
0
0
27 Oct 2022
Privately Fine-Tuning Large Language Models with Differential Privacy
R. Behnia
Mohammadreza Ebrahimi
Jason L. Pacheco
B. Padmanabhan
127
51
0
26 Oct 2022
Causality Detection using Multiple Annotation Decisions
Quynh-Anh Nguyen
Arka Mitra
29
2
0
26 Oct 2022
Learning on Large-scale Text-attributed Graphs via Variational Inference
Jianan Zhao
Meng Qu
Chaozhuo Li
Hao Yan
Qian Liu
Rui Li
Xing Xie
Jian Tang
VLM
132
142
0
26 Oct 2022
Uncertainty Sentence Sampling by Virtual Adversarial Perturbation
Han Zhang
Zhen Zhang
Hongfei Jiang
Yang Song
40
0
0
26 Oct 2022
Leveraging Affirmative Interpretations from Negation Improves Natural Language Understanding
Md Mosharaf Hossain
Eduardo Blanco
73
5
0
26 Oct 2022
Exploring Robustness of Prefix Tuning in Noisy Data: A Case Study in Financial Sentiment Analysis
Sudhandar Balakrishnan
Yihao Fang
Xioadan Zhu
46
1
0
26 Oct 2022
Causal Analysis of Syntactic Agreement Neurons in Multilingual Language Models
Aaron Mueller
Yudi Xia
Tal Linzen
MILM
113
10
0
25 Oct 2022
Revision for Concision: A Constrained Paraphrase Generation Task
Wenchuan Mu
Kwanin Lim
70
3
0
25 Oct 2022
Same Pre-training Loss, Better Downstream: Implicit Bias Matters for Language Models
Hong Liu
Sang Michael Xie
Zhiyuan Li
Tengyu Ma
AI4CE
135
55
0
25 Oct 2022
PolyHope: Two-Level Hope Speech Detection from Tweets
F. Balouchzahi
Grigori Sidorov
Alexander Gelbukh
53
50
0
25 Oct 2022
Audio MFCC-gram Transformers for respiratory insufficiency detection in COVID-19
M. Gauy
Marcelo Finger
54
9
0
25 Oct 2022
MemoNet: Memorizing All Cross Features' Representations Efficiently via Multi-Hash Codebook Network for CTR Prediction
P. Zhang
Junlin Zhang
52
3
0
25 Oct 2022
Effective Pre-Training Objectives for Transformer-based Autoencoders
Luca Di Liello
Matteo Gabburo
Alessandro Moschitti
41
3
0
24 Oct 2022
Towards Better Few-Shot and Finetuning Performance with Forgetful Causal Language Models
Hao Liu
Xinyang Geng
Lisa Lee
Igor Mordatch
Sergey Levine
Sharan Narang
Pieter Abbeel
KELM
CLL
89
2
0
24 Oct 2022
ELMER: A Non-Autoregressive Pre-trained Language Model for Efficient and Effective Text Generation
Junyi Li
Tianyi Tang
Wayne Xin Zhao
J. Nie
Ji-Rong Wen
81
17
0
24 Oct 2022
A Visual Tour Of Current Challenges In Multimodal Language Models
Shashank Sonkar
Naiming Liu
Richard G. Baraniuk
DiffM
47
2
0
22 Oct 2022
DiscoSense: Commonsense Reasoning with Discourse Connectives
Prajjwal Bhargava
Vincent Ng
LRM
335
4
0
22 Oct 2022
Leveraging Large Language Models for Multiple Choice Question Answering
Joshua Robinson
Christopher Rytting
David Wingate
ELM
244
200
0
22 Oct 2022
P
3
^3
3
LM: Probabilistically Permuted Prophet Language Modeling for Generative Pre-Training
Junwei Bao
Yifan Wang
Jiangyong Ying
Yeyun Gong
Jing Zhao
Youzheng Wu
Xiaodong He
70
1
0
22 Oct 2022
TCAB: A Large-Scale Text Classification Attack Benchmark
Kalyani Asthana
Zhouhang Xie
Wencong You
Adam Noack
Jonathan Brophy
Sameer Singh
Daniel Lowd
121
3
0
21 Oct 2022
Fine-grained Semantic Alignment Network for Weakly Supervised Temporal Language Grounding
Yuechen Wang
Wen-gang Zhou
Houqiang Li
AI4TS
63
13
0
21 Oct 2022
Extracted BERT Model Leaks More Information than You Think!
Xuanli He
Chen Chen
Lingjuan Lyu
Xingliang Yuan
SILM
MIACV
74
6
0
21 Oct 2022
Design a Sustainable Micro-mobility Future: Trends and Challenges in the United States and European Union Using Natural Language Processing Techniques
Lilit Avetisyan
Chengxin Zhang
Sue Bai
Ehsan Moradi-Pari
Fred Feng
Shan Bao
Feng Zhou
21
0
0
21 Oct 2022
Transcending Scaling Laws with 0.1% Extra Compute
Yi Tay
Jason W. Wei
Hyung Won Chung
Vinh Q. Tran
David R. So
...
Donald Metzler
Slav Petrov
N. Houlsby
Quoc V. Le
Mostafa Dehghani
LRM
109
71
0
20 Oct 2022
Late Prompt Tuning: A Late Prompt Could Be Better Than Many Prompts
Xiangyang Liu
Tianxiang Sun
Xuanjing Huang
Xipeng Qiu
VLM
103
29
0
20 Oct 2022
How Does a Deep Learning Model Architecture Impact Its Privacy? A Comprehensive Study of Privacy Attacks on CNNs and Transformers
Guangsheng Zhang
B. Liu
Huan Tian
Tianqing Zhu
Ming Ding
Wanlei Zhou
PILM
MIACV
85
6
0
20 Oct 2022
Attribution and Obfuscation of Neural Text Authorship: A Data Mining Perspective
Adaku Uchendu
Thai Le
Dongwon Lee
DeLMO
121
45
0
19 Oct 2022
Language Detoxification with Attribute-Discriminative Latent Space
Jin Myung Kwak
Minseon Kim
Sung Ju Hwang
70
14
0
19 Oct 2022
Forging Multiple Training Objectives for Pre-trained Language Models via Meta-Learning
Hongqiu Wu
Ruixue Ding
Haizhen Zhao
Boli Chen
Pengjun Xie
Fei Huang
Min Zhang
MoMe
102
8
0
19 Oct 2022
Language Model Decomposition: Quantifying the Dependency and Correlation of Language Models
Hao Zhang
29
0
0
19 Oct 2022
ELASTIC: Numerical Reasoning with Adaptive Symbolic Compiler
Jiaxin Zhang
Yashar Moshfeghi
AIMat
68
18
0
18 Oct 2022
Detecting and analyzing missing citations to published scientific entities
Jialiang Lin
Yao Yu
Jia-Qi Song
X. Shi
52
4
0
18 Oct 2022
Tiny-Attention Adapter: Contexts Are More Important Than the Number of Parameters
Hongyu Zhao
Hao Tan
Hongyuan Mei
MoE
81
18
0
18 Oct 2022
Controllable Fake Document Infilling for Cyber Deception
Yibo Hu
Yu Lin
Eric Parolin
Latif Khan
Kevin W. Hamlen
66
8
0
18 Oct 2022
Sentiment-Aware Word and Sentence Level Pre-training for Sentiment Analysis
Shuai Fan
Chen Lin
Haonan Li
Zheng-Wen Lin
Jinsong Su
Hang Zhang
Yeyun Gong
Jian Guo
Nan Duan
VLM
67
19
0
18 Oct 2022
Modelling Emotion Dynamics in Song Lyrics with State Space Models
Yingjin Song
Daniel Beck
87
5
0
17 Oct 2022
Flipped Classroom: Effective Teaching for Time Series Forecasting
P. Teutsch
Patrick Mäder
AI4TS
67
8
0
17 Oct 2022
Knowledge Prompting in Pre-trained Language Model for Natural Language Understanding
Jiadong Wang
Wenkang Huang
Qiuhui Shi
Hongbin Wang
Minghui Qiu
Xiang Li
Ming Gao
KELM
VLM
90
19
0
16 Oct 2022
A Simple and Strong Baseline for End-to-End Neural RST-style Discourse Parsing
Naoki Kobayashi
Tsutomu Hirao
Hidetaka Kamigaito
Manabu Okumura
Masaaki Nagata
38
11
0
15 Oct 2022
Style Transfer as Data Augmentation: A Case Study on Named Entity Recognition
Shuguang Chen
Leonardo Neves
Thamar Solorio
110
4
0
14 Oct 2022
HashFormers: Towards Vocabulary-independent Pre-trained Transformers
Huiyin Xue
Nikolaos Aletras
51
4
0
14 Oct 2022
BERTScore is Unfair: On Social Bias in Language Model-Based Metrics for Text Generation
Tianxiang Sun
Junliang He
Xipeng Qiu
Xuanjing Huang
85
47
0
14 Oct 2022
Multitask Pre-training of Modular Prompt for Chinese Few-Shot Learning
Tianxiang Sun
Zhengfu He
Qinen Zhu
Xipeng Qiu
Xuanjing Huang
VLM
VPVLM
38
21
0
14 Oct 2022
StyLEx: Explaining Style Using Human Lexical Annotations
Shirley Anugrah Hayati
Kyumin Park
Dheeraj Rajagopal
Lyle Ungar
Dongyeop Kang
96
3
0
14 Oct 2022
Frustratingly Easy Sentiment Analysis of Text Streams: Generating High-Quality Emotion Arcs Using Emotion Lexicons
Daniela Teodorescu
Saif M. Mohammad
55
8
0
13 Oct 2022
Predicting Fine-Tuning Performance with Probing
Zining Zhu
Soroosh Shahtalebi
Frank Rudzicz
64
10
0
13 Oct 2022
Counterfactual Multihop QA: A Cause-Effect Approach for Reducing Disconnected Reasoning
Wangzhen Guo
Qinkang Gong
Hanjiang Lai
LRM
75
4
0
13 Oct 2022
Spontaneous Emerging Preference in Two-tower Language Model
Zhengqi He
Taro Toyoizumi
LRM
50
1
0
13 Oct 2022
On the Evaluation of the Plausibility and Faithfulness of Sentiment Analysis Explanations
Julia El Zini
Mohamad Mansour
Basel Mousi
M. Awad
64
8
0
13 Oct 2022
Previous
1
2
3
...
23
24
25
...
69
70
71
Next