Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1906.08237
Cited By
XLNet: Generalized Autoregressive Pretraining for Language Understanding
19 June 2019
Zhilin Yang
Zihang Dai
Yiming Yang
J. Carbonell
Ruslan Salakhutdinov
Quoc V. Le
AI4CE
Re-assign community
ArXiv
PDF
HTML
Papers citing
"XLNet: Generalized Autoregressive Pretraining for Language Understanding"
50 / 1,454 papers shown
Title
A context-aware knowledge transferring strategy for CTC-based ASR
Keda Lu
Kuan-Yu Chen
15
15
0
12 Oct 2022
Decoupled Context Processing for Context Augmented Language Modeling
Zonglin Li
Ruiqi Guo
Surinder Kumar
RALM
KELM
21
23
0
11 Oct 2022
Enriching Biomedical Knowledge for Low-resource Language Through Large-Scale Translation
Long Phan
Tai Dang
H. Tran
Trieu H. Trinh
Vy Phan
Lam D. Chau
Minh-Thang Luong
20
8
0
11 Oct 2022
Multi-CLS BERT: An Efficient Alternative to Traditional Ensembling
Haw-Shiuan Chang
Ruei-Yao Sun
Kathryn Ricci
Andrew McCallum
43
15
0
10 Oct 2022
Uncertainty Quantification with Pre-trained Language Models: A Large-Scale Empirical Analysis
Yuxin Xiao
Paul Pu Liang
Umang Bhatt
Willie Neiswanger
Ruslan Salakhutdinov
Louis-Philippe Morency
175
86
0
10 Oct 2022
Parameter-Efficient Tuning with Special Token Adaptation
Xiaoocong Yang
James Y. Huang
Wenxuan Zhou
Muhao Chen
34
12
0
10 Oct 2022
SparseAdapter: An Easy Approach for Improving the Parameter-Efficiency of Adapters
Shwai He
Liang Ding
Daize Dong
Miao Zhang
Dacheng Tao
MoE
32
87
0
09 Oct 2022
SpeechUT: Bridging Speech and Text with Hidden-Unit for Encoder-Decoder Based Speech-Text Pre-training
Zi-Hua Zhang
Long Zhou
Junyi Ao
Shujie Liu
Lirong Dai
Jinyu Li
Furu Wei
61
57
0
07 Oct 2022
Debiasing isn't enough! -- On the Effectiveness of Debiasing MLMs and their Social Biases in Downstream Tasks
Masahiro Kaneko
Danushka Bollegala
Naoaki Okazaki
28
41
0
06 Oct 2022
XDoc: Unified Pre-training for Cross-Format Document Understanding
Jingye Chen
Tengchao Lv
Lei Cui
Changrong Zhang
Furu Wei
52
13
0
06 Oct 2022
Melody Infilling with User-Provided Structural Context
Chih-Pin Tan
A. Su
Yi-Hsuan Yang
41
3
0
06 Oct 2022
Transformer-based conditional generative adversarial network for multivariate time series generation
Abdellah Madane
M. Dilmi
Florent Forest
Hanane Azzag
M. Lebbah
J. Lacaille
30
10
0
05 Oct 2022
Understanding Prior Bias and Choice Paralysis in Transformer-based Language Representation Models through Four Experimental Probes
Ke Shen
Mayank Kejriwal
27
4
0
03 Oct 2022
Neural Media Bias Detection Using Distant Supervision With BABE -- Bias Annotations By Experts
Timo Spinde
Manuel Plank
Jan-David Krieger
Terry Ruas
Bela Gipp
Akiko Aizawa
27
69
0
29 Sep 2022
Downstream Datasets Make Surprisingly Good Pretraining Corpora
Kundan Krishna
Saurabh Garg
Jeffrey P. Bigham
Zachary Chase Lipton
50
30
0
28 Sep 2022
TVLT: Textless Vision-Language Transformer
Zineng Tang
Jaemin Cho
Yixin Nie
Joey Tianyi Zhou
VLM
54
28
0
28 Sep 2022
Outlier Suppression: Pushing the Limit of Low-bit Transformer Language Models
Xiuying Wei
Yunchen Zhang
Xiangguo Zhang
Ruihao Gong
Shanghang Zhang
Qi Zhang
F. Yu
Xianglong Liu
MQ
38
147
0
27 Sep 2022
Adaptation of domain-specific transformer models with text oversampling for sentiment analysis of social media posts on Covid-19 vaccines
Anmol Bansal
Arjun Choudhry
Anubhav Sharma
Seba Susan
MedIm
21
4
0
22 Sep 2022
Deep Lake: a Lakehouse for Deep Learning
S. Hambardzumyan
Abhina Tuli
Levon Ghukasyan
Fariz Rahman
Hrant Topchyan
...
Mark McQuade
M. Harutyunyan
Tatevik Hakobyan
I. Stranic
Davit Buniatyan
29
17
0
22 Sep 2022
Knowledge-Aware Bayesian Deep Topic Model
Dongsheng Wang
Yishi Xu
Miaoge Li
Zhibin Duan
Chaojie Wang
Bo Chen
Mingyuan Zhou
BDL
38
15
0
20 Sep 2022
Probabilistic Generative Transformer Language models for Generative Design of Molecules
Lai Wei
Nihang Fu
Yuqi Song
Qian Wang
Jianjun Hu
AI4CE
38
11
0
20 Sep 2022
SAMP: A Model Inference Toolkit of Post-Training Quantization for Text Processing via Self-Adaptive Mixed-Precision
Rong Tian
Zijing Zhao
Weijie Liu
Haoyan Liu
Weiquan Mao
Zhe Zhao
Kimmo Yan
MQ
22
5
0
19 Sep 2022
Batch Layer Normalization, A new normalization layer for CNNs and RNN
A. Ziaee
Erion cCano
19
13
0
19 Sep 2022
Probing Spurious Correlations in Popular Event-Based Rumor Detection Benchmarks
Jiaying Wu
Bryan Hooi
28
5
0
19 Sep 2022
Evons: A Dataset for Fake and Real News Virality Analysis and Prediction
K. Krstovski
A. Ryu
B. Kogut
GNN
26
5
0
16 Sep 2022
Linear Transformations for Cross-lingual Sentiment Analysis
Pavel Přibáň
Jakub Šmíd
Adam Mištera
Pavel Král
26
3
0
15 Sep 2022
Revisiting the Practical Effectiveness of Constituency Parse Extraction from Pre-trained Language Models
Taeuk Kim
37
1
0
15 Sep 2022
A semantic hierarchical graph neural network for text classification
Shuai Hua
Xinxin Li
Yun Jing
Qu Liu
17
3
0
15 Sep 2022
Drawing Causal Inferences About Performance Effects in NLP
Sandra Wankmüller
CML
16
1
0
14 Sep 2022
SPACE-3: Unified Dialog Model Pre-training for Task-Oriented Dialog Understanding and Generation
Wanwei He
Yinpei Dai
Min Yang
Jian Sun
Fei Huang
Luo Si
Yongbin Li
33
60
0
14 Sep 2022
Pre-training for Information Retrieval: Are Hyperlinks Fully Explored?
Jiawen Wu
Xinyu Zhang
Yutao Zhu
Zheng Liu
Zikai Guo
Zhaoye Fei
Ruofei Lai
Yongkang Wu
Bo Zhao
Zhicheng Dou
38
5
0
14 Sep 2022
PainPoints: A Framework for Language-based Detection of Chronic Pain and Expert-Collaborative Text-Summarization
S. Fadnavis
Amit Dhurandhar
R. Norel
Jenna M. Reinen
C. Agurto
E. Secchettin
V. Schweiger
Giovanni Perini
Guillermo Cecchi
34
1
0
14 Sep 2022
CNN-Trans-Enc: A CNN-Enhanced Transformer-Encoder On Top Of Static BERT representations for Document Classification
Charaf Eddine Benarab
Shenglin Gui
27
6
0
13 Sep 2022
Pre-trained Language Models for the Legal Domain: A Case Study on Indian Law
Shounak Paul
A. Mandal
Pawan Goyal
Saptarshi Ghosh
AILaw
VLM
ELM
40
45
0
13 Sep 2022
Socially Enhanced Situation Awareness from Microblogs using Artificial Intelligence: A Survey
Rabindra Lamsal
Aaron Harwood
M. Read
42
20
0
13 Sep 2022
An Embedding-Based Grocery Search Model at Instacart
Yuqing Xie
Taesik Na
X. Xiao
Saurav Manchanda
Young Rao
Zhihong Xu
Guanghua Shu
Esther Vasiete
Tejaswi Tenneti
Haixun Wang
DML
RALM
32
6
0
12 Sep 2022
Blessing of Class Diversity in Pre-training
Yulai Zhao
Jianshu Chen
S. Du
AI4CE
26
3
0
07 Sep 2022
AutoPruner: Transformer-Based Call Graph Pruning
Thanh Le-Cong
Hong Jin Kang
Truong-Giang Nguyen
S. A. Haryono
David Lo
X. Le
H. Thang
32
19
0
07 Sep 2022
Mimose: An Input-Aware Checkpointing Planner for Efficient Training on GPU
Jian-He Liao
Mingzhen Li
Qingxiao Sun
Jiwei Hao
F. Yu
...
Ye Tao
Zicheng Zhang
Hailong Yang
Zhongzhi Luan
D. Qian
23
4
0
06 Sep 2022
A Survey on Measuring and Mitigating Reasoning Shortcuts in Machine Reading Comprehension
Xanh Ho
Johannes Mario Meissner
Saku Sugawara
Akiko Aizawa
OffRL
40
4
0
05 Sep 2022
Survey: Exploiting Data Redundancy for Optimization of Deep Learning
Jou-An Chen
Wei Niu
Bin Ren
Yanzhi Wang
Xipeng Shen
23
24
0
29 Aug 2022
Of Human Criteria and Automatic Metrics: A Benchmark of the Evaluation of Story Generation
Cyril Chhun
Pierre Colombo
Chloé Clavel
Fabian M. Suchanek
53
51
0
24 Aug 2022
Visual Subtitle Feature Enhanced Video Outline Generation
Qi Lv
Ziqiang Cao
Wenrui Xie
Derui Wang
Jingwen Wang
...
Yuan-Fang Li
Min Cao
Wenjie Li
Sujian Li
Guohong Fu
VGen
26
0
0
24 Aug 2022
CLOWER: A Pre-trained Language Model with Contrastive Learning over Word and Character Representations
Borun Chen
Hongyin Tang
Jiahao Bu
Kai Zhang
Jingang Wang
Qifan Wang
Haitao Zheng
Wei Wu
Liqian Yu
VLM
27
1
0
23 Aug 2022
A Novel Multi-Task Learning Approach for Context-Sensitive Compound Type Identification in Sanskrit
Jivnesh Sandhan
Ashish Gupta
Hrishikesh Terdalkar
Tushar Sandhan
S. Samanta
Laxmidhar Behera
Pawan Goyal
26
3
0
22 Aug 2022
PANDA: Prompt Transfer Meets Knowledge Distillation for Efficient Model Adaptation
Qihuang Zhong
Liang Ding
Juhua Liu
Bo Du
Dacheng Tao
VLM
CLL
34
41
0
22 Aug 2022
Lost in Context? On the Sense-wise Variance of Contextualized Word Embeddings
Yile Wang
Yue Zhang
26
4
0
20 Aug 2022
Pretrained Language Encoders are Natural Tagging Frameworks for Aspect Sentiment Triplet Extraction
Yanjie Gou
Yinjie Lei
Lingqiao Liu
Yong Dai
Chun-Yen Shen
Yongqi Tong
ViT
34
0
0
20 Aug 2022
Graph-Augmented Cyclic Learning Framework for Similarity Estimation of Medical Clinical Notes
Can Zheng
Yanshan Wang
X. Jia
32
0
0
19 Aug 2022
A Kind Introduction to Lexical and Grammatical Aspect, with a Survey of Computational Approaches
Annemarie Friedrich
Nianwen Xue
Alexis Palmer
40
2
0
18 Aug 2022
Previous
1
2
3
...
8
9
10
...
28
29
30
Next