Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1906.08237
Cited By
v1
v2 (latest)
XLNet: Generalized Autoregressive Pretraining for Language Understanding
19 June 2019
Zhilin Yang
Zihang Dai
Yiming Yang
J. Carbonell
Ruslan Salakhutdinov
Quoc V. Le
AI4CE
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"XLNet: Generalized Autoregressive Pretraining for Language Understanding"
50 / 3,520 papers shown
Title
Disentangled Variational Autoencoder for Emotion Recognition in Conversations
Kailai Yang
Tianlin Zhang
Sophia Ananiadou
DRL
95
11
0
23 May 2023
Assessing Linguistic Generalisation in Language Models: A Dataset for Brazilian Portuguese
Rodrigo Wilkens
Leonardo Zilio
Aline Villavicencio
58
1
0
23 May 2023
BigVideo: A Large-scale Video Subtitle Translation Dataset for Multimodal Machine Translation
Liyan Kang
Luyang Huang
Ningxin Peng
Peihao Zhu
Zewei Sun
Shanbo Cheng
Mingxuan Wang
Degen Huang
Jinsong Su
79
10
0
23 May 2023
Towards Zero-shot Relation Extraction in Web Mining: A Multimodal Approach with Relative XML Path
Zilong Wang
Jingbo Shang
82
0
0
23 May 2023
TransCoder: Towards Unified Transferable Code Representation Learning Inspired by Human Skills
Qiushi Sun
Nuo Chen
Jiadong Wang
Xiang Li
Ming Gao
79
8
0
23 May 2023
Regex-augmented Domain Transfer Topic Classification based on a Pre-trained Language Model: An application in Financial Domain
Vanessa Liao
Syed Shariyar Murtaza
Yifan Nie
Jimmy J. Lin
52
0
0
23 May 2023
Can LLMs facilitate interpretation of pre-trained language models?
Basel Mousi
Nadir Durrani
Fahim Dalvi
93
13
0
22 May 2023
A Pretrainer's Guide to Training Data: Measuring the Effects of Data Age, Domain Coverage, Quality, & Toxicity
Shayne Longpre
Gregory Yauney
Emily Reif
Katherine Lee
Adam Roberts
...
Denny Zhou
Jason W. Wei
Kevin Robinson
David M. Mimno
Daphne Ippolito
117
168
0
22 May 2023
Exploring User Perspectives on ChatGPT: Applications, Perceptions, and Implications for AI-Integrated Education
Reza Hadi Mogavi
Chaohua Deng
Justin Juho Kim
Pengyuan Zhou
Young D. Kwon
...
Simone Bassanelli
A. Bucchiarone
Sujit Gujar
Lennart E. Nacke
Pan Hui
90
49
0
22 May 2023
Should We Attend More or Less? Modulating Attention for Fairness
A. Zayed
Gonçalo Mordido
Samira Shabanian
Sarath Chandar
85
10
0
22 May 2023
Bidirectional Transformer Reranker for Grammatical Error Correction
Ying Zhang
Hidetaka Kamigaito
Manabu Okumura
54
2
0
22 May 2023
Data-efficient Active Learning for Structured Prediction with Partial Annotation and Self-Training
Zhisong Zhang
Emma Strubell
Eduard H. Hovy
77
1
0
22 May 2023
F-PABEE: Flexible-patience-based Early Exiting for Single-label and Multi-label text Classification Tasks
Xiangxiang Gao
Wei-wei Zhu
Jiasheng Gao
Congrui Yin
VLM
92
12
0
21 May 2023
PINA: Leveraging Side Information in eXtreme Multi-label Classification via Predicted Instance Neighborhood Aggregation
Eli Chien
Jiong Zhang
Cho-Jui Hsieh
Jyun-Yu Jiang
Wei-Cheng Chang
O. Milenkovic
Hsiang-Fu Yu
86
10
0
21 May 2023
Lifelong Language Pretraining with Distribution-Specialized Experts
Wuyang Chen
Yan-Quan Zhou
Nan Du
Yanping Huang
James Laudon
Zhiwen Chen
Claire Cu
KELM
111
52
0
20 May 2023
Contextualizing Argument Quality Assessment with Relevant Knowledge
D. Deshpande
Zhivar Sourati
Filip Ilievski
Fred Morstatter
88
2
0
20 May 2023
Patton: Language Model Pretraining on Text-Rich Networks
Bowen Jin
Wentao Zhang
Yu Zhang
Yu Meng
Xinyang Zhang
Qi Zhu
Jiawei Han
VLM
112
46
0
20 May 2023
Deep Learning Approaches to Lexical Simplification: A Survey
Kai North
Tharindu Ranasinghe
Matthew Shardlow
Marcos Zampieri
50
15
0
19 May 2023
SeeGULL: A Stereotype Benchmark with Broad Geo-Cultural Coverage Leveraging Generative Models
Akshita Jha
Aida Mostafazadeh Davani
Chandan K. Reddy
Shachi Dave
Vinodkumar Prabhakaran
Sunipa Dev
87
50
0
19 May 2023
Decouple knowledge from parameters for plug-and-play language modeling
Xin Cheng
Yankai Lin
Preslav Nakov
Dongyan Zhao
Rui Yan
KELM
86
2
0
19 May 2023
A Survey of Safety and Trustworthiness of Large Language Models through the Lens of Verification and Validation
Xiaowei Huang
Wenjie Ruan
Wei Huang
Gao Jin
Yizhen Dong
...
Sihao Wu
Peipei Xu
Dengyu Wu
André Freitas
Mustafa A. Mustafa
ALM
132
96
0
19 May 2023
Human Behavioral Benchmarking: Numeric Magnitude Comparison Effects in Large Language Models
Raj Sanjay Shah
Vijay Marupudi
Reba Koenen
Khushi Bhardwaj
Sashank Varma
78
6
0
18 May 2023
NoisywikiHow: A Benchmark for Learning with Real-world Noisy Labels in Natural Language Processing
Tingting Wu
Xiao Ding
Minji Tang
Haotian Zhang
Bing Qin
Ting Liu
NoLa
94
11
0
18 May 2023
Think Outside the Code: Brainstorming Boosts Large Language Models in Code Generation
Xinyu Li
Jiang-Tian Xue
Zheng Xie
Ming Li
LRM
83
28
0
18 May 2023
Statistical Knowledge Assessment for Large Language Models
Qingxiu Dong
Jingjing Xu
Lingpeng Kong
Zhifang Sui
Lei Li
HILM
63
8
0
17 May 2023
When Gradient Descent Meets Derivative-Free Optimization: A Match Made in Black-Box Scenario
Chengcheng Han
Liqing Cui
Renyu Zhu
Jiadong Wang
Nuo Chen
Qiushi Sun
Xiang Li
Ming Gao
82
7
0
17 May 2023
CageViT: Convolutional Activation Guided Efficient Vision Transformer
Hao Zheng
Jinbao Wang
Xiantong Zhen
Hao Chen
Jingkuan Song
Feng Zheng
ViT
78
0
0
17 May 2023
Machine-Made Media: Monitoring the Mobilization of Machine-Generated Articles on Misinformation and Mainstream News Websites
Hans W. A. Hanley
Zakir Durumeric
DeLMO
65
32
0
16 May 2023
Tailoring Instructions to Student's Learning Levels Boosts Knowledge Distillation
Yuxin Ren
Zi-Qi Zhong
Xingjian Shi
Yi Zhu
Chun Yuan
Mu Li
105
7
0
16 May 2023
UOR: Universal Backdoor Attacks on Pre-trained Language Models
Wei Du
Peixuan Li
Yue Liu
Haodong Zhao
Gongshen Liu
AAML
59
9
0
16 May 2023
DLUE: Benchmarking Document Language Understanding
Ruoxi Xu
Hongyu Lin
Xinyan Guan
Xianpei Han
Yingfei Sun
Le Sun
ELM
80
0
0
16 May 2023
Weight-Inherited Distillation for Task-Agnostic BERT Compression
Taiqiang Wu
Cheng-An Hou
Shanshan Lao
Jiayi Li
Ngai Wong
Zhe Zhao
Yujiu Yang
136
10
0
16 May 2023
Exploring In-Context Learning Capabilities of Foundation Models for Generating Knowledge Graphs from Text
H. Khorashadizadeh
Nandana Mihindukulasooriya
Sanju Tiwari
Jinghua Groppe
Sven Groppe
73
23
0
15 May 2023
Assessing Hidden Risks of LLMs: An Empirical Study on Robustness, Consistency, and Credibility
Wen-song Ye
Mingfeng Ou
Tianyi Li
Yipeng Chen
Xuetao Ma
...
Sai Wu
Jie Fu
Gang Chen
Haobo Wang
Jiaqi Zhao
104
38
0
15 May 2023
AdamR at SemEval-2023 Task 10: Solving the Class Imbalance Problem in Sexism Detection with Ensemble Learning
Adam Rydelek
Daryna Dementieva
Georg Groh
33
2
0
15 May 2023
MeeQA: Natural Questions in Meeting Transcripts
Reut Apel
Tom Braude
Amir Kantor
Eyal Kolman
RALM
63
2
0
15 May 2023
Text Classification via Large Language Models
Xiaofei Sun
Xiaoya Li
Jiwei Li
Leilei Gan
Shangwei Guo
Tianwei Zhang
Guoyin Wang
RALM
LRM
104
150
0
15 May 2023
STORYWARS: A Dataset and Instruction Tuning Baselines for Collaborative Story Understanding and Generation
Yulun Du
Lydia B. Chilton
88
8
0
14 May 2023
ParaLS: Lexical Substitution via Pretrained Paraphraser
Jipeng Qiang
Kang Liu
Yun Li
Yunhao Yuan
Yi Zhu
KELM
94
11
0
14 May 2023
Make Prompt-based Black-Box Tuning Colorful: Boosting Model Generalization from Three Orthogonal Perspectives
Qiushi Sun
Chengcheng Han
Nuo Chen
Renyu Zhu
Jing Gong
Xiang Li
Ming Gao
VLM
47
9
0
14 May 2023
Efficient Asynchronize Stochastic Gradient Algorithm with Structured Data
Zhao Song
Mingquan Ye
72
4
0
13 May 2023
EAML: Ensemble Self-Attention-based Mutual Learning Network for Document Image Classification
Souhail Bakkali
Zuheng Ming
Mickael Coustaty
Marçal Rusiñol
65
6
0
11 May 2023
INGENIOUS: Using Informative Data Subsets for Efficient Pre-Training of Language Models
H. S. V. N. S. K. Renduchintala
Krishnateja Killamsetty
S. Bhatia
Milan Aggarwal
Ganesh Ramakrishnan
Rishabh K. Iyer
Balaji Krishnamurthy
AIFin
36
4
0
11 May 2023
ChatGPT-Like Large-Scale Foundation Models for Prognostics and Health Management: A Survey and Roadmaps
Yanfang Li
Huan Wang
Muxia Sun
LM&MA
AI4TS
AI4CE
103
59
0
10 May 2023
Dynamic Graph Representation Learning for Depression Screening with Transformer
Ai-Te Kuo
Haiquan Chen
Yu-Hsuan Kuo
Wei-Shinn Ku
31
3
0
10 May 2023
ORKG-Leaderboards: A Systematic Workflow for Mining Leaderboards as a Knowledge Graph
Salomon Kabongo KABENAMUALU
Jennifer D'Souza
Sören Auer
111
20
0
10 May 2023
VTPNet for 3D deep learning on point cloud
Wei Zhou
Weiwei Jin
Qian Wang
Yifan Wang
Dekui Wang
Xingxing Hao
Yong Yu
3DPC
ViT
49
0
0
10 May 2023
SPSQL: Step-by-step Parsing Based Framework for Text-to-SQL Generation
Ran Shen
Gang Sun
Hao Shen
Yiling Li
Liangfeng Jin
Han Jiang
55
5
0
10 May 2023
ANALOGICAL -- A Novel Benchmark for Long Text Analogy Evaluation in Large Language Models
Thilini Wijesiriwardene
Ruwan Wickramarachchi
Bimal Gajera
Shreeyash Mukul Gowaikar
Chandan Gupta
Aman Chadha
Aishwarya N. Reganti
Amit P. Sheth
Amitava Das
ELM
81
14
0
08 May 2023
SignBERT+: Hand-model-aware Self-supervised Pre-training for Sign Language Understanding
Hezhen Hu
Weichao Zhao
Wen-gang Zhou
Houqiang Li
ViT
95
74
0
08 May 2023
Previous
1
2
3
...
16
17
18
...
69
70
71
Next