Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1906.08237
Cited By
v1
v2 (latest)
XLNet: Generalized Autoregressive Pretraining for Language Understanding
19 June 2019
Zhilin Yang
Zihang Dai
Yiming Yang
J. Carbonell
Ruslan Salakhutdinov
Quoc V. Le
AI4CE
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"XLNet: Generalized Autoregressive Pretraining for Language Understanding"
50 / 3,518 papers shown
Title
Modeling and Analyzing the Influence of Non-Item Pages on Sequential Next-Item Prediction
Elisabeth Fischer
Albin Zehe
Andreas Hotho
Daniel Schlor
HAI
161
0
0
28 Aug 2024
A Survey of Large Language Models for European Languages
Wazir Ali
S. Pyysalo
153
3
0
27 Aug 2024
CVPT: Cross-Attention help Visual Prompt Tuning adapt visual task
Lingyun Huang
Jianxu Mao
Yaonan Wang
Junfei Yi
Ziming Tao
VLM
VPVLM
88
2
0
27 Aug 2024
Sapiens: Foundation for Human Vision Models
Rawal Khirodkar
Timur M. Bagautdinov
Julieta Martinez
Su Zhaoen
Austin James
Peter Selednik
Stuart Anderson
Shunsuke Saito
VLM
143
81
0
22 Aug 2024
Inside the Black Box: Detecting Data Leakage in Pre-trained Language Encoders
Yuan Xin
Zehan Li
Ning Yu
Dingfan Chen
Mario Fritz
Michael Backes
Yang Zhang
PILM
MIACV
106
2
0
20 Aug 2024
Leveraging Superfluous Information in Contrastive Representation Learning
Xuechu Yu
SSL
67
2
0
19 Aug 2024
Zero Day Ransomware Detection with Pulse: Function Classification with Transformer Models and Assembly Language
Matthew G. Gaber
Mohiuddin Ahmed
Helge Janicke
123
10
0
15 Aug 2024
End-to-end Semantic-centric Video-based Multimodal Affective Computing
Ronghao Lin
Ying Zeng
Sijie Mai
Haifeng Hu
VGen
118
0
0
14 Aug 2024
Multilingual Models for Check-Worthy Social Media Posts Detection
Sebastian Kula
Michal Gregor
65
0
0
13 Aug 2024
A Psychology-based Unified Dynamic Framework for Curriculum Learning
Guangyu Meng
Qingkai Zeng
John P. Lalor
Hong-ye Yu
76
0
0
09 Aug 2024
Survey: Transformer-based Models in Data Modality Conversion
Elyas Rashno
Amir Eskandari
Aman Anand
F. Zulkernine
MedIm
91
0
0
08 Aug 2024
Recognizing Emotion Regulation Strategies from Human Behavior with Large Language Models
Philipp Müller
Alexander Heimerl
Sayed Muddashir Hossain
Lea Siegel
Jan Alexandersson
Patrick Gebhard
Elisabeth André
T. Schneeberger
88
0
0
08 Aug 2024
Semantics or spelling? Probing contextual word embeddings with orthographic noise
Jacob A. Matthews
John R. Starr
Marten van Schijndel
70
2
0
08 Aug 2024
Improving the quality of Persian clinical text with a novel spelling correction system
Seyed Mohammad Sadegh Dashti
S. F. Dashti
87
0
0
07 Aug 2024
SwinShadow: Shifted Window for Ambiguous Adjacent Shadow Detection
Yonghui Wang
Shaokai Liu
Li Li
Wengang Zhou
Houqiang Li
ViT
83
1
0
07 Aug 2024
Recent Advances in Multi-Choice Machine Reading Comprehension: A Survey on Methods and Datasets
Shima Foolad
Kourosh Kiani
R. Rastgoo
FaML
89
0
0
04 Aug 2024
GalleryGPT: Analyzing Paintings with Large Multimodal Models
Yi Bin
Wenhao Shi
Yujuan Ding
Zhiqiang Hu
Zheng Wang
Yang Yang
See-Kiong Ng
H. Shen
MLLM
89
11
0
01 Aug 2024
What comes after transformers? -- A selective survey connecting ideas in deep learning
Johannes Schneider
AI4CE
112
2
0
01 Aug 2024
Big Cooperative Learning
Yulai Cong
AI4CE
70
0
0
31 Jul 2024
Informed Correctors for Discrete Diffusion Models
Yixiu Zhao
Jiaxin Shi
F. Chen
Shaul Druckmann
Lester W. Mackey
Scott W. Linderman
133
15
0
30 Jul 2024
Evaluating Large Language Models for automatic analysis of teacher simulations
David de-Fitero-Dominguez
Mariano Albaladejo-González
Antonio Garcia-Cabot
Eva García-López
Antonio Moreno-Cediel
Erin Barno
Justin Reich
ELM
52
0
0
29 Jul 2024
What Matters in Explanations: Towards Explainable Fake Review Detection Focusing on Transformers
Md. Shajalal
Md. Atabuzzaman
Alexander Boden
Gunnar Stevens
Delong Du
85
0
0
24 Jul 2024
NarrationDep: Narratives on Social Media For Automatic Depression Detection
Hamad Zogan
Imran Razzak
Shoaib Jameel
Guandong Xu
39
0
0
24 Jul 2024
Pre-Training and Prompting for Few-Shot Node Classification on Text-Attributed Graphs
Huan-jing Zhao
Beining Yang
Yukuo Cen
Junyu Ren
Chenhui Zhang
Yuxiao Dong
Evgeny Kharlamov
Shu Zhao
Jie Tang
VLM
94
8
0
22 Jul 2024
ALLaM: Large Language Models for Arabic and English
M Saiful Bari
Yazeed Alnumay
Norah A. Alzahrani
Nouf M. Alotaibi
H. A. Alyahya
...
Jeril Kuriakose
Abdalghani Abujabal
Nora Al-Twairesh
Areeb Alowisheq
Haidar Khan
73
17
0
22 Jul 2024
Recent Advances in Generative AI and Large Language Models: Current Status, Challenges, and Perspectives
D. Hagos
Rick Battle
Danda B. Rawat
LM&MA
OffRL
114
27
0
20 Jul 2024
PERCORE: A Deep Learning-Based Framework for Persian Spelling Correction with Phonetic Analysis
S. Dashti
A. K. Bardsiri
M. J. Shahbazzadeh
109
4
0
20 Jul 2024
PassTSL: Modeling Human-Created Passwords through Two-Stage Learning
Yangde Wang
Haozhang Li
Weidong Qiu
Shujun Li
Peng Tang
78
1
0
19 Jul 2024
Detecting and Characterising Mobile App Metamorphosis in Google Play Store
Dishanika Denipitiyage
B. Silva
K. Gunathilaka
Suranga Seneviratne
A. Mahanti
A. Seneviratne
Sanjay Chawla
60
1
0
19 Jul 2024
Clinical Reading Comprehension with Encoder-Decoder Models Enhanced by Direct Preference Optimization
Md Sultan al Nahian
R. Kavuluru
MedIm
AI4CE
56
0
0
19 Jul 2024
Temporal Representation Learning for Stock Similarities and Its Applications in Investment Management
Yoon-Jeong Hwang
Stefan Zohren
Yongjae Lee
AIFin
73
1
0
18 Jul 2024
CellularLint: A Systematic Approach to Identify Inconsistent Behavior in Cellular Network Specifications
Mirza Masfiqur Rahman
Imtiaz Karim
Elisa Bertino
59
3
0
18 Jul 2024
Transformer-based Single-Cell Language Model: A Survey
Wei Lan
Guohang He
Mingyang Liu
Qingfeng Chen
Junyue Cao
Wei Peng
MedIm
LRM
62
7
0
18 Jul 2024
Sharif-STR at SemEval-2024 Task 1: Transformer as a Regression Model for Fine-Grained Scoring of Textual Semantic Relations
Seyedeh Fatemeh Ebrahimi
Karim Akhavan Azari
Amirmasoud Iravani
Hadi Alizadeh
Zeinab Taghavi
Hossein Sameti
60
4
0
17 Jul 2024
SDPT: Synchronous Dual Prompt Tuning for Fusion-based Visual-Language Pre-trained Models
Yang Zhou
Yongjian Wu
Jiya Saiyin
Bingzheng Wei
Maode Lai
Eric Chang
Yan Xu
VLM
87
1
0
16 Jul 2024
Hierarchical Multi-modal Transformer for Cross-modal Long Document Classification
Tengfei Liu
Yongli Hu
Junbin Gao
Yanfeng Sun
Baocai Yin
83
0
0
14 Jul 2024
FarFetched: Entity-centric Reasoning and Claim Validation for the Greek Language based on Textually Represented Environments
D. Papadopoulos
Katerina Metropoulou
N. Matsatsinis
N. Papadakis
LRM
57
3
0
13 Jul 2024
Mitigating Entity-Level Hallucination in Large Language Models
Weihang Su
Yichen Tang
Qingyao Ai
Changyue Wang
Zhijing Wu
Yiqun Liu
HILM
91
8
0
12 Jul 2024
Is Contrasting All You Need? Contrastive Learning for the Detection and Attribution of AI-generated Text
Lucio La Cava
Davide Costa
Andrea Tagarelli
DeLMO
110
5
0
12 Jul 2024
A Review of the Challenges with Massive Web-mined Corpora Used in Large Language Models Pre-Training
Michał Perełkiewicz
Rafał Poświata
71
3
0
10 Jul 2024
NoisyAG-News: A Benchmark for Addressing Instance-Dependent Noise in Text Classification
Hongfei Huang
Tingting Liang
Xixi Sun
Zikang Jin
Yuyu Yin
NoLa
77
1
0
09 Jul 2024
Mobile Edge Intelligence for Large Language Models: A Contemporary Survey
Guanqiao Qu
Qiyuan Chen
Wei Wei
Zheng Lin
Xianhao Chen
Kaibin Huang
152
56
0
09 Jul 2024
Data, Data Everywhere: A Guide for Pretraining Dataset Construction
Jupinder Parmar
Shrimai Prabhumoye
Joseph Jennings
Bo Liu
Aastha Jhunjhunwala
Zhilin Wang
M. Patwary
Mohammad Shoeybi
Bryan Catanzaro
122
10
0
08 Jul 2024
MST5 -- Multilingual Question Answering over Knowledge Graphs
Nikit Srivastava
Mengshi Ma
Daniel Vollmers
Hamada M. Zahera
Diego Moussallem
A. N. Ngomo
59
1
0
08 Jul 2024
MSP-Podcast SER Challenge 2024: Lántenne du Ventoux Multimodal Self-Supervised Learning for Speech Emotion Recognition
J. Duret
Mickael Rouvier
Yannick Esteve
49
3
0
08 Jul 2024
Unmasking Trees for Tabular Data
Calvin McCarter
90
3
0
08 Jul 2024
Advancing Prompt Recovery in NLP: A Deep Dive into the Integration of Gemma-2b-it and Phi2 Models
Jianlong Chen
Wei Xu
Zhicheng Ding
Jinxin Xu
Hao Yan
Xinyu Zhang
128
2
0
07 Jul 2024
HYBRINFOX at CheckThat! 2024 -- Task 1: Enhancing Language Models with Structured Information for Check-Worthiness Estimation
Géraud Faye
Morgane Casanova
B. Icard
Julien Chanson
Guillaume Gadek
Guillaume Gravier
Paul Égré
27
2
0
04 Jul 2024
Aspect-Based Sentiment Analysis Techniques: A Comparative Study
Dineth Jayakody
Koshila Isuranda
A. V. A. Malkith
Nisansa de Silva
Sachintha Rajith Ponnamperuma
G. Sandamali
K. L. Sudheera
65
1
0
03 Jul 2024
MLKD-BERT: Multi-level Knowledge Distillation for Pre-trained Language Models
Ying Zhang
Ziheng Yang
Shufan Ji
KELM
46
1
0
03 Jul 2024
Previous
1
2
3
4
5
6
...
69
70
71
Next