ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1906.08237
  4. Cited By
XLNet: Generalized Autoregressive Pretraining for Language Understanding
v1v2 (latest)

XLNet: Generalized Autoregressive Pretraining for Language Understanding

19 June 2019
Zhilin Yang
Zihang Dai
Yiming Yang
J. Carbonell
Ruslan Salakhutdinov
Quoc V. Le
    AI4CE
ArXiv (abs)PDFHTML

Papers citing "XLNet: Generalized Autoregressive Pretraining for Language Understanding"

50 / 3,518 papers shown
Title
Modeling and Analyzing the Influence of Non-Item Pages on Sequential Next-Item Prediction
Modeling and Analyzing the Influence of Non-Item Pages on Sequential Next-Item Prediction
Elisabeth Fischer
Albin Zehe
Andreas Hotho
Daniel Schlor
HAI
161
0
0
28 Aug 2024
A Survey of Large Language Models for European Languages
A Survey of Large Language Models for European Languages
Wazir Ali
S. Pyysalo
153
3
0
27 Aug 2024
CVPT: Cross-Attention help Visual Prompt Tuning adapt visual task
CVPT: Cross-Attention help Visual Prompt Tuning adapt visual task
Lingyun Huang
Jianxu Mao
Yaonan Wang
Junfei Yi
Ziming Tao
VLMVPVLM
88
2
0
27 Aug 2024
Sapiens: Foundation for Human Vision Models
Sapiens: Foundation for Human Vision Models
Rawal Khirodkar
Timur M. Bagautdinov
Julieta Martinez
Su Zhaoen
Austin James
Peter Selednik
Stuart Anderson
Shunsuke Saito
VLM
143
81
0
22 Aug 2024
Inside the Black Box: Detecting Data Leakage in Pre-trained Language
  Encoders
Inside the Black Box: Detecting Data Leakage in Pre-trained Language Encoders
Yuan Xin
Zehan Li
Ning Yu
Dingfan Chen
Mario Fritz
Michael Backes
Yang Zhang
PILMMIACV
106
2
0
20 Aug 2024
Leveraging Superfluous Information in Contrastive Representation
  Learning
Leveraging Superfluous Information in Contrastive Representation Learning
Xuechu Yu
SSL
67
2
0
19 Aug 2024
Zero Day Ransomware Detection with Pulse: Function Classification with
  Transformer Models and Assembly Language
Zero Day Ransomware Detection with Pulse: Function Classification with Transformer Models and Assembly Language
Matthew G. Gaber
Mohiuddin Ahmed
Helge Janicke
123
10
0
15 Aug 2024
End-to-end Semantic-centric Video-based Multimodal Affective Computing
End-to-end Semantic-centric Video-based Multimodal Affective Computing
Ronghao Lin
Ying Zeng
Sijie Mai
Haifeng Hu
VGen
118
0
0
14 Aug 2024
Multilingual Models for Check-Worthy Social Media Posts Detection
Multilingual Models for Check-Worthy Social Media Posts Detection
Sebastian Kula
Michal Gregor
65
0
0
13 Aug 2024
A Psychology-based Unified Dynamic Framework for Curriculum Learning
A Psychology-based Unified Dynamic Framework for Curriculum Learning
Guangyu Meng
Qingkai Zeng
John P. Lalor
Hong-ye Yu
76
0
0
09 Aug 2024
Survey: Transformer-based Models in Data Modality Conversion
Survey: Transformer-based Models in Data Modality Conversion
Elyas Rashno
Amir Eskandari
Aman Anand
F. Zulkernine
MedIm
91
0
0
08 Aug 2024
Recognizing Emotion Regulation Strategies from Human Behavior with Large
  Language Models
Recognizing Emotion Regulation Strategies from Human Behavior with Large Language Models
Philipp Müller
Alexander Heimerl
Sayed Muddashir Hossain
Lea Siegel
Jan Alexandersson
Patrick Gebhard
Elisabeth André
T. Schneeberger
88
0
0
08 Aug 2024
Semantics or spelling? Probing contextual word embeddings with
  orthographic noise
Semantics or spelling? Probing contextual word embeddings with orthographic noise
Jacob A. Matthews
John R. Starr
Marten van Schijndel
70
2
0
08 Aug 2024
Improving the quality of Persian clinical text with a novel spelling
  correction system
Improving the quality of Persian clinical text with a novel spelling correction system
Seyed Mohammad Sadegh Dashti
S. F. Dashti
87
0
0
07 Aug 2024
SwinShadow: Shifted Window for Ambiguous Adjacent Shadow Detection
SwinShadow: Shifted Window for Ambiguous Adjacent Shadow Detection
Yonghui Wang
Shaokai Liu
Li Li
Wengang Zhou
Houqiang Li
ViT
83
1
0
07 Aug 2024
Recent Advances in Multi-Choice Machine Reading Comprehension: A Survey
  on Methods and Datasets
Recent Advances in Multi-Choice Machine Reading Comprehension: A Survey on Methods and Datasets
Shima Foolad
Kourosh Kiani
R. Rastgoo
FaML
89
0
0
04 Aug 2024
GalleryGPT: Analyzing Paintings with Large Multimodal Models
GalleryGPT: Analyzing Paintings with Large Multimodal Models
Yi Bin
Wenhao Shi
Yujuan Ding
Zhiqiang Hu
Zheng Wang
Yang Yang
See-Kiong Ng
H. Shen
MLLM
89
11
0
01 Aug 2024
What comes after transformers? -- A selective survey connecting ideas in
  deep learning
What comes after transformers? -- A selective survey connecting ideas in deep learning
Johannes Schneider
AI4CE
112
2
0
01 Aug 2024
Big Cooperative Learning
Big Cooperative Learning
Yulai Cong
AI4CE
70
0
0
31 Jul 2024
Informed Correctors for Discrete Diffusion Models
Informed Correctors for Discrete Diffusion Models
Yixiu Zhao
Jiaxin Shi
F. Chen
Shaul Druckmann
Lester W. Mackey
Scott W. Linderman
133
15
0
30 Jul 2024
Evaluating Large Language Models for automatic analysis of teacher
  simulations
Evaluating Large Language Models for automatic analysis of teacher simulations
David de-Fitero-Dominguez
Mariano Albaladejo-González
Antonio Garcia-Cabot
Eva García-López
Antonio Moreno-Cediel
Erin Barno
Justin Reich
ELM
52
0
0
29 Jul 2024
What Matters in Explanations: Towards Explainable Fake Review Detection
  Focusing on Transformers
What Matters in Explanations: Towards Explainable Fake Review Detection Focusing on Transformers
Md. Shajalal
Md. Atabuzzaman
Alexander Boden
Gunnar Stevens
Delong Du
85
0
0
24 Jul 2024
NarrationDep: Narratives on Social Media For Automatic Depression
  Detection
NarrationDep: Narratives on Social Media For Automatic Depression Detection
Hamad Zogan
Imran Razzak
Shoaib Jameel
Guandong Xu
39
0
0
24 Jul 2024
Pre-Training and Prompting for Few-Shot Node Classification on
  Text-Attributed Graphs
Pre-Training and Prompting for Few-Shot Node Classification on Text-Attributed Graphs
Huan-jing Zhao
Beining Yang
Yukuo Cen
Junyu Ren
Chenhui Zhang
Yuxiao Dong
Evgeny Kharlamov
Shu Zhao
Jie Tang
VLM
94
8
0
22 Jul 2024
ALLaM: Large Language Models for Arabic and English
ALLaM: Large Language Models for Arabic and English
M Saiful Bari
Yazeed Alnumay
Norah A. Alzahrani
Nouf M. Alotaibi
H. A. Alyahya
...
Jeril Kuriakose
Abdalghani Abujabal
Nora Al-Twairesh
Areeb Alowisheq
Haidar Khan
73
17
0
22 Jul 2024
Recent Advances in Generative AI and Large Language Models: Current
  Status, Challenges, and Perspectives
Recent Advances in Generative AI and Large Language Models: Current Status, Challenges, and Perspectives
D. Hagos
Rick Battle
Danda B. Rawat
LM&MAOffRL
114
27
0
20 Jul 2024
PERCORE: A Deep Learning-Based Framework for Persian Spelling Correction
  with Phonetic Analysis
PERCORE: A Deep Learning-Based Framework for Persian Spelling Correction with Phonetic Analysis
S. Dashti
A. K. Bardsiri
M. J. Shahbazzadeh
109
4
0
20 Jul 2024
PassTSL: Modeling Human-Created Passwords through Two-Stage Learning
PassTSL: Modeling Human-Created Passwords through Two-Stage Learning
Yangde Wang
Haozhang Li
Weidong Qiu
Shujun Li
Peng Tang
78
1
0
19 Jul 2024
Detecting and Characterising Mobile App Metamorphosis in Google Play
  Store
Detecting and Characterising Mobile App Metamorphosis in Google Play Store
Dishanika Denipitiyage
B. Silva
K. Gunathilaka
Suranga Seneviratne
A. Mahanti
A. Seneviratne
Sanjay Chawla
60
1
0
19 Jul 2024
Clinical Reading Comprehension with Encoder-Decoder Models Enhanced by
  Direct Preference Optimization
Clinical Reading Comprehension with Encoder-Decoder Models Enhanced by Direct Preference Optimization
Md Sultan al Nahian
R. Kavuluru
MedImAI4CE
56
0
0
19 Jul 2024
Temporal Representation Learning for Stock Similarities and Its
  Applications in Investment Management
Temporal Representation Learning for Stock Similarities and Its Applications in Investment Management
Yoon-Jeong Hwang
Stefan Zohren
Yongjae Lee
AIFin
73
1
0
18 Jul 2024
CellularLint: A Systematic Approach to Identify Inconsistent Behavior in
  Cellular Network Specifications
CellularLint: A Systematic Approach to Identify Inconsistent Behavior in Cellular Network Specifications
Mirza Masfiqur Rahman
Imtiaz Karim
Elisa Bertino
59
3
0
18 Jul 2024
Transformer-based Single-Cell Language Model: A Survey
Transformer-based Single-Cell Language Model: A Survey
Wei Lan
Guohang He
Mingyang Liu
Qingfeng Chen
Junyue Cao
Wei Peng
MedImLRM
62
7
0
18 Jul 2024
Sharif-STR at SemEval-2024 Task 1: Transformer as a Regression Model for
  Fine-Grained Scoring of Textual Semantic Relations
Sharif-STR at SemEval-2024 Task 1: Transformer as a Regression Model for Fine-Grained Scoring of Textual Semantic Relations
Seyedeh Fatemeh Ebrahimi
Karim Akhavan Azari
Amirmasoud Iravani
Hadi Alizadeh
Zeinab Taghavi
Hossein Sameti
60
4
0
17 Jul 2024
SDPT: Synchronous Dual Prompt Tuning for Fusion-based Visual-Language
  Pre-trained Models
SDPT: Synchronous Dual Prompt Tuning for Fusion-based Visual-Language Pre-trained Models
Yang Zhou
Yongjian Wu
Jiya Saiyin
Bingzheng Wei
Maode Lai
Eric Chang
Yan Xu
VLM
87
1
0
16 Jul 2024
Hierarchical Multi-modal Transformer for Cross-modal Long Document
  Classification
Hierarchical Multi-modal Transformer for Cross-modal Long Document Classification
Tengfei Liu
Yongli Hu
Junbin Gao
Yanfeng Sun
Baocai Yin
83
0
0
14 Jul 2024
FarFetched: Entity-centric Reasoning and Claim Validation for the Greek
  Language based on Textually Represented Environments
FarFetched: Entity-centric Reasoning and Claim Validation for the Greek Language based on Textually Represented Environments
D. Papadopoulos
Katerina Metropoulou
N. Matsatsinis
N. Papadakis
LRM
57
3
0
13 Jul 2024
Mitigating Entity-Level Hallucination in Large Language Models
Mitigating Entity-Level Hallucination in Large Language Models
Weihang Su
Yichen Tang
Qingyao Ai
Changyue Wang
Zhijing Wu
Yiqun Liu
HILM
91
8
0
12 Jul 2024
Is Contrasting All You Need? Contrastive Learning for the Detection and Attribution of AI-generated Text
Is Contrasting All You Need? Contrastive Learning for the Detection and Attribution of AI-generated Text
Lucio La Cava
Davide Costa
Andrea Tagarelli
DeLMO
110
5
0
12 Jul 2024
A Review of the Challenges with Massive Web-mined Corpora Used in Large
  Language Models Pre-Training
A Review of the Challenges with Massive Web-mined Corpora Used in Large Language Models Pre-Training
Michał Perełkiewicz
Rafał Poświata
71
3
0
10 Jul 2024
NoisyAG-News: A Benchmark for Addressing Instance-Dependent Noise in
  Text Classification
NoisyAG-News: A Benchmark for Addressing Instance-Dependent Noise in Text Classification
Hongfei Huang
Tingting Liang
Xixi Sun
Zikang Jin
Yuyu Yin
NoLa
77
1
0
09 Jul 2024
Mobile Edge Intelligence for Large Language Models: A Contemporary Survey
Mobile Edge Intelligence for Large Language Models: A Contemporary Survey
Guanqiao Qu
Qiyuan Chen
Wei Wei
Zheng Lin
Xianhao Chen
Kaibin Huang
152
56
0
09 Jul 2024
Data, Data Everywhere: A Guide for Pretraining Dataset Construction
Data, Data Everywhere: A Guide for Pretraining Dataset Construction
Jupinder Parmar
Shrimai Prabhumoye
Joseph Jennings
Bo Liu
Aastha Jhunjhunwala
Zhilin Wang
M. Patwary
Mohammad Shoeybi
Bryan Catanzaro
122
10
0
08 Jul 2024
MST5 -- Multilingual Question Answering over Knowledge Graphs
MST5 -- Multilingual Question Answering over Knowledge Graphs
Nikit Srivastava
Mengshi Ma
Daniel Vollmers
Hamada M. Zahera
Diego Moussallem
A. N. Ngomo
59
1
0
08 Jul 2024
MSP-Podcast SER Challenge 2024: Lántenne du Ventoux Multimodal
  Self-Supervised Learning for Speech Emotion Recognition
MSP-Podcast SER Challenge 2024: Lántenne du Ventoux Multimodal Self-Supervised Learning for Speech Emotion Recognition
J. Duret
Mickael Rouvier
Yannick Esteve
49
3
0
08 Jul 2024
Unmasking Trees for Tabular Data
Unmasking Trees for Tabular Data
Calvin McCarter
90
3
0
08 Jul 2024
Advancing Prompt Recovery in NLP: A Deep Dive into the Integration of
  Gemma-2b-it and Phi2 Models
Advancing Prompt Recovery in NLP: A Deep Dive into the Integration of Gemma-2b-it and Phi2 Models
Jianlong Chen
Wei Xu
Zhicheng Ding
Jinxin Xu
Hao Yan
Xinyu Zhang
128
2
0
07 Jul 2024
HYBRINFOX at CheckThat! 2024 -- Task 1: Enhancing Language Models with
  Structured Information for Check-Worthiness Estimation
HYBRINFOX at CheckThat! 2024 -- Task 1: Enhancing Language Models with Structured Information for Check-Worthiness Estimation
Géraud Faye
Morgane Casanova
B. Icard
Julien Chanson
Guillaume Gadek
Guillaume Gravier
Paul Égré
27
2
0
04 Jul 2024
Aspect-Based Sentiment Analysis Techniques: A Comparative Study
Aspect-Based Sentiment Analysis Techniques: A Comparative Study
Dineth Jayakody
Koshila Isuranda
A. V. A. Malkith
Nisansa de Silva
Sachintha Rajith Ponnamperuma
G. Sandamali
K. L. Sudheera
65
1
0
03 Jul 2024
MLKD-BERT: Multi-level Knowledge Distillation for Pre-trained Language
  Models
MLKD-BERT: Multi-level Knowledge Distillation for Pre-trained Language Models
Ying Zhang
Ziheng Yang
Shufan Ji
KELM
46
1
0
03 Jul 2024
Previous
123456...697071
Next