ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1906.08237
  4. Cited By
XLNet: Generalized Autoregressive Pretraining for Language Understanding
v1v2 (latest)

XLNet: Generalized Autoregressive Pretraining for Language Understanding

19 June 2019
Zhilin Yang
Zihang Dai
Yiming Yang
J. Carbonell
Ruslan Salakhutdinov
Quoc V. Le
    AI4CE
ArXiv (abs)PDFHTML

Papers citing "XLNet: Generalized Autoregressive Pretraining for Language Understanding"

50 / 3,518 papers shown
Title
DEUCE: Dual-diversity Enhancement and Uncertainty-awareness for Cold-start Active Learning
DEUCE: Dual-diversity Enhancement and Uncertainty-awareness for Cold-start Active Learning
Jiaxin Guo
Cheng Chen
Shuzhen Li
Tianze Zhang
144
0
0
01 Feb 2025
Adversarial Attacks on AI-Generated Text Detection Models: A Token Probability-Based Approach Using Embeddings
Adversarial Attacks on AI-Generated Text Detection Models: A Token Probability-Based Approach Using Embeddings
Ahmed K. Kadhim
Lei Jiao
Rishad Shafik
Ole-Christoffer Granmo
DeLMO
175
1
0
31 Jan 2025
Detecting harassment and defamation in cyberbullying with emotion-adaptive training
Detecting harassment and defamation in cyberbullying with emotion-adaptive training
Peiling Yi
A. Zubiaga
Yunfei Long
169
0
0
28 Jan 2025
Optimizing Sentence Embedding with Pseudo-Labeling and Model Ensembles: A Hierarchical Framework for Enhanced NLP Tasks
Ziwei Liu
Qi Zhang
Lifu Gao
86
0
0
28 Jan 2025
TabularARGN: A Flexible and Efficient Auto-Regressive Framework for Generating High-Fidelity Synthetic Data
TabularARGN: A Flexible and Efficient Auto-Regressive Framework for Generating High-Fidelity Synthetic Data
P. Tiwald
Ivona Krchova
Andrey Sidorenko
Mariana Vargas-Vieyra
Mario Scriminaci
Michael Platzer
138
3
0
21 Jan 2025
Prediction-Assisted Online Distributed Deep Learning Workload Scheduling in GPU Clusters
Prediction-Assisted Online Distributed Deep Learning Workload Scheduling in GPU Clusters
Ziyue Luo
Jia-Wei Liu
Myungjin Lee
Ness B. Shroff
79
0
0
09 Jan 2025
Trust Modeling in Counseling Conversations: A Benchmark Study
Aseem Srivastava
Zuhair Hasan Shaik
Tanmoy Chakraborty
Md. Shad Akhtar
82
0
0
06 Jan 2025
Swift Cross-Dataset Pruning: Enhancing Fine-Tuning Efficiency in Natural Language Understanding
Swift Cross-Dataset Pruning: Enhancing Fine-Tuning Efficiency in Natural Language Understanding
Binh-Nguyen Nguyen
Yang He
116
1
0
05 Jan 2025
TED: Turn Emphasis with Dialogue Feature Attention for Emotion Recognition in Conversation
Junya Ono
Hiromi Wakaki
88
0
0
03 Jan 2025
Efficient support ticket resolution using Knowledge Graphs
Sherwin Varghese
James Tian
57
0
0
03 Jan 2025
A Comprehensive Survey of Large Language Models and Multimodal Large Language Models in Medicine
A Comprehensive Survey of Large Language Models and Multimodal Large Language Models in Medicine
Hanguang Xiao
Feizhong Zhou
Xianglong Liu
Tianqi Liu
Zhipeng Li
Xin Liu
Xiaoxuan Huang
AILawLM&MALRM
145
30
0
31 Dec 2024
Context-Aware Deep Learning for Multi Modal Depression Detection
Context-Aware Deep Learning for Multi Modal Depression Detection
Genevieve Lam
Huang Dongyan
Weisi Lin
81
0
0
26 Dec 2024
Double Landmines: Invisible Textual Backdoor Attacks based on Dual-Trigger
Double Landmines: Invisible Textual Backdoor Attacks based on Dual-Trigger
Yang Hou
Qiuling Yue
Lujia Chai
Guozhao Liao
Wenbao Han
Wei Ou
86
0
0
23 Dec 2024
ImagePiece: Content-aware Re-tokenization for Efficient Image
  Recognition
ImagePiece: Content-aware Re-tokenization for Efficient Image Recognition
Seungdong Yoa
Seungjun Lee
Hyeseung Cho
Bumsoo Kim
Woohyung Lim
ViT
106
0
0
21 Dec 2024
Automated CVE Analysis: Harnessing Machine Learning In Designing
  Question-Answering Models For Cybersecurity Information Extraction
Automated CVE Analysis: Harnessing Machine Learning In Designing Question-Answering Models For Cybersecurity Information Extraction
Tanjim Bin Faruk
81
0
0
21 Dec 2024
Unlocking LLMs: Addressing Scarce Data and Bias Challenges in Mental
  Health
Unlocking LLMs: Addressing Scarce Data and Bias Challenges in Mental Health
Vivek Kumar
Eirini Ntoutsi
Pushpraj Singh Rajawat
Giacomo Medda
Diego Reforgiato Recupero
AI4MH
112
1
0
17 Dec 2024
Multi-Head Encoding for Extreme Label Classification
Multi-Head Encoding for Extreme Label Classification
Daojun Liang
Haixia Zhang
Dongfeng Yuan
Minggao Zhang
113
0
0
13 Dec 2024
TECO: Improving Multimodal Intent Recognition with Text Enhancement
  through Commonsense Knowledge Extraction
TECO: Improving Multimodal Intent Recognition with Text Enhancement through Commonsense Knowledge Extraction
Quynh-Mai Thi Nguyen
Lan-Nhi Thi Nguyen
Cam-Van Thi Nguyen
74
0
0
11 Dec 2024
Comateformer: Combined Attention Transformer for Semantic Sentence
  Matching
Comateformer: Combined Attention Transformer for Semantic Sentence Matching
Bo Li
Di Liang
Zixin Zhang
104
2
0
10 Dec 2024
A Review of Human Emotion Synthesis Based on Generative Technology
A Review of Human Emotion Synthesis Based on Generative Technology
Fei Ma
Yongqian Li
Yifan Xie
Y. He
Yize Zhang
...
Z. Liu
Wei Yao
Fuji Ren
Fei Richard Yu
Shiguang Ni
115
2
0
10 Dec 2024
Investigating Acoustic-Textual Emotional Inconsistency Information for
  Automatic Depression Detection
Investigating Acoustic-Textual Emotional Inconsistency Information for Automatic Depression Detection
Rongfeng Su
Changqing Xu
Xinyi Wu
Feng Xu
Xie Chen
Lan Wangt
Nan Yan
92
0
0
09 Dec 2024
RandAR: Decoder-only Autoregressive Visual Generation in Random Orders
RandAR: Decoder-only Autoregressive Visual Generation in Random Orders
Ziqi Pang
Tianyuan Zhang
Fujun Luan
Yunze Man
Hao Tan
Kai Zhang
William T. Freeman
Yu-Xiong Wang
VGen
135
20
0
02 Dec 2024
Impromptu Cybercrime Euphemism Detection
Impromptu Cybercrime Euphemism Detection
Xiang Li
Yimiao Zhou
Laiping Zhao
Jing Li
Fengyuan Liu
132
2
0
02 Dec 2024
CPRM: A LLM-based Continual Pre-training Framework for Relevance Modeling in Commercial Search
CPRM: A LLM-based Continual Pre-training Framework for Relevance Modeling in Commercial Search
Kaixin Wu
Yixin Ji
Ziyang Chen
Qiang Wang
Cunxiang Wang
...
Jia Xu
Zhongyi Liu
Jinjie Gu
Yuan Zhou
Linjian Mo
KELMCLL
251
0
0
02 Dec 2024
Generative Language Models Potential for Requirement Engineering
  Applications: Insights into Current Strengths and Limitations
Generative Language Models Potential for Requirement Engineering Applications: Insights into Current Strengths and Limitations
Summra Saleem
Muhammad Nabeel Asim
L. V. Elst
Andreas Dengel
109
0
0
01 Dec 2024
Can bidirectional encoder become the ultimate winner for downstream
  applications of foundation models?
Can bidirectional encoder become the ultimate winner for downstream applications of foundation models?
Lewen Yang
Xuanyu Zhou
Juao Fan
Xinyi Xie
Shengxin Zhu
AI4CE
123
0
0
27 Nov 2024
What Differentiates Educational Literature? A Multimodal Fusion Approach
  of Transformers and Computational Linguistics
What Differentiates Educational Literature? A Multimodal Fusion Approach of Transformers and Computational Linguistics
Jordan J. Bird
121
0
0
26 Nov 2024
MolMetaLM: a Physicochemical Knowledge-Guided Molecular Meta Language
  Model
MolMetaLM: a Physicochemical Knowledge-Guided Molecular Meta Language Model
Yifan Wu
Min Zeng
Yang Li
Yize Zhang
Min Li
154
1
0
23 Nov 2024
Forecasting Future International Events: A Reliable Dataset for
  Text-Based Event Modeling
Forecasting Future International Events: A Reliable Dataset for Text-Based Event Modeling
Daehoon Gwak
Junwoo Park
Minho Park
C. Park
Hyunchan Lee
E. Choi
Jaegul Choo
110
1
0
21 Nov 2024
Hysteresis Activation Function for Efficient Inference
Hysteresis Activation Function for Efficient Inference
Moshe Kimhi
Idan Kashani
A. Mendelson
Chaim Baskin
LLMSV
122
0
0
15 Nov 2024
Unstructured Text Enhanced Open-domain Dialogue System: A Systematic
  Survey
Unstructured Text Enhanced Open-domain Dialogue System: A Systematic Survey
Longxuan Ma
Mingda Li
Weinan Zhang
Jiapeng Li
Ting Liu
124
17
0
14 Nov 2024
Multi-head Span-based Detector for AI-generated Fragments in Scientific
  Papers
Multi-head Span-based Detector for AI-generated Fragments in Scientific Papers
German Gritsai
Ildar Khabutdinov
Andrey Grabovoy
DeLMO
102
4
0
11 Nov 2024
TrajGPT: Controlled Synthetic Trajectory Generation Using a Multitask
  Transformer-Based Spatiotemporal Model
TrajGPT: Controlled Synthetic Trajectory Generation Using a Multitask Transformer-Based Spatiotemporal Model
Shang-Ling Hsu
Emmanuel Tung
John Krumm
Cyrus Shahabi
Khurram Hassan-Shafique
43
4
0
07 Nov 2024
Performance-Guided LLM Knowledge Distillation for Efficient Text
  Classification at Scale
Performance-Guided LLM Knowledge Distillation for Efficient Text Classification at Scale
Flavio Di Palo
Prateek Singhi
Bilal Fadlallah
35
4
0
07 Nov 2024
Pseudo-labeling with Keyword Refining for Few-Supervised Video
  Captioning
Pseudo-labeling with Keyword Refining for Few-Supervised Video Captioning
Ping Li
Tao Wang
Xinkui Zhao
Xianghua Xu
Mingli Song
71
4
0
06 Nov 2024
A Library Perspective on Supervised Text Processing in Digital
  Libraries: An Investigation in the Biomedical Domain
A Library Perspective on Supervised Text Processing in Digital Libraries: An Investigation in the Biomedical Domain
H. Kroll
Pascal Sackhoff
Bill Matthias Thang
Maha Ksouri
Wolf-Tilo Balke
94
0
0
06 Nov 2024
Trustworthy Federated Learning: Privacy, Security, and Beyond
Trustworthy Federated Learning: Privacy, Security, and Beyond
Chunlu Chen
Ji Liu
Haowen Tan
Xingjian Li
Kevin I-Kai Wang
Peng Li
Kouichi Sakurai
Dejing Dou
FedML
105
11
0
03 Nov 2024
Randomized Autoregressive Visual Generation
Randomized Autoregressive Visual Generation
Qihang Yu
Ju He
XueQing Deng
Xiaohui Shen
Liang-Chieh Chen
VGenDiffM
138
40
1
01 Nov 2024
GigaCheck: Detecting LLM-generated Content
GigaCheck: Detecting LLM-generated Content
Irina Tolstykh
Aleksandra Tsybina
Sergey Yakubson
Aleksandr Gordeev
Vladimir Dokholyan
Maksim Kuprashevich
DeLMO
86
2
0
31 Oct 2024
Bonafide at LegalLens 2024 Shared Task: Using Lightweight DeBERTa Based
  Encoder For Legal Violation Detection and Resolution
Bonafide at LegalLens 2024 Shared Task: Using Lightweight DeBERTa Based Encoder For Legal Violation Detection and Resolution
Shikha Bordia
AILaw
104
0
0
30 Oct 2024
DeTeCtive: Detecting AI-generated Text via Multi-Level Contrastive
  Learning
DeTeCtive: Detecting AI-generated Text via Multi-Level Contrastive Learning
Xun Guo
Shan Zhang
Yongxin He
Ting Zhang
Wanquan Feng
Haibin Huang
Chongyang Ma
DeLMO
86
10
0
28 Oct 2024
Beyond Autoregression: Fast LLMs via Self-Distillation Through Time
Beyond Autoregression: Fast LLMs via Self-Distillation Through Time
Justin Deschenaux
Çağlar Gülçehre
131
5
0
28 Oct 2024
Uncovering Capabilities of Model Pruning in Graph Contrastive Learning
Uncovering Capabilities of Model Pruning in Graph Contrastive Learning
Wu Junran
Chen Xueyuan
Li Shangzhe
95
1
0
27 Oct 2024
Deep Insights into Cognitive Decline: A Survey of Leveraging
  Non-Intrusive Modalities with Deep Learning Techniques
Deep Insights into Cognitive Decline: A Survey of Leveraging Non-Intrusive Modalities with Deep Learning Techniques
David Ortiz-Perez
Manuel Benavent-Lledo
José García Rodríguez
David Tomás
M. Flores Vizcaya-Moreno
69
1
0
24 Oct 2024
Building Dialogue Understanding Models for Low-resource Language
  Indonesian from Scratch
Building Dialogue Understanding Models for Low-resource Language Indonesian from Scratch
Donglin Di
Weinan Zhang
Yue Zhang
Fanglin Wang
87
1
0
24 Oct 2024
Dependency Graph Parsing as Sequence Labeling
Dependency Graph Parsing as Sequence Labeling
Ana Ezquerro
David Vilares
Carlos Gómez-Rodríguez
43
0
0
23 Oct 2024
Future Token Prediction -- Causal Language Modelling with Per-Token
  Semantic State Vector for Multi-Token Prediction
Future Token Prediction -- Causal Language Modelling with Per-Token Semantic State Vector for Multi-Token Prediction
Nicholas Walker
63
0
0
23 Oct 2024
BadFair: Backdoored Fairness Attacks with Group-conditioned Triggers
BadFair: Backdoored Fairness Attacks with Group-conditioned Triggers
Jiaqi Xue
Qian Lou
Mengxin Zheng
77
1
0
23 Oct 2024
Multi-head Sequence Tagging Model for Grammatical Error Correction
Multi-head Sequence Tagging Model for Grammatical Error Correction
Kamal Al-Sabahi
Kang Yang
Wangwang Liu
Guanyu Jiang
Xian Li
Ming Yang
62
2
0
21 Oct 2024
Evaluation Of P300 Speller Performance Using Large Language Models Along
  With Cross-Subject Training
Evaluation Of P300 Speller Performance Using Large Language Models Along With Cross-Subject Training
Nithin Parthasarathy
J. Soetedjo
S. Panchavati
Nitya Parthasarathy
C. Arnold
N. Pouratian
W. Speier
25
0
0
19 Oct 2024
Previous
123456...697071
Next