Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1906.08237
Cited By
v1
v2 (latest)
XLNet: Generalized Autoregressive Pretraining for Language Understanding
19 June 2019
Zhilin Yang
Zihang Dai
Yiming Yang
J. Carbonell
Ruslan Salakhutdinov
Quoc V. Le
AI4CE
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"XLNet: Generalized Autoregressive Pretraining for Language Understanding"
50 / 3,518 papers shown
Title
A Hybrid DeBERTa and Gated Broad Learning System for Cyberbullying Detection in English Text
Devesh Kumar
17
0
0
19 Jun 2025
GeoGuess: Multimodal Reasoning based on Hierarchy of Visual Information in Street View
Fenghua Cheng
Jinxiang Wang
Sen Wang
Zi Huang
Xue Li
LRM
19
0
0
19 Jun 2025
CFBenchmark-MM: Chinese Financial Assistant Benchmark for Multimodal Large Language Model
Jiangtong Li
Yiyun Zhu
Dawei Cheng
Zhijun Ding
Changjun Jiang
25
0
0
16 Jun 2025
Assessing the Limits of In-Context Learning beyond Functions using Partially Ordered Relation
Debanjan Dutta
Faizanuddin Ansari
Swagatam Das
20
0
0
16 Jun 2025
INTERPOS: Interaction Rhythm Guided Positional Morphing for Mobile App Recommender Systems
M. H. Maqbool
Moghis Fereidouni
Umar Farooq
A.B. Siddique
H. Foroosh
AI4TS
15
0
0
14 Jun 2025
Multimodal Representation Alignment for Cross-modal Information Retrieval
Fan Xu
Luis A. Leiva
17
0
0
10 Jun 2025
Edit Flows: Flow Matching with Edit Operations
Marton Havasi
Brian Karrer
Itai Gat
Ricky T. Q. Chen
BDL
37
0
0
10 Jun 2025
Label-semantics Aware Generative Approach for Domain-Agnostic Multilabel Classification
Subhendu Khatuya
Shashwat Naidu
Saptarshi Ghosh
Pawan Goyal
Niloy Ganguly
VLM
25
0
0
07 Jun 2025
RecGPT: A Foundation Model for Sequential Recommendation
Yangqin Jiang
Xubin Ren
Lianghao Xia
Da Luo
Kangyi Lin
Chao Huang
LRM
107
0
0
06 Jun 2025
Corrector Sampling in Language Models
Itai Gat
Neta Shaul
Uriel Singer
Y. Lipman
KELM
AI4TS
40
0
0
06 Jun 2025
SoK: Are Watermarks in LLMs Ready for Deployment?
Kieu Dang
Phung Lai
Nhathai Phan
Yelong Shen
Ruoming Jin
Abdallah Khreishah
My T. Thai
32
0
0
05 Jun 2025
A MISMATCHED Benchmark for Scientific Natural Language Inference
Firoz Shaik
Mobashir Sadat
Nikita Gautam
Doina Caragea
Cornelia Caragea
77
0
0
05 Jun 2025
HACo-Det: A Study Towards Fine-Grained Machine-Generated Text Detection under Human-AI Coauthoring
Zhixiong Su
Yichen Wang
Herun Wan
Zhaohan Zhang
Minnan Luo
DeLMO
57
0
0
03 Jun 2025
Improving QA Efficiency with DistilBERT: Fine-Tuning and Inference on mobile Intel CPUs
Ngeyen Yinkfu
12
0
0
28 May 2025
Xinyu AI Search: Enhanced Relevance and Comprehensive Results with Rich Answer Presentations
Bo Tang
Junyi Zhu
Chenyang Xi
Yunhang Ge
Jiahao Wu
...
Yebin Yang
Jiajia Wang
Zhiyu Li
Feiyu Xiong
Jingrun Chen
50
0
0
28 May 2025
Automated Essay Scoring Incorporating Annotations from Automated Feedback Systems
Christopher Ormerod
25
0
0
28 May 2025
MultiPhishGuard: An LLM-based Multi-Agent System for Phishing Email Detection
Yinuo Xue
Eric Spero
Yun Sing Koh
Giovanni Russello
AAML
26
1
0
26 May 2025
Multi-Party Conversational Agents: A Survey
Sagar Sapkota
M. Hasan
Mubarak Shah
Santu Karmaker
LLMAG
69
0
0
24 May 2025
A Position Paper on the Automatic Generation of Machine Learning Leaderboards
Roelien C Timmer
Yufang Hou
Stephen Wan
227
0
0
23 May 2025
Large Language Models and Their Applications in Roadway Safety and Mobility Enhancement: A Comprehensive Review
Muhammad Monjurul Karim
Yan Shi
Shucheng Zhang
Bingzhang Wang
Mehrdad Nasri
Yinhai Wang
26
0
0
19 May 2025
Spatial-LLaVA: Enhancing Large Language Models with Spatial Referring Expressions for Visual Understanding
Xuefei Sun
Doncey Albin
Cecilia Mauceri
Dusty Woods
Christoffer Heckman
LRM
46
0
0
18 May 2025
Class Distillation with Mahalanobis Contrast: An Efficient Training Paradigm for Pragmatic Language Understanding Tasks
Chenlu Wang
Weimin Lyu
Ritwik Banerjee
70
0
0
17 May 2025
Hierarchical Bracketing Encodings for Dependency Parsing as Tagging
Ana Ezquerro
David Vilares
Anssi Yli-Jyrä
Carlos Gómez-Rodríguez
128
0
0
16 May 2025
An empirical study of task and feature correlations in the reuse of pre-trained models
Jama Hussein Mohamud
17
0
0
15 May 2025
Multi-Token Prediction Needs Registers
Anastasios Gerontopoulos
Spyros Gidaris
N. Komodakis
115
0
0
15 May 2025
Structural-Temporal Coupling Anomaly Detection with Dynamic Graph Transformer
Chang Zong
Yueting Zhuang
Jian Shao
Weiming Lu
89
0
0
13 May 2025
Probability Consistency in Large Language Models: Theoretical Foundations Meet Empirical Discrepancies
Xiaoliang Luo
Xinyi Xu
Michael Ramscar
Bradley C. Love
71
0
0
13 May 2025
Boosting Neural Language Inference via Cascaded Interactive Reasoning
Min Li
Chun Yuan
ReLM
LRM
65
0
0
10 May 2025
I Know What You Said: Unveiling Hardware Cache Side-Channels in Local Large Language Model Inference
Zibo Gao
Junjie Hu
Feng Guo
Yixin Zhang
Yinglong Han
Siyuan Liu
Haiyang Li
Zhiqiang Lv
103
0
0
10 May 2025
Insertion Language Models: Sequence Generation with Arbitrary-Position Insertions
Dhruvesh Patel
Aishwarya Sahoo
Avinash Amballa
Tahira Naseem
Tim G. J. Rudner
Andrew McCallum
KELM
130
0
0
09 May 2025
An Adaptive Data-Resilient Multi-Modal Framework for Hierarchical Multi-Label Book Genre Identification
Utsav Nareti
S. Chattopadhyay
Prolay Mallick
Suraj Kumar
Ayush Vikas Daga
Chandranath Adak
Adarsh Wase
Arjab Roy
161
1
0
05 May 2025
A Character-based Diffusion Embedding Algorithm for Enhancing the Generation Quality of Generative Linguistic Steganographic Texts
Yingquan Chen
Qianmu Li
Xiaocong Wu
Huifeng Li
Qing Chang
DiffM
114
0
0
02 May 2025
Bridging Cognition and Emotion: Empathy-Driven Multimodal Misinformation Detection
Zihan Wang
Lu Yuan
Zhengxuan Zhang
Qing Zhao
48
1
0
24 Apr 2025
HMI: Hierarchical Knowledge Management for Efficient Multi-Tenant Inference in Pretrained Language Models
Junxuan Zhang
Jiadong Wang
Haoyang Li
Lidan Shou
Ke Chen
Gang Chen
Qin Xie
Guiming Xie
Xuejian Gong
44
0
0
24 Apr 2025
The Ultimate Cookbook for Invisible Poison: Crafting Subtle Clean-Label Text Backdoors with Style Attributes
Wencong You
Daniel Lowd
94
0
0
24 Apr 2025
RAGAT-Mind: A Multi-Granular Modeling Approach for Rumor Detection Based on MindSpore
Zhenkai Qin
Guifang Yang
Dongze Wu
MoE
80
0
0
24 Apr 2025
Distilling semantically aware orders for autoregressive image generation
Rishav Pramanik
Antoine Poupon
Juan A. Rodriguez
Masih Aminbeidokhti
David Vazquez
Christopher Pal
Zhaozheng Yin
M. Pedersoli
78
0
0
23 Apr 2025
Sentiment Analysis in Software Engineering: Evaluating Generative Pre-trained Transformers
KM Khalid Saifullah
Faiaz Azmain
Habiba Hye
47
0
0
22 Apr 2025
VLM as Policy: Common-Law Content Moderation Framework for Short Video Platform
Xingyu Lu
Tianke Zhang
Chang Meng
Xinyu Wang
Jinpeng Wang
...
Hai-Tao Zheng
Fan Yang
Yan Li
Di Zhang
Kun Gai
OffRL
87
0
0
21 Apr 2025
Q-FAKER: Query-free Hard Black-box Attack via Controlled Generation
CheolWon Na
YunSeok Choi
Jee-Hyong Lee
AAML
71
0
0
18 Apr 2025
Transformers Can Overcome the Curse of Dimensionality: A Theoretical Study from an Approximation Perspective
Yuling Jiao
Yanming Lai
Yang Wang
Bokai Yan
62
0
0
18 Apr 2025
You Don't Need All Attentions: Distributed Dynamic Fine-Tuning for Foundation Models
Shiwei Ding
Lan Zhang
Zhenlin Wang
Giuseppe Ateniese
Xiaoyong Yuan
70
0
0
16 Apr 2025
Looking beyond the next token
Abitha Thankaraj
Yiding Jiang
J. Zico Kolter
Yonatan Bisk
LRM
128
1
0
15 Apr 2025
C-MTCSD: A Chinese Multi-Turn Conversational Stance Detection Dataset
Fuqiang Niu
Yue Yang
Xianghua Fu
Genan Dai
Bowen Zhang
120
1
0
14 Apr 2025
Learning from Reference Answers: Versatile Language Model Alignment without Binary Human Preference Data
Shuai Zhao
Linchao Zhu
Yi Yang
93
3
0
14 Apr 2025
Confidence Regularized Masked Language Modeling using Text Length
Seunghyun Ji
Soowon Lee
214
0
0
08 Apr 2025
Unleashing the Power of LLMs in Dense Retrieval with Query Likelihood Modeling
Hengran Zhang
Keping Bi
Jiafeng Guo
Xiaojie Sun
Shihao Liu
Daiting Shi
Dawei Yin
Xueqi Cheng
RALM
479
0
0
07 Apr 2025
SapiensID: Foundation for Human Recognition
Minchul Kim
Dingqiang Ye
Yiyang Su
Feng Liu
Xiaoming Liu
CVBM
VLM
89
1
0
07 Apr 2025
TathyaNyaya and FactLegalLlama: Advancing Factual Judgment Prediction and Explanation in the Indian Legal Context
S. Nigam
Balaramamahanthi Deepak Patnaik
Shivam Mishra
Noel Shallum
Kripabandhu Ghosh
Arnab Bhattacharya
AILaw
ELM
109
0
0
07 Apr 2025
Pyramid-based Mamba Multi-class Unsupervised Anomaly Detection
Nasar Iqbal
Niki Martinel
Mamba
79
1
0
04 Apr 2025
1
2
3
4
...
69
70
71
Next