Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1906.08237
Cited By
v1
v2 (latest)
XLNet: Generalized Autoregressive Pretraining for Language Understanding
19 June 2019
Zhilin Yang
Zihang Dai
Yiming Yang
J. Carbonell
Ruslan Salakhutdinov
Quoc V. Le
AI4CE
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"XLNet: Generalized Autoregressive Pretraining for Language Understanding"
50 / 3,518 papers shown
Title
Controllable Discovery of Intents: Incremental Deep Clustering Using Semi-Supervised Contrastive Learning
Mrinal Rawat
Hithesh Sankararaman
Victor Barrès
86
0
0
18 Oct 2024
Beyond Autoregression: Discrete Diffusion for Complex Reasoning and Planning
Jiacheng Ye
Jiahui Gao
Shansan Gong
Lin Zheng
Xin Jiang
Zhiyu Li
Dianbo Sui
DiffM
LRM
180
25
0
18 Oct 2024
Rationale Behind Essay Scores: Enhancing S-LLM's Multi-Trait Essay Scoring with Rationale Generated by LLMs
SeongYeub Chu
JongWoo Kim
Bryan Wong
MunYong Yi
LRM
94
3
0
18 Oct 2024
Fine-Tuning Language Models on Multiple Datasets for Citation Intention Classification
Zeren Shui
Petros Karypis
Daniel S. Karls
Mingjian Wen
Saurav Manchanda
E. Tadmor
George Karypis
46
1
0
17 Oct 2024
The Mystery of the Pathological Path-star Task for Language Models
Arvid Frydenlund
LRM
127
4
0
17 Oct 2024
CMAL: A Novel Cross-Modal Associative Learning Framework for Vision-Language Pre-Training
Zhiyuan Ma
Jianjun Li
Guohui Li
Kaiyan Huang
VLM
120
9
0
16 Oct 2024
MLP-SLAM: Multilayer Perceptron-Based Simultaneous Localization and Mapping With a Dynamic and Static Object Discriminator
Taozhe Li
Wei Sun
61
0
0
14 Oct 2024
Rethinking Legal Judgement Prediction in a Realistic Scenario in the Era of Large Language Models
S. Nigam
Aniket Deroy
Subhankar Maity
Arnab Bhattacharya
ELM
AILaw
76
6
0
14 Oct 2024
Customize Your Visual Autoregressive Recipe with Set Autoregressive Modeling
Wenze Liu
Le Zhuo
Yi Xin
Sheng Xia
Peng Gao
Xiangyu Yue
125
9
0
14 Oct 2024
COrAL: Order-Agnostic Language Modeling for Efficient Iterative Refinement
Yuxi Xie
Anirudh Goyal
Xiaobao Wu
Xunjian Yin
Xiao Xu
Min-Yen Kan
Liangming Pan
William Yang Wang
LRM
358
1
0
12 Oct 2024
Emphasis Rendering for Conversational Text-to-Speech with Multi-modal Multi-scale Context Modeling
Rui Liu
Zhenqi Jia
Jie Yang
Yifan Hu
Hong Li
98
2
0
12 Oct 2024
Text Classification using Graph Convolutional Networks: A Comprehensive Survey
Syed Mustafa Haider Rizvi
Ramsha Imran
Arif Mahmood
GNN
OOD
FaML
51
2
0
12 Oct 2024
HLM-Cite: Hybrid Language Model Workflow for Text-based Scientific Citation Prediction
Qianyue Hao
Jingyang Fan
Fengli Xu
Jian Yuan
Yong Li
67
9
0
10 Oct 2024
Chain and Causal Attention for Efficient Entity Tracking
Erwan Fagnou
Paul Caillon
Blaise Delattre
Alexandre Allauzen
92
5
0
07 Oct 2024
Investigating large language models for their competence in extracting grammatically sound sentences from transcribed noisy utterances
Alina Wróblewska
58
0
0
07 Oct 2024
Computational design of target-specific linear peptide binders with TransformerBeta
Haowen Zhao
Francesco A. Aprile
Barbara Bravi
77
0
0
07 Oct 2024
Hyper-multi-step: The Truth Behind Difficult Long-context Tasks
Yijiong Yu
Ma Xiufa
Fang Jianwei
Zhi-liang Xu
Su Guangyao
...
Zhixiao Qi
Wei Wang
Wen Liu
Ran Chen
Ji Pei
LRM
RALM
73
0
0
06 Oct 2024
Fundamental Limitations on Subquadratic Alternatives to Transformers
Josh Alman
Hantao Yu
119
4
0
05 Oct 2024
Variational Language Concepts for Interpreting Foundation Language Models
Hengyi Wang
Shiwei Tan
Zhiqing Hong
Desheng Zhang
Hao Wang
148
3
0
04 Oct 2024
Linear Transformer Topological Masking with Graph Random Features
Isaac Reid
Kumar Avinava Dubey
Deepali Jain
Will Whitney
Amr Ahmed
...
Connor Schenck
Richard E. Turner
René Wagner
Adrian Weller
Krzysztof Choromanski
87
1
0
04 Oct 2024
Structure-Enhanced Protein Instruction Tuning: Towards General-Purpose Protein Understanding with LLMs
Wei Wu
Chao Wang
L. Chen
Mingze Yin
Yiheng Zhu
Kun Fu
Jieping Ye
Hui Xiong
Zheng Wang
143
1
0
04 Oct 2024
Graph-tree Fusion Model with Bidirectional Information Propagation for Long Document Classification
Sudipta Singha Roy
Xindi Wang
Robert E. Mercer
Frank Rudzicz
62
0
0
03 Oct 2024
On The Adaptation of Unlimiformer for Decoder-Only Transformers
Kian Ahrabian
Alon Benhaim
Barun Patra
Jay Pujara
Saksham Singhal
Xia Song
68
0
0
02 Oct 2024
Preserving Generalization of Language models in Few-shot Continual Relation Extraction
Quyen Tran
Nguyen Xuan Thanh
Nguyen Hoang Anh
Nam Le Hai
Trung Le
Linh Van Ngo
Thien Huu Nguyen
CLL
KELM
80
7
0
01 Oct 2024
Leveraging Long-Context Large Language Models for Multi-Document Understanding and Summarization in Enterprise Applications
Aditi Godbole
Jabin Geevarghese George
Smita Shandilya
84
5
0
27 Sep 2024
Trustworthy AI: Securing Sensitive Data in Large Language Models
G. Feretzakis
V. Verykios
58
17
0
26 Sep 2024
Decoding Large-Language Models: A Systematic Overview of Socio-Technical Impacts, Constraints, and Emerging Questions
Zeyneb N. Kaya
Souvick Ghosh
55
0
0
25 Sep 2024
The Roles of Generative Artificial Intelligence in Internet of Electric Vehicles
Hanwen Zhang
Dusit Niyato
Wei Zhang
Changyuan Zhao
Hongyang Du
Abbas Jamalipour
Sumei Sun
Yiyang Pei
AI4CE
70
2
0
24 Sep 2024
Improving Academic Skills Assessment with NLP and Ensemble Learning
Xinyi Huang
Yingyi Wu
Danyang Zhang
Jiacheng Hu
Yujian Long
49
7
0
23 Sep 2024
"I Never Said That": A dataset, taxonomy and baselines on response clarity classification
Konstantinos Thomas
Giorgos Filandrianos
Maria Lymperaiou
Chrysoula Zerva
Giorgos Stamou
58
0
0
20 Sep 2024
GAProtoNet: A Multi-head Graph Attention-based Prototypical Network for Interpretable Text Classification
Ximing Wen
Wenjuan Tan
Rosina O. Weber
89
2
0
20 Sep 2024
Incremental and Data-Efficient Concept Formation to Support Masked Word Prediction
Xin Lian
Nishant Baglodi
Christopher J. MacLellan
61
1
0
19 Sep 2024
VL-Reader: Vision and Language Reconstructor is an Effective Scene Text Recognizer
Humen Zhong
Zhibo Yang
Zhaohai Li
Peng Wang
Jun Tang
Wenqing Cheng
Cong Yao
71
1
0
18 Sep 2024
Evaluation of pretrained language models on music understanding
Yannis Vasilakis
Rachel M. Bittner
Johan Pauwels
99
1
0
17 Sep 2024
OneEncoder: A Lightweight Framework for Progressive Alignment of Modalities
Bilal Faye
Hanane Azzag
M. Lebbah
ObjD
105
0
0
17 Sep 2024
BAD: Bidirectional Auto-regressive Diffusion for Text-to-Motion Generation
Seyed Rohollah Hosseyni
Ali Ahmad Rahmani
S. J. Seyedmohammadi
Sanaz Seyedin
Arash Mohammadi
DiffM
93
7
0
17 Sep 2024
Language Models Learn Metadata: Political Stance Detection Case Study
Stanley Cao
Felix Drinkall
51
0
0
15 Sep 2024
AlpaPICO: Extraction of PICO Frames from Clinical Trial Documents Using LLMs
Madhusudan Ghosh
Shrimon Mukherjee
Asmit Ganguly
Partha Basuchowdhuri
S. Naskar
Debasis Ganguly
99
8
0
15 Sep 2024
Synthetic4Health: Generating Annotated Synthetic Clinical Letters
Libo Ren
Samuel Belkadi
Lifeng Han
Warren Del-Pinto
Goran Nenadic
SyDa
57
2
0
14 Sep 2024
Layerwise Change of Knowledge in Neural Networks
Xu Cheng
Lei Cheng
Zhaoran Peng
Yang Xu
Tian Han
Quanshi Zhang
KELM
FAtt
74
5
0
13 Sep 2024
TheraGen: Therapy for Every Generation
Kartikey Doshi
Jimit Shah
Narendra Shekokar
AI4MH
53
0
0
12 Sep 2024
Multimodal Emotion Recognition with Vision-language Prompting and Modality Dropout
Anbin QI
Zhongliang Liu
Xinyong Zhou
Jinba Xiao
Fengrun Zhang
Qi Gan
Ming Tao
Gaozheng Zhang
Lu Zhang
VLM
48
2
0
11 Sep 2024
DA-MoE: Towards Dynamic Expert Allocation for Mixture-of-Experts Models
Maryam Akhavan Aghdam
Hongpeng Jin
Yanzhao Wu
MoE
63
3
0
10 Sep 2024
Expanding Expressivity in Transformer Models with MöbiusAttention
Anna-Maria Halacheva
M. Nayyeri
Steffen Staab
78
1
0
08 Sep 2024
An overview of domain-specific foundation model: key technologies, applications and challenges
Haolong Chen
Hanzhi Chen
Zijian Zhao
Kaifeng Han
Guangxu Zhu
Yichen Zhao
Ying Du
Wei Xu
Qingjiang Shi
ALM
VLM
111
5
0
06 Sep 2024
Revolutionizing Database Q&A with Large Language Models: Comprehensive Benchmark and Evaluation
Yihang Zheng
Yue Liu
Zhenghao Lin
Yi Luo
Xuanhe Zhou
Chen Lin
Jinsong Su
Guoliang Li
Shifu Li
ELM
103
2
0
05 Sep 2024
Dreaming is All You Need
Mingze Ni
Wei Liu
53
0
0
03 Sep 2024
Pre-Trained Language Models for Keyphrase Prediction: A Review
Muhammad Umair
Tangina Sultana
Young-Koo Lee
80
4
0
02 Sep 2024
Hound: Hunting Supervision Signals for Few and Zero Shot Node Classification on Text-attributed Graph
Yuxiang Wang
Xiao Yan
Shiyu Jin
Quanqing Xu
Chuanhui Yang
Yuanyuan Zhu
Chuang Hu
Bo Du
Jiawei Jiang
VLM
61
0
0
01 Sep 2024
EMP: Enhance Memory in Data Pruning
Jinying Xiao
Ping Li
Jie Nie
Zhe Tang
VLM
94
0
0
28 Aug 2024
Previous
1
2
3
4
5
...
69
70
71
Next