Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1906.08237
Cited By
v1
v2 (latest)
XLNet: Generalized Autoregressive Pretraining for Language Understanding
19 June 2019
Zhilin Yang
Zihang Dai
Yiming Yang
J. Carbonell
Ruslan Salakhutdinov
Quoc V. Le
AI4CE
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"XLNet: Generalized Autoregressive Pretraining for Language Understanding"
50 / 3,518 papers shown
Title
A Depression Detection Method Based on Multi-Modal Feature Fusion Using Cross-Attention
Shengjie Li
Yinhao Xiao
69
1
0
02 Jul 2024
Look Ahead or Look Around? A Theoretical Comparison Between Autoregressive and Masked Pretraining
Qi Zhang
Tianqi Du
Haotian Huang
Yifei Wang
Yisen Wang
71
5
0
01 Jul 2024
Large Language Model Enhanced Knowledge Representation Learning: A Survey
Xin Wang
Zirui Chen
Haofen Wang
Leong Hou U
Zhao Li
Wenbin Guo
KELM
215
3
0
01 Jul 2024
A Comparative Study of Quality Evaluation Methods for Text Summarization
Huyen Nguyen
Haihua Chen
Lavanya Pobbathi
Junhua Ding
ELM
83
6
0
30 Jun 2024
LegalTurk Optimized BERT for Multi-Label Text Classification and NER
Farnaz Zeidi
Mehmet Fatih Amasyali
Çiğdem Erol
VLM
58
2
0
30 Jun 2024
When Search Engine Services meet Large Language Models: Visions and Challenges
Haoyi Xiong
Jiang Bian
Yuchen Li
Xuhong Li
Jundong Li
Shuaiqiang Wang
Dawei Yin
Sumi Helal
135
36
0
28 Jun 2024
The Odyssey of Commonsense Causality: From Foundational Benchmarks to Cutting-Edge Reasoning
Shaobo Cui
Zhijing Jin
Bernhard Schölkopf
Boi Faltings
CML
LRM
87
4
0
27 Jun 2024
Unveiling and Controlling Anomalous Attention Distribution in Transformers
Ruiqing Yan
Xingbo Du
Haoyu Deng
Linghan Zheng
Qiuzhuang Sun
Jifang Hu
Yuhang Shao
Penghao Jiang
Jinrong Jiang
Lian Zhao
62
1
0
26 Jun 2024
Zero-shot prompt-based classification: topic labeling in times of foundation models in German Tweets
Simon Münker
Kai Kugler
Achim Rettinger
VLM
68
1
0
26 Jun 2024
PharmaGPT: Domain-Specific Large Language Models for Bio-Pharmaceutical and Chemistry
Linqing Chen
Weilei Wang
Zilong Bai
Peng Xu
Yan Fang
...
Lisha Zhang
Fu Bian
Zhongkai Ye
Lidong Pei
Changyang Tu
AI4MH
LM&MA
107
3
0
26 Jun 2024
ViANLI: Adversarial Natural Language Inference for Vietnamese
Tin Van Huynh
Kiet Van Nguyen
Ngan Luu-Thuy Nguyen
64
0
0
25 Jun 2024
Knowledge Distillation in Automated Annotation: Supervised Text Classification with LLM-Generated Training Labels
Nicholas Pangakis
Samuel Wolken
88
19
0
25 Jun 2024
Evaluation of Language Models in the Medical Context Under Resource-Constrained Settings
Andrea Posada
Daniel Rueckert
Felix Meissen
Philip Muller
LM&MA
ELM
58
0
0
24 Jun 2024
SyROCCo: Enhancing Systematic Reviews using Machine Learning
Zheng Fang
Miguel Arana Catania
Felix-Anselm van Lier
Juliana Outes Velarde
Harry Bregazzi
Mara Airoldi
Eleanor Carter
Rob Procter
34
0
0
24 Jun 2024
Deepfake tweets automatic detection
Adam Frej
Adrian Kaminski
Piotr Marciniak
Szymon Szmajdzinski
Soveatin Kuntur
Anna Wroblewska
43
0
0
24 Jun 2024
Evaluating the Effectiveness of the Foundational Models for Q&A Classification in Mental Health care
Hassan Alhuzali
Ashwag Alasmari
AI4MH
79
2
0
23 Jun 2024
Text Serialization and Their Relationship with the Conventional Paradigms of Tabular Machine Learning
Kyoka Ono
Simon A. Lee
LMTD
52
8
0
19 Jun 2024
A Primal-Dual Framework for Transformers and Neural Networks
Tan M. Nguyen
Tam Nguyen
Nhat Ho
Andrea L. Bertozzi
Richard G. Baraniuk
Stanley J. Osher
ViT
70
14
0
19 Jun 2024
Unveiling the Hidden Structure of Self-Attention via Kernel Principal Component Analysis
R. Teo
Tan M. Nguyen
91
4
0
19 Jun 2024
Not Eliminate but Aggregate: Post-Hoc Control over Mixture-of-Experts to Address Shortcut Shifts in Natural Language Understanding
Ukyo Honda
Tatsushi Oka
Peinan Zhang
Masato Mita
90
1
0
17 Jun 2024
WellDunn: On the Robustness and Explainability of Language Models and Large Language Models in Identifying Wellness Dimensions
Seyedali Mohammadi
Edward Raff
Jinendra Malekar
Vedant Palit
Francis Ferraro
Manas Gaur
AI4MH
92
3
0
17 Jun 2024
GECOBench: A Gender-Controlled Text Dataset and Benchmark for Quantifying Biases in Explanations
Rick Wilming
Artur Dox
Hjalmar Schulz
Marta Oliveira
Benedict Clark
Stefan Haufe
102
2
0
17 Jun 2024
A Systematic Analysis of Large Language Models as Soft Reasoners: The Case of Syllogistic Inferences
Leonardo Bertolazzi
Albert Gatt
Raffaella Bernardi
LRM
ELM
39
6
0
17 Jun 2024
HyperSIGMA: Hyperspectral Intelligence Comprehension Foundation Model
Di Wang
Meiqi Hu
Yao Jin
Yuchun Miao
Jiaqi Yang
...
Lefei Zhang
Chen Wu
Di Lin
Dacheng Tao
Liangpei Zhang
164
27
0
17 Jun 2024
Improving Large Models with Small models: Lower Costs and Better Performance
Dong Chen
Shuo Zhang
Yueting Zhuang
Siliang Tang
Qidong Liu
Hua Wang
Mingliang Xu
96
6
0
15 Jun 2024
Exploring the Correlation between Human and Machine Evaluation of Simultaneous Speech Translation
Xiaoman Wang
Claudio Fantinuoli
46
1
0
14 Jun 2024
Fine-Tuned 'Small' LLMs (Still) Significantly Outperform Zero-Shot Generative AI Models in Text Classification
Martin Juan José Bucher
Marco Martini
ALM
AI4MH
125
36
0
12 Jun 2024
Adversarial Evasion Attack Efficiency against Large Language Models
João Vitorino
Eva Maia
Isabel Praça
AAML
72
2
0
12 Jun 2024
Defining and Detecting Vulnerability in Human Evaluation Guidelines: A Preliminary Study Towards Reliable NLG Evaluation
Jie Ruan
Wenqing Wang
Xiaojun Wan
AAML
ELM
80
6
0
12 Jun 2024
To be Continuous, or to be Discrete, Those are Bits of Questions
Yiran Wang
Masao Utiyama
80
4
0
12 Jun 2024
Autoregressive Pretraining with Mamba in Vision
Sucheng Ren
Xianhang Li
Haoqin Tu
Feng Wang
Fangxun Shu
...
L. Yang
Peng Wang
Heng Wang
Alan Yuille
Cihang Xie
Mamba
125
12
0
11 Jun 2024
Visual Representation Learning with Stochastic Frame Prediction
Huiwon Jang
Dongyoung Kim
Junsu Kim
Jinwoo Shin
Pieter Abbeel
Younggyo Seo
90
3
0
11 Jun 2024
Leveraging Large Language Models for Efficient Failure Analysis in Game Development
Leonardo Marini
Linus Gisslén
Alessandro Sestini
101
0
0
11 Jun 2024
COVID-19 Twitter Sentiment Classification Using Hybrid Deep Learning Model Based on Grid Search Methodology
Jitendra Tembhurne
Anant Agrawal
Kirtan Lakhotia
71
0
0
11 Jun 2024
The Factorization Curse: Which Tokens You Predict Underlie the Reversal Curse and More
O. Kitouni
Niklas Nolte
Diane Bouchacourt
Adina Williams
Mike Rabbat
Mark Ibrahim
LRM
CLL
101
12
0
07 Jun 2024
Creating an AI Observer: Generative Semantic Workspaces
Pavan Holur
Shreyas Rajesh
David Chong
V. Roychowdhury
40
0
0
07 Jun 2024
Legal Judgment Reimagined: PredEx and the Rise of Intelligent AI Interpretation in Indian Courts
S. Nigam
Anurag Sharma
Danush Khanna
Noel Shallum
Kripabandhu Ghosh
Arnab Bhattacharya
ELM
AILaw
77
9
0
06 Jun 2024
Pre-trained Transformer Uncovers Meaningful Patterns in Human Mobility Data
Alameen Najjar
71
0
0
06 Jun 2024
Wings: Learning Multimodal LLMs without Text-only Forgetting
Yi-Kai Zhang
Shiyin Lu
Yang Li
Yanqing Ma
Qing-Guo Chen
Zhao Xu
Weihua Luo
Kaifu Zhang
De-Chuan Zhan
Han-Jia Ye
VLM
128
10
0
05 Jun 2024
RadBARTsum: Domain Specific Adaption of Denoising Sequence-to-Sequence Models for Abstractive Radiology Report Summarization
Jinge Wu
Abul Hasan
Honghan Wu
33
1
0
05 Jun 2024
Measure-Observe-Remeasure: An Interactive Paradigm for Differentially-Private Exploratory Analysis
Priyanka Nanayakkara
Hyeok Kim
Yifan Wu
Ali Sarvghad
Narges Mahyar
G. Miklau
Jessica Hullman
74
20
0
04 Jun 2024
Focus on the Core: Efficient Attention via Pruned Token Compression for Document Classification
Jungmin Yun
Mihyeon Kim
Youngbin Kim
113
9
0
03 Jun 2024
Synergizing Unsupervised and Supervised Learning: A Hybrid Approach for Accurate Natural Language Task Modeling
Wrick Talukdar
Anjanava Biswas
52
5
0
03 Jun 2024
GAMedX: Generative AI-based Medical Entity Data Extractor Using Large Language Models
Mohammed-Khalil Ghali
Abdelrahman Farrag
Hajar Sakai
Hicham El Baz
Yu Jin
Sarah Lam
LM&MA
MedIm
81
9
0
31 May 2024
Ensemble Model With Bert,Roberta and Xlnet For Molecular property prediction
Junling Hu
72
1
0
30 May 2024
PathReasoner: Modeling Reasoning Path with Equivalent Extension for Logical Question Answering
Fangzhi Xu
Qika Lin
Tianzhe Zhao
Jiawei Han
Jun Liu
LRM
63
1
0
29 May 2024
Arithmetic Reasoning with LLM: Prolog Generation & Permutation
Xiaocheng Yang
Bingsen Chen
Yik-Cheung Tam
LRM
91
12
0
28 May 2024
FAIIR: Building Toward A Conversational AI Agent Assistant for Youth Mental Health Service Provision
Stephen Obadinma
Alia Lachana
M. Norman
Jocelyn Rankin
Joanna Yu
Xiaodan Zhu
Darren Mastropaolo
D. Pandya
Roxana Sultan
Elham Dolatabadi
AI4MH
115
1
0
28 May 2024
InversionView: A General-Purpose Method for Reading Information from Neural Activations
Xinting Huang
Madhur Panwar
Navin Goyal
Michael Hahn
98
5
0
27 May 2024
WirelessLLM: Empowering Large Language Models Towards Wireless Intelligence
Jiawei Shao
Jingwen Tong
Qiong Wu
Wei Guo
Zijian Li
Zehong Lin
Jun Zhang
92
40
0
27 May 2024
Previous
1
2
3
...
5
6
7
...
69
70
71
Next