ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1906.08237
  4. Cited By
XLNet: Generalized Autoregressive Pretraining for Language Understanding
v1v2 (latest)

XLNet: Generalized Autoregressive Pretraining for Language Understanding

19 June 2019
Zhilin Yang
Zihang Dai
Yiming Yang
J. Carbonell
Ruslan Salakhutdinov
Quoc V. Le
    AI4CE
ArXiv (abs)PDFHTML

Papers citing "XLNet: Generalized Autoregressive Pretraining for Language Understanding"

50 / 3,518 papers shown
Title
A Depression Detection Method Based on Multi-Modal Feature Fusion Using
  Cross-Attention
A Depression Detection Method Based on Multi-Modal Feature Fusion Using Cross-Attention
Shengjie Li
Yinhao Xiao
69
1
0
02 Jul 2024
Look Ahead or Look Around? A Theoretical Comparison Between
  Autoregressive and Masked Pretraining
Look Ahead or Look Around? A Theoretical Comparison Between Autoregressive and Masked Pretraining
Qi Zhang
Tianqi Du
Haotian Huang
Yifei Wang
Yisen Wang
71
5
0
01 Jul 2024
Large Language Model Enhanced Knowledge Representation Learning: A Survey
Large Language Model Enhanced Knowledge Representation Learning: A Survey
Xin Wang
Zirui Chen
Haofen Wang
Leong Hou U
Zhao Li
Wenbin Guo
KELM
215
3
0
01 Jul 2024
A Comparative Study of Quality Evaluation Methods for Text Summarization
A Comparative Study of Quality Evaluation Methods for Text Summarization
Huyen Nguyen
Haihua Chen
Lavanya Pobbathi
Junhua Ding
ELM
83
6
0
30 Jun 2024
LegalTurk Optimized BERT for Multi-Label Text Classification and NER
LegalTurk Optimized BERT for Multi-Label Text Classification and NER
Farnaz Zeidi
Mehmet Fatih Amasyali
Çiğdem Erol
VLM
58
2
0
30 Jun 2024
When Search Engine Services meet Large Language Models: Visions and
  Challenges
When Search Engine Services meet Large Language Models: Visions and Challenges
Haoyi Xiong
Jiang Bian
Yuchen Li
Xuhong Li
Jundong Li
Shuaiqiang Wang
Dawei Yin
Sumi Helal
135
36
0
28 Jun 2024
The Odyssey of Commonsense Causality: From Foundational Benchmarks to
  Cutting-Edge Reasoning
The Odyssey of Commonsense Causality: From Foundational Benchmarks to Cutting-Edge Reasoning
Shaobo Cui
Zhijing Jin
Bernhard Schölkopf
Boi Faltings
CMLLRM
87
4
0
27 Jun 2024
Unveiling and Controlling Anomalous Attention Distribution in
  Transformers
Unveiling and Controlling Anomalous Attention Distribution in Transformers
Ruiqing Yan
Xingbo Du
Haoyu Deng
Linghan Zheng
Qiuzhuang Sun
Jifang Hu
Yuhang Shao
Penghao Jiang
Jinrong Jiang
Lian Zhao
62
1
0
26 Jun 2024
Zero-shot prompt-based classification: topic labeling in times of
  foundation models in German Tweets
Zero-shot prompt-based classification: topic labeling in times of foundation models in German Tweets
Simon Münker
Kai Kugler
Achim Rettinger
VLM
68
1
0
26 Jun 2024
PharmaGPT: Domain-Specific Large Language Models for Bio-Pharmaceutical
  and Chemistry
PharmaGPT: Domain-Specific Large Language Models for Bio-Pharmaceutical and Chemistry
Linqing Chen
Weilei Wang
Zilong Bai
Peng Xu
Yan Fang
...
Lisha Zhang
Fu Bian
Zhongkai Ye
Lidong Pei
Changyang Tu
AI4MHLM&MA
107
3
0
26 Jun 2024
ViANLI: Adversarial Natural Language Inference for Vietnamese
ViANLI: Adversarial Natural Language Inference for Vietnamese
Tin Van Huynh
Kiet Van Nguyen
Ngan Luu-Thuy Nguyen
64
0
0
25 Jun 2024
Knowledge Distillation in Automated Annotation: Supervised Text
  Classification with LLM-Generated Training Labels
Knowledge Distillation in Automated Annotation: Supervised Text Classification with LLM-Generated Training Labels
Nicholas Pangakis
Samuel Wolken
88
19
0
25 Jun 2024
Evaluation of Language Models in the Medical Context Under
  Resource-Constrained Settings
Evaluation of Language Models in the Medical Context Under Resource-Constrained Settings
Andrea Posada
Daniel Rueckert
Felix Meissen
Philip Muller
LM&MAELM
58
0
0
24 Jun 2024
SyROCCo: Enhancing Systematic Reviews using Machine Learning
SyROCCo: Enhancing Systematic Reviews using Machine Learning
Zheng Fang
Miguel Arana Catania
Felix-Anselm van Lier
Juliana Outes Velarde
Harry Bregazzi
Mara Airoldi
Eleanor Carter
Rob Procter
34
0
0
24 Jun 2024
Deepfake tweets automatic detection
Deepfake tweets automatic detection
Adam Frej
Adrian Kaminski
Piotr Marciniak
Szymon Szmajdzinski
Soveatin Kuntur
Anna Wroblewska
43
0
0
24 Jun 2024
Evaluating the Effectiveness of the Foundational Models for Q&A
  Classification in Mental Health care
Evaluating the Effectiveness of the Foundational Models for Q&A Classification in Mental Health care
Hassan Alhuzali
Ashwag Alasmari
AI4MH
79
2
0
23 Jun 2024
Text Serialization and Their Relationship with the Conventional
  Paradigms of Tabular Machine Learning
Text Serialization and Their Relationship with the Conventional Paradigms of Tabular Machine Learning
Kyoka Ono
Simon A. Lee
LMTD
52
8
0
19 Jun 2024
A Primal-Dual Framework for Transformers and Neural Networks
A Primal-Dual Framework for Transformers and Neural Networks
Tan M. Nguyen
Tam Nguyen
Nhat Ho
Andrea L. Bertozzi
Richard G. Baraniuk
Stanley J. Osher
ViT
70
14
0
19 Jun 2024
Unveiling the Hidden Structure of Self-Attention via Kernel Principal
  Component Analysis
Unveiling the Hidden Structure of Self-Attention via Kernel Principal Component Analysis
R. Teo
Tan M. Nguyen
91
4
0
19 Jun 2024
Not Eliminate but Aggregate: Post-Hoc Control over Mixture-of-Experts to
  Address Shortcut Shifts in Natural Language Understanding
Not Eliminate but Aggregate: Post-Hoc Control over Mixture-of-Experts to Address Shortcut Shifts in Natural Language Understanding
Ukyo Honda
Tatsushi Oka
Peinan Zhang
Masato Mita
90
1
0
17 Jun 2024
WellDunn: On the Robustness and Explainability of Language Models and
  Large Language Models in Identifying Wellness Dimensions
WellDunn: On the Robustness and Explainability of Language Models and Large Language Models in Identifying Wellness Dimensions
Seyedali Mohammadi
Edward Raff
Jinendra Malekar
Vedant Palit
Francis Ferraro
Manas Gaur
AI4MH
92
3
0
17 Jun 2024
GECOBench: A Gender-Controlled Text Dataset and Benchmark for
  Quantifying Biases in Explanations
GECOBench: A Gender-Controlled Text Dataset and Benchmark for Quantifying Biases in Explanations
Rick Wilming
Artur Dox
Hjalmar Schulz
Marta Oliveira
Benedict Clark
Stefan Haufe
102
2
0
17 Jun 2024
A Systematic Analysis of Large Language Models as Soft Reasoners: The
  Case of Syllogistic Inferences
A Systematic Analysis of Large Language Models as Soft Reasoners: The Case of Syllogistic Inferences
Leonardo Bertolazzi
Albert Gatt
Raffaella Bernardi
LRMELM
39
6
0
17 Jun 2024
HyperSIGMA: Hyperspectral Intelligence Comprehension Foundation Model
HyperSIGMA: Hyperspectral Intelligence Comprehension Foundation Model
Di Wang
Meiqi Hu
Yao Jin
Yuchun Miao
Jiaqi Yang
...
Lefei Zhang
Chen Wu
Di Lin
Dacheng Tao
Liangpei Zhang
164
27
0
17 Jun 2024
Improving Large Models with Small models: Lower Costs and Better
  Performance
Improving Large Models with Small models: Lower Costs and Better Performance
Dong Chen
Shuo Zhang
Yueting Zhuang
Siliang Tang
Qidong Liu
Hua Wang
Mingliang Xu
96
6
0
15 Jun 2024
Exploring the Correlation between Human and Machine Evaluation of
  Simultaneous Speech Translation
Exploring the Correlation between Human and Machine Evaluation of Simultaneous Speech Translation
Xiaoman Wang
Claudio Fantinuoli
46
1
0
14 Jun 2024
Fine-Tuned 'Small' LLMs (Still) Significantly Outperform Zero-Shot
  Generative AI Models in Text Classification
Fine-Tuned 'Small' LLMs (Still) Significantly Outperform Zero-Shot Generative AI Models in Text Classification
Martin Juan José Bucher
Marco Martini
ALMAI4MH
125
36
0
12 Jun 2024
Adversarial Evasion Attack Efficiency against Large Language Models
Adversarial Evasion Attack Efficiency against Large Language Models
João Vitorino
Eva Maia
Isabel Praça
AAML
72
2
0
12 Jun 2024
Defining and Detecting Vulnerability in Human Evaluation Guidelines: A
  Preliminary Study Towards Reliable NLG Evaluation
Defining and Detecting Vulnerability in Human Evaluation Guidelines: A Preliminary Study Towards Reliable NLG Evaluation
Jie Ruan
Wenqing Wang
Xiaojun Wan
AAMLELM
80
6
0
12 Jun 2024
To be Continuous, or to be Discrete, Those are Bits of Questions
To be Continuous, or to be Discrete, Those are Bits of Questions
Yiran Wang
Masao Utiyama
80
4
0
12 Jun 2024
Autoregressive Pretraining with Mamba in Vision
Autoregressive Pretraining with Mamba in Vision
Sucheng Ren
Xianhang Li
Haoqin Tu
Feng Wang
Fangxun Shu
...
L. Yang
Peng Wang
Heng Wang
Alan Yuille
Cihang Xie
Mamba
125
12
0
11 Jun 2024
Visual Representation Learning with Stochastic Frame Prediction
Visual Representation Learning with Stochastic Frame Prediction
Huiwon Jang
Dongyoung Kim
Junsu Kim
Jinwoo Shin
Pieter Abbeel
Younggyo Seo
90
3
0
11 Jun 2024
Leveraging Large Language Models for Efficient Failure Analysis in Game
  Development
Leveraging Large Language Models for Efficient Failure Analysis in Game Development
Leonardo Marini
Linus Gisslén
Alessandro Sestini
101
0
0
11 Jun 2024
COVID-19 Twitter Sentiment Classification Using Hybrid Deep Learning
  Model Based on Grid Search Methodology
COVID-19 Twitter Sentiment Classification Using Hybrid Deep Learning Model Based on Grid Search Methodology
Jitendra Tembhurne
Anant Agrawal
Kirtan Lakhotia
71
0
0
11 Jun 2024
The Factorization Curse: Which Tokens You Predict Underlie the Reversal
  Curse and More
The Factorization Curse: Which Tokens You Predict Underlie the Reversal Curse and More
O. Kitouni
Niklas Nolte
Diane Bouchacourt
Adina Williams
Mike Rabbat
Mark Ibrahim
LRMCLL
101
12
0
07 Jun 2024
Creating an AI Observer: Generative Semantic Workspaces
Creating an AI Observer: Generative Semantic Workspaces
Pavan Holur
Shreyas Rajesh
David Chong
V. Roychowdhury
40
0
0
07 Jun 2024
Legal Judgment Reimagined: PredEx and the Rise of Intelligent AI
  Interpretation in Indian Courts
Legal Judgment Reimagined: PredEx and the Rise of Intelligent AI Interpretation in Indian Courts
S. Nigam
Anurag Sharma
Danush Khanna
Noel Shallum
Kripabandhu Ghosh
Arnab Bhattacharya
ELMAILaw
77
9
0
06 Jun 2024
Pre-trained Transformer Uncovers Meaningful Patterns in Human Mobility
  Data
Pre-trained Transformer Uncovers Meaningful Patterns in Human Mobility Data
Alameen Najjar
71
0
0
06 Jun 2024
Wings: Learning Multimodal LLMs without Text-only Forgetting
Wings: Learning Multimodal LLMs without Text-only Forgetting
Yi-Kai Zhang
Shiyin Lu
Yang Li
Yanqing Ma
Qing-Guo Chen
Zhao Xu
Weihua Luo
Kaifu Zhang
De-Chuan Zhan
Han-Jia Ye
VLM
128
10
0
05 Jun 2024
RadBARTsum: Domain Specific Adaption of Denoising Sequence-to-Sequence
  Models for Abstractive Radiology Report Summarization
RadBARTsum: Domain Specific Adaption of Denoising Sequence-to-Sequence Models for Abstractive Radiology Report Summarization
Jinge Wu
Abul Hasan
Honghan Wu
33
1
0
05 Jun 2024
Measure-Observe-Remeasure: An Interactive Paradigm for
  Differentially-Private Exploratory Analysis
Measure-Observe-Remeasure: An Interactive Paradigm for Differentially-Private Exploratory Analysis
Priyanka Nanayakkara
Hyeok Kim
Yifan Wu
Ali Sarvghad
Narges Mahyar
G. Miklau
Jessica Hullman
74
20
0
04 Jun 2024
Focus on the Core: Efficient Attention via Pruned Token Compression for
  Document Classification
Focus on the Core: Efficient Attention via Pruned Token Compression for Document Classification
Jungmin Yun
Mihyeon Kim
Youngbin Kim
113
9
0
03 Jun 2024
Synergizing Unsupervised and Supervised Learning: A Hybrid Approach for
  Accurate Natural Language Task Modeling
Synergizing Unsupervised and Supervised Learning: A Hybrid Approach for Accurate Natural Language Task Modeling
Wrick Talukdar
Anjanava Biswas
52
5
0
03 Jun 2024
GAMedX: Generative AI-based Medical Entity Data Extractor Using Large
  Language Models
GAMedX: Generative AI-based Medical Entity Data Extractor Using Large Language Models
Mohammed-Khalil Ghali
Abdelrahman Farrag
Hajar Sakai
Hicham El Baz
Yu Jin
Sarah Lam
LM&MAMedIm
81
9
0
31 May 2024
Ensemble Model With Bert,Roberta and Xlnet For Molecular property
  prediction
Ensemble Model With Bert,Roberta and Xlnet For Molecular property prediction
Junling Hu
72
1
0
30 May 2024
PathReasoner: Modeling Reasoning Path with Equivalent Extension for
  Logical Question Answering
PathReasoner: Modeling Reasoning Path with Equivalent Extension for Logical Question Answering
Fangzhi Xu
Qika Lin
Tianzhe Zhao
Jiawei Han
Jun Liu
LRM
63
1
0
29 May 2024
Arithmetic Reasoning with LLM: Prolog Generation & Permutation
Arithmetic Reasoning with LLM: Prolog Generation & Permutation
Xiaocheng Yang
Bingsen Chen
Yik-Cheung Tam
LRM
91
12
0
28 May 2024
FAIIR: Building Toward A Conversational AI Agent Assistant for Youth Mental Health Service Provision
FAIIR: Building Toward A Conversational AI Agent Assistant for Youth Mental Health Service Provision
Stephen Obadinma
Alia Lachana
M. Norman
Jocelyn Rankin
Joanna Yu
Xiaodan Zhu
Darren Mastropaolo
D. Pandya
Roxana Sultan
Elham Dolatabadi
AI4MH
115
1
0
28 May 2024
InversionView: A General-Purpose Method for Reading Information from
  Neural Activations
InversionView: A General-Purpose Method for Reading Information from Neural Activations
Xinting Huang
Madhur Panwar
Navin Goyal
Michael Hahn
98
5
0
27 May 2024
WirelessLLM: Empowering Large Language Models Towards Wireless
  Intelligence
WirelessLLM: Empowering Large Language Models Towards Wireless Intelligence
Jiawei Shao
Jingwen Tong
Qiong Wu
Wei Guo
Zijian Li
Zehong Lin
Jun Zhang
92
40
0
27 May 2024
Previous
123...567...697071
Next