ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1906.08237
  4. Cited By
XLNet: Generalized Autoregressive Pretraining for Language Understanding
v1v2 (latest)

XLNet: Generalized Autoregressive Pretraining for Language Understanding

19 June 2019
Zhilin Yang
Zihang Dai
Yiming Yang
J. Carbonell
Ruslan Salakhutdinov
Quoc V. Le
    AI4CE
ArXiv (abs)PDFHTML

Papers citing "XLNet: Generalized Autoregressive Pretraining for Language Understanding"

50 / 3,520 papers shown
Title
Attention over pre-trained Sentence Embeddings for Long Document
  Classification
Attention over pre-trained Sentence Embeddings for Long Document Classification
Amine Abdaoui
Sourav Dutta
52
1
0
18 Jul 2023
Is Prompt-Based Finetuning Always Better than Vanilla Finetuning?
  Insights from Cross-Lingual Language Understanding
Is Prompt-Based Finetuning Always Better than Vanilla Finetuning? Insights from Cross-Lingual Language Understanding
Bolei Ma
Ercong Nie
Helmut Schmid
Hinrich Schütze
AAMLVLMLRM
97
9
0
15 Jul 2023
Improving BERT with Hybrid Pooling Network and Drop Mask
Improving BERT with Hybrid Pooling Network and Drop Mask
Qian Chen
Wen Wang
Qinglin Zhang
Chong Deng
Ma Yukun
Siqi Zheng
43
1
0
14 Jul 2023
Unsupervised Calibration through Prior Adaptation for Text
  Classification using Large Language Models
Unsupervised Calibration through Prior Adaptation for Text Classification using Large Language Models
Lautaro Estienne
Luciana Ferrer
Matías Vera
Pablo Piantanida
VLM
55
1
0
13 Jul 2023
Deep Network Approximation: Beyond ReLU to Diverse Activation Functions
Deep Network Approximation: Beyond ReLU to Diverse Activation Functions
Shijun Zhang
Jianfeng Lu
Hongkai Zhao
62
21
0
13 Jul 2023
EgoVLPv2: Egocentric Video-Language Pre-training with Fusion in the
  Backbone
EgoVLPv2: Egocentric Video-Language Pre-training with Fusion in the Backbone
Shraman Pramanick
Yale Song
Sayan Nag
Kevin Qinghong Lin
Hardik Shah
Mike Zheng Shou
Ramalingam Chellappa
Pengchuan Zhang
VLM
124
100
0
11 Jul 2023
Vacaspati: A Diverse Corpus of Bangla Literature
Vacaspati: A Diverse Corpus of Bangla Literature
Pramit Bhattacharyya
Joydeep Mondal
S. Maji
Arnab Bhattacharya
69
7
0
11 Jul 2023
Synthetic Dataset for Evaluating Complex Compositional Knowledge for
  Natural Language Inference
Synthetic Dataset for Evaluating Complex Compositional Knowledge for Natural Language Inference
Sushma A. Akoju
Robert Vacareanu
Haris Riaz
Eduardo Blanco
Mihai Surdeanu
NAICoGe
32
1
0
11 Jul 2023
ChatGPT for Digital Forensic Investigation: The Good, The Bad, and The
  Unknown
ChatGPT for Digital Forensic Investigation: The Good, The Bad, and The Unknown
Mark Scanlon
Frank Breitinger
Christopher J. Hargreaves
Jan-Niclas Hilgert
John W. Sheppard
SILM
33
72
0
10 Jul 2023
Advancements in Scientific Controllable Text Generation Methods
Advancements in Scientific Controllable Text Generation Methods
Arnav Goel
Medha Hira
Avinash Anand
Siddhesh Bangar
R. Shah
79
7
0
08 Jul 2023
Unveiling the Potential of Knowledge-Prompted ChatGPT for Enhancing Drug
  Trafficking Detection on Social Media
Unveiling the Potential of Knowledge-Prompted ChatGPT for Enhancing Drug Trafficking Detection on Social Media
Chuanbo Hu
Bin Liu
Xin Li
Yanfang Ye
35
4
0
07 Jul 2023
A Side-by-side Comparison of Transformers for English Implicit Discourse
  Relation Classification
A Side-by-side Comparison of Transformers for English Implicit Discourse Relation Classification
Bruce W. Lee
Bongseok Yang
J. Lee
76
0
0
07 Jul 2023
Evaluating Biased Attitude Associations of Language Models in an
  Intersectional Context
Evaluating Biased Attitude Associations of Language Models in an Intersectional Context
Shiva Omrani Sabbaghi
Robert Wolfe
Aylin Caliskan
73
25
0
07 Jul 2023
S2vNTM: Semi-supervised vMF Neural Topic Modeling
S2vNTM: Semi-supervised vMF Neural Topic Modeling
Weijie Xu
Jay Desai
Srinivasan H. Sengamedu
Xiaoyu Jiang
Francis Iannacci
VLM
68
1
0
06 Jul 2023
BLEURT Has Universal Translations: An Analysis of Automatic Metrics by
  Minimum Risk Training
BLEURT Has Universal Translations: An Analysis of Automatic Metrics by Minimum Risk Training
Yiming Yan
Tao Wang
Chengqi Zhao
Shujian Huang
Jiajun Chen
Mingxuan Wang
92
24
0
06 Jul 2023
SpaceNLI: Evaluating the Consistency of Predicting Inferences in Space
SpaceNLI: Evaluating the Consistency of Predicting Inferences in Space
Lasha Abzianidze
J. Zwarts
Yoad Winter
34
2
0
05 Jul 2023
Generative Job Recommendations with Large Language Model
Generative Job Recommendations with Large Language Model
Zhi Zheng
Zhaopeng Qiu
Xiao Hu
Likang Wu
Hengshu Zhu
Hui Xiong
53
22
0
05 Jul 2023
KDSTM: Neural Semi-supervised Topic Modeling with Knowledge Distillation
KDSTM: Neural Semi-supervised Topic Modeling with Knowledge Distillation
Weijie Xu
Xiaoyu Jiang
Jay Desai
Bin Han
Fuqin Yan
Francis Iannacci
BDL
88
3
0
04 Jul 2023
Deep Attention Q-Network for Personalized Treatment Recommendation
Deep Attention Q-Network for Personalized Treatment Recommendation
Simin Ma
Junghwan Lee
N. Serban
Shihao Yang
OffRL
77
6
0
04 Jul 2023
A Critical Re-evaluation of Benchmark Datasets for (Deep) Learning-Based
  Matching Algorithms
A Critical Re-evaluation of Benchmark Datasets for (Deep) Learning-Based Matching Algorithms
G. Papadakis
Nishadi Kirielle
Peter Christen
Themis Palpanas
97
8
0
03 Jul 2023
A Dual-Stream Recurrence-Attention Network With Global-Local Awareness
  for Emotion Recognition in Textual Dialog
A Dual-Stream Recurrence-Attention Network With Global-Local Awareness for Emotion Recognition in Textual Dialog
Jiang Li
Xiaoping Wang
Zhigang Zeng
68
4
0
02 Jul 2023
BatGPT: A Bidirectional Autoregessive Talker from Generative Pre-trained
  Transformer
BatGPT: A Bidirectional Autoregessive Talker from Generative Pre-trained Transformer
Z. Li
Shitou Zhang
Hai Zhao
Yifei Yang
Dongjie Yang
LM&MA
116
17
0
01 Jul 2023
iMETRE: Incorporating Markers of Entity Types for Relation Extraction
iMETRE: Incorporating Markers of Entity Types for Relation Extraction
Harsha Vardhan
Manav Chaudhary
44
3
0
30 Jun 2023
Information Extraction in Domain and Generic Documents: Findings from
  Heuristic-based and Data-driven Approaches
Information Extraction in Domain and Generic Documents: Findings from Heuristic-based and Data-driven Approaches
Shiyu Yuan
Carlo Lipizzi
31
2
0
30 Jun 2023
Transformers in Healthcare: A Survey
Transformers in Healthcare: A Survey
Subhash Nerella
S. Bandyopadhyay
Jiaqing Zhang
Miguel Contreras
Scott Siegel
...
Jessica Sena
B. Shickel
A. Bihorac
Kia Khezeli
Parisa Rashidi
MedImAI4CE
96
35
0
30 Jun 2023
SpATr: MoCap 3D Human Action Recognition based on Spiral Auto-encoder
  and Transformer Network
SpATr: MoCap 3D Human Action Recognition based on Spiral Auto-encoder and Transformer Network
Hamza Bouzid
Lahoucine Ballihi
ViT3DH
107
3
0
30 Jun 2023
Classifying Crime Types using Judgment Documents from Social Media
Haoxuan Xu
Zeyu He
Mengfan Shen
Songning Lai
Ziqiang Han
Yifan Peng
94
0
0
29 Jun 2023
ICSVR: Investigating Compositional and Syntactic Understanding in Video
  Retrieval Models
ICSVR: Investigating Compositional and Syntactic Understanding in Video Retrieval Models
Avinash Madasu
Vasudev Lal
CoGe
102
3
0
28 Jun 2023
MyCrunchGPT: A chatGPT assisted framework for scientific machine
  learning
MyCrunchGPT: A chatGPT assisted framework for scientific machine learning
Varun V. Kumar
Leonard Gleyzer
Adar Kahana
K. Shukla
George Karniadakis
AI4CE
88
14
0
27 Jun 2023
A Solution to CVPR'2023 AQTC Challenge: Video Alignment for Multi-Step
  Inference
A Solution to CVPR'2023 AQTC Challenge: Video Alignment for Multi-Step Inference
Chao Zhang
Shiwei Wu
Sirui Zhao
Tong Xu
Enhong Chen
61
0
0
26 Jun 2023
Switch-BERT: Learning to Model Multimodal Interactions by Switching
  Attention and Input
Switch-BERT: Learning to Model Multimodal Interactions by Switching Attention and Input
Qingpei Guo
Kaisheng Yao
Wei Chu
MLLM
45
5
0
25 Jun 2023
H$_2$O: Heavy-Hitter Oracle for Efficient Generative Inference of Large
  Language Models
H2_22​O: Heavy-Hitter Oracle for Efficient Generative Inference of Large Language Models
Zhenyu Zhang
Ying Sheng
Dinesh Manocha
Tianlong Chen
Lianmin Zheng
...
Yuandong Tian
Christopher Ré
Clark W. Barrett
Zhangyang Wang
Beidi Chen
VLM
194
315
0
24 Jun 2023
Emotion Flip Reasoning in Multiparty Conversations
Emotion Flip Reasoning in Multiparty Conversations
Shivani Kumar
Shubham Dudeja
Md. Shad Akhtar
Tanmoy Chakraborty
57
14
0
24 Jun 2023
Knowledge-Infused Self Attention Transformers
Knowledge-Infused Self Attention Transformers
Kaushik Roy
Yuxin Zi
Vignesh Narayanan
Manas Gaur
Amit P. Sheth
KELM
50
7
0
23 Jun 2023
First Place Solution to the CVPR'2023 AQTC Challenge: A
  Function-Interaction Centric Approach with Spatiotemporal Visual-Language
  Alignment
First Place Solution to the CVPR'2023 AQTC Challenge: A Function-Interaction Centric Approach with Spatiotemporal Visual-Language Alignment
Tom Tongjia Chen
Hongshan Yu
Zhengeng Yang
Ming Li
Zechuan Li
Jingwen Wang
Wei Miao
Wei Sun
Chen Chen
45
2
0
23 Jun 2023
Quantizable Transformers: Removing Outliers by Helping Attention Heads
  Do Nothing
Quantizable Transformers: Removing Outliers by Helping Attention Heads Do Nothing
Yelysei Bondarenko
Markus Nagel
Tijmen Blankevoort
MQ
123
93
0
22 Jun 2023
Sample Attackability in Natural Language Adversarial Attacks
Sample Attackability in Natural Language Adversarial Attacks
Vyas Raina
Mark Gales
SILM
110
1
0
21 Jun 2023
Give Us the Facts: Enhancing Large Language Models with Knowledge Graphs
  for Fact-aware Language Modeling
Give Us the Facts: Enhancing Large Language Models with Knowledge Graphs for Fact-aware Language Modeling
Lin F. Yang
Hongyang Chen
Zhao Li
Xiao Ding
Xindong Wu
KELM
114
93
0
20 Jun 2023
Instant Soup: Cheap Pruning Ensembles in A Single Pass Can Draw Lottery
  Tickets from Large Models
Instant Soup: Cheap Pruning Ensembles in A Single Pass Can Draw Lottery Tickets from Large Models
A. Jaiswal
Shiwei Liu
Tianlong Chen
Ying Ding
Zhangyang Wang
VLM
115
21
0
18 Jun 2023
Investigating Masking-based Data Generation in Language Models
Investigating Masking-based Data Generation in Language Models
Edward Ma
61
0
0
16 Jun 2023
Pushing the Limits of ChatGPT on NLP Tasks
Pushing the Limits of ChatGPT on NLP Tasks
Xiaofei Sun
Linfeng Dong
Xiaoya Li
Zhen Wan
Shuhe Wang
...
Jiwei Li
Fei Cheng
Lingjuan Lyu
Leilei Gan
Guoyin Wang
AI4MHLRM
117
32
0
16 Jun 2023
Lexical Speaker Error Correction: Leveraging Language Models for Speaker
  Diarization Error Correction
Lexical Speaker Error Correction: Leveraging Language Models for Speaker Diarization Error Correction
Rohit Paturi
S. Srinivasan
Xiang Li
62
15
0
15 Jun 2023
Matching Pairs: Attributing Fine-Tuned Models to their Pre-Trained Large
  Language Models
Matching Pairs: Attributing Fine-Tuned Models to their Pre-Trained Large Language Models
Myles Foley
Ambrish Rawat
Taesung Lee
Yufang Hou
Gabriele Picco
Giulio Zizzo
DeLMO
138
6
0
15 Jun 2023
Relational Temporal Graph Reasoning for Dual-task Dialogue Language
  Understanding
Relational Temporal Graph Reasoning for Dual-task Dialogue Language Understanding
Bowen Xing
Ivor W. Tsang
70
15
0
15 Jun 2023
Anticipatory Music Transformer
Anticipatory Music Transformer
John Thickstun
David Leo Wright Hall
Chris Donahue
Percy Liang
77
16
0
14 Jun 2023
Research on Named Entity Recognition in Improved transformer with R-Drop
  structure
Research on Named Entity Recognition in Improved transformer with R-Drop structure
Weidong Ji
Yousheng Zhang
Guohui Zhou
Xu Wang
88
0
0
14 Jun 2023
Sociodemographic Bias in Language Models: A Survey and Forward Path
Sociodemographic Bias in Language Models: A Survey and Forward Path
Vipul Gupta
Pranav Narayanan Venkit
Shomir Wilson
R. Passonneau
97
23
0
13 Jun 2023
Adversarial Capsule Networks for Romanian Satire Detection and Sentiment
  Analysis
Adversarial Capsule Networks for Romanian Satire Detection and Sentiment Analysis
Sebastian-Vasile Echim
Ruazvan-Alexandru Smuadu
Andrei-Marius Avram
Dumitru-Clementin Cercel
Florin-Catalin Pop
57
5
0
13 Jun 2023
Rank-Aware Negative Training for Semi-Supervised Text Classification
Rank-Aware Negative Training for Semi-Supervised Text Classification
Ahmed Murtadha
Shengfeng Pan
Wen Bo
Jianlin Su
Xinxin Cao
Wenze Zhang
Yunfeng Liu
84
9
0
13 Jun 2023
Learning to Mask and Permute Visual Tokens for Vision Transformer Pre-Training
Learning to Mask and Permute Visual Tokens for Vision Transformer Pre-Training
Lorenzo Baraldi
Roberto Amoroso
Marcella Cornia
Lorenzo Baraldi
Andrea Pilzer
Rita Cucchiara
153
2
0
12 Jun 2023
Previous
123...141516...697071
Next