ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1906.08237
  4. Cited By
XLNet: Generalized Autoregressive Pretraining for Language Understanding
v1v2 (latest)

XLNet: Generalized Autoregressive Pretraining for Language Understanding

19 June 2019
Zhilin Yang
Zihang Dai
Yiming Yang
J. Carbonell
Ruslan Salakhutdinov
Quoc V. Le
    AI4CE
ArXiv (abs)PDFHTML

Papers citing "XLNet: Generalized Autoregressive Pretraining for Language Understanding"

50 / 3,520 papers shown
Title
The EarlyBIRD Catches the Bug: On Exploiting Early Layers of Encoder
  Models for More Efficient Code Classification
The EarlyBIRD Catches the Bug: On Exploiting Early Layers of Encoder Models for More Efficient Code Classification
Anastasiia Grishina
Max Hort
Leon Moonen
68
6
0
08 May 2023
Differentially Private Attention Computation
Differentially Private Attention Computation
Yeqi Gao
Zhao Song
Xin Yang
92
21
0
08 May 2023
Toward Adversarial Training on Contextualized Language Representation
Toward Adversarial Training on Contextualized Language Representation
Hongqiu Wu
Yang Liu
Han Shi
Haizhen Zhao
Hao Fei
AAML
54
14
0
08 May 2023
AnomalyBERT: Self-Supervised Transformer for Time Series Anomaly
  Detection using Data Degradation Scheme
AnomalyBERT: Self-Supervised Transformer for Time Series Anomaly Detection using Data Degradation Scheme
Yungi Jeong
Eu-Hui Yang
Jung Hyun Ryu
Imseong Park
Myung-joo Kang
ViTAI4TS
72
31
0
08 May 2023
Vcc: Scaling Transformers to 128K Tokens or More by Prioritizing
  Important Tokens
Vcc: Scaling Transformers to 128K Tokens or More by Prioritizing Important Tokens
Zhanpeng Zeng
Cole Hawkins
Min-Fong Hong
Aston Zhang
Nikolaos Pappas
Vikas Singh
Shuai Zheng
72
8
0
07 May 2023
Refining the Responses of LLMs by Themselves
Refining the Responses of LLMs by Themselves
Tianqiang Yan
Tiansheng Xu
51
3
0
06 May 2023
Pre-training Language Model as a Multi-perspective Course Learner
Pre-training Language Model as a Multi-perspective Course Learner
Beiduo Chen
Shaohan Huang
Zi-qiang Zhang
Wu Guo
Zhen-Hua Ling
Haizhen Huang
Furu Wei
Weiwei Deng
Qi Zhang
61
0
0
06 May 2023
DiscoPrompt: Path Prediction Prompt Tuning for Implicit Discourse
  Relation Recognition
DiscoPrompt: Path Prediction Prompt Tuning for Implicit Discourse Relation Recognition
Chunkit Chan
Xin Liu
Cheng Jiayang
Zihan Li
Yangqiu Song
Ginny Wong
Simon See
82
31
0
06 May 2023
Adaptive loose optimization for robust question answering
Adaptive loose optimization for robust question answering
Jie Ma
Pinghui Wang
Ze-you Wang
Dechen Kong
Min Hu
Tingxu Han
Jun Liu
OOD
129
4
0
06 May 2023
Multi-grained Hypergraph Interest Modeling for Conversational
  Recommendation
Multi-grained Hypergraph Interest Modeling for Conversational Recommendation
Chenzhang Shang
Yupeng Hou
Wayne Xin Zhao
Yaliang Li
Jing Zhang
110
12
0
04 May 2023
VendorLink: An NLP approach for Identifying & Linking Vendor Migrants &
  Potential Aliases on Darknet Markets
VendorLink: An NLP approach for Identifying & Linking Vendor Migrants & Potential Aliases on Darknet Markets
V. Saxena
Nils Rethmeier
Gijs Van Dijck
Gerasimos Spanakis
40
6
0
04 May 2023
Using Language Models on Low-end Hardware
Using Language Models on Low-end Hardware
Silin Gao
Beatriz Borges
Saya Kanno
Antoine Bosselut
113
0
0
03 May 2023
FreeLM: Fine-Tuning-Free Language Model
FreeLM: Fine-Tuning-Free Language Model
Xiang Li
Xin Jiang
Xuying Meng
Aixin Sun
Yequan Wang
84
3
0
02 May 2023
Prompt as Triggers for Backdoor Attack: Examining the Vulnerability in
  Language Models
Prompt as Triggers for Backdoor Attack: Examining the Vulnerability in Language Models
Shuai Zhao
Jinming Wen
Anh Tuan Luu
Jiaqi Zhao
Jie Fu
SILM
174
99
0
02 May 2023
ArK: Augmented Reality with Knowledge Interactive Emergent Ability
ArK: Augmented Reality with Knowledge Interactive Emergent Ability
Qiuyuan Huang
Jinho Park
Abhinav Gupta
Paul N. Bennett
Ran Gong
...
Baolin Peng
O. Mohammed
C. Pal
Yejin Choi
Jianfeng Gao
122
6
0
01 May 2023
Multimodal Graph Transformer for Multimodal Question Answering
Multimodal Graph Transformer for Multimodal Question Answering
Xuehai He
Xin Eric Wang
88
9
0
30 Apr 2023
Calibration Error Estimation Using Fuzzy Binning
Calibration Error Estimation Using Fuzzy Binning
Geetanjali Bihani
Julia Taylor Rayz
215
2
0
30 Apr 2023
A Review of ChatGPT Applications in Education, Marketing, Software
  Engineering, and Healthcare: Benefits, Drawbacks, and Research Directions
A Review of ChatGPT Applications in Education, Marketing, Software Engineering, and Healthcare: Benefits, Drawbacks, and Research Directions
Mohammad Fraiwan
Natheer Khasawneh
106
48
0
29 Apr 2023
MMViT: Multiscale Multiview Vision Transformers
MMViT: Multiscale Multiview Vision Transformers
Yuchen Liu
Natasha Ong
Kaiyan Peng
Bo Xiong
Qifan Wang
...
Madian Khabsa
Kaiyue Yang
David C. Liu
Donald Williamson
Hanchao Yu
ViT
63
4
0
28 Apr 2023
DIAMANT: Dual Image-Attention Map Encoders For Medical Image
  Segmentation
DIAMANT: Dual Image-Attention Map Encoders For Medical Image Segmentation
Yousef Yeganeh
Azade Farshad
Peter Weinberger
Seyed-Ahmad Ahmadi
Ehsan Adeli
Nassir Navab
ViTMedIm
56
0
0
28 Apr 2023
Harnessing the Power of LLMs in Practice: A Survey on ChatGPT and Beyond
Harnessing the Power of LLMs in Practice: A Survey on ChatGPT and Beyond
Jingfeng Yang
Hongye Jin
Ruixiang Tang
Xiaotian Han
Qizhang Feng
Haoming Jiang
Bing Yin
Helen Zhou
LM&MA
214
686
0
26 Apr 2023
Pre-trained Embeddings for Entity Resolution: An Experimental Analysis
  [Experiment, Analysis & Benchmark]
Pre-trained Embeddings for Entity Resolution: An Experimental Analysis [Experiment, Analysis & Benchmark]
Alexandros Zeakis
G. Papadakis
Dimitrios Skoutas
Manolis Koubarakis
78
39
0
24 Apr 2023
Learn What NOT to Learn: Towards Generative Safety in Chatbots
Learn What NOT to Learn: Towards Generative Safety in Chatbots
Leila Khalatbari
Yejin Bang
Jane Polak Scowcroft
Willy Chung
Saeedeh Ghadimi
Hossein Sameti
Pascale Fung
73
7
0
21 Apr 2023
Domain-specific Continued Pretraining of Language Models for Capturing
  Long Context in Mental Health
Domain-specific Continued Pretraining of Language Models for Capturing Long Context in Mental Health
Shaoxiong Ji
Tianlin Zhang
Kailai Yang
Sophia Ananiadou
Min Zhang
Jörg Tiedemann
AI4MHALM
86
29
0
20 Apr 2023
MasakhaNEWS: News Topic Classification for African languages
MasakhaNEWS: News Topic Classification for African languages
David Ifeoluwa Adelani
Marek Masiak
Israel Abebe Azime
Jesujoba Oluwadara Alabi
A. Tonja
...
Moges Ahmed Mehamed
Evrard Ngabire
Jules Jules
Ivan Ssenkungu
Pontus Stenetorp
114
24
0
19 Apr 2023
SemEval 2023 Task 6: LegalEval - Understanding Legal Texts
SemEval 2023 Task 6: LegalEval - Understanding Legal Texts
Ashutosh Modi
Prathamesh Kalamkar
S. Karn
Aman Tiwari
Abhinav Joshi
Sai Kiran Tanikella
S. Guha
Sachin Malhan
Vivek Raghavan
ELMAILaw
55
42
0
19 Apr 2023
Exploring the Trade-Offs: Unified Large Language Models vs Local
  Fine-Tuned Models for Highly-Specific Radiology NLI Task
Exploring the Trade-Offs: Unified Large Language Models vs Local Fine-Tuned Models for Highly-Specific Radiology NLI Task
Zihao Wu
Lu Zhang
Chao-Yang Cao
Xiao-Xing Yu
Haixing Dai
...
Quanzheng Li
Dinggang Shen
Xiang Li
Dajiang Zhu
Tianming Liu
LM&MA
66
39
0
18 Apr 2023
MER 2023: Multi-label Learning, Modality Robustness, and Semi-Supervised
  Learning
MER 2023: Multi-label Learning, Modality Robustness, and Semi-Supervised Learning
Zheng Lian
Haiyang Sun
Guoying Zhao
Kang Chen
Mingyu Xu
...
Meng Wang
Min Zhang
Guoying Zhao
Björn W. Schuller
Jianhua Tao
96
51
0
18 Apr 2023
GlobalMind: Global Multi-head Interactive Self-attention Network for
  Hyperspectral Change Detection
GlobalMind: Global Multi-head Interactive Self-attention Network for Hyperspectral Change Detection
Meiqi Hu
Chen Wu
Lefei Zhang
129
21
0
18 Apr 2023
Towards Better Instruction Following Language Models for Chinese:
  Investigating the Impact of Training Data and Evaluation
Towards Better Instruction Following Language Models for Chinese: Investigating the Impact of Training Data and Evaluation
Yunjie Ji
Yan Gong
Yong Deng
Yiping Peng
Qiang Niu
Baochang Ma
Xiangang Li
ALMELM
102
25
0
16 Apr 2023
MisRoBÆRTa: Transformers versus Misinformation
MisRoBÆRTa: Transformers versus Misinformation
Ciprian-Octavian Truică
Elena Simona Apostol
66
39
0
16 Apr 2023
Fairness in Visual Clustering: A Novel Transformer Clustering Approach
Fairness in Visual Clustering: A Novel Transformer Clustering Approach
Xuan-Bac Nguyen
C. Duong
Marios Savvides
Kaushik Roy
Hugh Churchill
Khoa Luu
108
9
0
14 Apr 2023
Context-aware Coherent Speaking Style Prediction with Hierarchical
  Transformers for Audiobook Speech Synthesis
Context-aware Coherent Speaking Style Prediction with Hierarchical Transformers for Audiobook Speech Synthesis
Shunwei Lei
Yixuan Zhou
Liyang Chen
Zhiyong Wu
Shiyin Kang
Helen Meng
84
6
0
13 Apr 2023
A Cheaper and Better Diffusion Language Model with Soft-Masked Noise
A Cheaper and Better Diffusion Language Model with Soft-Masked Noise
Jiaao Chen
Aston Zhang
Mu Li
Alexander J. Smola
Diyi Yang
DiffM
88
21
0
10 Apr 2023
Similarity-Aware Multimodal Prompt Learning for Fake News Detection
Similarity-Aware Multimodal Prompt Learning for Fake News Detection
Ye Jiang
Xiaomin Yu
Yimin Wang
Xiaoman Xu
Xingyi Song
Diana Maynard
83
27
0
09 Apr 2023
Continual Graph Convolutional Network for Text Classification
Continual Graph Convolutional Network for Text Classification
Tiandeng Wu
Qijiong Liu
Yinhao Cao
yao. huang
Xiao-Ming Wu
Jiandong Ding
GNN
74
10
0
09 Apr 2023
Multi-class Categorization of Reasons behind Mental Disturbance in Long
  Texts
Multi-class Categorization of Reasons behind Mental Disturbance in Long Texts
Muskan Garg
AI4MH
47
2
0
08 Apr 2023
Pump It Up: Predict Water Pump Status using Attentive Tabular Learning
Pump It Up: Predict Water Pump Status using Attentive Tabular Learning
Karan Pathak
L. Shalini
35
0
0
08 Apr 2023
Gated Mechanism Enhanced Multi-Task Learning for Dialog Routing
Gated Mechanism Enhanced Multi-Task Learning for Dialog Routing
Ziming Huang
Zhuoxuan Jiang
Ke Min Wang
Juntao Li
Shanshan Feng
Xian-Ling Mao
MoE
282
0
0
07 Apr 2023
SSS at SemEval-2023 Task 10: Explainable Detection of Online Sexism
  using Majority Voted Fine-Tuned Transformers
SSS at SemEval-2023 Task 10: Explainable Detection of Online Sexism using Majority Voted Fine-Tuned Transformers
Sriya Rallabandi
Sanchit Singhal
Pratinav Seth
21
3
0
07 Apr 2023
Deep Learning for Opinion Mining and Topic Classification of Course
  Reviews
Deep Learning for Opinion Mining and Topic Classification of Course Reviews
Anna Koufakou
62
20
0
06 Apr 2023
Towards Interpretable Mental Health Analysis with Large Language Models
Towards Interpretable Mental Health Analysis with Large Language Models
Kailai Yang
Shaoxiong Ji
Tianlin Zhang
Qianqian Xie
Zi-Zhou Kuang
Sophia Ananiadou
ELMAI4MHLRM
121
61
0
06 Apr 2023
How to Design Translation Prompts for ChatGPT: An Empirical Study
How to Design Translation Prompts for ChatGPT: An Empirical Study
Yuan Gao
Ruili Wang
Feng Hou
69
46
0
05 Apr 2023
MEGClass: Extremely Weakly Supervised Text Classification via
  Mutually-Enhancing Text Granularities
MEGClass: Extremely Weakly Supervised Text Classification via Mutually-Enhancing Text Granularities
Priyanka Kargupta
Tanay Komarlu
Susik Yoon
Xuan Wang
Jiawei Han
98
8
0
04 Apr 2023
PromptAid: Prompt Exploration, Perturbation, Testing and Iteration using
  Visual Analytics for Large Language Models
PromptAid: Prompt Exploration, Perturbation, Testing and Iteration using Visual Analytics for Large Language Models
Aditi Mishra
Utkarsh Soni
Anjana Arunkumar
Jinbin Huang
Bum Chul Kwon
Chris Bryan
LRM
83
35
0
04 Apr 2023
Optimizing Group Utility in Itinerary Planning: A Strategic and Crowd-Aware Approach
Junhua Liu
Kwan Hui Lim
Kristin L. Wood
Menglin Li
115
0
0
04 Apr 2023
EDeR: A Dataset for Exploring Dependency Relations Between Events
EDeR: A Dataset for Exploring Dependency Relations Between Events
RUIQI LI
P. Haslum
Leyang Cui
67
0
0
04 Apr 2023
A Bibliometric Review of Large Language Models Research from 2017 to
  2023
A Bibliometric Review of Large Language Models Research from 2017 to 2023
Lizhou Fan
Lingyao Li
Zihui Ma
Sanggyu Lee
Huizi Yu
Libby Hemphill
117
158
0
03 Apr 2023
A Comparison of Document Similarity Algorithms
A Comparison of Document Similarity Algorithms
Nicholas Gahman
V. Elangovan
AI4TS
53
4
0
03 Apr 2023
Safety Analysis in the Era of Large Language Models: A Case Study of
  STPA using ChatGPT
Safety Analysis in the Era of Large Language Models: A Case Study of STPA using ChatGPT
Yi Qi
Xingyu Zhao
Siddartha Khastgir
Xiaowei Huang
80
16
0
03 Apr 2023
Previous
123...171819...697071
Next