ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1906.08237
  4. Cited By
XLNet: Generalized Autoregressive Pretraining for Language Understanding
v1v2 (latest)

XLNet: Generalized Autoregressive Pretraining for Language Understanding

19 June 2019
Zhilin Yang
Zihang Dai
Yiming Yang
J. Carbonell
Ruslan Salakhutdinov
Quoc V. Le
    AI4CE
ArXiv (abs)PDFHTML

Papers citing "XLNet: Generalized Autoregressive Pretraining for Language Understanding"

50 / 3,518 papers shown
Title
BWArea Model: Learning World Model, Inverse Dynamics, and Policy for
  Controllable Language Generation
BWArea Model: Learning World Model, Inverse Dynamics, and Policy for Controllable Language Generation
Chengxing Jia
Pengyuan Wang
Ziniu Li
Yi-Chen Li
Zhilong Zhang
Nan Tang
Yang Yu
OffRL
64
2
0
27 May 2024
5W1H Extraction With Large Language Models
5W1H Extraction With Large Language Models
Yang Cao
Yangsong Lan
Feiyan Zhai
Piji Li
102
1
0
25 May 2024
How Well Do Deep Learning Models Capture Human Concepts? The Case of the
  Typicality Effect
How Well Do Deep Learning Models Capture Human Concepts? The Case of the Typicality Effect
Siddhartha K. Vemuri
Raj Sanjay Shah
Sashank Varma
VLM
80
5
0
25 May 2024
Text Generation: A Systematic Literature Review of Tasks, Evaluation,
  and Challenges
Text Generation: A Systematic Literature Review of Tasks, Evaluation, and Challenges
Jonas Becker
Jan Philip Wahle
Bela Gipp
Terry Ruas
122
11
0
24 May 2024
Meteor: Mamba-based Traversal of Rationale for Large Language and Vision
  Models
Meteor: Mamba-based Traversal of Rationale for Large Language and Vision Models
Byung-Kwan Lee
Chae Won Kim
Beomchan Park
Yonghyun Ro
MLLMLRM
142
21
0
24 May 2024
Adversarial Attacks on Hidden Tasks in Multi-Task Learning
Adversarial Attacks on Hidden Tasks in Multi-Task Learning
Yu Zhe
Rei Nagaike
Daiki Nishiyama
Kazuto Fukuchi
Jun Sakuma
AAML
67
0
0
24 May 2024
ARVideo: Autoregressive Pretraining for Self-Supervised Video
  Representation Learning
ARVideo: Autoregressive Pretraining for Self-Supervised Video Representation Learning
Sucheng Ren
Hongru Zhu
Chen Wei
Yijiang Li
Alan Yuille
Cihang Xie
AI4TSVGenSSL
83
2
0
24 May 2024
Machine Unlearning in Large Language Models
Machine Unlearning in Large Language Models
Saaketh Koundinya Gundavarapu
Shreya Agarwal
Arushi Arora
Chandana Thimmalapura Jagadeeshaiah
MU
33
0
0
24 May 2024
CEEBERT: Cross-Domain Inference in Early Exit BERT
CEEBERT: Cross-Domain Inference in Early Exit BERT
Divya J. Bajpai
M. Hanawal
LRM
79
5
0
23 May 2024
Bitune: Bidirectional Instruction-Tuning
Bitune: Bidirectional Instruction-Tuning
D. J. Kopiczko
Tijmen Blankevoort
Yuki Markus Asano
47
3
0
23 May 2024
Exploration of Attention Mechanism-Enhanced Deep Learning Models in the
  Mining of Medical Textual Data
Exploration of Attention Mechanism-Enhanced Deep Learning Models in the Mining of Medical Textual Data
Lingxi Xiao
Muqing Li
Yinqiu Feng
Meiqi Wang
Ziyi Zhu
Zexi Chen
111
16
0
23 May 2024
A Survey on Vision-Language-Action Models for Embodied AI
A Survey on Vision-Language-Action Models for Embodied AI
Yueen Ma
Zixing Song
Yuzheng Zhuang
Jianye Hao
Irwin King
LM&Ro
333
54
0
23 May 2024
Dense Connector for MLLMs
Dense Connector for MLLMs
Huanjin Yao
Wenhao Wu
Taojiannan Yang
Yuxin Song
Mengxi Zhang
Haocheng Feng
Yifan Sun
Zhiheng Li
Wanli Ouyang
Jingdong Wang
MLLMVLM
102
25
0
22 May 2024
Do Language Models Enjoy Their Own Stories? Prompting Large Language
  Models for Automatic Story Evaluation
Do Language Models Enjoy Their Own Stories? Prompting Large Language Models for Automatic Story Evaluation
Cyril Chhun
Fabian M. Suchanek
Chloé Clavel
LRM
114
18
0
22 May 2024
Investigating Persuasion Techniques in Arabic: An Empirical Study
  Leveraging Large Language Models
Investigating Persuasion Techniques in Arabic: An Empirical Study Leveraging Large Language Models
Abdurahmman Alzahrani
Eyad Babkier
Faisal Yanbaawi
Firas Yanbaawi
Hassan Alhuzali
75
0
0
21 May 2024
CReMa: Crisis Response through Computational Identification and Matching
  of Cross-Lingual Requests and Offers Shared on Social Media
CReMa: Crisis Response through Computational Identification and Matching of Cross-Lingual Requests and Offers Shared on Social Media
Rabindra Lamsal
M. Read
S. Karunasekera
Muhammad Imran
62
3
0
20 May 2024
Multilingual Substitution-based Word Sense Induction
Multilingual Substitution-based Word Sense Induction
Denis Kokosinskii
Nikolay Arefyev
40
2
0
17 May 2024
DeepPavlov at SemEval-2024 Task 8: Leveraging Transfer Learning for
  Detecting Boundaries of Machine-Generated Texts
DeepPavlov at SemEval-2024 Task 8: Leveraging Transfer Learning for Detecting Boundaries of Machine-Generated Texts
Anastasia Voznyuk
Vasily Konovalov
DeLMO
92
5
0
17 May 2024
A Hybrid Deep Learning Framework for Stock Price Prediction Considering
  the Investor Sentiment of Online Forum Enhanced by Popularity
A Hybrid Deep Learning Framework for Stock Price Prediction Considering the Investor Sentiment of Online Forum Enhanced by Popularity
Huiyu Li
Junhua Hu
26
0
0
17 May 2024
Multi-Evidence based Fact Verification via A Confidential Graph Neural
  Network
Multi-Evidence based Fact Verification via A Confidential Graph Neural Network
Yuqing Lan
Zhenghao Liu
Yu Gu
Xiaoyuan Yi
Xiaohua Li
Liner Yang
Ge Yu
86
1
0
17 May 2024
Beyond Traditional Single Object Tracking: A Survey
Beyond Traditional Single Object Tracking: A Survey
Omar Abdelaziz
Mohamed Shehata
Mohamed Mohamed
123
1
0
16 May 2024
A survey on fairness of large language models in e-commerce: progress,
  application, and challenge
A survey on fairness of large language models in e-commerce: progress, application, and challenge
Qingyang Ren
Zilin Jiang
Jinghan Cao
Sijia Li
Chiqu Li
Yiyang Liu
Shuning Huo
Tiange He
Yuan Chen
AILawFaML
101
7
0
15 May 2024
A Survey on Transformers in NLP with Focus on Efficiency
A Survey on Transformers in NLP with Focus on Efficiency
Wazib Ansar
Saptarsi Goswami
Amlan Chakrabarti
MedIm
93
2
0
15 May 2024
HAAP: Vision-context Hierarchical Attention Autoregressive with Adaptive
  Permutation for Scene Text Recognition
HAAP: Vision-context Hierarchical Attention Autoregressive with Adaptive Permutation for Scene Text Recognition
Honghui Chen
Yuhang Qiu
Jiabao Wang
Pingping Chen
Nam Ling
62
0
0
15 May 2024
Is Less More? Quality, Quantity and Context in Idiom Processing with
  Natural Language Models
Is Less More? Quality, Quantity and Context in Idiom Processing with Natural Language Models
Agne Knietaite
Adam Allsebrook
Anton Minkov
Adam Tomaszewski
Norbert Slinko
Richard Johnson
Thomas Pickard
Dylan Phelps
Aline Villavicencio
62
2
0
14 May 2024
G-SAP: Graph-based Structure-Aware Prompt Learning over Heterogeneous
  Knowledge for Commonsense Reasoning
G-SAP: Graph-based Structure-Aware Prompt Learning over Heterogeneous Knowledge for Commonsense Reasoning
Ruiting Dai
Yuqiao Tan
Lisi Mo
Shuang Liang
Guohao Huo
Jiayi Luo
Yao Cheng
ReLMRALMLRM
62
1
0
09 May 2024
Binary Hypothesis Testing for Softmax Models and Leverage Score Models
Binary Hypothesis Testing for Softmax Models and Leverage Score Models
Yeqi Gao
Yuzhou Gu
Zhao Song
75
0
0
09 May 2024
LingML: Linguistic-Informed Machine Learning for Enhanced Fake News
  Detection
LingML: Linguistic-Informed Machine Learning for Enhanced Fake News Detection
Jasraj Singh
Fang Liu
Hong Xu
Bee Chin Ng
Wei Zhang
AI4CE
52
0
0
07 May 2024
Who Wrote This? The Key to Zero-Shot LLM-Generated Text Detection Is GECScore
Who Wrote This? The Key to Zero-Shot LLM-Generated Text Detection Is GECScore
Junchao Wu
Runzhe Zhan
Derek F. Wong
Shu Yang
Xuebo Liu
Lidia S. Chao
Min Zhang
DeLMO
123
5
0
07 May 2024
Vietnamese AI Generated Text Detection
Vietnamese AI Generated Text Detection
Quang-Dan Tran
Van-Quan Nguyen
Quang-Huy Pham
K. B. T. Nguyen
Trong-Hop Do
DeLMO
42
1
0
06 May 2024
Large Language Models estimate fine-grained human color-concept
  associations
Large Language Models estimate fine-grained human color-concept associations
Kushin Mukherjee
Timothy T. Rogers
Karen B. Schloss
VLM
106
4
0
04 May 2024
What does the Knowledge Neuron Thesis Have to do with Knowledge?
What does the Knowledge Neuron Thesis Have to do with Knowledge?
Jingcheng Niu
Andrew Liu
Zining Zhu
Gerald Penn
115
38
0
03 May 2024
SoftMCL: Soft Momentum Contrastive Learning for Fine-grained
  Sentiment-aware Pre-training
SoftMCL: Soft Momentum Contrastive Learning for Fine-grained Sentiment-aware Pre-training
Jin Wang
Liang-Chih Yu
Xuejie Zhang
VLM
26
6
0
03 May 2024
Exploiting ChatGPT for Diagnosing Autism-Associated Language Disorders
  and Identifying Distinct Features
Exploiting ChatGPT for Diagnosing Autism-Associated Language Disorders and Identifying Distinct Features
Chuanbo Hu
Wenqi Li
Mindi Ruan
Xiangxu Yu
Lynn K. Paul
Shuo Wang
Xin Li
34
3
0
03 May 2024
Large Language Models for UAVs: Current State and Pathways to the Future
Large Language Models for UAVs: Current State and Pathways to the Future
Shumaila Javaid
Nasir Saeed
Bin He
96
24
0
02 May 2024
Enhancing Language Models for Financial Relation Extraction with Named
  Entities and Part-of-Speech
Enhancing Language Models for Financial Relation Extraction with Named Entities and Part-of-Speech
Menglin Li
Kwan Hui Lim
61
1
0
02 May 2024
Investigating Automatic Scoring and Feedback using Large Language Models
Investigating Automatic Scoring and Feedback using Large Language Models
G. Katuka
Alexander Gain
Yen-Yun Yu
AI4EdALM
66
3
0
01 May 2024
Guiding Attention in End-to-End Driving Models
Guiding Attention in End-to-End Driving Models
Diego Porres
Yi Xiao
Gabriel Villalonga
Alexandre Levy
Antonio M. López
66
0
0
30 Apr 2024
Better & Faster Large Language Models via Multi-token Prediction
Better & Faster Large Language Models via Multi-token Prediction
Fabian Gloeckle
Badr Youbi Idrissi
Baptiste Rozière
David Lopez-Paz
Gabriele Synnaeve
114
121
0
30 Apr 2024
Improving Disease Detection from Social Media Text via Self-Augmentation
  and Contrastive Learning
Improving Disease Detection from Social Media Text via Self-Augmentation and Contrastive Learning
Pervaiz Iqbal Khan
Andreas Dengel
Sheraz Ahmed
59
1
0
30 Apr 2024
QLSC: A Query Latent Semantic Calibrator for Robust Extractive Question
  Answering
QLSC: A Query Latent Semantic Calibrator for Robust Extractive Question Answering
Ouyang Sheng
Jianzong Wang
Yong Zhang
Zhitao Li
Ziqi Liang
Xulong Zhang
Ning Cheng
Jing Xiao
41
0
0
30 Apr 2024
Transfer Learning Enhanced Single-choice Decision for Multi-choice
  Question Answering
Transfer Learning Enhanced Single-choice Decision for Multi-choice Question Answering
Chenhao Cui
Yufan Jiang
Shuangzhi Wu
Zhoujun Li
FaML
60
0
0
27 Apr 2024
Temporal Scaling Law for Large Language Models
Temporal Scaling Law for Large Language Models
Yizhe Xiong
Xiansheng Chen
Xin Ye
Hui Chen
Zijia Lin
...
Zhenpeng Su
Wei Huang
Jianwei Niu
Jiawei Han
Guiguang Ding
120
10
0
27 Apr 2024
MER 2024: Semi-Supervised Learning, Noise Robustness, and
  Open-Vocabulary Multimodal Emotion Recognition
MER 2024: Semi-Supervised Learning, Noise Robustness, and Open-Vocabulary Multimodal Emotion Recognition
Zheng Lian
Haiyang Sun
Guoying Zhao
Zhuofan Wen
Siyuan Zhang
...
Bin Liu
Min Zhang
Guoying Zhao
Björn W. Schuller
Jianhua Tao
VLM
118
11
0
26 Apr 2024
Beyond ESM2: Graph-Enhanced Protein Sequence Modeling with Efficient
  Clustering
Beyond ESM2: Graph-Enhanced Protein Sequence Modeling with Efficient Clustering
Shu-Lin Jiao
Bingxuan Li
Lei Wang
Xiaojin Zhang
Wei Chen
J. Peng
Zhongyu Wei
56
2
0
24 Apr 2024
KS-LLM: Knowledge Selection of Large Language Models with Evidence
  Document for Question Answering
KS-LLM: Knowledge Selection of Large Language Models with Evidence Document for Question Answering
Xinxin Zheng
Feihu Che
Jinyang Wu
Shuai Zhang
Shuai Nie
Kang Liu
Jianhua Tao
RALMHILM
63
4
0
24 Apr 2024
PEACH: Pretrained-embedding Explanation Across Contextual and
  Hierarchical Structure
PEACH: Pretrained-embedding Explanation Across Contextual and Hierarchical Structure
Feiqi Cao
S. Han
Hyunsuk Chung
58
0
0
21 Apr 2024
Towards Universal Performance Modeling for Machine Learning Training on
  Multi-GPU Platforms
Towards Universal Performance Modeling for Machine Learning Training on Multi-GPU Platforms
Zhongyi Lin
Ning Sun
Pallab Bhattacharya
Xizhou Feng
Louis Feng
John Douglas Owens
117
2
0
19 Apr 2024
Exploring the landscape of large language models: Foundations,
  techniques, and challenges
Exploring the landscape of large language models: Foundations, techniques, and challenges
M. Moradi
Ke Yan
David Colwell
Matthias Samwald
Rhona Asgari
OffRL
67
2
0
18 Apr 2024
Spatial Context-based Self-Supervised Learning for Handwritten Text Recognition
Spatial Context-based Self-Supervised Learning for Handwritten Text Recognition
Carlos Peñarrubia
Carlos Garrido-Munoz
J. J. Valero-Mas
Jorge Calvo-Zaragoza
207
2
0
17 Apr 2024
Previous
123...678...697071
Next