ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1906.08237
  4. Cited By
XLNet: Generalized Autoregressive Pretraining for Language Understanding
v1v2 (latest)

XLNet: Generalized Autoregressive Pretraining for Language Understanding

19 June 2019
Zhilin Yang
Zihang Dai
Yiming Yang
J. Carbonell
Ruslan Salakhutdinov
Quoc V. Le
    AI4CE
ArXiv (abs)PDFHTML

Papers citing "XLNet: Generalized Autoregressive Pretraining for Language Understanding"

50 / 3,518 papers shown
Title
The Effects of In-domain Corpus Size on pre-training BERT
The Effects of In-domain Corpus Size on pre-training BERT
Chris Sanchez
Zheyu Zhang
AI4CE
25
4
0
15 Dec 2022
Curriculum Learning Meets Weakly Supervised Modality Correlation
  Learning
Curriculum Learning Meets Weakly Supervised Modality Correlation Learning
Sijie Mai
Ya Sun
Haifeng Hu
101
3
0
15 Dec 2022
Efficient Pre-training of Masked Language Model via Concept-based
  Curriculum Masking
Efficient Pre-training of Masked Language Model via Concept-based Curriculum Masking
Mingyu Lee
Jun-Hyung Park
Junho Kim
Kang-Min Kim
SangKeun Lee
69
12
0
15 Dec 2022
Towards mapping the contemporary art world with ArtLM: an art-specific
  NLP model
Towards mapping the contemporary art world with ArtLM: an art-specific NLP model
Qinkai Chen
Mohamed El-Mennaoui
Antoine Fosset
Amine Rebei
Haoyang Cao
Philine Bouscasse
Christy Eóin O'Beirne
Sasha Shevchenko
Mathieu Rosenbaum
KELM
88
1
0
14 Dec 2022
Paraphrase Identification with Deep Learning: A Review of Datasets and
  Methods
Paraphrase Identification with Deep Learning: A Review of Datasets and Methods
Chao Zhou
Cheng Qiu
Daniel Ernesto Acuna
127
26
0
13 Dec 2022
RPN: A Word Vector Level Data Augmentation Algorithm in Deep Learning
  for Language Understanding
RPN: A Word Vector Level Data Augmentation Algorithm in Deep Learning for Language Understanding
Zheng Yuan
Xiaolong Zhang
Yue Wang
Xuecong Hou
Huiwen Xue
Zhuanzhe Zhao
Yongming Liu
118
1
0
12 Dec 2022
Automated ICD Coding using Extreme Multi-label Long Text
  Transformer-based Models
Automated ICD Coding using Extreme Multi-label Long Text Transformer-based Models
Leibo Liu
O. Perez-Concha
Anthony N. Nguyen
Vicki Bennett
Louisa R Jorm
80
19
0
12 Dec 2022
P-Transformer: Towards Better Document-to-Document Neural Machine
  Translation
P-Transformer: Towards Better Document-to-Document Neural Machine Translation
Yachao Li
Junhui Li
Jing Jiang
Shimin Tao
Hao Yang
Hao Fei
ViT
64
10
0
12 Dec 2022
Ensembling Transformers for Cross-domain Automatic Term Extraction
Ensembling Transformers for Cross-domain Automatic Term Extraction
T. Hanh
Matej Martinc
Andraz Pelicon
Antoine Doucet
Senja Pollak
42
6
0
12 Dec 2022
Position Embedding Needs an Independent Layer Normalization
Position Embedding Needs an Independent Layer Normalization
Runyi Yu
Zhennan Wang
Yinhuai Wang
Kehan Li
Yian Zhao
Jian Zhang
Guoli Song
Jie Chen
103
1
0
10 Dec 2022
All-to-key Attention for Arbitrary Style Transfer
All-to-key Attention for Arbitrary Style Transfer
Mingrui Zhu
Xiao He
N. Wang
Xiaoyu Wang
Xinbo Gao
82
23
0
08 Dec 2022
KATSum: Knowledge-aware Abstractive Text Summarization
KATSum: Knowledge-aware Abstractive Text Summarization
Guan-Hua Wang
Weihua Li
E. Lai
Jianhua Jiang
50
2
0
06 Dec 2022
QBERT: Generalist Model for Processing Questions
QBERT: Generalist Model for Processing Questions
Zhaozhen Xu
N. Cristianini
32
1
0
05 Dec 2022
MiLMo:Minority Multilingual Pre-trained Language Model
MiLMo:Minority Multilingual Pre-trained Language Model
Sisi Liu
Hanru Shi
Xinhe Yu
Wugedele Bao
Yuan Sun
Xiaobing Zhao
81
0
0
04 Dec 2022
Exploring Stochastic Autoregressive Image Modeling for Visual
  Representation
Exploring Stochastic Autoregressive Image Modeling for Visual Representation
Yu-Hang Qi
Fan Yang
Yousong Zhu
Yufei Liu
Liwei Wu
Rui Zhao
Wei Li
DiffM
57
13
0
03 Dec 2022
Event knowledge in large language models: the gap between the impossible
  and the unlikely
Event knowledge in large language models: the gap between the impossible and the unlikely
Carina Kauf
Anna A. Ivanova
Giulia Rambelli
Emmanuele Chersoni
Jingyuan Selena She
Zawad Chowdhury
Evelina Fedorenko
Alessandro Lenci
122
70
0
02 Dec 2022
SOLD: Sinhala Offensive Language Dataset
SOLD: Sinhala Offensive Language Dataset
Tharindu Ranasinghe
Isuri Anuradha
Damith Premasiri
Kanishka Silva
Hansi Hettiarachchi
Lasitha Uyangodage
Marcos Zampieri
106
8
0
01 Dec 2022
UniT3D: A Unified Transformer for 3D Dense Captioning and Visual
  Grounding
UniT3D: A Unified Transformer for 3D Dense Captioning and Visual Grounding
Dave Zhenyu Chen
Ronghang Hu
Xinlei Chen
Matthias Nießner
Angel X. Chang
120
54
0
01 Dec 2022
Language Model Pre-training on True Negatives
Language Model Pre-training on True Negatives
Zhuosheng Zhang
Hai Zhao
Masao Utiyama
Eiichiro Sumita
73
2
0
01 Dec 2022
Reliable Joint Segmentation of Retinal Edema Lesions in OCT Images
Meng Wang
Kai-An Yu
Chun-Mei Feng
K. Zou
Yanyu Xu
Qingquan Meng
Rick Siow Mong Goh
Yong Liu
Huazhu Fu
MedIm
93
3
0
01 Dec 2022
A Commonsense-Infused Language-Agnostic Learning Framework for Enhancing
  Prediction of Political Polarity in Multilingual News Headlines
A Commonsense-Infused Language-Agnostic Learning Framework for Enhancing Prediction of Political Polarity in Multilingual News Headlines
Swati Swati
Adrian Mladenic Grobelnik
Dunja Mladenić
M. Grobelnik
80
3
0
01 Dec 2022
BudgetLongformer: Can we Cheaply Pretrain a SotA Legal Language Model
  From Scratch?
BudgetLongformer: Can we Cheaply Pretrain a SotA Legal Language Model From Scratch?
Joel Niklaus
Daniele Giofré
75
12
0
30 Nov 2022
Using Text Classification with a Bayesian Correction for Estimating
  Overreporting in the Creditor Reporting System on Climate Adaptation Finance
Using Text Classification with a Bayesian Correction for Estimating Overreporting in the Creditor Reporting System on Climate Adaptation Finance
Janos Borst
Thomas Wencker
A. Niekler
50
0
0
30 Nov 2022
Protein Language Models and Structure Prediction: Connection and
  Progression
Protein Language Models and Structure Prediction: Connection and Progression
Bozhen Hu
Jun Xia
Jiangbin Zheng
Cheng Tan
Yufei Huang
Yongjie Xu
Stan Z. Li
72
41
0
30 Nov 2022
Improving Commonsense in Vision-Language Models via Knowledge Graph
  Riddles
Improving Commonsense in Vision-Language Models via Knowledge Graph Riddles
Shuquan Ye
Yujia Xie
Dongdong Chen
Yichong Xu
Lu Yuan
Chenguang Zhu
Jing Liao
VLM
66
12
0
29 Nov 2022
Model Extraction Attack against Self-supervised Speech Models
Model Extraction Attack against Self-supervised Speech Models
Tsung-Yuan Hsu
Chen-An Li
Tung-Yu Wu
Hung-yi Lee
48
1
0
29 Nov 2022
ClueWeb22: 10 Billion Web Documents with Visual and Semantic Information
ClueWeb22: 10 Billion Web Documents with Visual and Semantic Information
Arnold Overwijk
Chenyan Xiong
X. Liu
Cameron VandenBerg
Jamie Callan
3DV
39
16
0
29 Nov 2022
Survey on Self-Supervised Multimodal Representation Learning and
  Foundation Models
Survey on Self-Supervised Multimodal Representation Learning and Foundation Models
Sushil Thapa
AI4TSSSL
48
1
0
29 Nov 2022
Predicting Digital Asset Prices using Natural Language Processing: a
  survey
Predicting Digital Asset Prices using Natural Language Processing: a survey
Trang Tran
62
1
0
28 Nov 2022
Arguments to Key Points Mapping with Prompt-based Learning
Arguments to Key Points Mapping with Prompt-based Learning
Ahnaf Mozib Samin
Behrooz Nikandish
Jingyan Chen
AAML
48
2
0
28 Nov 2022
Understanding BLOOM: An empirical study on diverse NLP tasks
Understanding BLOOM: An empirical study on diverse NLP tasks
Parag Dakle
Sai Krishna Rallabandi
Preethi Raghavan
AI4CE
89
4
0
27 Nov 2022
ESIE-BERT: Enriching Sub-words Information Explicitly with BERT for
  Joint Intent Classification and SlotFilling
ESIE-BERT: Enriching Sub-words Information Explicitly with BERT for Joint Intent Classification and SlotFilling
Yutian Guo
Zhilong Xie
Xingyan Chen
Huangen Chen
Leilei Wang
Huaming Du
Shaopeng Wei
Yu Zhao
Qing Li
Ganglu Wu
109
10
0
27 Nov 2022
Deep representation learning: Fundamentals, Perspectives, Applications,
  and Open Challenges
Deep representation learning: Fundamentals, Perspectives, Applications, and Open Challenges
K. T. Baghaei
Amirreza Payandeh
Pooya Fayyazsanavi
Shahram Rahimi
Zhiqian Chen
Somayeh Bakhtiari Ramezani
FaMLAI4TS
69
6
0
27 Nov 2022
A Survey of Text Representation Methods and Their Genealogy
A Survey of Text Representation Methods and Their Genealogy
Philipp Siebers
Christian Janiesch
Patrick Zschech
AI4TS
33
9
0
26 Nov 2022
Asymmetric Cross-Scale Alignment for Text-Based Person Search
Asymmetric Cross-Scale Alignment for Text-Based Person Search
Zhong Ji
Junhua Hu
Deyin Liu
Yuan Wu
Ye Zhao
106
46
0
26 Nov 2022
Using Selective Masking as a Bridge between Pre-training and Fine-tuning
Using Selective Masking as a Bridge between Pre-training and Fine-tuning
Tanish Lad
Himanshu Maheshwari
Shreyas Kottukkal
R. Mamidi
81
3
0
24 Nov 2022
Question Answering and Question Generation for Finnish
Question Answering and Question Generation for Finnish
Ilmari Kylliäinen
R. Yangarber
31
5
0
24 Nov 2022
Indian Commercial Truck License Plate Detection and Recognition for
  Weighbridge Automation
Indian Commercial Truck License Plate Detection and Recognition for Weighbridge Automation
Siddharth Agrawal
Keyur D. Joshi
73
4
0
23 Nov 2022
This is the way: designing and compiling LEPISZCZE, a comprehensive NLP
  benchmark for Polish
This is the way: designing and compiling LEPISZCZE, a comprehensive NLP benchmark for Polish
Lukasz Augustyniak
Kamil Tagowski
Albert Sawczyn
Denis Janiak
Roman Bartusiak
...
Arkadiusz Janz
Piotr Szymañski
M. Morzy
Tomasz Kajdanowicz
Maciej Piasecki
62
12
0
23 Nov 2022
Unified Multimodal Model with Unlikelihood Training for Visual Dialog
Unified Multimodal Model with Unlikelihood Training for Visual Dialog
Zihao Wang
Junli Wang
Changjun Jiang
MLLM
58
10
0
23 Nov 2022
Predicting the Type and Target of Offensive Social Media Posts in
  Marathi
Predicting the Type and Target of Offensive Social Media Posts in Marathi
Marcos Zampieri
Tharindu Ranasinghe
Mrinal Chaudhari
Saurabh Gaikwad
P. Krishna
Mayuresh Nene
Shrunali Paygude
74
24
0
22 Nov 2022
A Scope Sensitive and Result Attentive Model for Multi-Intent Spoken
  Language Understanding
A Scope Sensitive and Result Attentive Model for Multi-Intent Spoken Language Understanding
Lizhi Cheng
Wenmian Yang
Weijia Jia
59
10
0
22 Nov 2022
A Survey on Backdoor Attack and Defense in Natural Language Processing
A Survey on Backdoor Attack and Defense in Natural Language Processing
Xuan Sheng
Zhaoyang Han
Piji Li
Xiangmao Chang
SILM
71
21
0
22 Nov 2022
Evaluating the Knowledge Dependency of Questions
Evaluating the Knowledge Dependency of Questions
Hyeongdon Moon
Yoonseok Yang
Jamin Shin
Hangyeol Yu
Seunghyun Lee
Myeongho Jeong
Juneyoung Park
Minsam Kim
Seungtaek Choi
AI4Ed
63
11
0
21 Nov 2022
Perceiver-VL: Efficient Vision-and-Language Modeling with Iterative
  Latent Attention
Perceiver-VL: Efficient Vision-and-Language Modeling with Iterative Latent Attention
Zineng Tang
Jaemin Cho
Jie Lei
Joey Tianyi Zhou
VLM
84
9
0
21 Nov 2022
PointCLIP V2: Prompting CLIP and GPT for Powerful 3D Open-world Learning
PointCLIP V2: Prompting CLIP and GPT for Powerful 3D Open-world Learning
Xiangyang Zhu
Renrui Zhang
Bowei He
Ziyu Guo
Ziyao Zeng
Zipeng Qin
Shanghang Zhang
Peng Gao
VLM
111
149
0
21 Nov 2022
Deanthropomorphising NLP: Can a Language Model Be Conscious?
Deanthropomorphising NLP: Can a Language Model Be Conscious?
Matthew Shardlow
Piotr Przybyła
64
7
0
21 Nov 2022
AF Adapter: Continual Pretraining for Building Chinese Biomedical
  Language Model
AF Adapter: Continual Pretraining for Building Chinese Biomedical Language Model
Yongyu Yan
Kui Xue
Xiaoming Shi
Qi Ye
Jingping Liu
Tong Ruan
CLL
71
2
0
21 Nov 2022
Deep Learning on a Healthy Data Diet: Finding Important Examples for
  Fairness
Deep Learning on a Healthy Data Diet: Finding Important Examples for Fairness
A. Zayed
Prasanna Parthasarathi
Gonçalo Mordido
Hamid Palangi
Samira Shabanian
Sarath Chandar
54
22
0
20 Nov 2022
Artificial Interrogation for Attributing Language Models
Artificial Interrogation for Attributing Language Models
Farhan Dhanani
Muhammad Rafi
34
1
0
20 Nov 2022
Previous
123...212223...697071
Next