ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1906.08237
  4. Cited By
XLNet: Generalized Autoregressive Pretraining for Language Understanding
v1v2 (latest)

XLNet: Generalized Autoregressive Pretraining for Language Understanding

19 June 2019
Zhilin Yang
Zihang Dai
Yiming Yang
J. Carbonell
Ruslan Salakhutdinov
Quoc V. Le
    AI4CE
ArXiv (abs)PDFHTML

Papers citing "XLNet: Generalized Autoregressive Pretraining for Language Understanding"

50 / 3,521 papers shown
Title
Few Shot Learning for Information Verification
Few Shot Learning for Information Verification
Usama Khalid
M. O. Beg
57
0
0
22 Feb 2021
Using Prior Knowledge to Guide BERT's Attention in Semantic Textual
  Matching Tasks
Using Prior Knowledge to Guide BERT's Attention in Semantic Textual Matching Tasks
Tingyu Xia
Yue Wang
Yuan Tian
Yi-Ju Chang
65
51
0
22 Feb 2021
Conditional Positional Encodings for Vision Transformers
Conditional Positional Encodings for Vision Transformers
Xiangxiang Chu
Zhi Tian
Bo Zhang
Xinlong Wang
Chunhua Shen
ViT
181
626
0
22 Feb 2021
ReINTEL Challenge 2020: Exploiting Transfer Learning Models for Reliable
  Intelligence Identification on Vietnamese Social Network Sites
ReINTEL Challenge 2020: Exploiting Transfer Learning Models for Reliable Intelligence Identification on Vietnamese Social Network Sites
Kim Thi-Thanh Nguyen
Kiet Van Nguyen
64
1
0
22 Feb 2021
UniT: Multimodal Multitask Learning with a Unified Transformer
UniT: Multimodal Multitask Learning with a Unified Transformer
Ronghang Hu
Amanpreet Singh
ViT
106
301
0
22 Feb 2021
VisualGPT: Data-efficient Adaptation of Pretrained Language Models for
  Image Captioning
VisualGPT: Data-efficient Adaptation of Pretrained Language Models for Image Captioning
Jun Chen
Han Guo
Kai Yi
Boyang Albert Li
Mohamed Elhoseiny
VLM
166
227
0
20 Feb 2021
Multilingual Answer Sentence Reranking via Automatically Translated Data
Multilingual Answer Sentence Reranking via Automatically Translated Data
Thuy Vu
Alessandro Moschitti
66
5
0
20 Feb 2021
Analyzing Curriculum Learning for Sentiment Analysis along Task
  Difficulty, Pacing and Visualization Axes
Analyzing Curriculum Learning for Sentiment Analysis along Task Difficulty, Pacing and Visualization Axes
Anvesh Rao Vijjini
Kaveri Anuranjana
R. Mamidi
70
3
0
19 Feb 2021
Back Translation Survey for Improving Text Augmentation
Back Translation Survey for Improving Text Augmentation
Matt Ciolino
David Noever
Josh Kalin
60
0
0
19 Feb 2021
MUDES: Multilingual Detection of Offensive Spans
MUDES: Multilingual Detection of Offensive Spans
Tharindu Ranasinghe
Marcos Zampieri
83
41
0
18 Feb 2021
Introducing the Hidden Neural Markov Chain framework
Introducing the Hidden Neural Markov Chain framework
E. Azeraf
E. Monfrini
Emmanuel Vignon
W. Pieczynski
BDL
44
6
0
17 Feb 2021
Conceptual 12M: Pushing Web-Scale Image-Text Pre-Training To Recognize
  Long-Tail Visual Concepts
Conceptual 12M: Pushing Web-Scale Image-Text Pre-Training To Recognize Long-Tail Visual Concepts
Soravit Changpinyo
P. Sharma
Nan Ding
Radu Soricut
VLM
570
1,143
0
17 Feb 2021
Highly Fast Text Segmentation With Pairwise Markov Chains
Highly Fast Text Segmentation With Pairwise Markov Chains
E. Azeraf
E. Monfrini
Emmanuel Vignon
W. Pieczynski
56
5
0
17 Feb 2021
COCO-LM: Correcting and Contrasting Text Sequences for Language Model
  Pretraining
COCO-LM: Correcting and Contrasting Text Sequences for Language Model Pretraining
Yu Meng
Chenyan Xiong
Payal Bajaj
Saurabh Tiwary
Paul N. Bennett
Jiawei Han
Xia Song
182
206
0
16 Feb 2021
Boosting Low-Resource Biomedical QA via Entity-Aware Masking Strategies
Boosting Low-Resource Biomedical QA via Entity-Aware Masking Strategies
Gabriele Pergola
E. Kochkina
Lin Gui
Maria Liakata
Yulan He
145
32
0
16 Feb 2021
Exploring Transformers in Natural Language Generation: GPT, BERT, and
  XLNet
Exploring Transformers in Natural Language Generation: GPT, BERT, and XLNet
M. O. Topal
Anil Bas
Imke van Heerden
LLMAGAI4CE
73
91
0
16 Feb 2021
Improving speech recognition models with small samples for air traffic
  control systems
Improving speech recognition models with small samples for air traffic control systems
Yi Lin
Qin Li
Bo Yang
Zhen Yan
Huachun Tan
Zhengmao Chen
104
32
0
16 Feb 2021
TeraPipe: Token-Level Pipeline Parallelism for Training Large-Scale
  Language Models
TeraPipe: Token-Level Pipeline Parallelism for Training Large-Scale Language Models
Zhuohan Li
Siyuan Zhuang
Shiyuan Guo
Danyang Zhuo
Hao Zhang
Basel Alomair
Ion Stoica
MoE
104
125
0
16 Feb 2021
Have Attention Heads in BERT Learned Constituency Grammar?
Have Attention Heads in BERT Learned Constituency Grammar?
Ziyang Luo
58
6
0
16 Feb 2021
Within-Document Event Coreference with BERT-Based Contextualized Representations
Shafiuddin Rehan Ahmed
James H. Martin
20
0
0
15 Feb 2021
Overview of the TREC 2020 deep learning track
Overview of the TREC 2020 deep learning track
Nick Craswell
Bhaskar Mitra
Emine Yilmaz
Daniel Fernando Campos
141
389
0
15 Feb 2021
Improved Customer Transaction Classification using Semi-Supervised
  Knowledge Distillation
Improved Customer Transaction Classification using Semi-Supervised Knowledge Distillation
Rohan Sukumaran
31
2
0
15 Feb 2021
DOBF: A Deobfuscation Pre-Training Objective for Programming Languages
DOBF: A Deobfuscation Pre-Training Objective for Programming Languages
Baptiste Roziere
Marie-Anne Lachaux
Marc Szafraniec
Guillaume Lample
AI4CE
148
141
0
15 Feb 2021
Improved Bengali Image Captioning via deep convolutional neural network
  based encoder-decoder model
Improved Bengali Image Captioning via deep convolutional neural network based encoder-decoder model
Mohammad Faiyaz Khan
S. M. S. Shifath
Md. Saiful Islam
VLM
65
21
0
14 Feb 2021
CATE: Computation-aware Neural Architecture Encoding with Transformers
CATE: Computation-aware Neural Architecture Encoding with Transformers
Shen Yan
Kaiqiang Song
Z. Feng
Mi Zhang
81
28
0
14 Feb 2021
InsNet: An Efficient, Flexible, and Performant Insertion-based Text
  Generation Model
InsNet: An Efficient, Flexible, and Performant Insertion-based Text Generation Model
Sidi Lu
Tao Meng
Nanyun Peng
115
13
0
12 Feb 2021
Emoji-Based Transfer Learning for Sentiment Tasks
Emoji-Based Transfer Learning for Sentiment Tasks
Susann Boy
Dana Ruiter
Dietrich Klakow
39
2
0
12 Feb 2021
Less is More: ClipBERT for Video-and-Language Learning via Sparse
  Sampling
Less is More: ClipBERT for Video-and-Language Learning via Sparse Sampling
Jie Lei
Linjie Li
Luowei Zhou
Zhe Gan
Tamara L. Berg
Joey Tianyi Zhou
Jingjing Liu
CLIP
194
666
0
11 Feb 2021
Text Compression-aided Transformer Encoding
Text Compression-aided Transformer Encoding
Z. Li
Zhuosheng Zhang
Hai Zhao
Rui Wang
Kehai Chen
Masao Utiyama
Eiichiro Sumita
AI4CE
71
45
0
11 Feb 2021
Scaling Up Visual and Vision-Language Representation Learning With Noisy
  Text Supervision
Scaling Up Visual and Vision-Language Representation Learning With Noisy Text Supervision
Chao Jia
Yinfei Yang
Ye Xia
Yi-Ting Chen
Zarana Parekh
Hieu H. Pham
Quoc V. Le
Yun-hsuan Sung
Zhen Li
Tom Duerig
VLMCLIP
562
3,917
0
11 Feb 2021
Multi-turn Dialogue Reading Comprehension with Pivot Turns and Knowledge
Multi-turn Dialogue Reading Comprehension with Pivot Turns and Knowledge
Zhuosheng Zhang
Junlong Li
Hai Zhao
79
24
0
10 Feb 2021
NewsBERT: Distilling Pre-trained Language Model for Intelligent News
  Application
NewsBERT: Distilling Pre-trained Language Model for Intelligent News Application
Chuhan Wu
Fangzhao Wu
Yang Yu
Tao Qi
Yongfeng Huang
Qi Liu
VLM
67
45
0
09 Feb 2021
Bias Out-of-the-Box: An Empirical Analysis of Intersectional
  Occupational Biases in Popular Generative Language Models
Bias Out-of-the-Box: An Empirical Analysis of Intersectional Occupational Biases in Popular Generative Language Models
Hannah Rose Kirk
Yennie Jun
Haider Iqbal
Elias Benussi
Filippo Volpin
F. Dreyer
Aleksandar Shtedritski
Yuki M. Asano
68
194
0
08 Feb 2021
Spoiler Alert: Using Natural Language Processing to Detect Spoilers in
  Book Reviews
Spoiler Alert: Using Natural Language Processing to Detect Spoilers in Book Reviews
Allen Bao
Marshall Ho
Saarthak Sangamnerkar
37
2
0
07 Feb 2021
CSS-LM: A Contrastive Framework for Semi-supervised Fine-tuning of
  Pre-trained Language Models
CSS-LM: A Contrastive Framework for Semi-supervised Fine-tuning of Pre-trained Language Models
Yusheng Su
Xu Han
Yankai Lin
Zhengyan Zhang
Zhiyuan Liu
Peng Li
Jie Zhou
Maosong Sun
73
10
0
07 Feb 2021
Unifying Vision-and-Language Tasks via Text Generation
Unifying Vision-and-Language Tasks via Text Generation
Jaemin Cho
Jie Lei
Hao Tan
Joey Tianyi Zhou
MLLM
406
547
0
04 Feb 2021
Generating images from caption and vice versa via CLIP-Guided Generative
  Latent Space Search
Generating images from caption and vice versa via CLIP-Guided Generative Latent Space Search
Federico A. Galatolo
M. G. Cimino
G. Vaglini
VLM
178
87
0
02 Feb 2021
"Is depression related to cannabis?": A knowledge-infused model for
  Entity and Relation Extraction with Limited Supervision
"Is depression related to cannabis?": A knowledge-infused model for Entity and Relation Extraction with Limited Supervision
Kaushik Roy
Usha Lokala
Vedant Khandelwal
A. Sheth
AI4MH
49
19
0
01 Feb 2021
Civil Rephrases Of Toxic Texts With Self-Supervised Transformers
Civil Rephrases Of Toxic Texts With Self-Supervised Transformers
Leo Laugier
John Pavlopoulos
Jeffrey Scott Sorensen
Lucas Dixon
94
48
0
01 Feb 2021
Decoupling the Role of Data, Attention, and Losses in Multimodal
  Transformers
Decoupling the Role of Data, Attention, and Losses in Multimodal Transformers
Lisa Anne Hendricks
John F. J. Mellor
R. Schneider
Jean-Baptiste Alayrac
Aida Nematzadeh
150
117
0
31 Jan 2021
Classification Models for Partially Ordered Sequences
Classification Models for Partially Ordered Sequences
Stephanie Ger
Diego Klabjan
J. Utke
26
0
0
31 Jan 2021
Combining pre-trained language models and structured knowledge
Combining pre-trained language models and structured knowledge
Pedro Colon-Hernandez
Catherine Havasi
Jason B. Alonso
Matthew Huggins
C. Breazeal
KELM
86
48
0
28 Jan 2021
A transformer based approach for fighting COVID-19 fake news
A transformer based approach for fighting COVID-19 fake news
S. M. S. Shifath
Mohammad Faiyaz Khan
Md. Saiful Islam
MedIm
66
23
0
28 Jan 2021
Tokens-to-Token ViT: Training Vision Transformers from Scratch on
  ImageNet
Tokens-to-Token ViT: Training Vision Transformers from Scratch on ImageNet
Li-xin Yuan
Yunpeng Chen
Tao Wang
Weihao Yu
Yujun Shi
Zihang Jiang
Francis E. H. Tay
Jiashi Feng
Shuicheng Yan
ViT
216
1,957
0
28 Jan 2021
Semi-automatic Generation of Multilingual Datasets for Stance Detection
  in Twitter
Semi-automatic Generation of Multilingual Datasets for Stance Detection in Twitter
Elena Zotova
Rodrigo Agerri
German Rigau
60
22
0
28 Jan 2021
Identifying COVID-19 Fake News in Social Media
Identifying COVID-19 Fake News in Social Media
Tathagata Raha
Vijayasaradhi Indurthi
Aayush Upadhyaya
Jeevesh Kataria
Pramud Bommakanti
Vikram Keswani
Vasudeva Varma
GNNMedIm
59
12
0
28 Jan 2021
LESA: Linguistic Encapsulation and Semantic Amalgamation Based
  Generalised Claim Detection from Online Content
LESA: Linguistic Encapsulation and Semantic Amalgamation Based Generalised Claim Detection from Online Content
Shreya Gupta
Parantak Singh
Megha Sundriyal
Md. Shad Akhtar
Tanmoy Chakraborty
149
27
0
28 Jan 2021
Explaining Natural Language Processing Classifiers with Occlusion and
  Language Modeling
Explaining Natural Language Processing Classifiers with Occlusion and Language Modeling
David Harbecke
AAML
55
2
0
28 Jan 2021
Scheduled Sampling in Vision-Language Pretraining with Decoupled
  Encoder-Decoder Network
Scheduled Sampling in Vision-Language Pretraining with Decoupled Encoder-Decoder Network
Yehao Li
Yingwei Pan
Ting Yao
Jingwen Chen
Tao Mei
VLM
95
53
0
27 Jan 2021
KoreALBERT: Pretraining a Lite BERT Model for Korean Language
  Understanding
KoreALBERT: Pretraining a Lite BERT Model for Korean Language Understanding
HyunJae Lee
Jaewoong Yoon
Bonggyu Hwang
Seongho Joe
Seungjai Min
Youngjune Gwon
SSeg
58
16
0
27 Jan 2021
Previous
123...484950...697071
Next