ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1906.08237
  4. Cited By
XLNet: Generalized Autoregressive Pretraining for Language Understanding
v1v2 (latest)

XLNet: Generalized Autoregressive Pretraining for Language Understanding

19 June 2019
Zhilin Yang
Zihang Dai
Yiming Yang
J. Carbonell
Ruslan Salakhutdinov
Quoc V. Le
    AI4CE
ArXiv (abs)PDFHTML

Papers citing "XLNet: Generalized Autoregressive Pretraining for Language Understanding"

50 / 3,520 papers shown
Title
SelfKG: Self-Supervised Entity Alignment in Knowledge Graphs
SelfKG: Self-Supervised Entity Alignment in Knowledge Graphs
Xiao Liu
Haoyun Hong
Xinghao Wang
Zeyi Chen
Evgeny Kharlamov
Yuxiao Dong
Jie Tang
SSL
94
73
0
02 Mar 2022
PKGM: A Pre-trained Knowledge Graph Model for E-commerce Application
PKGM: A Pre-trained Knowledge Graph Model for E-commerce Application
Wen Zhang
Chi-Man Wong
Ganqinag Ye
Bo Wen
Hongting Zhou
Wei Zhang
Hua-zeng Chen
47
0
0
02 Mar 2022
Two-Level Supervised Contrastive Learning for Response Selection in
  Multi-Turn Dialogue
Two-Level Supervised Contrastive Learning for Response Selection in Multi-Turn Dialogue
Wentao Zhang
Shuangyin Xu
Haoran Huang
73
2
0
01 Mar 2022
E-LANG: Energy-Based Joint Inferencing of Super and Swift Language
  Models
E-LANG: Energy-Based Joint Inferencing of Super and Swift Language Models
Mohammad Akbari
Amin Banitalebi-Dehkordi
Yong Zhang
69
8
0
01 Mar 2022
Improving Performance of Automated Essay Scoring by using
  back-translation essays and adjusted scores
Improving Performance of Automated Essay Scoring by using back-translation essays and adjusted scores
You-Jin Jong
Yong-Jin Kim
Ok-Chol Ri
50
7
0
01 Mar 2022
Semantic Sentence Composition Reasoning for Multi-Hop Question Answering
Semantic Sentence Composition Reasoning for Multi-Hop Question Answering
Qianglong Chen
LRM
62
2
0
01 Mar 2022
TraceNet: Tracing and Locating the Key Elements in Sentiment Analysis
TraceNet: Tracing and Locating the Key Elements in Sentiment Analysis
Qinghua Zhao
Shuai Ma
46
0
0
28 Feb 2022
COMPASS: a Creative Support System that Alerts Novelists to the
  Unnoticed Missing Contents
COMPASS: a Creative Support System that Alerts Novelists to the Unnoticed Missing Contents
Yusuke Mori
Hiroaki Yamane
Ryohei Shimizu
Yusuke Mukuta
Tatsuya Harada
55
6
0
26 Feb 2022
Automated Identification of Toxic Code Reviews Using ToxiCR
Automated Identification of Toxic Code Reviews Using ToxiCR
Jaydeb Sarker
Asif Kamal Turzo
Mingyou Dong
Amiangshu Bosu
73
35
0
26 Feb 2022
Deep Understanding based Multi-Document Machine Reading Comprehension
Deep Understanding based Multi-Document Machine Reading Comprehension
Feiliang Ren
Yongkang Liu
Bochao Li
Zhibo Wang
Yu Guo
Shilei Liu
Huimin Wu
Jiaqi Wang
Chunchao Liu
Bingchao Wang
32
2
0
25 Feb 2022
NoisyTune: A Little Noise Can Help You Finetune Pretrained Language
  Models Better
NoisyTune: A Little Noise Can Help You Finetune Pretrained Language Models Better
Chuhan Wu
Fangzhao Wu
Tao Qi
Yongfeng Huang
Xing Xie
77
60
0
24 Feb 2022
Short-answer scoring with ensembles of pretrained language models
Short-answer scoring with ensembles of pretrained language models
Christopher M. Ormerod
67
8
0
23 Feb 2022
Zero-shot Cross-lingual Transfer of Prompt-based Tuning with a Unified
  Multilingual Prompt
Zero-shot Cross-lingual Transfer of Prompt-based Tuning with a Unified Multilingual Prompt
Lianzhe Huang
Shuming Ma
Dongdong Zhang
Furu Wei
Houfeng Wang
VLMLRM
96
32
0
23 Feb 2022
Utilizing Out-Domain Datasets to Enhance Multi-Task Citation Analysis
Utilizing Out-Domain Datasets to Enhance Multi-Task Citation Analysis
Dominique Mercier
Syed Tahseen Raza Rizvi
Vikas Rajashekar
Sheraz Ahmed
Andreas Dengel
55
1
0
22 Feb 2022
Hierarchical Interpretation of Neural Text Classification
Hierarchical Interpretation of Neural Text Classification
Hanqi Yan
Lin Gui
Yulan He
107
14
0
20 Feb 2022
Evaluating the Construct Validity of Text Embeddings with Application to
  Survey Questions
Evaluating the Construct Validity of Text Embeddings with Application to Survey Questions
Qixiang Fang
D. Nguyen
Daniel L. Oberski
96
12
0
18 Feb 2022
VLP: A Survey on Vision-Language Pre-training
VLP: A Survey on Vision-Language Pre-training
Feilong Chen
Duzhen Zhang
Minglun Han
Xiuyi Chen
Jing Shi
Shuang Xu
Bo Xu
VLM
183
227
0
18 Feb 2022
'Beach' to 'Bitch': Inadvertent Unsafe Transcription of Kids' Content on
  YouTube
'Beach' to 'Bitch': Inadvertent Unsafe Transcription of Kids' Content on YouTube
Krithika Ramesh
Ashiqur R. KhudaBukhsh
Sumeet Kumar
55
5
0
17 Feb 2022
SAITS: Self-Attention-based Imputation for Time Series
SAITS: Self-Attention-based Imputation for Time Series
Wenjie Du
David Cote
Yang Liu
AI4TS
116
270
0
17 Feb 2022
The NLP Task Effectiveness of Long-Range Transformers
The NLP Task Effectiveness of Long-Range Transformers
Guanghui Qin
Yukun Feng
Benjamin Van Durme
67
30
0
16 Feb 2022
Tomayto, Tomahto. Beyond Token-level Answer Equivalence for Question
  Answering Evaluation
Tomayto, Tomahto. Beyond Token-level Answer Equivalence for Question Answering Evaluation
Jannis Bulian
Christian Buck
Wojciech Gajewski
Benjamin Boerschinger
Tal Schuster
105
47
0
15 Feb 2022
Threats to Pre-trained Language Models: Survey and Taxonomy
Threats to Pre-trained Language Models: Survey and Taxonomy
Shangwei Guo
Chunlong Xie
Jiwei Li
Lingjuan Lyu
Tianwei Zhang
PILM
57
32
0
14 Feb 2022
A Differential Entropy Estimator for Training Neural Networks
A Differential Entropy Estimator for Training Neural Networks
Georg Pichler
Pierre Colombo
Malik Boudiaf
Günther Koliander
Pablo Piantanida
170
23
0
14 Feb 2022
PQuAD: A Persian Question Answering Dataset
PQuAD: A Persian Question Answering Dataset
Kasra Darvishi
Newsha Shahbodagh
Zahra Abbasiantaeb
S. Momtazi
36
19
0
13 Feb 2022
Double-Barreled Question Detection at Momentive
Double-Barreled Question Detection at Momentive
Peng Jiang
K. S. Muppalla
Qingyue Wei
C. N. Gopal
Chun Wang
18
0
0
12 Feb 2022
FedQAS: Privacy-aware machine reading comprehension with federated
  learning
FedQAS: Privacy-aware machine reading comprehension with federated learning
Addi Ait-Mlouk
Sadi Alawadi
Salman Toor
Andreas Hellander
62
11
0
09 Feb 2022
Topic Discovery via Latent Space Clustering of Pretrained Language Model
  Representations
Topic Discovery via Latent Space Clustering of Pretrained Language Model Representations
Yu Meng
Yunyi Zhang
Jiaxin Huang
Yu Zhang
Jiawei Han
105
58
0
09 Feb 2022
data2vec: A General Framework for Self-supervised Learning in Speech,
  Vision and Language
data2vec: A General Framework for Self-supervised Learning in Speech, Vision and Language
Alexei Baevski
Wei-Ning Hsu
Qiantong Xu
Arun Babu
Jiatao Gu
Michael Auli
SSLVLMViT
170
863
0
07 Feb 2022
Universal Spam Detection using Transfer Learning of BERT Model
Universal Spam Detection using Transfer Learning of BERT Model
Vijay Srinivas Tida
Sonya Hsu
91
50
0
07 Feb 2022
Hybrid Contrastive Quantization for Efficient Cross-View Video Retrieval
Hybrid Contrastive Quantization for Efficient Cross-View Video Retrieval
Jinpeng Wang
Bin Chen
Dongliang Liao
Ziyun Zeng
Gongfu Li
Shutao Xia
Jin Xu
74
8
0
07 Feb 2022
Conversational Agents: Theory and Applications
Conversational Agents: Theory and Applications
M. Wahde
M. Virgolin
LLMAG
68
26
0
07 Feb 2022
OFA: Unifying Architectures, Tasks, and Modalities Through a Simple
  Sequence-to-Sequence Learning Framework
OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework
Peng Wang
An Yang
Rui Men
Junyang Lin
Shuai Bai
Zhikang Li
Jianxin Ma
Chang Zhou
Jingren Zhou
Hongxia Yang
MLLMObjD
258
884
0
07 Feb 2022
Multilingual Hate Speech and Offensive Content Detection using Modified
  Cross-entropy Loss
Multilingual Hate Speech and Offensive Content Detection using Modified Cross-entropy Loss
Arka Mitra
Priyanshu Sankhala
36
6
0
05 Feb 2022
A Benchmark Corpus for the Detection of Automatically Generated Text in
  Academic Publications
A Benchmark Corpus for the Detection of Automatically Generated Text in Academic Publications
Vijini Liyanage
Davide Buscaldi
A. Nazarenko
DeLMO
60
26
0
04 Feb 2022
Pre-Trained Language Models for Interactive Decision-Making
Pre-Trained Language Models for Interactive Decision-Making
Shuang Li
Xavier Puig
Chris Paxton
Yilun Du
Clinton Jia Wang
...
Anima Anandkumar
Jacob Andreas
Igor Mordatch
Antonio Torralba
Yuke Zhu
LM&Ro
135
264
0
03 Feb 2022
Relative Position Prediction as Pre-training for Text Encoders
Relative Position Prediction as Pre-training for Text Encoders
Rickard Brüel-Gabrielsson
Chris Scarvelis
75
1
0
02 Feb 2022
GatorTron: A Large Clinical Language Model to Unlock Patient Information
  from Unstructured Electronic Health Records
GatorTron: A Large Clinical Language Model to Unlock Patient Information from Unstructured Electronic Health Records
Xi Yang
Aokun Chen
Nima M. Pournejatian
Hoo-Chang Shin
Kaleb E. Smith
...
Duane A. Mitchell
W. Hogan
E. Shenkman
Jiang Bian
Yonghui Wu
AI4MHLM&MA
102
553
0
02 Feb 2022
Context-Aware Discrimination Detection in Job Vacancies using
  Computational Language Models
Context-Aware Discrimination Detection in Job Vacancies using Computational Language Models
S. Vethman
A. Adhikari
M. D. Boer
J. V. Genabeek
C. Veenman
46
2
0
02 Feb 2022
Regression Transformer: Concurrent sequence regression and generation
  for molecular language modeling
Regression Transformer: Concurrent sequence regression and generation for molecular language modeling
Jannis Born
Matteo Manica
114
97
0
01 Feb 2022
WebFormer: The Web-page Transformer for Structure Information Extraction
WebFormer: The Web-page Transformer for Structure Information Extraction
Qifan Wang
Yi Fang
Anirudh Ravula
Fuli Feng
Xiaojun Quan
Dongfang Liu
ViT
202
68
0
01 Feb 2022
Stock2Vec: An Embedding to Improve Predictive Models for Companies
Stock2Vec: An Embedding to Improve Predictive Models for Companies
Ziruo Yi
Tingsong Xiao
Kaz-Onyeakazi Ijeoma
Ratnam Cheran
Yuvraj Baweja
Phillip Nelson
AIFin
54
3
0
27 Jan 2022
Team Yao at Factify 2022: Utilizing Pre-trained Models and Co-attention
  Networks for Multi-Modal Fact Verification
Team Yao at Factify 2022: Utilizing Pre-trained Models and Co-attention Networks for Multi-Modal Fact Verification
Wei-Yao Wang
Chao-Han Huck Yang
44
10
0
26 Jan 2022
Whose Language Counts as High Quality? Measuring Language Ideologies in
  Text Data Selection
Whose Language Counts as High Quality? Measuring Language Ideologies in Text Data Selection
Suchin Gururangan
Dallas Card
Sarah K. Drier
E. K. Gade
Leroy Z. Wang
Zeyu Wang
Luke Zettlemoyer
Noah A. Smith
270
81
0
25 Jan 2022
OntoProtein: Protein Pretraining With Gene Ontology Embedding
OntoProtein: Protein Pretraining With Gene Ontology Embedding
Ningyu Zhang
Zhen Bi
Xiaozhuan Liang
Shuyang Cheng
Haosen Hong
Shumin Deng
J. Lian
Qiang Zhang
Huajun Chen
194
99
0
23 Jan 2022
Dual Contrastive Learning: Text Classification via Label-Aware Data
  Augmentation
Dual Contrastive Learning: Text Classification via Label-Aware Data Augmentation
Qianben Chen
Richong Zhang
Yaowei Zheng
Yongyi Mao
SSL
81
70
0
21 Jan 2022
Text Style Transfer for Bias Mitigation using Masked Language Modeling
Text Style Transfer for Bias Mitigation using Masked Language Modeling
E. Tokpo
T. Calders
64
36
0
21 Jan 2022
Identifying Adversarial Attacks on Text Classifiers
Identifying Adversarial Attacks on Text Classifiers
Zhouhang Xie
Jonathan Brophy
Adam Noack
Wencong You
Kalyani Asthana
Carter Perkins
Sabrina Reis
Sameer Singh
Daniel Lowd
AAML
84
10
0
21 Jan 2022
AutoDistill: an End-to-End Framework to Explore and Distill
  Hardware-Efficient Language Models
AutoDistill: an End-to-End Framework to Explore and Distill Hardware-Efficient Language Models
Xiaofan Zhang
Zongwei Zhou
Deming Chen
Yu Emma Wang
81
11
0
21 Jan 2022
LaMDA: Language Models for Dialog Applications
LaMDA: Language Models for Dialog Applications
R. Thoppilan
Daniel De Freitas
Jamie Hall
Noam M. Shazeer
Apoorv Kulshreshtha
...
Blaise Aguera-Arcas
Claire Cui
M. Croak
Ed H. Chi
Quoc Le
ALM
154
1,606
0
20 Jan 2022
Linguistically-driven Multi-task Pre-training for Low-resource Neural
  Machine Translation
Linguistically-driven Multi-task Pre-training for Low-resource Neural Machine Translation
Zhuoyuan Mao
Chenhui Chu
Sadao Kurohashi
44
7
0
20 Jan 2022
Previous
123...323334...697071
Next