ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1906.08237
  4. Cited By
XLNet: Generalized Autoregressive Pretraining for Language Understanding

XLNet: Generalized Autoregressive Pretraining for Language Understanding

19 June 2019
Zhilin Yang
Zihang Dai
Yiming Yang
J. Carbonell
Ruslan Salakhutdinov
Quoc V. Le
    AI4CE
ArXivPDFHTML

Papers citing "XLNet: Generalized Autoregressive Pretraining for Language Understanding"

50 / 1,511 papers shown
Title
Detection of Propaganda Techniques in Visuo-Lingual Metaphor in Memes
Detection of Propaganda Techniques in Visuo-Lingual Metaphor in Memes
Sunil Gundapu
R. Mamidi
20
2
0
03 May 2022
Logiformer: A Two-Branch Graph Transformer Network for Interpretable
  Logical Reasoning
Logiformer: A Two-Branch Graph Transformer Network for Interpretable Logical Reasoning
Fangzhi Xu
Jun Liu
Qika Lin
Yudai Pan
Lingling Zhang
34
24
0
02 May 2022
EasyNLP: A Comprehensive and Easy-to-use Toolkit for Natural Language
  Processing
EasyNLP: A Comprehensive and Easy-to-use Toolkit for Natural Language Processing
Chengyu Wang
Minghui Qiu
Chen Shi
Taolin Zhang
Tingting Liu
Lei Li
Rongxiang Weng
Ming Wang
Jun Huang
W. Lin
30
21
0
30 Apr 2022
QRelScore: Better Evaluating Generated Questions with Deeper
  Understanding of Context-aware Relevance
QRelScore: Better Evaluating Generated Questions with Deeper Understanding of Context-aware Relevance
Xiaoqiang Wang
Bang Liu
Siliang Tang
Lingfei Wu
35
9
0
29 Apr 2022
Where in the World is this Image? Transformer-based Geo-localization in
  the Wild
Where in the World is this Image? Transformer-based Geo-localization in the Wild
Shraman Pramanick
E. Nowara
Joshua Gleason
Carlos D. Castillo
Rama Chellappa
ViT
21
30
0
29 Apr 2022
Detecting Textual Adversarial Examples Based on Distributional
  Characteristics of Data Representations
Detecting Textual Adversarial Examples Based on Distributional Characteristics of Data Representations
Na Liu
Mark Dras
Wei Emma Zhang
AAML
24
6
0
29 Apr 2022
Tragedy Plus Time: Capturing Unintended Human Activities from
  Weakly-labeled Videos
Tragedy Plus Time: Capturing Unintended Human Activities from Weakly-labeled Videos
Arnav Chakravarthy
Zhiyuan Fang
Yezhou Yang
40
2
0
28 Apr 2022
Systematic Literature Review: Anti-Phishing Defences and Their
  Application to Before-the-click Phishing Email Detection
Systematic Literature Review: Anti-Phishing Defences and Their Application to Before-the-click Phishing Email Detection
T. Wood
Vitor Basto-Fernandes
E. Boiten
I. Yevseyeva
AAML
24
2
0
27 Apr 2022
Process Knowledge-infused Learning for Suicidality Assessment on Social
  Media
Process Knowledge-infused Learning for Suicidality Assessment on Social Media
Kaushik Roy
Manas Gaur
Qi Zhang
Amit P. Sheth
25
16
0
26 Apr 2022
ViTPose: Simple Vision Transformer Baselines for Human Pose Estimation
ViTPose: Simple Vision Transformer Baselines for Human Pose Estimation
Yufei Xu
Jing Zhang
Qiming Zhang
Dacheng Tao
ViT
34
517
0
26 Apr 2022
A Review on Text-Based Emotion Detection -- Techniques, Applications,
  Datasets, and Future Directions
A Review on Text-Based Emotion Detection -- Techniques, Applications, Datasets, and Future Directions
Sheetal Kusal
S. Patil
J. Choudrie
K. Kotecha
D. Vora
I. Pappas
19
24
0
26 Apr 2022
EPiDA: An Easy Plug-in Data Augmentation Framework for High Performance
  Text Classification
EPiDA: An Easy Plug-in Data Augmentation Framework for High Performance Text Classification
Minyi Zhao
Lu Zhang
Yi Xu
Jiandong Ding
Jihong Guan
Shuigeng Zhou
VLM
49
10
0
24 Apr 2022
Grad-SAM: Explaining Transformers via Gradient Self-Attention Maps
Grad-SAM: Explaining Transformers via Gradient Self-Attention Maps
Oren Barkan
Edan Hauon
Avi Caciularu
Ori Katz
Itzik Malkiel
Omri Armstrong
Noam Koenigstein
39
37
0
23 Apr 2022
Efficient Pipeline Planning for Expedited Distributed DNN Training
Efficient Pipeline Planning for Expedited Distributed DNN Training
Ziyue Luo
Xiaodong Yi
Guoping Long
Shiqing Fan
Chuan Wu
Jun Yang
Wei Lin
36
16
0
22 Apr 2022
Towards an Enhanced Understanding of Bias in Pre-trained Neural Language
  Models: A Survey with Special Emphasis on Affective Bias
Towards an Enhanced Understanding of Bias in Pre-trained Neural Language Models: A Survey with Special Emphasis on Affective Bias
Anoop Kadan
Manjary P.Gangan
Deepak P
L. LajishV.
AI4CE
48
10
0
21 Apr 2022
TEAM-Atreides at SemEval-2022 Task 11: On leveraging data augmentation
  and ensemble to recognize complex Named Entities in Bangla
TEAM-Atreides at SemEval-2022 Task 11: On leveraging data augmentation and ensemble to recognize complex Named Entities in Bangla
Nazia Tasnim
Md. Istiak Hossain Shihab
Asif Sushmit
Steven Bethard
Farig Sadeque
37
1
0
21 Apr 2022
UMass PCL at SemEval-2022 Task 4: Pre-trained Language Model Ensembles
  for Detecting Patronizing and Condescending Language
UMass PCL at SemEval-2022 Task 4: Pre-trained Language Model Ensembles for Detecting Patronizing and Condescending Language
David Koleczek
Alexander Scarlatos
Siddha Makarand Karkare
Preshma Linet Pereira
29
0
0
18 Apr 2022
Nested Named Entity Recognition as Holistic Structure Parsing
Nested Named Entity Recognition as Holistic Structure Parsing
Yifei Yang
Z. Li
Hai Zhao
30
0
0
17 Apr 2022
mGPT: Few-Shot Learners Go Multilingual
mGPT: Few-Shot Learners Go Multilingual
Oleh Shliazhko
Alena Fenogenova
Maria Tikhonova
Vladislav Mikhailov
Anastasia Kozlova
Tatiana Shavrina
56
149
0
15 Apr 2022
SNP2Vec: Scalable Self-Supervised Pre-Training for Genome-Wide
  Association Study
SNP2Vec: Scalable Self-Supervised Pre-Training for Genome-Wide Association Study
Samuel Cahyawijaya
Tiezheng Yu
Zihan Liu
Tiffany Mak
Xiaopu Zhou
N. Ip
Pascale Fung
23
8
0
14 Apr 2022
METRO: Efficient Denoising Pretraining of Large Scale Autoencoding
  Language Models with Model Generated Signals
METRO: Efficient Denoising Pretraining of Large Scale Autoencoding Language Models with Model Generated Signals
Payal Bajaj
Chenyan Xiong
Guolin Ke
Xiaodong Liu
Di He
Saurabh Tiwary
Tie-Yan Liu
Paul N. Bennett
Xia Song
Jianfeng Gao
52
32
0
13 Apr 2022
Impossible Triangle: What's Next for Pre-trained Language Models?
Impossible Triangle: What's Next for Pre-trained Language Models?
Chenguang Zhu
Michael Zeng
24
1
0
13 Apr 2022
A Call for Clarity in Beam Search: How It Works and When It Stops
A Call for Clarity in Beam Search: How It Works and When It Stops
Jungo Kasai
Keisuke Sakaguchi
Ronan Le Bras
Dragomir R. Radev
Yejin Choi
Noah A. Smith
31
7
0
11 Apr 2022
Towards Understanding Large-Scale Discourse Structures in Pre-Trained
  and Fine-Tuned Language Models
Towards Understanding Large-Scale Discourse Structures in Pre-Trained and Fine-Tuned Language Models
Patrick Huber
Giuseppe Carenini
20
11
0
08 Apr 2022
Contextual Representation Learning beyond Masked Language Modeling
Contextual Representation Learning beyond Masked Language Modeling
Zhiyi Fu
Wangchunshu Zhou
Jingjing Xu
Hao Zhou
Lei Li
31
25
0
08 Apr 2022
Using Decision Tree as Local Interpretable Model in Autoencoder-based
  LIME
Using Decision Tree as Local Interpretable Model in Autoencoder-based LIME
Niloofar Ranjbar
Reza Safabakhsh
FAtt
18
5
0
07 Apr 2022
Domain Specific Fine-tuning of Denoising Sequence-to-Sequence Models for
  Natural Language Summarization
Domain Specific Fine-tuning of Denoising Sequence-to-Sequence Models for Natural Language Summarization
Brydon Parker
A. Sokolov
Mahtab Ahmed
Matt Kalebic
S. Koçak
Ofer Shai
27
1
0
06 Apr 2022
Inducing Positive Perspectives with Text Reframing
Inducing Positive Perspectives with Text Reframing
Caleb Ziems
Minzhi Li
Anthony Zhang
Diyi Yang
DiffM
36
36
0
06 Apr 2022
Long Movie Clip Classification with State-Space Video Models
Long Movie Clip Classification with State-Space Video Models
Md. Mohaiminul Islam
Gedas Bertasius
VLM
56
102
0
04 Apr 2022
Graph Enhanced BERT for Query Understanding
Graph Enhanced BERT for Query Understanding
Juanhui Li
Yao Ma
Weizhen Zeng
Suqi Cheng
Jiliang Tang
Shuaiqiang Wang
Dawei Yin
29
7
0
03 Apr 2022
Exploiting Local and Global Features in Transformer-based Extreme
  Multi-label Text Classification
Exploiting Local and Global Features in Transformer-based Extreme Multi-label Text Classification
Ruohong Zhang
Yau-Shian Wang
Yiming Yang
Tom Vu
Li Lei
30
2
0
02 Apr 2022
Efficient comparison of sentence embeddings
Efficient comparison of sentence embeddings
Spyros Zoupanos
Stratis Kolovos
Athanasios Kanavos
Orestis Papadimitriou
M. Maragoudakis
11
11
0
02 Apr 2022
CharacterBERT and Self-Teaching for Improving the Robustness of Dense
  Retrievers on Queries with Typos
CharacterBERT and Self-Teaching for Improving the Robustness of Dense Retrievers on Queries with Typos
Shengyao Zhuang
Guido Zuccon
OOD
27
30
0
01 Apr 2022
Auto-MLM: Improved Contrastive Learning for Self-supervised
  Multi-lingual Knowledge Retrieval
Auto-MLM: Improved Contrastive Learning for Self-supervised Multi-lingual Knowledge Retrieval
Wenshen Xu
M. Maimaiti
Yuanhang Zheng
Xin Tang
Ji Zhang
RALM
SSL
16
2
0
30 Mar 2022
MFA-Conformer: Multi-scale Feature Aggregation Conformer for Automatic
  Speaker Verification
MFA-Conformer: Multi-scale Feature Aggregation Conformer for Automatic Speaker Verification
Yang Zhang
Zhiqiang Lv
Haibin Wu
Shanshan Zhang
Pengfei Hu
Zhiyong Wu
Hung-yi Lee
Helen Meng
ViT
39
131
0
29 Mar 2022
Parameter-efficient Model Adaptation for Vision Transformers
Parameter-efficient Model Adaptation for Vision Transformers
Xuehai He
Chunyuan Li
Pengchuan Zhang
Jianwei Yang
Xinze Wang
35
85
0
29 Mar 2022
ANNA: Enhanced Language Representation for Question Answering
ANNA: Enhanced Language Representation for Question Answering
Changwook Jun
Hansol Jang
Myoseop Sim
Hyun Kim
Jooyoung Choi
Kyungkoo Min
Kyunghoon Bae
31
6
0
28 Mar 2022
Reinforcement Learning with Action-Free Pre-Training from Videos
Reinforcement Learning with Action-Free Pre-Training from Videos
Younggyo Seo
Kimin Lee
Stephen James
Pieter Abbeel
SSL
OnRL
23
119
0
25 Mar 2022
Email Summarization to Assist Users in Phishing Identification
Email Summarization to Assist Users in Phishing Identification
Amir Kashapov
Tingmin Wu
A. Abuadbba
Carsten Rudolph
11
16
0
24 Mar 2022
Ensembling and Knowledge Distilling of Large Sequence Taggers for
  Grammatical Error Correction
Ensembling and Knowledge Distilling of Large Sequence Taggers for Grammatical Error Correction
M. Tarnavskyi
Artem Chernodub
Kostiantyn Omelianchuk
3DV
25
24
0
24 Mar 2022
ERNIE-SPARSE: Learning Hierarchical Efficient Transformer Through
  Regularized Self-Attention
ERNIE-SPARSE: Learning Hierarchical Efficient Transformer Through Regularized Self-Attention
Yang Liu
Jiaxiang Liu
L. Chen
Yuxiang Lu
Shi Feng
Zhida Feng
Yu Sun
Hao Tian
Huancheng Wu
Hai-feng Wang
36
9
0
23 Mar 2022
Towards Expressive Speaking Style Modelling with Hierarchical Context
  Information for Mandarin Speech Synthesis
Towards Expressive Speaking Style Modelling with Hierarchical Context Information for Mandarin Speech Synthesis
Shunwei Lei
Yixuan Zhou
Liyang Chen
Zhiyong Wu
Shiyin Kang
Helen Meng
28
12
0
23 Mar 2022
Transformer based ensemble for emotion detection
Transformer based ensemble for emotion detection
Aditya Kane
Shantanu Patankar
Sahil Khose
Neeraja Kirtane
41
9
0
22 Mar 2022
Factual Consistency of Multilingual Pretrained Language Models
Factual Consistency of Multilingual Pretrained Language Models
Constanza Fierro
Anders Søgaard
HILM
27
15
0
22 Mar 2022
How does the pre-training objective affect what large language models
  learn about linguistic properties?
How does the pre-training objective affect what large language models learn about linguistic properties?
Ahmed Alajrami
Nikolaos Aletras
32
20
0
20 Mar 2022
Pretraining with Artificial Language: Studying Transferable Knowledge in
  Language Models
Pretraining with Artificial Language: Studying Transferable Knowledge in Language Models
Ryokan Ri
Yoshimasa Tsuruoka
37
27
0
19 Mar 2022
DP-KB: Data Programming with Knowledge Bases Improves Transformer Fine
  Tuning for Answer Sentence Selection
DP-KB: Data Programming with Knowledge Bases Improves Transformer Fine Tuning for Answer Sentence Selection
Nic Jedema
Thuy Vu
Manish Gupta
Alessandro Moschitti
22
1
0
17 Mar 2022
Leveraging Adversarial Examples to Quantify Membership Information
  Leakage
Leveraging Adversarial Examples to Quantify Membership Information Leakage
Ganesh Del Grosso
Hamid Jalalzai
Georg Pichler
C. Palamidessi
Pablo Piantanida
MIACV
44
21
0
17 Mar 2022
elBERto: Self-supervised Commonsense Learning for Question Answering
elBERto: Self-supervised Commonsense Learning for Question Answering
Xunlin Zhan
Yuan Li
Xiao Dong
Xiaodan Liang
Zhiting Hu
Lawrence Carin
SSL
RALM
LRM
29
7
0
17 Mar 2022
EVA2.0: Investigating Open-Domain Chinese Dialogue Systems with
  Large-Scale Pre-Training
EVA2.0: Investigating Open-Domain Chinese Dialogue Systems with Large-Scale Pre-Training
Yuxian Gu
Jiaxin Wen
Hao Sun
Yi Song
Pei Ke
...
Zheng Zhang
Jianzhu Yao
Lei Liu
Xiaoyan Zhu
Minlie Huang
26
55
0
17 Mar 2022
Previous
123...111213...293031
Next