ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1906.08237
  4. Cited By
XLNet: Generalized Autoregressive Pretraining for Language Understanding
v1v2 (latest)

XLNet: Generalized Autoregressive Pretraining for Language Understanding

19 June 2019
Zhilin Yang
Zihang Dai
Yiming Yang
J. Carbonell
Ruslan Salakhutdinov
Quoc V. Le
    AI4CE
ArXiv (abs)PDFHTML

Papers citing "XLNet: Generalized Autoregressive Pretraining for Language Understanding"

50 / 3,524 papers shown
Title
Is Attention always needed? A Case Study on Language Identification from
  Speech
Is Attention always needed? A Case Study on Language Identification from Speech
A. Mandal
Santanu Pal
Indranil Dutta
Mahidas Bhattacharya
S. Naskar
48
6
0
05 Oct 2021
Autoregressive Diffusion Models
Autoregressive Diffusion Models
Emiel Hoogeboom
Alexey A. Gritsenko
Jasmijn Bastings
Ben Poole
Rianne van den Berg
Tim Salimans
DiffM
134
155
0
05 Oct 2021
Investigating the Impact of Pre-trained Language Models on Dialog
  Evaluation
Investigating the Impact of Pre-trained Language Models on Dialog Evaluation
Chen Zhang
L. F. D’Haro
Yiming Chen
Thomas Friedrichs
Haizhou Li
66
5
0
05 Oct 2021
A Survey On Neural Word Embeddings
A Survey On Neural Word Embeddings
Erhan Sezerer
Selma Tekir
AI4TS
90
13
0
05 Oct 2021
Classification of hierarchical text using geometric deep learning: the
  case of clinical trials corpus
Classification of hierarchical text using geometric deep learning: the case of clinical trials corpus
Sohrab Ferdowsi
Nikolay Borissov
J. Knafou
P. Amini
Douglas Teodoro
33
7
0
04 Oct 2021
Revisiting Self-Training for Few-Shot Learning of Language Model
Revisiting Self-Training for Few-Shot Learning of Language Model
Yiming Chen
Yan Zhang
Chen Zhang
Grandee Lee
Ran Cheng
Haizhou Li
71
42
0
04 Oct 2021
Scheduling Optimization Techniques for Neural Network Training
Scheduling Optimization Techniques for Neural Network Training
Hyungjun Oh
Junyeol Lee
HyeongJu Kim
Jiwon Seo
55
1
0
03 Oct 2021
Swiss-Judgment-Prediction: A Multilingual Legal Judgment Prediction
  Benchmark
Swiss-Judgment-Prediction: A Multilingual Legal Judgment Prediction Benchmark
Joel Niklaus
Ilias Chalkidis
Matthias Sturmer
ELMAILaw
67
70
0
02 Oct 2021
ProTo: Program-Guided Transformer for Program-Guided Tasks
ProTo: Program-Guided Transformer for Program-Guided Tasks
Zelin Zhao
Karan Samel
Binghong Chen
Le Song
ViTLM&Ro
100
30
0
02 Oct 2021
Fast Multi-Resolution Transformer Fine-tuning for Extreme Multi-label
  Text Classification
Fast Multi-Resolution Transformer Fine-tuning for Extreme Multi-label Text Classification
Jiong Zhang
Wei-Cheng Chang
Hsiang-Fu Yu
Inderjit S. Dhillon
117
103
0
01 Oct 2021
Low Frequency Names Exhibit Bias and Overfitting in Contextualizing
  Language Models
Low Frequency Names Exhibit Bias and Overfitting in Contextualizing Language Models
Robert Wolfe
Aylin Caliskan
125
51
0
01 Oct 2021
A Survey of Knowledge Enhanced Pre-trained Models
A Survey of Knowledge Enhanced Pre-trained Models
Jian Yang
Xinyu Hu
Gang Xiao
Yulong Shen
KELM
109
6
0
01 Oct 2021
Focused Contrastive Training for Test-based Constituency Analysis
Focused Contrastive Training for Test-based Constituency Analysis
Benjamin Roth
Erion cCano
31
0
0
30 Sep 2021
Fine-tuning wav2vec2 for speaker recognition
Fine-tuning wav2vec2 for speaker recognition
Nik Vaessen
David A. van Leeuwen
116
109
0
30 Sep 2021
First to Possess His Statistics: Data-Free Model Extraction Attack on
  Tabular Data
First to Possess His Statistics: Data-Free Model Extraction Attack on Tabular Data
Masataka Tasumi
Kazuki Iwahana
Naoto Yanai
Katsunari Shishido
Toshiya Shimizu
Yuji Higuchi
I. Morikawa
Jun Yajima
AAML
78
4
0
30 Sep 2021
Multi-Task Pre-Training for Plug-and-Play Task-Oriented Dialogue System
Multi-Task Pre-Training for Plug-and-Play Task-Oriented Dialogue System
Yixuan Su
Lei Shu
Elman Mansimov
Arshit Gupta
Deng Cai
Yi-An Lai
Yi Zhang
229
195
0
29 Sep 2021
Multimodal Emotion Recognition with High-level Speech and Text Features
Multimodal Emotion Recognition with High-level Speech and Text Features
M. R. Makiuchi
Kuniaki Uto
Koichi Shinoda
85
72
0
29 Sep 2021
VideoCLIP: Contrastive Pre-training for Zero-shot Video-Text
  Understanding
VideoCLIP: Contrastive Pre-training for Zero-shot Video-Text Understanding
Hu Xu
Gargi Ghosh
Po-Yao (Bernie) Huang
Dmytro Okhonko
Armen Aghajanyan
Florian Metze
Luke Zettlemoyer
Florian Metze Luke Zettlemoyer Christoph Feichtenhofer
CLIPVLM
335
584
0
28 Sep 2021
What to Prioritize? Natural Language Processing for the Development of a
  Modern Bug Tracking Solution in Hardware Development
What to Prioritize? Natural Language Processing for the Development of a Modern Bug Tracking Solution in Hardware Development
T. Do
Markus Dobler
Niklas Kühl
32
0
0
28 Sep 2021
TURINGBENCH: A Benchmark Environment for Turing Test in the Age of
  Neural Text Generation
TURINGBENCH: A Benchmark Environment for Turing Test in the Age of Neural Text Generation
Adaku Uchendu
Zeyu Ma
Thai Le
Rui Zhang
Dongwon Lee
DeLMO
115
127
0
27 Sep 2021
VQA-MHUG: A Gaze Dataset to Study Multimodal Neural Attention in Visual
  Question Answering
VQA-MHUG: A Gaze Dataset to Study Multimodal Neural Attention in Visual Question Answering
Ekta Sood
Fabian Kögel
Florian Strohm
Prajit Dhar
Andreas Bulling
67
19
0
27 Sep 2021
Context-guided Triple Matching for Multiple Choice Question Answering
Context-guided Triple Matching for Multiple Choice Question Answering
Xun Yao
Junlong Ma
Xinrong Hu
Junping Liu
Jie Yang
Wanqing Li
66
2
0
27 Sep 2021
Understanding and Overcoming the Challenges of Efficient Transformer
  Quantization
Understanding and Overcoming the Challenges of Efficient Transformer Quantization
Yelysei Bondarenko
Markus Nagel
Tijmen Blankevoort
MQ
83
146
0
27 Sep 2021
Multiplicative Position-aware Transformer Models for Language
  Understanding
Multiplicative Position-aware Transformer Models for Language Understanding
Zhiheng Huang
Davis Liang
Peng Xu
Bing Xiang
36
1
0
27 Sep 2021
Improving Question Answering Performance Using Knowledge Distillation
  and Active Learning
Improving Question Answering Performance Using Knowledge Distillation and Active Learning
Yasaman Boreshban
Seyed Morteza Mirbostani
Gholamreza Ghassem-Sani
Seyed Abolghasem Mirroshandel
Shahin Amiriparian
83
16
0
26 Sep 2021
Curb Your Carbon Emissions: Benchmarking Carbon Emissions in Machine Translation
Mirza Yusuf
Praatibh Surana
Gauri Gupta
Krithika Ramesh
86
8
0
26 Sep 2021
Entity Linking Meets Deep Learning: Techniques and Solutions
Entity Linking Meets Deep Learning: Techniques and Solutions
Wei Shen
Yuhan Li
Yinan Liu
Jiawei Han
Jianyong Wang
Xiaojie Yuan
124
53
0
26 Sep 2021
One-shot Key Information Extraction from Document with Deep Partial
  Graph Matching
One-shot Key Information Extraction from Document with Deep Partial Graph Matching
Minghong Yao
Zhiguang Liu
Liangwei Wang
Houqiang Li
Liansheng Zhuang
118
5
0
26 Sep 2021
Parallel Refinements for Lexically Constrained Text Generation with BART
Parallel Refinements for Lexically Constrained Text Generation with BART
Xingwei He
83
43
0
26 Sep 2021
DziriBERT: a Pre-trained Language Model for the Algerian Dialect
DziriBERT: a Pre-trained Language Model for the Algerian Dialect
Amine Abdaoui
Mohamed Berrimi
Mourad Oussalah
A. Moussaoui
99
45
0
25 Sep 2021
Finetuning Transformer Models to Build ASAG System
Finetuning Transformer Models to Build ASAG System
Mithun Thakkar
15
2
0
25 Sep 2021
More Than Reading Comprehension: A Survey on Datasets and Metrics of
  Textual Question Answering
More Than Reading Comprehension: A Survey on Datasets and Metrics of Textual Question Answering
Yang Bai
D. Wang
166
10
0
25 Sep 2021
Pushing on Text Readability Assessment: A Transformer Meets Handcrafted
  Linguistic Features
Pushing on Text Readability Assessment: A Transformer Meets Handcrafted Linguistic Features
Bruce W. Lee
Yoonna Jang
J. Lee
VLM
101
83
0
25 Sep 2021
Monolingual and Cross-Lingual Acceptability Judgments with the Italian
  CoLA corpus
Monolingual and Cross-Lingual Acceptability Judgments with the Italian CoLA corpus
Daniela Trotta
R. Guarasci
Elisa Leonardelli
Sara Tonelli
103
31
0
24 Sep 2021
Lacking the embedding of a word? Look it up into a traditional
  dictionary
Lacking the embedding of a word? Look it up into a traditional dictionary
Elena Sofia Ruzzetti
Leonardo Ranaldi
Michele Mastromattei
Francesca Fallucchi
Fabio Massimo Zanzotto
61
15
0
24 Sep 2021
Don't be Contradicted with Anything! CI-ToD: Towards Benchmarking
  Consistency for Task-oriented Dialogue System
Don't be Contradicted with Anything! CI-ToD: Towards Benchmarking Consistency for Task-oriented Dialogue System
Libo Qin
Tianbao Xie
Shijue Huang
Qiguang Chen
Xiao Xu
Wanxiang Che
114
20
0
23 Sep 2021
Conditional Poisson Stochastic Beam Search
Conditional Poisson Stochastic Beam Search
Clara Meister
Afra Amini
Tim Vieira
Ryan Cotterell
83
10
0
22 Sep 2021
BFClass: A Backdoor-free Text Classification Framework
BFClass: A Backdoor-free Text Classification Framework
Zichao Li
Dheeraj Mekala
Chengyu Dong
Jingbo Shang
SILM
108
28
0
22 Sep 2021
Small-Bench NLP: Benchmark for small single GPU trained models in
  Natural Language Processing
Small-Bench NLP: Benchmark for small single GPU trained models in Natural Language Processing
K. Kanakarajan
Bhuvana Kundumani
Malaikannan Sankarasubbu
ALMMoE
62
5
0
22 Sep 2021
MiRANews: Dataset and Benchmarks for Multi-Resource-Assisted News
  Summarization
MiRANews: Dataset and Benchmarks for Multi-Resource-Assisted News Summarization
Xinnuo Xu
Ondrej Dusek
Shashi Narayan
Verena Rieser
Ioannis Konstas
HILM
74
6
0
22 Sep 2021
Role of Language Relatedness in Multilingual Fine-tuning of Language
  Models: A Case Study in Indo-Aryan Languages
Role of Language Relatedness in Multilingual Fine-tuning of Language Models: A Case Study in Indo-Aryan Languages
Tejas I. Dhamecha
V. Rudramurthy
Samarth Bharadwaj
Karthik Sankaranarayanan
P. Bhattacharyya
95
26
0
22 Sep 2021
Digital Signal Processing Using Deep Neural Networks
Digital Signal Processing Using Deep Neural Networks
Brian Shevitski
Y. Watkins
Nicole Man
Michael Girard
AI4CE
88
4
0
21 Sep 2021
AutoGCL: Automated Graph Contrastive Learning via Learnable View
  Generators
AutoGCL: Automated Graph Contrastive Learning via Learnable View Generators
Yihang Yin
Qingzhong Wang
Siyu Huang
Haoyi Xiong
Xiang Zhang
112
156
0
21 Sep 2021
RAIL-KD: RAndom Intermediate Layer Mapping for Knowledge Distillation
RAIL-KD: RAndom Intermediate Layer Mapping for Knowledge Distillation
Md. Akmal Haidar
Nithin Anchuri
Mehdi Rezagholizadeh
Abbas Ghaddar
Philippe Langlais
Pascal Poupart
111
22
0
21 Sep 2021
BERT Has Uncommon Sense: Similarity Ranking for Word Sense BERTology
BERT Has Uncommon Sense: Similarity Ranking for Word Sense BERTology
Luke Gessler
Nathan Schneider
66
7
0
20 Sep 2021
DisCoDisCo at the DISRPT2021 Shared Task: A System for Discourse
  Segmentation, Classification, and Connective Detection
DisCoDisCo at the DISRPT2021 Shared Task: A System for Discourse Segmentation, Classification, and Connective Detection
Luke Gessler
Shabnam Behzad
Yang Liu
Siyao Peng
Yilun Zhu
Amir Zeldes
93
33
0
20 Sep 2021
Towards Zero-Label Language Learning
Towards Zero-Label Language Learning
Zirui Wang
Adams Wei Yu
Orhan Firat
Yuan Cao
SyDa
251
105
0
19 Sep 2021
Navigating the Kaleidoscope of COVID-19 Misinformation Using Deep
  Learning
Navigating the Kaleidoscope of COVID-19 Misinformation Using Deep Learning
Yuanzhi Chen
Mohammad Rashedul Hasan
60
4
0
19 Sep 2021
Augmenting semantic lexicons using word embeddings and transfer learning
Augmenting semantic lexicons using word embeddings and transfer learning
Thayer Alshaabi
C. V. Oort
M. Fudolig
M. V. Arnold
C. Danforth
P. Dodds
80
4
0
18 Sep 2021
Primer: Searching for Efficient Transformers for Language Modeling
Primer: Searching for Efficient Transformers for Language Modeling
David R. So
Wojciech Mañke
Hanxiao Liu
Zihang Dai
Noam M. Shazeer
Quoc V. Le
VLM
285
156
0
17 Sep 2021
Previous
123...373839...697071
Next