ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1906.08237
  4. Cited By
XLNet: Generalized Autoregressive Pretraining for Language Understanding
v1v2 (latest)

XLNet: Generalized Autoregressive Pretraining for Language Understanding

19 June 2019
Zhilin Yang
Zihang Dai
Yiming Yang
J. Carbonell
Ruslan Salakhutdinov
Quoc V. Le
    AI4CE
ArXiv (abs)PDFHTML

Papers citing "XLNet: Generalized Autoregressive Pretraining for Language Understanding"

50 / 3,520 papers shown
Title
TopicBERT for Energy Efficient Document Classification
TopicBERT for Energy Efficient Document Classification
Yatin Chaudhary
Pankaj Gupta
Khushbu Saxena
Vivek Kulkarni
Thomas Runkler
Hinrich Schütze
75
21
0
15 Oct 2020
Positioning yourself in the maze of Neural Text Generation: A
  Task-Agnostic Survey
Positioning yourself in the maze of Neural Text Generation: A Task-Agnostic Survey
Khyathi Chandu
A. Black
76
0
0
14 Oct 2020
Text Classification Using Label Names Only: A Language Model
  Self-Training Approach
Text Classification Using Label Names Only: A Language Model Self-Training Approach
Yu Meng
Yunyi Zhang
Jiaxin Huang
Chenyan Xiong
Heng Ji
Chao Zhang
Jiawei Han
VLM
88
76
0
14 Oct 2020
Summarize, Outline, and Elaborate: Long-Text Generation via Hierarchical
  Supervision from Extractive Summaries
Summarize, Outline, and Elaborate: Long-Text Generation via Hierarchical Supervision from Extractive Summaries
Xiaofei Sun
Zijun Sun
Yuxian Meng
Jiwei Li
Chun Fan
61
20
0
14 Oct 2020
Length-Adaptive Transformer: Train Once with Length Drop, Use Anytime
  with Search
Length-Adaptive Transformer: Train Once with Length Drop, Use Anytime with Search
Gyuwan Kim
Kyunghyun Cho
96
98
0
14 Oct 2020
Vokenization: Improving Language Understanding with Contextualized,
  Visual-Grounded Supervision
Vokenization: Improving Language Understanding with Contextualized, Visual-Grounded Supervision
Hao Tan
Joey Tianyi Zhou
CLIP
89
121
0
14 Oct 2020
With Little Power Comes Great Responsibility
With Little Power Comes Great Responsibility
Dallas Card
Peter Henderson
Urvashi Khandelwal
Robin Jia
Kyle Mahowald
Dan Jurafsky
281
119
0
13 Oct 2020
Pretrained Transformers for Text Ranking: BERT and Beyond
Pretrained Transformers for Text Ranking: BERT and Beyond
Jimmy J. Lin
Rodrigo Nogueira
Andrew Yates
VLM
397
628
0
13 Oct 2020
Interpreting Attention Models with Human Visual Attention in Machine
  Reading Comprehension
Interpreting Attention Models with Human Visual Attention in Machine Reading Comprehension
Ekta Sood
Simon Tannert
Diego Frassinelli
Andreas Bulling
Ngoc Thang Vu
HAI
75
57
0
13 Oct 2020
Aspect-based Document Similarity for Research Papers
Aspect-based Document Similarity for Research Papers
Malte Ostendorff
Terry Ruas
Till Blume
Bela Gipp
Georg Rehm
105
27
0
13 Oct 2020
CAPT: Contrastive Pre-Training for Learning Denoised Sequence
  Representations
CAPT: Contrastive Pre-Training for Learning Denoised Sequence Representations
Fuli Luo
Pengcheng Yang
Shicheng Li
Xuancheng Ren
Xu Sun
VLMSSL
73
16
0
13 Oct 2020
RGCL at SemEval-2020 Task 6: Neural Approaches to Definition Extraction
RGCL at SemEval-2020 Task 6: Neural Approaches to Definition Extraction
Tharindu Ranasinghe
Alistair Plum
Constantin Orasan
R. Mitkov
NAI
21
2
0
13 Oct 2020
BRUMS at SemEval-2020 Task 12 : Transformer based Multilingual Offensive
  Language Identification in Social Media
BRUMS at SemEval-2020 Task 12 : Transformer based Multilingual Offensive Language Identification in Social Media
Tharindu Ranasinghe
Hansi Hettiarachchi
60
20
0
13 Oct 2020
BRUMS at SemEval-2020 Task 3: Contextualised Embeddings for Predicting
  the (Graded) Effect of Context in Word Similarity
BRUMS at SemEval-2020 Task 3: Contextualised Embeddings for Predicting the (Graded) Effect of Context in Word Similarity
Hansi Hettiarachchi
Tharindu Ranasinghe
53
14
0
13 Oct 2020
X-FACTR: Multilingual Factual Knowledge Retrieval from Pretrained
  Language Models
X-FACTR: Multilingual Factual Knowledge Retrieval from Pretrained Language Models
Zhengbao Jiang
Antonios Anastasopoulos
Jun Araki
Haibo Ding
Graham Neubig
HILMKELM
98
144
0
13 Oct 2020
Incorporating BERT into Parallel Sequence Decoding with Adapters
Incorporating BERT into Parallel Sequence Decoding with Adapters
Junliang Guo
Zhirui Zhang
Linli Xu
Hao-Ran Wei
Boxing Chen
Enhong Chen
113
69
0
13 Oct 2020
BERT-EMD: Many-to-Many Layer Mapping for BERT Compression with Earth
  Mover's Distance
BERT-EMD: Many-to-Many Layer Mapping for BERT Compression with Earth Mover's Distance
Jianquan Li
Xiaokang Liu
Honghong Zhao
Ruifeng Xu
Min Yang
Yaohong Jin
111
54
0
13 Oct 2020
Improving Self-supervised Pre-training via a Fully-Explored Masked
  Language Model
Improving Self-supervised Pre-training via a Fully-Explored Masked Language Model
Ming Zheng
Dinghan Shen
Yelong Shen
Weizhu Chen
Lin Xiao
SSL
29
4
0
12 Oct 2020
Measuring and Reducing Gendered Correlations in Pre-trained Models
Measuring and Reducing Gendered Correlations in Pre-trained Models
Kellie Webster
Xuezhi Wang
Ian Tenney
Alex Beutel
Emily Pitler
Ellie Pavlick
Jilin Chen
Ed Chi
Slav Petrov
FaML
117
260
0
12 Oct 2020
Chatbot Interaction with Artificial Intelligence: Human Data
  Augmentation with T5 and Language Transformer Ensemble for Text
  Classification
Chatbot Interaction with Artificial Intelligence: Human Data Augmentation with T5 and Language Transformer Ensemble for Text Classification
Jordan J. Bird
Anikó Ekárt
Diego Resende Faria
61
60
0
12 Oct 2020
PECOS: Prediction for Enormous and Correlated Output Spaces
PECOS: Prediction for Enormous and Correlated Output Spaces
Hsiang-Fu Yu
Kai Zhong
Jiong Zhang
Wei-Cheng Chang
Inderjit S. Dhillon
130
85
0
12 Oct 2020
Webly Supervised Image Classification with Metadata: Automatic Noisy
  Label Correction via Visual-Semantic Graph
Webly Supervised Image Classification with Metadata: Automatic Noisy Label Correction via Visual-Semantic Graph
Jingkang Yang
Weirong Chen
Xue Jiang
Xiaopeng Yan
Huabin Zheng
Wayne Zhang
NoLa
77
13
0
12 Oct 2020
On the Complementary Nature of Knowledge Graph Embedding, Fine Grain
  Entity Types, and Language Modeling
On the Complementary Nature of Knowledge Graph Embedding, Fine Grain Entity Types, and Language Modeling
Rajat Patel
Francis Ferraro
40
1
0
12 Oct 2020
Counterfactual Variable Control for Robust and Interpretable Question
  Answering
Counterfactual Variable Control for Robust and Interpretable Question Answering
S. Yu
Yulei Niu
Shuohang Wang
Jing Jiang
Qianru Sun
AAMLOOD
93
9
0
12 Oct 2020
Meta-Context Transformers for Domain-Specific Response Generation
Meta-Context Transformers for Domain-Specific Response Generation
Debanjana Kar
Suranjana Samanta
A. Azad
44
1
0
12 Oct 2020
Pre-trained Language Model Based Active Learning for Sentence Matching
Pre-trained Language Model Based Active Learning for Sentence Matching
Guirong Bai
Shizhu He
Kang Liu
Jun Zhao
Zaiqing Nie
112
10
0
12 Oct 2020
Neural, Symbolic and Neural-Symbolic Reasoning on Knowledge Graphs
Neural, Symbolic and Neural-Symbolic Reasoning on Knowledge Graphs
Jing Zhang
Bo Chen
Lingxi Zhang
Xirui Ke
Haipeng Ding
NAI
112
3
0
12 Oct 2020
Quantitative Argument Summarization and Beyond: Cross-Domain Key Point
  Analysis
Quantitative Argument Summarization and Beyond: Cross-Domain Key Point Analysis
Roy Bar-Haim
Yoav Kantor
Lilach Eden
Roni Friedman
Dan Lahav
Noam Slonim
82
47
0
11 Oct 2020
InfoMiner at WNUT-2020 Task 2: Transformer-based Covid-19 Informative
  Tweet Extraction
InfoMiner at WNUT-2020 Task 2: Transformer-based Covid-19 Informative Tweet Extraction
Hansi Hettiarachchi
Tharindu Ranasinghe
MedIm
36
21
0
11 Oct 2020
SMYRF: Efficient Attention using Asymmetric Clustering
SMYRF: Efficient Attention using Asymmetric Clustering
Giannis Daras
Nikita Kitaev
Augustus Odena
A. Dimakis
106
46
0
11 Oct 2020
SJTU-NICT's Supervised and Unsupervised Neural Machine Translation
  Systems for the WMT20 News Translation Task
SJTU-NICT's Supervised and Unsupervised Neural Machine Translation Systems for the WMT20 News Translation Task
Z. Li
Hai Zhao
Rui Wang
Kehai Chen
Masao Utiyama
Eiichiro Sumita
66
15
0
11 Oct 2020
Contrastive Representation Learning: A Framework and Review
Contrastive Representation Learning: A Framework and Review
Phúc H. Lê Khắc
Graham Healy
Alan F. Smeaton
SSLAI4TS
330
722
0
10 Oct 2020
On the Importance of Adaptive Data Collection for Extremely Imbalanced
  Pairwise Tasks
On the Importance of Adaptive Data Collection for Extremely Imbalanced Pairwise Tasks
Stephen Mussmann
Robin Jia
Percy Liang
83
15
0
10 Oct 2020
Automated Concatenation of Embeddings for Structured Prediction
Automated Concatenation of Embeddings for Structured Prediction
Xinyu Wang
Yong Jiang
Nguyen Bach
Tao Wang
Zhongqiang Huang
Fei Huang
Kewei Tu
109
177
0
10 Oct 2020
What Do Position Embeddings Learn? An Empirical Study of Pre-Trained
  Language Model Positional Encoding
What Do Position Embeddings Learn? An Empirical Study of Pre-Trained Language Model Positional Encoding
Yu-An Wang
Yun-Nung Chen
SSL
59
95
0
10 Oct 2020
Adversarial Self-Supervised Data-Free Distillation for Text
  Classification
Adversarial Self-Supervised Data-Free Distillation for Text Classification
Xinyin Ma
Yongliang Shen
Gongfan Fang
Chen Chen
Chenghao Jia
Weiming Lu
124
24
0
10 Oct 2020
Recursive Top-Down Production for Sentence Generation with Latent Trees
Recursive Top-Down Production for Sentence Generation with Latent Trees
Shawn Tan
Songlin Yang
Timothy J. O'Donnell
Alessandro Sordoni
Aaron Courville
47
4
0
09 Oct 2020
Multichannel Generative Language Model: Learning All Possible
  Factorizations Within and Across Channels
Multichannel Generative Language Model: Learning All Possible Factorizations Within and Across Channels
Harris Chan
J. Kiros
William Chan
LRM
23
0
0
09 Oct 2020
TurboTransformers: An Efficient GPU Serving System For Transformer
  Models
TurboTransformers: An Efficient GPU Serving System For Transformer Models
Jiarui Fang
Yang Yu
Chen-liang Zhao
Jie Zhou
86
140
0
09 Oct 2020
Plug-and-Play Conversational Models
Plug-and-Play Conversational Models
Andrea Madotto
Etsuko Ishii
Zhaojiang Lin
Sumanth Dathathri
Pascale Fung
86
51
0
09 Oct 2020
Masked ELMo: An evolution of ELMo towards fully contextual RNN language
  models
Masked ELMo: An evolution of ELMo towards fully contextual RNN language models
Grégory Senay
Emmanuelle Salin
34
2
0
08 Oct 2020
Deep Learning Meets Projective Clustering
Deep Learning Meets Projective Clustering
Alaa Maalouf
Harry Lang
Daniela Rus
Dan Feldman
113
9
0
08 Oct 2020
An Empirical Study on Model-agnostic Debiasing Strategies for Robust
  Natural Language Inference
An Empirical Study on Model-agnostic Debiasing Strategies for Robust Natural Language Inference
Tianyu Liu
Xin Zheng
Xiaoan Ding
Baobao Chang
Zhifang Sui
73
25
0
08 Oct 2020
Improving Attention Mechanism with Query-Value Interaction
Improving Attention Mechanism with Query-Value Interaction
Chuhan Wu
Fangzhao Wu
Tao Qi
Yongfeng Huang
43
4
0
08 Oct 2020
Assessing Phrasal Representation and Composition in Transformers
Assessing Phrasal Representation and Composition in Transformers
Lang-Chi Yu
Allyson Ettinger
CoGe
90
68
0
08 Oct 2020
Discriminatively-Tuned Generative Classifiers for Robust Natural
  Language Inference
Discriminatively-Tuned Generative Classifiers for Robust Natural Language Inference
Xiaoan Ding
Tianyu Liu
Baobao Chang
Zhifang Sui
Kevin Gimpel
85
8
0
08 Oct 2020
Infusing Disease Knowledge into BERT for Health Question Answering,
  Medical Inference and Disease Name Recognition
Infusing Disease Knowledge into BERT for Health Question Answering, Medical Inference and Disease Name Recognition
Yun He
Ziwei Zhu
Yin Zhang
Qin Chen
James Caverlee
AI4MH
87
109
0
08 Oct 2020
PARADE: A New Dataset for Paraphrase Identification Requiring Computer
  Science Domain Knowledge
PARADE: A New Dataset for Paraphrase Identification Requiring Computer Science Domain Knowledge
Yun He
Zhuoer Wang
Yin Zhang
Ruihong Huang
James Caverlee
51
23
0
08 Oct 2020
A Mathematical Exploration of Why Language Models Help Solve Downstream
  Tasks
A Mathematical Exploration of Why Language Models Help Solve Downstream Tasks
Nikunj Saunshi
Sadhika Malladi
Sanjeev Arora
87
89
0
07 Oct 2020
A Self-supervised Approach for Semantic Indexing in the Context of
  COVID-19 Pandemic
A Self-supervised Approach for Semantic Indexing in the Context of COVID-19 Pandemic
Nima Ebadi
Peyman Najafirad
OOD
42
2
0
07 Oct 2020
Previous
123...545556...697071
Next