Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1906.08237
Cited By
v1
v2 (latest)
XLNet: Generalized Autoregressive Pretraining for Language Understanding
19 June 2019
Zhilin Yang
Zihang Dai
Yiming Yang
J. Carbonell
Ruslan Salakhutdinov
Quoc V. Le
AI4CE
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"XLNet: Generalized Autoregressive Pretraining for Language Understanding"
50 / 3,518 papers shown
Title
Exploring Benefits of Transfer Learning in Neural Machine Translation
Tom Kocmi
60
17
0
06 Jan 2020
oLMpics -- On what Language Model Pre-training Captures
Alon Talmor
Yanai Elazar
Yoav Goldberg
Jonathan Berant
LRM
107
304
0
31 Dec 2019
Clinical XLNet: Modeling Sequential Clinical Notes and Predicting Prolonged Mechanical Ventilation
Kexin Huang
Abhishek Singh
Sitong Chen
E. Moseley
Chih-ying Deng
Naomi George
C. Lindvall
119
59
0
27 Dec 2019
Is Attention All What You Need? -- An Empirical Investigation on Convolution-Based Active Memory and Self-Attention
Thomas D. Dowdell
Hongyu Zhang
36
4
0
27 Dec 2019
Multi-Graph Transformer for Free-Hand Sketch Recognition
Peng Xu
Chaitanya K. Joshi
Xavier Bresson
ViT
115
87
0
24 Dec 2019
A Multimodal Target-Source Classifier with Attention Branches to Understand Ambiguous Instructions for Fetching Daily Objects
A. Magassouba
K. Sugiura
Hisashi Kawai
81
9
0
23 Dec 2019
Learning and Evaluating Contextual Embedding of Source Code
Aditya Kanade
Petros Maniatis
Gogul Balakrishnan
Kensen Shi
ELM
87
77
0
21 Dec 2019
Are Transformers universal approximators of sequence-to-sequence functions?
Chulhee Yun
Srinadh Bhojanapalli
A. S. Rawat
Sashank J. Reddi
Sanjiv Kumar
140
358
0
20 Dec 2019
End-to-end Named Entity Recognition and Relation Extraction using Pre-trained Language Models
John Giorgi
Xindi Wang
Nicola Sahar
W. Shin
Gary D. Bader
Bo Wang
76
38
0
20 Dec 2019
Pretrained Encyclopedia: Weakly Supervised Knowledge-Pretrained Language Model
Wenhan Xiong
Jingfei Du
William Yang Wang
Veselin Stoyanov
SSL
KELM
105
201
0
20 Dec 2019
Asymmetrical Hierarchical Networks with Attentive Interactions for Interpretable Review-Based Recommendation
Xin Dong
Jingchao Ni
Wei Cheng
Zhengzhang Chen
Bo Zong
Dongjin Song
Yanchi Liu
Haifeng Chen
Gerard de Melo
144
55
0
18 Dec 2019
PEGASUS: Pre-training with Extracted Gap-sentences for Abstractive Summarization
Jingqing Zhang
Yao-Min Zhao
Mohammad Saleh
Peter J. Liu
RALM
3DGS
313
2,057
0
18 Dec 2019
Curriculum Learning Strategies for IR: An Empirical Study on Conversation Response Ranking
Gustavo Penha
C. Hauff
77
24
0
18 Dec 2019
Chinese Named Entity Recognition Augmented with Lexicon Memory
Yi Zhou
Xiaoqing Zheng
Xuanjing Huang
29
5
0
17 Dec 2019
Cross-Lingual Ability of Multilingual BERT: An Empirical Study
Karthikeyan K
Zihan Wang
Stephen D. Mayhew
Dan Roth
LRM
98
340
0
17 Dec 2019
WaLDORf: Wasteless Language-model Distillation On Reading-comprehension
J. Tian
A. Kreuzer
Pai-Hung Chen
Hans-Martin Will
VLM
60
3
0
13 Dec 2019
Spatial-Temporal Self-Attention Network for Flow Prediction
Haoxing Lin
Weijia Jia
Yiping Sun
Yongjian You
3DPC
AI4TS
62
8
0
13 Dec 2019
Extending Machine Language Models toward Human-Level Language Understanding
James L. McClelland
Felix Hill
Maja R. Rudolph
Jason Baldridge
Hinrich Schütze
LRM
78
35
0
12 Dec 2019
FlauBERT: Unsupervised Language Model Pre-training for French
Hang Le
Loïc Vial
Jibril Frej
Vincent Segonne
Maximin Coavoux
Benjamin Lecouteux
A. Allauzen
Benoît Crabbé
Laurent Besacier
D. Schwab
AI4CE
111
401
0
11 Dec 2019
Automatic Spanish Translation of the SQuAD Dataset for Multilingual Question Answering
C. Carrino
Marta R. Costa-jussá
José A. R. Fonollosa
69
89
0
11 Dec 2019
Zero-shot Text Classification With Generative Language Models
Raul Puri
Bryan Catanzaro
VLM
81
106
0
10 Dec 2019
Learning Norms from Stories: A Prior for Value Aligned Agents
Spencer Frazier
Md Sultan al Nahian
Mark O. Riedl
Brent Harrison
73
39
0
07 Dec 2019
Personalized Patent Claim Generation and Measurement
Jieh-Sheng Lee
25
4
0
07 Dec 2019
Large-scale Pretraining for Visual Dialog: A Simple State-of-the-Art Baseline
Vishvak Murahari
Dhruv Batra
Devi Parikh
Abhishek Das
VLM
111
117
0
05 Dec 2019
12-in-1: Multi-Task Vision and Language Representation Learning
Jiasen Lu
Vedanuj Goswami
Marcus Rohrbach
Devi Parikh
Stefan Lee
VLM
ObjD
131
481
0
05 Dec 2019
Natural Alpha Embeddings
Riccardo Volpi
Luigi Malagò
53
5
0
04 Dec 2019
An Exploration of Data Augmentation and Sampling Techniques for Domain-Agnostic Question Answering
Shayne Longpre
Yi Lu
Zhucheng Tu
Christopher DuBois
71
70
0
04 Dec 2019
Acquiring Knowledge from Pre-trained Model to Neural Machine Translation
Rongxiang Weng
Heng Yu
Shujian Huang
Shanbo Cheng
Weihua Luo
93
67
0
04 Dec 2019
Deep Contextualized Acoustic Representations For Semi-Supervised Speech Recognition
Shaoshi Ling
Yuzong Liu
Julian Salazar
Katrin Kirchhoff
SSL
86
139
0
03 Dec 2019
BERT for Large-scale Video Segment Classification with Test-time Augmentation
Tianqi Liu
Qizhan Shao
57
4
0
02 Dec 2019
Leveraging Contextual Embeddings for Detecting Diachronic Semantic Shift
Matej Martinc
Petra Kralj Novak
Senja Pollak
80
72
0
02 Dec 2019
EDA: Enriching Emotional Dialogue Acts using an Ensemble of Neural Annotators
Chandrakant Bothe
C. Weber
S. Magg
S. Wermter
57
10
0
02 Dec 2019
Bimodal Speech Emotion Recognition Using Pre-Trained Language Models
Verena Heusser
Niklas Freymuth
Stefan Constantin
A. Waibel
92
26
0
29 Nov 2019
Inducing Relational Knowledge from BERT
Zied Bouraoui
Jose Camacho-Collados
Steven Schockaert
98
167
0
28 Nov 2019
Multi-label Classification for Automatic Tag Prediction in the Context of Programming Challenges
Bianca Iancu
Gabriele Mazzola
Kyriakos Psarakis
Panagiotis Soilis
21
5
0
27 Nov 2019
Taking a Stance on Fake News: Towards Automatic Disinformation Assessment via Deep Bidirectional Transformer Language Models for Stance Detection
Chris Dulhanty
Jason L. Deglint
Ibrahim Ben Daya
A. Wong
49
22
0
27 Nov 2019
Evaluating Commonsense in Pre-trained Language Models
Xuhui Zhou
Yue Zhang
Leyang Cui
Dandan Huang
AI4MH
LRM
88
185
0
27 Nov 2019
Word-Class Embeddings for Multiclass Text Classification
Alejandro Moreo
Andrea Esuli
Fabrizio Sebastiani
33
36
0
26 Nov 2019
Pre-Training of Deep Bidirectional Protein Sequence Representations with Structural Information
Seonwoo Min
Seunghyun Park
Siwon Kim
Hyun-Soo Choi
Byunghan Lee
Sungroh Yoon
SSL
73
63
0
25 Nov 2019
FairyTED: A Fair Rating Predictor for TED Talk Data
Rupam Acharyya
Shouman Das
Ankani Chattoraj
Md. Iftekhar Tanveer
41
12
0
25 Nov 2019
Unsupervised Domain Adaptation of Language Models for Reading Comprehension
Kosuke Nishida
Kyosuke Nishida
Itsumi Saito
Hisako Asano
J. Tomita
102
26
0
25 Nov 2019
End-to-End Trainable Non-Collaborative Dialog System
Yu Li
Kun Qian
Weiyan Shi
Zhou Yu
87
46
0
25 Nov 2019
Invenio: Discovering Hidden Relationships Between Tasks/Domains Using Structured Meta Learning
Sameeksha Katoch
Kowshik Thopalli
Jayaraman J. Thiagarajan
Pavan Turaga
A. Spanias
39
4
0
24 Nov 2019
A Transformer-based approach to Irony and Sarcasm detection
Rolandos Alexandros Potamias
Georgios Siolas
A. Stafylopatis
47
213
0
23 Nov 2019
Factorized Multimodal Transformer for Multimodal Sequential Learning
Amir Zadeh
Chengfeng Mao
Kelly Shi
Yiwei Zhang
Paul Pu Liang
Soujanya Poria
Louis-Philippe Morency
69
45
0
22 Nov 2019
Learning Multi-level Dependencies for Robust Word Recognition
Z. Wang
Hui Liu
Jiliang Tang
Songfan Yang
Gale Yan Huang
Zitao Liu
67
8
0
22 Nov 2019
Outside the Box: Abstraction-Based Monitoring of Neural Networks
T. Henzinger
Anna Lukina
Christian Schilling
AAML
93
59
0
20 Nov 2019
Global Greedy Dependency Parsing
Z. Li
Zhao Hai
Kevin Parnow
113
31
0
20 Nov 2019
Towards non-toxic landscapes: Automatic toxic comment detection using DNN
Ashwin Geet D'Sa
Irina Illina
Dominique Fohr
56
22
0
19 Nov 2019
DARB: A Density-Aware Regular-Block Pruning for Deep Neural Networks
Ao Ren
Tao Zhang
Yuhao Wang
Sheng Lin
Peiyan Dong
Yen-kuang Chen
Yuan Xie
Yanzhi Wang
78
11
0
19 Nov 2019
Previous
1
2
3
...
66
67
68
69
70
71
Next