Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1906.08237
Cited By
v1
v2 (latest)
XLNet: Generalized Autoregressive Pretraining for Language Understanding
19 June 2019
Zhilin Yang
Zihang Dai
Yiming Yang
J. Carbonell
Ruslan Salakhutdinov
Quoc V. Le
AI4CE
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"XLNet: Generalized Autoregressive Pretraining for Language Understanding"
50 / 3,518 papers shown
Title
AVA: an Automatic eValuation Approach to Question Answering Systems
Thuy Vu
Alessandro Moschitti
49
13
0
02 May 2020
Gender Bias in Multilingual Embeddings and Cross-Lingual Transfer
Jieyu Zhao
Subhabrata Mukherjee
Saghar Hosseini
Kai-Wei Chang
Ahmed Hassan Awadallah
96
91
0
02 May 2020
DeFormer: Decomposing Pre-trained Transformers for Faster Question Answering
Qingqing Cao
H. Trivedi
A. Balasubramanian
Niranjan Balasubramanian
86
68
0
02 May 2020
DagoBERT: Generating Derivational Morphology with a Pretrained Language Model
Valentin Hofmann
J. Pierrehumbert
Hinrich Schütze
100
2
0
02 May 2020
On Faithfulness and Factuality in Abstractive Summarization
Joshua Maynez
Shashi Narayan
Bernd Bohnet
Ryan T. McDonald
HILM
98
1,048
0
02 May 2020
KLEJ: Comprehensive Benchmark for Polish Language Understanding
Piotr Rybak
Robert Mroczkowski
Janusz Tracz
Ireneusz Gawlik
ELM
73
84
0
01 May 2020
POINTER: Constrained Progressive Text Generation via Insertion-based Generative Pre-training
Yizhe Zhang
Guoyin Wang
Chunyuan Li
Zhe Gan
Chris Brockett
Bill Dolan
84
30
0
01 May 2020
Self-supervised Knowledge Triplet Learning for Zero-shot Question Answering
Pratyay Banerjee
Chitta Baral
90
65
0
01 May 2020
HERO: Hierarchical Encoder for Video+Language Omni-representation Pre-training
Linjie Li
Yen-Chun Chen
Yu Cheng
Zhe Gan
Licheng Yu
Jingjing Liu
MLLM
VLM
OffRL
AI4TS
133
507
0
01 May 2020
Why and when should you pool? Analyzing Pooling in Recurrent Architectures
Pratyush Maini
Keshav Kolluru
Danish Pruthi
Mausam
66
6
0
01 May 2020
Hide-and-Seek: A Template for Explainable AI
Thanos Tagaris
A. Stafylopatis
26
6
0
30 Apr 2020
Template Guided Text Generation for Task-Oriented Dialogue
Mihir Kale
Abhinav Rastogi
59
12
0
30 Apr 2020
Word Rotator's Distance
Sho Yokoi
Ryo Takahashi
Reina Akama
Jun Suzuki
Kentaro Inui
OT
75
59
0
30 Apr 2020
Segatron: Segment-Aware Transformer for Language Modeling and Understanding
Richard He Bai
Peng Shi
Jimmy J. Lin
Yuqing Xie
Luchen Tan
Kun Xiong
Wen Gao
Ming Li
38
8
0
30 Apr 2020
Recipes for Adapting Pre-trained Monolingual and Multilingual Models to Machine Translation
Asa Cooper Stickland
Xian Li
Marjan Ghazvininejad
101
46
0
30 Apr 2020
Analyzing the Surprising Variability in Word Embedding Stability Across Languages
Laura Burdick
Jonathan K. Kummerfeld
Rada Mihalcea
44
9
0
30 Apr 2020
Enriched Pre-trained Transformers for Joint Slot Filling and Intent Detection
Momchil Hardalov
Ivan Koychev
Preslav Nakov
VLM
47
17
0
30 Apr 2020
Character-Level Translation with Self-attention
Yingqiang Gao
Nikola I. Nikolov
Yuhuang Hu
Richard H. R. Hahnloser
46
27
0
30 Apr 2020
Look at the First Sentence: Position Bias in Question Answering
Miyoung Ko
Jinhyuk Lee
Hyunjae Kim
Gangwoo Kim
Jaewoo Kang
FaML
OOD
80
100
0
30 Apr 2020
memeBot: Towards Automatic Image Meme Generation
Aadhavan Sadasivam
K. Gunasekar
H. Davulcu
Yezhou Yang
29
10
0
30 Apr 2020
RikiNet: Reading Wikipedia Pages for Natural Question Answering
Dayiheng Liu
Yeyun Gong
Jie Fu
Yu Yan
Jiusheng Chen
Daxin Jiang
Jiancheng Lv
Nan Duan
RALM
94
55
0
30 Apr 2020
TAVAT: Token-Aware Virtual Adversarial Training for Language Understanding
Linyang Li
Xipeng Qiu
78
17
0
30 Apr 2020
An Empirical Study of Pre-trained Transformers for Arabic Information Extraction
Wuwei Lan
Yang Chen
Wei Xu
Alan Ritter
39
4
0
30 Apr 2020
"The Boating Store Had Its Best Sail Ever": Pronunciation-attentive Contextualized Pun Recognition
Yichao Zhou
Jyun-Yu Jiang
Jieyu Zhao
Kai-Wei Chang
Wei Wang
35
13
0
29 Apr 2020
The Effect of Natural Distribution Shift on Question Answering Models
John Miller
K. Krauth
Benjamin Recht
Ludwig Schmidt
OOD
105
145
0
29 Apr 2020
General Purpose Text Embeddings from Pre-trained Language Models for Scalable Inference
Jingfei Du
Myle Ott
Haoran Li
Xing Zhou
Veselin Stoyanov
AI4CE
66
10
0
29 Apr 2020
Exploiting Structured Knowledge in Text via Graph-Guided Representation Learning
Tao Shen
Yi Mao
Pengcheng He
Guodong Long
Adam Trischler
Weizhu Chen
84
63
0
29 Apr 2020
Zero-Shot Learning and its Applications from Autonomous Vehicles to COVID-19 Diagnosis: A Review
Mahdi Rezaei
Mahsa Shahidi
113
55
0
29 Apr 2020
Adversarial Subword Regularization for Robust Neural Machine Translation
Jungsoo Park
Mujeen Sung
Jinhyuk Lee
Jaewoo Kang
64
8
0
29 Apr 2020
Enhancing Answer Boundary Detection for Multilingual Machine Reading Comprehension
Fei Yuan
Linjun Shou
X. Bai
Ming Gong
Yaobo Liang
Nan Duan
Yan Fu
Daxin Jiang
88
23
0
29 Apr 2020
Benchmarking Robustness of Machine Reading Comprehension Models
Chenglei Si
Ziqing Yang
Yiming Cui
Wentao Ma
Ting Liu
Shijin Wang
ELM
AAML
112
42
0
29 Apr 2020
Knowledgeable Dialogue Reading Comprehension on Key Turns
Junlong Li
Zhuosheng Zhang
Hai Zhao
71
1
0
29 Apr 2020
BURT: BERT-inspired Universal Representation from Twin Structure
Yian Li
Hai Zhao
40
0
0
29 Apr 2020
Span-based Localizing Network for Natural Language Video Localization
Hao Zhang
Aixin Sun
Wei Jing
Qiufeng Wang
110
316
0
29 Apr 2020
Revisiting Pre-Trained Models for Chinese Natural Language Processing
Yiming Cui
Wanxiang Che
Ting Liu
Bing Qin
Shijin Wang
Guoping Hu
104
705
0
29 Apr 2020
Exploring Self-attention for Image Recognition
Hengshuang Zhao
Jiaya Jia
V. Koltun
SSL
100
790
0
28 Apr 2020
The Curse of Performance Instability in Analysis Datasets: Consequences, Source, and Suggestions
Xiang Zhou
Yixin Nie
Hao Tan
Joey Tianyi Zhou
111
41
0
28 Apr 2020
Conversational Word Embedding for Retrieval-Based Dialog System
Wentao Ma
Yiming Cui
Ting Liu
Dong Wang
Shijin Wang
Guoping Hu
45
5
0
28 Apr 2020
UXLA: A Robust Unsupervised Data Augmentation Framework for Zero-Resource Cross-Lingual NLP
M Saiful Bari
Tasnim Mohiuddin
Shafiq Joty
85
24
0
28 Apr 2020
SCELMo: Source Code Embeddings from Language Models
Rafael-Michael Karampatsis
Charles Sutton
67
53
0
28 Apr 2020
The Impact of the Mini-batch Size on the Variance of Gradients in Stochastic Gradient Descent
Xin-Yao Qian
Diego Klabjan
ODL
72
36
0
27 Apr 2020
DeeBERT: Dynamic Early Exiting for Accelerating BERT Inference
Ji Xin
Raphael Tang
Jaejun Lee
Yaoliang Yu
Jimmy J. Lin
65
377
0
27 Apr 2020
ColBERT: Using BERT Sentence Embedding in Parallel Neural Networks for Computational Humor
Issa Annamoradnejad
Gohar Zoghi
83
26
0
27 Apr 2020
The Gutenberg Dialogue Dataset
Richard Csaky
Gábor Recski
84
14
0
27 Apr 2020
Recall and Learn: Fine-tuning Deep Pretrained Language Models with Less Forgetting
Sanyuan Chen
Yutai Hou
Yiming Cui
Wanxiang Che
Ting Liu
Xiangzhan Yu
KELM
CLL
140
226
0
27 Apr 2020
Masking as an Efficient Alternative to Finetuning for Pretrained Language Models
Mengjie Zhao
Tao R. Lin
Fei Mi
Martin Jaggi
Hinrich Schütze
77
121
0
26 Apr 2020
Beyond 512 Tokens: Siamese Multi-depth Transformer-based Hierarchical Encoder for Long-Form Document Matching
Liu Yang
Mingyang Zhang
Cheng Li
Michael Bendersky
Marc Najork
96
89
0
26 Apr 2020
MixText: Linguistically-Informed Interpolation of Hidden Space for Semi-Supervised Text Classification
Jiaao Chen
Zichao Yang
Diyi Yang
VLM
103
365
0
25 Apr 2020
Quantifying the Contextualization of Word Representations with Semantic Class Probing
Mengjie Zhao
Philipp Dufter
Yadollah Yaghoobzadeh
Hinrich Schütze
83
27
0
25 Apr 2020
How Does NLP Benefit Legal System: A Summary of Legal Artificial Intelligence
Haoxiang Zhong
Chaojun Xiao
Cunchao Tu
Tianyang Zhang
Zhiyuan Liu
Maosong Sun
AILaw
140
307
0
25 Apr 2020
Previous
1
2
3
...
62
63
64
...
69
70
71
Next