Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1906.08237
Cited By
v1
v2 (latest)
XLNet: Generalized Autoregressive Pretraining for Language Understanding
19 June 2019
Zhilin Yang
Zihang Dai
Yiming Yang
J. Carbonell
Ruslan Salakhutdinov
Quoc V. Le
AI4CE
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"XLNet: Generalized Autoregressive Pretraining for Language Understanding"
50 / 3,518 papers shown
Title
New Protocols and Negative Results for Textual Entailment Data Collection
Samuel R. Bowman
J. Palomaki
Livio Baldini Soares
Emily Pitler
70
7
0
24 Apr 2020
Template-Based Question Generation from Retrieved Sentences for Improved Unsupervised Question Answering
Alexander R. Fabbri
Patrick Ng
Zhiguo Wang
Ramesh Nallapati
Bing Xiang
82
77
0
24 Apr 2020
Learning the grammar of drug prescription: recurrent neural network grammars for medication information extraction in clinical texts
Ivan Lerner
Jordan Jouffroy
Anita Burgun
A. Neuraz
50
9
0
24 Apr 2020
Supervised Contrastive Learning
Prannay Khosla
Piotr Teterwak
Chen Wang
Aaron Sarna
Yonglong Tian
Phillip Isola
Aaron Maschinot
Ce Liu
Dilip Krishnan
SSL
214
4,604
0
23 Apr 2020
Semi-Supervised Models via Data Augmentationfor Classifying Interactive Affective Responses
Jiaao Chen
Yuwei Wu
Diyi Yang
61
18
0
23 Apr 2020
Don't Stop Pretraining: Adapt Language Models to Domains and Tasks
Suchin Gururangan
Ana Marasović
Swabha Swayamdipta
Kyle Lo
Iz Beltagy
Doug Downey
Noah A. Smith
VLM
AI4CE
CLL
210
2,452
0
23 Apr 2020
MT-Clinical BERT: Scaling Clinical Information Extraction with Multitask Learning
Andriy Mulyar
Bridget T. McInnes
76
56
0
21 Apr 2020
Experience Grounds Language
Yonatan Bisk
Ari Holtzman
Jesse Thomason
Jacob Andreas
Yoshua Bengio
...
Angeliki Lazaridou
Jonathan May
Aleksandr Nisnevich
Nicolas Pinto
Joseph P. Turian
124
361
0
21 Apr 2020
Train No Evil: Selective Masking for Task-Guided Pre-Training
Yuxian Gu
Zhengyan Zhang
Xiaozhi Wang
Zhiyuan Liu
Maosong Sun
141
59
0
21 Apr 2020
StereoSet: Measuring stereotypical bias in pretrained language models
Moin Nadeem
Anna Bethke
Siva Reddy
103
1,027
0
20 Apr 2020
MPNet: Masked and Permuted Pre-training for Language Understanding
Kaitao Song
Xu Tan
Tao Qin
Jianfeng Lu
Tie-Yan Liu
111
1,142
0
20 Apr 2020
CheXbert: Combining Automatic Labelers and Expert Annotations for Accurate Radiology Report Labeling Using BERT
Akshay Smit
Saahil Jain
Pranav Rajpurkar
Anuj Pareek
A. Ng
M. Lungren
MedIm
89
333
0
20 Apr 2020
Adversarial Training for Large Neural Language Models
Xiaodong Liu
Hao Cheng
Pengcheng He
Weizhu Chen
Yu Wang
Hoifung Poon
Jianfeng Gao
AAML
94
186
0
20 Apr 2020
Fine-tuning Multi-hop Question Answering with Hierarchical Graph Network
Guanming Xiong
128
0
0
20 Apr 2020
Augmented Curation of Unstructured Clinical Notes from a Massive EHR System Reveals Specific Phenotypic Signature of Impending COVID-19 Diagnosis
F. Shweta
K. Murugadoss
S. Awasthi
A. Venkatakrishnan
Arjun Puranik
...
G. Gores
A. Williams
J. Halamka
V. Soundararajan
A. Badley
53
26
0
17 Apr 2020
Highway Transformer: Self-Gating Enhanced Self-Attentive Networks
Yekun Chai
Jin Shuo
Xinwen Hou
48
17
0
17 Apr 2020
Transform and Tell: Entity-Aware News Image Captioning
Alasdair Tran
A. Mathews
Lexing Xie
VLM
60
97
0
17 Apr 2020
A Survey of Document Grounded Dialogue Systems (DGDS)
Longxuan Ma
Weinan Zhang
Mingda Li
Ting Liu
75
19
0
17 Apr 2020
SongNet: Rigid Formats Controlled Text Generation
Piji Li
Haisong Zhang
Xiaojiang Liu
Shuming Shi
144
54
0
17 Apr 2020
How recurrent networks implement contextual processing in sentiment analysis
Niru Maheswaranathan
David Sussillo
46
23
0
17 Apr 2020
SPECTER: Document-level Representation Learning using Citation-informed Transformers
Arman Cohan
Sergey Feldman
Iz Beltagy
Doug Downey
Daniel S. Weld
AI4TS
135
561
0
15 Apr 2020
Coreferential Reasoning Learning for Language Representation
Deming Ye
Yankai Lin
Jiaju Du
Zhenghao Liu
Peng Li
Maosong Sun
Zhiyuan Liu
87
179
0
15 Apr 2020
CrisisBench: Benchmarking Crisis-related Social Media Datasets for Humanitarian Information Processing
Firoj Alam
Hassan Sajjad
Muhammad Imran
Ferda Ofli
24
14
0
14 Apr 2020
Weight Poisoning Attacks on Pre-trained Models
Keita Kurita
Paul Michel
Graham Neubig
AAML
SILM
145
457
0
14 Apr 2020
Robustly Pre-trained Neural Model for Direct Temporal Relation Extraction
Hong Guan
Jianfu Li
Hua Xu
M. Devarakonda
15
11
0
13 Apr 2020
CLUE: A Chinese Language Understanding Evaluation Benchmark
Liang Xu
Hai Hu
Xuanwei Zhang
Lu Li
Chenjie Cao
...
Cong Yue
Xinrui Zhang
Zhen-Yi Yang
Kyle Richardson
Zhenzhong Lan
ELM
110
388
0
13 Apr 2020
ProFormer: Towards On-Device LSH Projection Based Transformers
Chinnadhurai Sankar
Sujith Ravi
Zornitsa Kozareva
77
9
0
13 Apr 2020
Explaining Question Answering Models through Text Generation
Veronica Latcinnik
Jonathan Berant
LRM
101
51
0
12 Apr 2020
Unsupervised Commonsense Question Answering with Self-Talk
Vered Shwartz
Peter West
Ronan Le Bras
Chandra Bhagavatula
Yejin Choi
ReLM
SSL
AI4MH
LRM
72
263
0
11 Apr 2020
Identifying Distributional Perspective Differences from Colingual Groups
Yufei Tian
Tuhin Chakrabarty
Fred Morstatter
Nanyun Peng
29
3
0
10 Apr 2020
Multimodal Categorization of Crisis Events in Social Media
Mahdi Abavisani
Liwei Wu
Shengli Hu
Joel R. Tetreault
A. Jaimes
98
88
0
10 Apr 2020
Improving Readability for Automatic Speech Recognition Transcription
Junwei Liao
Sefik Emre Eskimez
Liyang Lu
Yu Shi
Ming Gong
Linjun Shou
Hong Qu
Michael Zeng
67
56
0
09 Apr 2020
MetaSleepLearner: A Pilot Study on Fast Adaptation of Bio-signals-Based Sleep Stage Classifier to New Individual Subject Using Meta-Learning
Nannapas Banluesombatkul
Pichayoot Ouppaphan
Pitshaporn Leelaarporn
Payongkit Lakhan
Busarakum Chaitusaney
...
Ekapol Chuangsuwanich
Wei Chen
Huy Phan
Nat Dilokthanakul
Theerawit Wilaiprasitporn
101
1
0
08 Apr 2020
Structure-Level Knowledge Distillation For Multilingual Sequence Labeling
Xinyu Wang
Yong Jiang
Nguyen Bach
Tao Wang
Fei Huang
Kewei Tu
96
36
0
08 Apr 2020
Exploring Versatile Generative Language Model Via Parameter-Efficient Transfer Learning
Zhaojiang Lin
Andrea Madotto
Pascale Fung
105
163
0
08 Apr 2020
CALM: Continuous Adaptive Learning for Language Modeling
Kristjan Arumae
Parminder Bhatia
CLL
KELM
29
6
0
08 Apr 2020
Downstream Model Design of Pre-trained Language Model for Relation Extraction Task
Cheng-rong Li
Ye Tian
72
36
0
08 Apr 2020
DialBERT: A Hierarchical Pre-Trained Model for Conversation Disentanglement
Tianda Li
Jia-Chen Gu
Xiao-Dan Zhu
Quan Liu
Zhenhua Ling
Zhiming Su
Si Wei
70
28
0
08 Apr 2020
Towards Evaluating the Robustness of Chinese BERT Classifiers
Wei Ping
Boyuan Pan
Xin Li
Yue Liu
AAML
77
8
0
07 Apr 2020
Byte Pair Encoding is Suboptimal for Language Model Pretraining
Kaj Bostrom
Greg Durrett
71
214
0
07 Apr 2020
A Sentence Cloze Dataset for Chinese Machine Reading Comprehension
Yiming Cui
Ting Liu
Ziqing Yang
Zhipeng Chen
Wentao Ma
Wanxiang Che
Shijin Wang
Guoping Hu
75
19
0
07 Apr 2020
Speaker-Aware BERT for Multi-Turn Response Selection in Retrieval-Based Chatbots
Jia-Chen Gu
Tianda Li
Quan Liu
Zhenhua Ling
Zhiming Su
Si Wei
Xiao-Dan Zhu
66
164
0
07 Apr 2020
Enhancing Review Comprehension with Domain-Specific Commonsense
Aaron Traylor
Chen Chen
Behzad Golshan
Xiaolan Wang
Yuliang Li
Yoshihiko Suhara
Jinfeng Li
Çağatay Demiralp
W. Tan
13
1
0
06 Apr 2020
"You are grounded!": Latent Name Artifacts in Pre-trained Language Models
Vered Shwartz
Rachel Rudinger
Oyvind Tafjord
53
51
0
06 Apr 2020
Multi-Step Inference for Reasoning Over Paragraphs
Jiangming Liu
Matt Gardner
Shay B. Cohen
Mirella Lapata
ReLM
LRM
49
18
0
06 Apr 2020
MobileBERT: a Compact Task-Agnostic BERT for Resource-Limited Devices
Zhiqing Sun
Hongkun Yu
Xiaodan Song
Renjie Liu
Yiming Yang
Denny Zhou
MQ
136
820
0
06 Apr 2020
Evaluating Models' Local Decision Boundaries via Contrast Sets
Matt Gardner
Yoav Artzi
Victoria Basmova
Jonathan Berant
Ben Bogin
...
Sanjay Subramanian
Reut Tsarfaty
Eric Wallace
Ally Zhang
Ben Zhou
ELM
120
84
0
06 Apr 2020
Deep Learning Based Text Classification: A Comprehensive Review
Shervin Minaee
Nal Kalchbrenner
Min Zhang
Narjes Nikzad
M. Asgari-Chenaghlu
Jianfeng Gao
AILaw
VLM
AI4TS
116
1,115
0
06 Apr 2020
Continual Domain-Tuning for Pretrained Language Models
Subendhu Rongali
Abhyuday N. Jagannatha
Bhanu Pratap Singh Rawat
Hong-ye Yu
CLL
KELM
50
7
0
05 Apr 2020
Syntax-driven Iterative Expansion Language Models for Controllable Text Generation
Noe Casas
José A. R. Fonollosa
Marta R. Costa-jussá
54
11
0
05 Apr 2020
Previous
1
2
3
...
63
64
65
...
69
70
71
Next