Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1706.03762
Cited By
Attention Is All You Need
12 June 2017
Ashish Vaswani
Noam M. Shazeer
Niki Parmar
Jakob Uszkoreit
Llion Jones
Aidan Gomez
Lukasz Kaiser
Illia Polosukhin
3DV
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Attention Is All You Need"
50 / 18,459 papers shown
Title
Generalized Data Augmentation for Low-Resource Translation
Mengzhou Xia
X. Kong
Antonios Anastasopoulos
Graham Neubig
18
119
0
10 Jun 2019
Adversarial Mahalanobis Distance-based Attentive Song Recommender for Automatic Playlist Continuation
Thanh-Binh Tran
Renee Sweeney
Kyumin Lee
36
32
0
08 Jun 2019
Leveraging BERT for Extractive Text Summarization on Lectures
Derek Miller
16
242
0
07 Jun 2019
Assessing incrementality in sequence-to-sequence models
Dennis Ulmer
Dieuwke Hupkes
Elia Bruni
AI4TS
12
5
0
07 Jun 2019
Building a Production Model for Retrieval-Based Chatbots
Kyle Swanson
L. Yu
C. Fox
Jeremy Wohlwend
Tao Lei
32
11
0
07 Jun 2019
Analyzing the Structure of Attention in a Transformer Language Model
Jesse Vig
Yonatan Belinkov
30
357
0
07 Jun 2019
Shared-Private Bilingual Word Embeddings for Neural Machine Translation
Xuebo Liu
Derek F. Wong
Yang Liu
Lidia S. Chao
Tong Xiao
Jingbo Zhu
35
37
0
07 Jun 2019
Learning Adaptive Classifiers Synthesis for Generalized Few-Shot Learning
Han-Jia Ye
Hexiang Hu
De-Chuan Zhan
25
59
0
07 Jun 2019
Multi-scale self-guided attention for medical image segmentation
Ashish Sinha
Jose Dolz
SSeg
31
413
0
07 Jun 2019
FSPool: Learning Set Representations with Featurewise Sort Pooling
Yan Zhang
Jonathon S. Hare
Adam Prugel-Bennett
19
75
0
06 Jun 2019
Visualizing and Measuring the Geometry of BERT
Andy Coenen
Emily Reif
Ann Yuan
Been Kim
Adam Pearce
F. Viégas
Martin Wattenberg
MILM
43
415
0
06 Jun 2019
When Does Label Smoothing Help?
Rafael Müller
Simon Kornblith
Geoffrey E. Hinton
UQCV
61
1,912
0
06 Jun 2019
Towards Interpretable Reinforcement Learning Using Attention Augmented Agents
Alex Mott
Daniel Zoran
Mike Chrzanowski
Daan Wierstra
Danilo Jimenez Rezende
26
188
0
06 Jun 2019
Unsupervised Pivot Translation for Distant Languages
Yichong Leng
Xu Tan
Tao Qin
Xiang-Yang Li
Tie-Yan Liu
33
30
0
06 Jun 2019
Robust Neural Machine Translation with Doubly Adversarial Inputs
Yong Cheng
Lu Jiang
Wolfgang Macherey
AAML
30
254
0
06 Jun 2019
Extracting Symptoms and their Status from Clinical Conversations
Nan Du
Kai Chen
Anjuli Kannan
Linh Tran
Yuhui Chen
Izhak Shafran
20
68
0
05 Jun 2019
Large-Scale Multi-Label Text Classification on EU Legislation
Ilias Chalkidis
Manos Fergadiotis
Prodromos Malakasiotis
Ion Androutsopoulos
AILaw
19
213
0
05 Jun 2019
Sequential Neural Networks as Automata
William Merrill
23
74
0
04 Jun 2019
KERMIT: Generative Insertion-Based Modeling for Sequences
William Chan
Nikita Kitaev
Kelvin Guu
Mitchell Stern
Jakob Uszkoreit
VLM
23
65
0
04 Jun 2019
Sequence Tagging with Contextual and Non-Contextual Subword Representations: A Multilingual Evaluation
Benjamin Heinzerling
Michael Strube
10
35
0
04 Jun 2019
Generative Adversarial Networks in Computer Vision: A Survey and Taxonomy
Zhengwei Wang
Qi She
T. Ward
MedIm
EGVM
29
90
0
04 Jun 2019
Face Parsing with RoI Tanh-Warping
Jinpeng Lin
Hao Yang
Dong Chen
Ming Zeng
Fang Wen
Lu Yuan
3DH
CVBM
38
77
0
04 Jun 2019
Lattice-Based Transformer Encoder for Neural Machine Translation
Fengshun Xiao
Jiangtong Li
Zhao Hai
Rui Wang
Kehai Chen
34
42
0
04 Jun 2019
RTHN: A RNN-Transformer Hierarchical Network for Emotion Cause Extraction
Rui Xia
Mengran Zhang
Zixiang Ding
11
103
0
04 Jun 2019
Transcoding compositionally: using attention to find more generalizable solutions
K. Korrel
Dieuwke Hupkes
Verna Dankers
Elia Bruni
30
31
0
04 Jun 2019
Coherent Comment Generation for Chinese Articles with a Graph-to-Sequence Model
Wei Li
Jingjing Xu
Yancheng He
Shengli Yan
Yunfang Wu
Xu Sun
19
47
0
04 Jun 2019
Learning Attention-based Embeddings for Relation Prediction in Knowledge Graphs
Deepak Nathani
Jatin Chauhan
Charu Sharma
Manohar Kaul
19
482
0
04 Jun 2019
Detecting Local Insights from Global Labels: Supervised & Zero-Shot Sequence Labeling via a Convolutional Decomposition
A. Schmaltz
24
8
0
04 Jun 2019
Episodic Memory in Lifelong Language Learning
Cyprien de Masson dÁutume
Sebastian Ruder
Lingpeng Kong
Dani Yogatama
CLL
KELM
34
281
0
03 Jun 2019
Masked Non-Autoregressive Image Captioning
Junlong Gao
Xi Meng
Shiqi Wang
Xia Li
Shanshe Wang
Siwei Ma
Wen Gao
19
36
0
03 Jun 2019
BAYHENN: Combining Bayesian Deep Learning and Homomorphic Encryption for Secure DNN Inference
Peichen Xie
Bingzhe Wu
Guangyu Sun
BDL
FedML
11
33
0
03 Jun 2019
Efficient 8-Bit Quantization of Transformer Neural Machine Language Translation Model
Aishwarya Bhandare
Vamsi Sripathi
Deepthi Karkada
Vivek V. Menon
Sun Choi
Kushal Datta
V. Saletore
MQ
27
130
0
03 Jun 2019
A Survey of Natural Language Generation Techniques with a Focus on Dialogue Systems - Past, Present and Future Directions
Sashank Santhanam
Samira Shaikh
3DV
31
52
0
02 Jun 2019
Generating Diverse High-Fidelity Images with VQ-VAE-2
Ali Razavi
Aaron van den Oord
Oriol Vinyals
DRL
BDL
21
1,770
0
02 Jun 2019
Domain Adaptation of Neural Machine Translation by Lexicon Induction
Junjie Hu
Mengzhou Xia
Graham Neubig
J. Carbonell
27
75
0
02 Jun 2019
Adversarial Generation and Encoding of Nested Texts
A. Rozental
GAN
16
0
0
01 Jun 2019
Do Human Rationales Improve Machine Explanations?
Julia Strout
Ye Zhang
Raymond J. Mooney
19
57
0
31 May 2019
Investigating an Effective Character-level Embedding in Korean Sentence Classification
Won Ik Cho
Seokhwan Kim
N. Kim
28
8
0
31 May 2019
Point Clouds Learning with Attention-based Graph Convolution Networks
Zhuyang Xie
Junzhou Chen
B. Peng
3DPC
19
53
0
31 May 2019
A Lightweight Recurrent Network for Sequence Modeling
Biao Zhang
Rico Sennrich
27
7
0
30 May 2019
Assessing The Factual Accuracy of Generated Text
Ben Goodrich
Vinay Rao
Mohammad Saleh
Peter J. Liu
HILM
35
185
0
30 May 2019
Graph Normalizing Flows
Jenny Liu
Aviral Kumar
Jimmy Ba
J. Kiros
Kevin Swersky
BDL
GNN
AI4CE
27
155
0
30 May 2019
Hierarchical Transformers for Multi-Document Summarization
Yang Liu
Mirella Lapata
13
294
0
30 May 2019
Neural Consciousness Flow
Xiaoran Xu
Wei Feng
Zhiqing Sun
Zhihong Deng
GNN
AI4CE
27
2
0
30 May 2019
Semantically Conditioned Dialog Response Generation via Hierarchical Disentangled Self-Attention
Wenhu Chen
Jianshu Chen
Pengda Qin
Xifeng Yan
William Yang Wang
25
129
0
30 May 2019
A Simple but Effective Method to Incorporate Multi-turn Context with BERT for Conversational Machine Comprehension
Yasuhito Ohsugi
Itsumi Saito
Kyosuke Nishida
Hisako Asano
J. Tomita
30
43
0
30 May 2019
Adversarial Sub-sequence for Text Generation
Xingyuan Chen
Yanzhe Li
Peng Jin
Jiuhua Zhang
Xinyu Dai
Jiajun Chen
Gang Song
GAN
35
5
0
30 May 2019
Attention: A Big Surprise for Cross-Domain Person Re-Identification
Haijun Liu
Jian Cheng
Shiguang Wang
Wen Wang
OOD
21
9
0
30 May 2019
A Generalized Framework of Sequence Generation with Application to Undirected Sequence Models
Elman Mansimov
Alex Jinpeng Wang
Sean Welleck
Kyunghyun Cho
AIMat
28
46
0
29 May 2019
Unsupervised Paraphrasing without Translation
Aurko Roy
David Grangier
BDL
LRM
11
61
0
29 May 2019
Previous
1
2
3
...
357
358
359
...
368
369
370
Next