Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1706.03762
Cited By
Attention Is All You Need
12 June 2017
Ashish Vaswani
Noam M. Shazeer
Niki Parmar
Jakob Uszkoreit
Llion Jones
Aidan Gomez
Lukasz Kaiser
Illia Polosukhin
3DV
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Attention Is All You Need"
50 / 18,803 papers shown
Title
Investigation of enhanced Tacotron text-to-speech synthesis systems with self-attention for pitch accent language
Yusuke Yasuda
Xin Wang
Shinji Takaki
Junichi Yamagishi
22
86
0
29 Oct 2018
Learning How to Listen: A Temporal-Frequential Attention Model for Sound Event Detection
Yuhan Shen
Ke-Xin He
Weiqiang Zhang
11
17
0
29 Oct 2018
Integrating Transformer and Paraphrase Rules for Sentence Simplification
Sanqiang Zhao
Rui Meng
Daqing He
Andi Saptono
B. Parmanto
35
120
0
26 Oct 2018
TarMAC: Targeted Multi-Agent Communication
Abhishek Das
Théophile Gervet
Joshua Romoff
Dhruv Batra
Devi Parikh
Michael G. Rabbat
Joelle Pineau
22
378
0
26 Oct 2018
Engaging Image Captioning Via Personality
Kurt Shuster
Samuel Humeau
Hexiang Hu
Antoine Bordes
Jason Weston
40
149
0
25 Oct 2018
Band Selection from Hyperspectral Images Using Attention-based Convolutional Neural Networks
Hanadi El Achi
Lukasz Tulczyjew
Michal Marcinkiewicz
Z. Kanaan
25
23
0
24 Oct 2018
Multi-Head Attention with Disagreement Regularization
Jian Li
Zhaopeng Tu
Baosong Yang
Michael R. Lyu
Tong Zhang
27
145
0
24 Oct 2018
AUNet: Attention-guided dense-upsampling networks for breast mass segmentation in whole mammograms
Hui Sun
Cheng Li
Boqiang Liu
Hairong Zheng
David Dagan Feng
Shanshan Wang
27
120
0
24 Oct 2018
Area Attention
Yang Li
Lukasz Kaiser
Samy Bengio
Si Si
31
20
0
23 Oct 2018
Graph Convolutional Reinforcement Learning
Jiechuan Jiang
Chen Dun
Tiejun Huang
Zongqing Lu
GNN
11
335
0
22 Oct 2018
Named Entity Disambiguation using Deep Learning on Graphs
A. Cetoli
Mohammad Akbari
Stefano Bragaglia
Andrew D. O'Harney
Marc Sloan
19
19
0
22 Oct 2018
Abstractive Summarization Using Attentive Neural Techniques
Jacob Krantz
Jugal Kalita
AIMat
19
11
0
20 Oct 2018
Sequential Context Encoding for Duplicate Removal
Lu Qi
Shu Liu
Jianping Shi
Jiaya Jia
25
23
0
20 Oct 2018
Supervising strong learners by amplifying weak experts
Paul Christiano
Buck Shlegeris
Dario Amodei
27
114
0
19 Oct 2018
STACL: Simultaneous Translation with Implicit Anticipation and Controllable Latency using Prefix-to-Prefix Framework
Mingbo Ma
Liang Huang
Hao Xiong
Renjie Zheng
Kaibo Liu
...
Zhongjun He
Hairong Liu
Xing Li
Hua Wu
Haifeng Wang
24
30
0
19 Oct 2018
Micro-Browsing Models for Search Snippets
Muhammad Asiful Islam
R. Srikant
Sugato Basu
LRM
16
2
0
18 Oct 2018
Semantic Parsing for Task Oriented Dialog using Hierarchical Representations
S. Gupta
Rushin Shah
Mrinal Mohit
Anuj Kumar
M. Lewis
31
202
0
18 Oct 2018
An Analysis of Attention Mechanisms: The Case of Word Sense Disambiguation in Neural Machine Translation
Gongbo Tang
Rico Sennrich
Joakim Nivre
27
84
0
17 Oct 2018
Sequence to Sequence Mixture Model for Diverse Machine Translation
Xuanli He
Gholamreza Haffari
Mohammad Norouzi
20
57
0
17 Oct 2018
SCALE-Sim: Systolic CNN Accelerator Simulator
A. Samajdar
Yuhao Zhu
P. Whatmough
Matthew Mattina
Tushar Krishna
30
137
0
16 Oct 2018
The CoNLL--SIGMORPHON 2018 Shared Task: Universal Morphological Reinflection
Ryan Cotterell
Christo Kirov
John Sylak-Glassman
Géraldine Walther
Ekaterina Vylomova
...
Garrett Nicolai
Miikka Silfverberg
David Yarowsky
Jason Eisner
Mans Hulden
23
148
0
16 Oct 2018
Sequence-to-Sequence Acoustic Modeling for Voice Conversion
Jing-Xuan Zhang
Zhenhua Ling
Li-Juan Liu
Yuan Jiang
Lirong Dai
19
129
0
16 Oct 2018
Quasi-hyperbolic momentum and Adam for deep learning
Jerry Ma
Denis Yarats
ODL
84
129
0
16 Oct 2018
Super Characters: A Conversion from Sentiment Classification to Image Classification
Baohua Sun
Ling Yang
Patrick Dong
Wenhan Zhang
Jason Dong
Charles Young
25
32
0
15 Oct 2018
Robust Neural Machine Translation with Joint Textual and Phonetic Embedding
Hairong Liu
Mingbo Ma
Liang Huang
Hao Xiong
Zhongjun He
AAML
46
59
0
15 Oct 2018
Trellis Networks for Sequence Modeling
Shaojie Bai
J. Zico Kolter
V. Koltun
25
145
0
15 Oct 2018
MeanSum: A Neural Model for Unsupervised Multi-document Abstractive Summarization
Eric Chu
Peter J. Liu
20
19
0
12 Oct 2018
PointGrow: Autoregressively Learned Point Cloud Generation with Self-Attention
Yongbin Sun
Yue Wang
Ziwei Liu
J. Siegel
Sanjay E. Sarma
3DPC
31
195
0
12 Oct 2018
U-Net: Machine Reading Comprehension with Unanswerable Questions
Fu Sun
Linyang Li
Xipeng Qiu
Yang Liu
30
47
0
12 Oct 2018
Bayesian Deep Convolutional Networks with Many Channels are Gaussian Processes
Roman Novak
Lechao Xiao
Jaehoon Lee
Yasaman Bahri
Greg Yang
Jiri Hron
Daniel A. Abolafia
Jeffrey Pennington
Jascha Narain Sohl-Dickstein
UQCV
BDL
25
307
0
11 Oct 2018
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
VLM
SSL
SSeg
103
93,232
0
11 Oct 2018
End-to-End Content and Plan Selection for Data-to-Text Generation
Sebastian Gehrmann
Falcon Z. Dai
H. Elder
Alexander M. Rush
28
70
0
10 Oct 2018
Towards Two-Dimensional Sequence to Sequence Model in Neural Machine Translation
Parnia Bahar
Christopher Brix
Hermann Ney
19
24
0
09 Oct 2018
Improving the Transformer Translation Model with Document-Level Context
Jiacheng Zhang
Huanbo Luan
Maosong Sun
Feifei Zhai
Jingfang Xu
Min Zhang
Yang Liu
51
250
0
08 Oct 2018
CHOPT : Automated Hyperparameter Optimization Framework for Cloud-Based Machine Learning Platforms
Jingwoong Kim
Minkyu Kim
Heungseok Park
Ernar Kusdavletov
Dongjun Lee
A. Kim
Ji-Hoon Kim
Jung-Woo Ha
Nako Sung
34
14
0
08 Oct 2018
A Comprehensive Survey of Deep Learning for Image Captioning
Md Zakir Hossain
Ferdous Sohel
M. Shiratuddin
Hamid Laga
VLM
3DV
45
761
0
06 Oct 2018
Actor-Attention-Critic for Multi-Agent Reinforcement Learning
Shariq Iqbal
Fei Sha
6
738
0
05 Oct 2018
CanvasGAN: A simple baseline for text to image generation by incrementally patching a canvas
Amanpreet Singh
Sharan Agrawal
DiffM
31
5
0
05 Oct 2018
Integrating Weakly Supervised Word Sense Disambiguation into Neural Machine Translation
X. Pu
Nikolaos Pappas
James Henderson
Andrei Popescu-Belis
19
40
0
05 Oct 2018
Learning Depth with Convolutional Spatial Propagation Network
Xinjing Cheng
Peng Wang
Ruigang Yang
3DV
3DPC
MDE
52
311
0
04 Oct 2018
Seq2Slate: Re-ranking and Slate Optimization with RNNs
Irwan Bello
Sayali Kulkarni
Sagar Jain
Craig Boutilier
Ed H. Chi
Elad Eban
Xiyang Luo
Alan Mackey
Ofer Meshi
33
91
0
04 Oct 2018
WAIC, but Why? Generative Ensembles for Robust Anomaly Detection
Hyun-Jae Choi
Eric Jang
Alexander A. Alemi
OODD
20
82
0
02 Oct 2018
Set Transformer: A Framework for Attention-based Permutation-Invariant Neural Networks
Juho Lee
Yoonho Lee
Jungtaek Kim
Adam R. Kosiorek
Seungjin Choi
Yee Whye Teh
23
274
0
01 Oct 2018
Interpretable Spatio-temporal Attention for Video Action Recognition
Lili Meng
Bo Zhao
B. Chang
Gao Huang
Wei Sun
Fred Tung
Leonid Sigal
33
83
0
01 Oct 2018
Phrase-Based Attentions
Phi Xuan Nguyen
Chenyu You
14
8
0
30 Sep 2018
Adaptive Input Representations for Neural Language Modeling
Alexei Baevski
Michael Auli
32
388
0
28 Sep 2018
Crowd-Robot Interaction: Crowd-aware Robot Navigation with Attention-based Deep Reinforcement Learning
Changan Chen
Yuejiang Liu
S. Kreiss
Alexandre Alahi
HAI
44
502
0
24 Sep 2018
Neural Approaches to Conversational AI
Jianfeng Gao
Michel Galley
Lihong Li
49
670
0
21 Sep 2018
Capacity Control of ReLU Neural Networks by Basis-path Norm
Shuxin Zheng
Qi Meng
Huishuai Zhang
Wei-neng Chen
Nenghai Yu
Tie-Yan Liu
24
23
0
19 Sep 2018
Multi-task Learning with Sample Re-weighting for Machine Reading Comprehension
Yichong Xu
Xiaodong Liu
Yelong Shen
Jingjing Liu
Jianfeng Gao
23
51
0
18 Sep 2018
Previous
1
2
3
...
371
372
373
...
375
376
377
Next