Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1706.03762
Cited By
Attention Is All You Need
12 June 2017
Ashish Vaswani
Noam M. Shazeer
Niki Parmar
Jakob Uszkoreit
Llion Jones
Aidan Gomez
Lukasz Kaiser
Illia Polosukhin
3DV
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Attention Is All You Need"
38 / 19,538 papers shown
Title
ATRank: An Attention-Based User Behavior Modeling Framework for Recommendation
Chang Zhou
Jinze Bai
Junshuai Song
Xiaofei Liu
Zhengchao Zhao
Xiusi Chen
Jun Gao
HAI
41
307
0
17 Nov 2017
Image Matters: Visually modeling user behaviors using Advanced Model Server
T. Ge
Liqin Zhao
Guorui Zhou
Keyu Chen
Shuying Liu
...
Sui Huang
Qing Cui
Xiaoqiang Zhu
Yu Zhang
Kun Gai
32
41
0
17 Nov 2017
FusionNet: Fusing via Fully-Aware Attention with Application to Machine Comprehension
Hsin-Yuan Huang
Chenguang Zhu
Yelong Shen
Weizhu Chen
FedML
38
183
0
16 Nov 2017
Classical Structured Prediction Losses for Sequence to Sequence Learning
Sergey Edunov
Myle Ott
Michael Auli
David Grangier
MarcÁurelio Ranzato
AIMat
56
185
0
14 Nov 2017
QuickEdit: Editing Text & Translations by Crossing Words Out
David Grangier
Michael Auli
KELM
31
10
0
13 Nov 2017
Few-Shot Learning with Graph Neural Networks
Victor Garcia Satorras
Joan Bruna
GNN
54
1,230
0
10 Nov 2017
Attend and Diagnose: Clinical Time Series Analysis using Attention Models
Huan-Zhi Song
Deepta Rajan
Jayaraman J. Thiagarajan
A. Spanias
MLAU
52
448
0
10 Nov 2017
Attentional Pooling for Action Recognition
Rohit Girdhar
Deva Ramanan
30
319
0
04 Nov 2017
Fixing a Broken ELBO
Alexander A. Alemi
Ben Poole
Ian S. Fischer
Joshua V. Dillon
Rif A. Saurous
Kevin Patrick Murphy
DRL
BDL
39
80
0
01 Nov 2017
Paraphrase Generation with Deep Reinforcement Learning
Zichao Li
Xin Jiang
Lifeng Shang
Hang Li
OffRL
27
213
0
01 Nov 2017
Phase Conductor on Multi-layered Attentions for Machine Comprehension
R. Liu
Wei Wei
Weiguang Mao
M. Chikina
40
22
0
28 Oct 2017
Social Attention: Modeling Attention in Human Crowds
Anirudh Vemula
Katharina Muelling
Jean Oh
HAI
48
632
0
12 Oct 2017
Improving Lexical Choice in Neural Machine Translation
Toan Q. Nguyen
David Chiang
29
86
0
03 Oct 2017
Attentive Convolution: Equipping CNNs with RNN-style Attention Mechanisms
Wenpeng Yin
Hinrich Schütze
38
41
0
02 Oct 2017
Application of a Hybrid Bi-LSTM-CRF model to the task of Russian Named Entity Recognition
L. T. Anh
M. Y. Arkhipov
M. Burtsev
18
37
0
27 Sep 2017
Generating Sentences by Editing Prototypes
Kelvin Guu
Tatsunori B. Hashimoto
Yonatan Oren
Percy Liang
30
316
0
26 Sep 2017
DiSAN: Directional Self-Attention Network for RNN/CNN-Free Language Understanding
Tao Shen
Dinesh Manocha
Guodong Long
Jing Jiang
Shirui Pan
Chengqi Zhang
16
749
0
14 Sep 2017
Natural Language Inference over Interaction Space
Yichen Gong
Heng Luo
Jian Zhang
26
264
0
13 Sep 2017
Deep Learning Techniques for Music Generation -- A Survey
Jean-Pierre Briot
Gaëtan Hadjeres
F. Pachet
MGen
42
297
0
05 Sep 2017
Squeeze-and-Excitation Networks
Jie Hu
Li Shen
Samuel Albanie
Gang Sun
Enhua Wu
171
26,077
0
05 Sep 2017
Revisiting the Effectiveness of Off-the-shelf Temporal Modeling Approaches for Large-scale Video Classification
Yunlong Bian
Chuang Gan
Xiao-Chang Liu
Fu Li
Xiang Long
Yandong Li
Heng Qi
Jie Zhou
Shilei Wen
Yuanqing Lin
18
48
0
12 Aug 2017
Recent Trends in Deep Learning Based Natural Language Processing
Tom Young
Devamanyu Hazarika
Soujanya Poria
Min Zhang
37
2,828
0
09 Aug 2017
Dual Supervised Learning
Yingce Xia
Tao Qin
Wei-neng Chen
Jiang Bian
Nenghai Yu
Tie-Yan Liu
SSL
32
143
0
03 Jul 2017
VAIN: Attentional Multi-agent Predictive Modeling
Yedid Hoshen
GNN
41
237
0
19 Jun 2017
One Model To Learn Them All
Lukasz Kaiser
Aidan Gomez
Noam M. Shazeer
Ashish Vaswani
Niki Parmar
Llion Jones
Jakob Uszkoreit
VLM
ViT
30
333
0
16 Jun 2017
Depthwise Separable Convolutions for Neural Machine Translation
Lukasz Kaiser
Aidan Gomez
François Chollet
41
278
0
09 Jun 2017
Reinforced Mnemonic Reader for Machine Reading Comprehension
Minghao Hu
Yuxing Peng
Zhen Huang
Xipeng Qiu
Furu Wei
Ming Zhou
RALM
AIMat
19
69
0
08 May 2017
Japanese Sentiment Classification using a Tree-Structured Long Short-Term Memory with Attention
R. Miyazaki
Mamoru Komachi
38
2
0
04 Apr 2017
Improving Neural Machine Translation with Conditional Sequence Generative Adversarial Nets
Zhen-Le Yang
Wei Chen
Feng Wang
Bo Xu
GAN
AI4CE
41
169
0
15 Mar 2017
Online Meta-learning by Parallel Algorithm Competition
Stefan Elfwing
E. Uchibe
Kenji Doya
31
22
0
24 Feb 2017
Symbolic, Distributed and Distributional Representations for Natural Language Processing in the Era of Deep Learning: a Survey
L. Ferrone
Fabio Massimo Zanzotto
39
37
0
02 Feb 2017
Deep Reinforcement Learning: An Overview
Yuxi Li
OffRL
VLM
109
1,505
0
25 Jan 2017
Boosting Neural Machine Translation
Dakun Zhang
Jungi Kim
Josep Crego
Jean Senellart
AI4CE
23
26
0
19 Dec 2016
Predicting Human Eye Fixations via an LSTM-based Saliency Attentive Model
Marcella Cornia
Lorenzo Baraldi
G. Serra
Rita Cucchiara
40
550
0
29 Nov 2016
Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation
Yonghui Wu
M. Schuster
Zhehuai Chen
Quoc V. Le
Mohammad Norouzi
...
Alex Rudnick
Oriol Vinyals
G. Corrado
Macduff Hughes
J. Dean
AIMat
718
6,754
0
26 Sep 2016
Quantifying the probable approximation error of probabilistic inference programs
Marco F. Cusumano-Towner
Vikash K. Mansinghka
33
7
0
31 May 2016
Impact of Power System Partitioning on the Efficiency of Distributed Multi-Step Optimization
Dongliang Chen
A. Bucchiarone
Zhihan Lv
25
12
0
31 May 2016
Effective Approaches to Attention-based Neural Machine Translation
Thang Luong
Hieu H. Pham
Christopher D. Manning
223
7,934
0
17 Aug 2015
Previous
1
2
3
...
389
390
391