Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1706.03762
Cited By
Attention Is All You Need
12 June 2017
Ashish Vaswani
Noam M. Shazeer
Niki Parmar
Jakob Uszkoreit
Llion Jones
Aidan Gomez
Lukasz Kaiser
Illia Polosukhin
3DV
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Attention Is All You Need"
50 / 18,000 papers shown
Title
Hierarchical Autoregressive Image Models with Auxiliary Decoders
J. Fauw
Sander Dieleman
Karen Simonyan
GAN
30
37
0
06 Mar 2019
Selective Sensor Fusion for Neural Visual-Inertial Odometry
Changhao Chen
Stefano Rosa
Yishu Miao
Chris Xiaoxuan Lu
Wei Wu
Andrew Markham
A. Trigoni
22
132
0
04 Mar 2019
VideoFlow: A Conditional Flow-Based Model for Stochastic Video Generation
Manoj Kumar
Mohammad Babaeizadeh
D. Erhan
Chelsea Finn
Sergey Levine
Laurent Dinh
Durk Kingma
VGen
25
131
0
04 Mar 2019
Collaborative Spatio-temporal Feature Learning for Video Action Recognition
Chong Li
Qiaoyong Zhong
Di Xie
Shiliang Pu
27
82
0
04 Mar 2019
Calibration of Encoder Decoder Models for Neural Machine Translation
Aviral Kumar
Sunita Sarawagi
27
98
0
03 Mar 2019
Efficient Reinforcement Learning for StarCraft by Abstract Forward Models and Transfer Learning
Ruo-Ze Liu
Haifeng Guo
Xiaozhong Ji
Yang Yu
Zhen-Jia Pang
Zitai Xiao
Yuzhou Wu
Tong Lu
OffRL
19
13
0
02 Mar 2019
Outcome-Driven Clustering of Acute Coronary Syndrome Patients using Multi-Task Neural Network with Attention
Eryu Xia
Xin Du
Jing Mei
Wen Sun
Suijun Tong
...
Jian Sheng
Jian Li
Changsheng Ma
Jianzeng Dong
Shaochun Li
12
10
0
01 Mar 2019
Chinese-Japanese Unsupervised Neural Machine Translation Using Sub-character Level Information
Longtu Zhang
Mamoru Komachi
14
10
0
01 Mar 2019
Improving Grammatical Error Correction via Pre-Training a Copy-Augmented Architecture with Unlabeled Data
Wei-Ye Zhao
Liang Wang
Kewei Shen
Ruoyu Jia
Jingming Liu
19
210
0
01 Mar 2019
Massively Multilingual Neural Machine Translation
Roee Aharoni
Melvin Johnson
Orhan Firat
LRM
AI4CE
17
482
0
28 Feb 2019
Infer Your Enemies and Know Yourself, Learning in Real-Time Bidding with Partially Observable Opponents
Manxing Du
Alexander I. Cowen-Rivers
Ying Wen
Phu Sakulwongtana
Jun Wang
M. Brorsson
R. State
16
1
0
28 Feb 2019
Representation Learning for Recommender Systems with Application to the Scientific Literature
Robin Brochier
16
5
0
28 Feb 2019
Link Prediction with Mutual Attention for Text-Attributed Networks
Robin Brochier
Adrien Guille
Julien Velcin
16
12
0
28 Feb 2019
Deep learning in bioinformatics: introduction, application, and perspective in big data era
Yu Li
Chao Huang
Lizhong Ding
Zhongxiao Li
Yijie Pan
Xin Gao
AI4CE
24
295
0
28 Feb 2019
BERT for Joint Intent Classification and Slot Filling
Qian Chen
Zhu Zhuo
Wen Wang
VLM
16
545
0
28 Feb 2019
Financial series prediction using Attention LSTM
Sangyeon Kim
Myung-joo Kang
AI4TS
HAI
23
51
0
28 Feb 2019
Bridging the Gap: Attending to Discontinuity in Identification of Multiword Expressions
Omid Rohanian
Shiva Taslimipoor
Samaneh Kouchaki
L. Ha
R. Mitkov
27
26
0
27 Feb 2019
Regularity Normalization: Neuroscience-Inspired Unsupervised Attention across Neural Network Layers
Baihan Lin
16
2
0
27 Feb 2019
Still a Pain in the Neck: Evaluating Text Representations on Lexical Composition
Vered Shwartz
Ido Dagan
CoGe
27
78
0
27 Feb 2019
Attributes-aided Part Detection and Refinement for Person Re-identification
Shuzhao Li
Huimin Yu
Wei Huang
Jing Zhang
30
52
0
27 Feb 2019
Multilingual Neural Machine Translation with Knowledge Distillation
Xu Tan
Yi Ren
Di He
Tao Qin
Zhou Zhao
Tie-Yan Liu
20
248
0
27 Feb 2019
EvolveGCN: Evolving Graph Convolutional Networks for Dynamic Graphs
A. Pareja
Giacomo Domeniconi
Jie Chen
Tengfei Ma
Toyotaro Suzumura
H. Kanezashi
Tim Kaler
Tao B. Schardl
Charles E. Leisersen
GNN
52
1,041
0
26 Feb 2019
Attention is not Explanation
Sarthak Jain
Byron C. Wallace
FAtt
31
1,299
0
26 Feb 2019
The State of Sparsity in Deep Neural Networks
Trevor Gale
Erich Elsen
Sara Hooker
33
745
0
25 Feb 2019
Dual Attention Networks for Visual Reference Resolution in Visual Dialog
Gi-Cheon Kang
Jaeseo Lim
Byoung-Tak Zhang
22
72
0
25 Feb 2019
Attentional Encoder Network for Targeted Sentiment Classification
Youwei Song
Jiahai Wang
Tao Jiang
Zhiyue Liu
Yanghui Rao
6
275
0
25 Feb 2019
Star-Transformer
Qipeng Guo
Xipeng Qiu
Pengfei Liu
Yunfan Shao
Xiangyang Xue
Zheng-Wei Zhang
27
262
0
25 Feb 2019
Enhancing Clinical Concept Extraction with Contextual Embeddings
Yuqi Si
Jingqi Wang
Hua Xu
Kirk Roberts
AI4MH
26
285
0
22 Feb 2019
Improving Multilingual Sentence Embedding using Bi-directional Dual Encoder with Additive Margin Softmax
Yinfei Yang
Gustavo Hernández Ábrego
Steve Yuan
Mandy Guo
Qinlan Shen
Daniel Cer
Yun-hsuan Sung
B. Strope
R. Kurzweil
52
115
0
22 Feb 2019
Non-Autoregressive Machine Translation with Auxiliary Regularization
Yiren Wang
Fei Tian
Di He
Tao Qin
ChengXiang Zhai
Tie-Yan Liu
16
158
0
22 Feb 2019
Deep Discriminative Representation Learning with Attention Map for Scene Classification
Jun Yu Li
Daoyu Lin
Yang Wang
Guangluan Xu
C. Ding
28
81
0
21 Feb 2019
Audio-Linguistic Embeddings for Spoken Sentences
Albert Haque
Michelle Guo
Prateek Verma
Li Fei-Fei
28
51
0
20 Feb 2019
Mixture Models for Diverse Machine Translation: Tricks of the Trade
T. Shen
Myle Ott
Michael Auli
MarcÁurelio Ranzato
MoE
33
148
0
20 Feb 2019
Semantic Neural Machine Translation using AMR
Linfeng Song
D. Gildea
Yue Zhang
Zhiguo Wang
Jinsong Su
22
141
0
19 Feb 2019
Context-Aware Self-Attention Networks
Baosong Yang
Jian Li
Derek F. Wong
Lidia S. Chao
Xing Wang
Zhaopeng Tu
39
113
0
15 Feb 2019
Situation-Aware Pedestrian Trajectory Prediction with Spatio-Temporal Attention Model
Sirin Haddad
Meiqing Wu
He Wei
S. Lam
21
56
0
13 Feb 2019
Optimal Kronecker-Sum Approximation of Real Time Recurrent Learning
Frederik Benzing
M. Gauy
Asier Mujika
A. Martinsson
Angelika Steger
23
22
0
11 Feb 2019
Unsupervised Polyglot Text To Speech
Eliya Nachmani
Lior Wolf
13
42
0
06 Feb 2019
Fine-Grained Temporal Relation Extraction
Siddharth Vashishtha
Benjamin Van Durme
A. White
NAI
31
62
0
04 Feb 2019
Improving Question Answering with External Knowledge
Xiaoman Pan
Kai Sun
Dian Yu
Jianshu Chen
Heng Ji
Claire Cardie
Dong Yu
KELM
19
66
0
03 Feb 2019
Multi-Task Deep Neural Networks for Natural Language Understanding
Xiaodong Liu
Pengcheng He
Weizhu Chen
Jianfeng Gao
AI4CE
24
1,261
0
31 Jan 2019
End-to-End Learned Early Classification of Time Series for In-Season Crop Type Mapping
M. Rußwurm
Nicolas Courty
Rémi Emonet
Sébastien Lefèvre
D. Tuia
R. Tavenard
AI4TS
25
57
0
30 Jan 2019
Latent Normalizing Flows for Discrete Sequences
Zachary M. Ziegler
Alexander M. Rush
BDL
DRL
24
122
0
29 Jan 2019
Visual Rhythm Prediction with Feature-Aligning Network
Yutong Xie
Haiyang Wang
Yan Hao
Zihao Xu
32
5
0
29 Jan 2019
Glyce: Glyph-vectors for Chinese Character Representations
Yuxian Meng
Wei Wu
Fei Wang
Xiaoya Li
Ping Nie
J. Mei
Muyu Li
Qinghong Han
Xiaofei Sun
Jiwei Li
VLM
27
190
0
29 Jan 2019
Conditioning by adaptive sampling for robust design
David H. Brookes
Hahnbeom Park
Jennifer Listgarten
21
193
0
29 Jan 2019
Evaluating Word Embedding Models: Methods and Experimental Results
Bin Wang
Angela Wang
Fenxiao Chen
Yun Cheng Wang
C.-C. Jay Kuo
ELM
18
260
0
28 Jan 2019
Semantic Relation Classification via Bidirectional LSTM Networks with Entity-aware Attention using Latent Entity Typing
Joohong Lee
Sang-gyu Seo
Y. Choi
30
116
0
23 Jan 2019
Hypergraph Convolution and Hypergraph Attention
S. Bai
Feihu Zhang
Philip Torr
GNN
26
612
0
23 Jan 2019
TransferTransfo: A Transfer Learning Approach for Neural Network Based Conversational Agents
Thomas Wolf
Victor Sanh
Julien Chaumond
Clement Delangue
40
493
0
23 Jan 2019
Previous
1
2
3
...
352
353
354
...
358
359
360
Next