Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1706.03762
Cited By
Attention Is All You Need
12 June 2017
Ashish Vaswani
Noam M. Shazeer
Niki Parmar
Jakob Uszkoreit
Llion Jones
Aidan Gomez
Lukasz Kaiser
Illia Polosukhin
3DV
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Attention Is All You Need"
50 / 18,000 papers shown
Title
Membership Inference Attacks on Sequence-to-Sequence Models: Is My Data In Your Machine Translation System?
Sorami Hisamoto
Matt Post
Kevin Duh
MIACV
SLR
28
106
0
11 Apr 2019
Large-Scale Long-Tailed Recognition in an Open World
Ziwei Liu
Zhongqi Miao
Xiaohang Zhan
Jiayun Wang
Boqing Gong
Stella X. Yu
45
1,135
0
10 Apr 2019
Just Jump: Dynamic Neighborhood Aggregation in Graph Neural Networks
Matthias Fey
GNN
19
47
0
09 Apr 2019
Multi-Agent Tensor Fusion for Contextual Trajectory Prediction
Tianyang Zhao
Yifei Xu
Mathew Monfort
Wongun Choi
Chris L. Baker
Yibiao Zhao
Yizhou Wang
Ying Nian Wu
21
393
0
09 Apr 2019
Bilingual-GAN: A Step Towards Parallel Text Generation
Ahmad Rashid
Alan Do-Omri
Md. Akmal Haidar
Qun Liu
Mehdi Rezagholizadeh
16
208
0
09 Apr 2019
Software and application patterns for explanation methods
Maximilian Alber
33
11
0
09 Apr 2019
Towards Universal Object Detection by Domain Attention
Xudong Wang
Zhaowei Cai
Dashan Gao
Nuno Vasconcelos
OOD
25
193
0
09 Apr 2019
Semantic Graph Convolutional Networks for 3D Human Pose Regression
Long Zhao
Xi Peng
Yu Tian
Mubbasir Kapadia
Dimitris N. Metaxas
3DH
16
505
0
06 Apr 2019
PoMo: Generating Entity-Specific Post-Modifiers in Context
Jun Seok Kang
IV RobertL.Logan
Zewei Chu
Yang Chen
Dheeru Dua
Kevin Gimpel
Sameer Singh
Niranjan Balasubramanian
28
11
0
05 Apr 2019
Convolutional Self-Attention Networks
Baosong Yang
Longyue Wang
Derek F. Wong
Lidia S. Chao
Zhaopeng Tu
24
124
0
05 Apr 2019
Modeling Recurrence for Transformer
Jie Hao
Xing Wang
Baosong Yang
Longyue Wang
Jinfeng Zhang
Zhaopeng Tu
45
85
0
05 Apr 2019
NL-FIIT at SemEval-2019 Task 9: Neural Model Ensemble for Suggestion Mining
Samuel Pecar
Marian Simko
Maria Bielikova
14
7
0
05 Apr 2019
Neural Networks for Modeling Source Code Edits
Rui Zhao
David Bieber
Kevin Swersky
Daniel Tarlow
19
12
0
04 Apr 2019
Crowd Transformer Network
Viresh Ranjan
M. Shah
Minh Hoai Nguyen
ViT
35
9
0
04 Apr 2019
A Learned Representation for Scalable Vector Graphics
Raphael Gontijo-Lopes
David R Ha
Douglas Eck
Jonathon Shlens
GAN
OCL
33
113
0
04 Apr 2019
Dialogue Act Classification with Context-Aware Self-Attention
Vipul Raheja
Joel R. Tetreault
22
102
0
04 Apr 2019
ReWE: Regressing Word Embeddings for Regularization of Neural Machine Translation Systems
Inigo Jauregi Unanue
E. Z. Borzeshi
Nazanin Esmaili
Massimo Piccardi
27
8
0
04 Apr 2019
Guiding Extractive Summarization with Question-Answering Rewards
Kristjan Arumae
Fei Liu
31
33
0
04 Apr 2019
75 Languages, 1 Model: Parsing Universal Dependencies Universally
Dan Kondratyuk
Milan Straka
30
263
0
03 Apr 2019
Cross-lingual transfer learning for spoken language understanding
Q. Do
Judith Gaspers
32
20
0
03 Apr 2019
Attentive Mimicking: Better Word Embeddings by Attending to Informative Contexts
Timo Schick
Hinrich Schütze
19
47
0
02 Apr 2019
Analysing Mathematical Reasoning Abilities of Neural Models
D. Saxton
Edward Grefenstette
Felix Hill
Pushmeet Kohli
LRM
33
418
0
02 Apr 2019
Recent Advances in Natural Language Inference: A Survey of Benchmarks, Resources, and Approaches
Shane Storks
Qiaozi Gao
J. Chai
21
128
0
02 Apr 2019
ASSERT: Anti-Spoofing with Squeeze-Excitation and Residual neTworks
Cheng-I Jeff Lai
Nanxin Chen
Jesús Villalba
Najim Dehak
AAML
37
158
0
01 Apr 2019
Video Object Segmentation using Space-Time Memory Networks
Seoung Wug Oh
Joon-Young Lee
N. Xu
Seon Joo Kim
VOS
23
701
0
01 Apr 2019
Distant Supervision Relation Extraction with Intra-Bag and Inter-Bag Attentions
Zhiquan Ye
Zhenhua Ling
14
125
0
30 Mar 2019
CUTIE: Learning to Understand Documents with Convolutional Universal Text Information Extractor
Xiaohui Zhao
Endi Niu
Zhuo Wu
Xiaoguang Wang
26
56
0
29 Mar 2019
A Large-Scale Multi-Length Headline Corpus for Analyzing Length-Constrained Headline Generation Model Evaluation
Yuta Hitomi
Yuya Taguchi
Hideaki Tamori
Ko Kikuta
Jiro Nishitoba
Naoaki Okazaki
Kentaro Inui
Manabu Okumura
31
9
0
28 Mar 2019
Interoperability and machine-to-machine translation model with mappings to machine learning tasks
Jacob Nilsson
Fredrik Sandin
J. Delsing
AI4CE
39
18
0
26 Mar 2019
On Measuring Social Biases in Sentence Encoders
Chandler May
Alex Jinpeng Wang
Shikha Bordia
Samuel R. Bowman
Rachel Rudinger
42
591
0
25 Mar 2019
Video Relationship Reasoning using Gated Spatio-Temporal Energy Graph
Yao-Hung Hubert Tsai
S. Divvala
Louis-Philippe Morency
Ruslan Salakhutdinov
Ali Farhadi
27
103
0
25 Mar 2019
Periphery-Fovea Multi-Resolution Driving Model guided by Human Attention
Ye Xia
Jinkyu Kim
John F. Canny
K. Zipser
D. Whitney
24
51
0
24 Mar 2019
Neural Abstractive Text Summarization and Fake News Detection
S. Esmaeilzadeh
Gao Xian Peh
Angela Xu
20
25
0
24 Mar 2019
Table understanding in structured documents
Martin Holecek
A. Hoskovec
P. Baudis
Pavel Klinger
LMTD
17
27
0
22 Mar 2019
Progressive Sparse Local Attention for Video object detection
Chaoxu Guo
Bin Fan
Jie Gu
Qian Zhang
Shiming Xiang
V. Prinet
Chunhong Pan
18
85
0
21 Mar 2019
Linguistic Knowledge and Transferability of Contextual Representations
Nelson F. Liu
Matt Gardner
Yonatan Belinkov
Matthew E. Peters
Noah A. Smith
52
719
0
21 Mar 2019
Selective Attention for Context-aware Neural Machine Translation
Sameen Maruf
André F. T. Martins
Gholamreza Haffari
20
175
0
21 Mar 2019
Neutron: An Implementation of the Transformer Translation Model and its Variants
Hongfei Xu
Qiuhui Liu
32
19
0
18 Mar 2019
Evaluating Sequence-to-Sequence Models for Handwritten Text Recognition
Johannes Michael
R. Labahn
Tobias Grüning
Jochen Zöllner
21
112
0
18 Mar 2019
Looking for the Devil in the Details: Learning Trilinear Attention Sampling Network for Fine-grained Image Recognition
Heliang Zheng
Jianlong Fu
Zhengjun Zha
Jiebo Luo
15
382
0
14 Mar 2019
Stochastic Beams and Where to Find Them: The Gumbel-Top-k Trick for Sampling Sequences Without Replacement
W. Kool
H. V. Hoof
Max Welling
71
215
0
14 Mar 2019
Episodic Memory Reader: Learning What to Remember for Question Answering from Streaming Data
Moonsu Han
Minki Kang
Hyunwoo Jung
Sung Ju Hwang
RALM
27
19
0
14 Mar 2019
To Tune or Not to Tune? Adapting Pretrained Representations to Diverse Tasks
Matthew E. Peters
Sebastian Ruder
Noah A. Smith
39
433
0
14 Mar 2019
Learning Parallax Attention for Stereo Image Super-Resolution
Longguang Wang
Yingqian Wang
Zhengfa Liang
Zaiping Lin
Jungang Yang
W. An
Yulan Guo
SupR
29
249
0
14 Mar 2019
Maybe Deep Neural Networks are the Best Choice for Modeling Source Code
Rafael-Michael Karampatsis
Charles Sutton
29
54
0
13 Mar 2019
DeepOBS: A Deep Learning Optimizer Benchmark Suite
Frank Schneider
Lukas Balles
Philipp Hennig
ODL
30
71
0
13 Mar 2019
Neural Network Model Extraction Attacks in Edge Devices by Hearing Architectural Hints
Xing Hu
Ling Liang
Lei Deng
Shuangchen Li
Xinfeng Xie
Yu Ji
Yufei Ding
Chang Liu
T. Sherwood
Yuan Xie
AAML
MLAU
21
36
0
10 Mar 2019
Scene Memory Transformer for Embodied Agents in Long-Horizon Tasks
Kuan Fang
Alexander Toshev
Li Fei-Fei
Silvio Savarese
OffRL
13
200
0
09 Mar 2019
Fast Prototyping a Dialogue Comprehension System for Nurse-Patient Conversations on Symptom Monitoring
Zhengyuan Liu
Jia Hui Hazel Lim
Nur Farah Ain Binte Sahimi
Shao Chuen Tong
Sharon Ong
...
M. Macdonald
Savitha Ramasamy
Pavitra Krishnaswamy
W. Chow
Nancy F. Chen
23
24
0
08 Mar 2019
SR-LSTM: State Refinement for LSTM towards Pedestrian Trajectory Prediction
Pu Zhang
Wanli Ouyang
Pengfei Zhang
Jianru Xue
Nanning Zheng
33
454
0
07 Mar 2019
Previous
1
2
3
...
351
352
353
...
358
359
360
Next