Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1706.03762
Cited By
v1
v2
v3
v4
v5
v6
v7 (latest)
Attention Is All You Need
12 June 2017
Ashish Vaswani
Noam M. Shazeer
Niki Parmar
Jakob Uszkoreit
Llion Jones
Aidan Gomez
Lukasz Kaiser
Illia Polosukhin
3DV
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Attention Is All You Need"
50 / 2,193 papers shown
Title
Pretrained Language Models for Dialogue Generation with Multiple Input Sources
Yu Cao
Wei Bi
Meng Fang
Dacheng Tao
146
29
0
15 Oct 2020
DeepRemaster: Temporal Source-Reference Attention Networks for Comprehensive Video Enhancement
S. Iizuka
E. Simo-Serra
143
39
0
18 Sep 2020
How Have We Reacted To The COVID-19 Pandemic? Analyzing Changing Indian Emotions Through The Lens of Twitter
Rajdeep Mukherjee
S. Poddar
Atharva Naik
Soham Dasgupta
76
5
0
20 Aug 2020
Neural Networks and Quantum Field Theory
James Halverson
Anindita Maiti
Keegan Stoner
74
78
0
19 Aug 2020
S^3-Rec: Self-Supervised Learning for Sequential Recommendation with Mutual Information Maximization
Kun Zhou
Haibo Wang
Wayne Xin Zhao
Yutao Zhu
Sirui Wang
Fuzheng Zhang
Zhongyuan Wang
Ji-Rong Wen
137
820
0
18 Aug 2020
MIDAS: Multi-agent Interaction-aware Decision-making with Adaptive Strategies for Urban Autonomous Navigation
Xiaoyi Chen
Pratik Chaudhari
121
4
0
17 Aug 2020
DCR-Net: A Deep Co-Interactive Relation Network for Joint Dialog Act Recognition and Sentiment Classification
Libo Qin
Wanxiang Che
Yangming Li
Minheng Ni
Ting Liu
86
94
0
16 Aug 2020
Wavelet Denoising and Attention-based RNN-ARIMA Model to Predict Forex Price
Zhiwen Zeng
Matloob Khushi
41
29
0
16 Aug 2020
Revisiting Low Resource Status of Indian Languages in Machine Translation
Jerin Philip
Shashank Siripragada
Vinay P. Namboodiri
C. V. Jawahar
61
28
0
11 Aug 2020
Describe What to Change: A Text-guided Unsupervised Image-to-Image Translation Approach
Yahui Liu
Marco De Nadai
Deng Cai
Huayang Li
Xavier Alameda-Pineda
N. Sebe
Bruno Lepri
84
59
0
10 Aug 2020
Pretraining Techniques for Sequence-to-Sequence Voice Conversion
Wen-Chin Huang
Tomoki Hayashi
Yi-Chiao Wu
Hirokazu Kameoka
Tomoki Toda
103
40
0
07 Aug 2020
Adversarial Examples on Object Recognition: A Comprehensive Survey
A. Serban
E. Poll
Joost Visser
AAML
99
73
0
07 Aug 2020
Fine-grained Iterative Attention Network for TemporalLanguage Localization in Videos
Xiaoye Qu
Peng Tang
Zhikang Zhou
Yu Cheng
Jianfeng Dong
Pan Zhou
77
92
0
06 Aug 2020
Match
2
^2
2
: A Matching over Matching Model for Similar Question Identification
Zizhen Wang
Yixing Fan
Jiafeng Guo
Liu Yang
Ruqing Zhang
Yanyan Lan
Xueqi Cheng
Hui Jiang
Xiaozhao Wang
115
15
0
21 Jun 2020
Class-Attentive Diffusion Network for Semi-Supervised Classification
Jongin Lim
Daeho Um
H. Chang
D. Jo
J. Choi
178
14
0
18 Jun 2020
Categorical Normalizing Flows via Continuous Transformations
Phillip Lippe
E. Gavves
BDL
98
43
0
17 Jun 2020
FFR v1.1: Fon-French Neural Machine Translation
Bonaventure F. P. Dossou
Chris C. Emezue
64
26
0
14 Jun 2020
Learning the Travelling Salesperson Problem Requires Rethinking Generalization
Chaitanya K. Joshi
Quentin Cappart
Louis-Martin Rousseau
T. Laurent
189
120
0
12 Jun 2020
Learning Graph Models for Retrosynthesis Prediction
Vignesh Ram Somnath
Charlotte Bunne
Connor W. Coley
Andreas Krause
Regina Barzilay
70
92
0
12 Jun 2020
Ansor: Generating High-Performance Tensor Programs for Deep Learning
Lianmin Zheng
Chengfan Jia
Minmin Sun
Zhao Wu
Cody Hao Yu
...
Jun Yang
Danyang Zhuo
Koushik Sen
Joseph E. Gonzalez
Ion Stoica
142
402
0
11 Jun 2020
Disentangled Non-Local Neural Networks
Minghao Yin
Zhuliang Yao
Yue Cao
Xiu Li
Zheng Zhang
Stephen Lin
Han Hu
118
328
0
11 Jun 2020
Pointer Graph Networks
Petar Velivcković
Lars Buesing
Matthew Overlan
Razvan Pascanu
Oriol Vinyals
Charles Blundell
GNN
103
62
0
11 Jun 2020
Revisiting Few-sample BERT Fine-tuning
Tianyi Zhang
Felix Wu
Arzoo Katiyar
Kilian Q. Weinberger
Yoav Artzi
171
445
0
10 Jun 2020
AMEIR: Automatic Behavior Modeling, Interaction Exploration and MLP Investigation in the Recommender System
Pengyu Zhao
Kecheng Xiao
Yuanxing Zhang
Kaigui Bian
Wei Yan
93
16
0
10 Jun 2020
Self-Supervised Reinforcement Learning for Recommender Systems
Xin Xin
Alexandros Karatzoglou
Ioannis Arapakis
J. Jose
SSL
OffRL
139
202
0
10 Jun 2020
Extrapolation for Large-batch Training in Deep Learning
Tao R. Lin
Lingjing Kong
Sebastian U. Stich
Martin Jaggi
77
36
0
10 Jun 2020
Knowing your FATE: Friendship, Action and Temporal Explanations for User Engagement Prediction on Social Apps
Xianfeng Tang
Yozen Liu
Neil Shah
Xiaolin Shi
P. Mitra
Suhang Wang
AI4TS
91
44
0
10 Jun 2020
Simplify-then-Translate: Automatic Preprocessing for Black-Box Machine Translation
Sneha Mehta
Bahareh Azarnoush
Boris Chen
Avneesh Singh Saluja
Vinith Misra
Ballav Bihani
Ritwik K. Kumar
75
17
0
22 May 2020
Neuroevolution of Self-Interpretable Agents
Yujin Tang
Duong Nguyen
David R Ha
106
113
0
18 Mar 2020
PowerNorm: Rethinking Batch Normalization in Transformers
Sheng Shen
Z. Yao
A. Gholami
Michael W. Mahoney
Kurt Keutzer
BDL
86
16
0
17 Mar 2020
Monocular Depth Estimation Based On Deep Learning: An Overview
Chaoqiang Zhao
Qiyu Sun
Chongzhen Zhang
Yang Tang
Feng Qian
MDE
203
254
0
14 Mar 2020
Channel Interaction Networks for Fine-Grained Image Categorization
Yu Gao
Xintong Han
Xun Wang
Weilin Huang
Matthew R. Scott
141
159
0
11 Mar 2020
Diverse and Admissible Trajectory Forecasting through Multimodal Context Understanding
Seonguk Park
Gyubok Lee
Manoj Bhat
Jimin Seo
Minseok Kang
Jonathan M Francis
Ashwin R. Jadhav
Paul Pu Liang
Louis-Philippe Morency
189
120
0
06 Mar 2020
Benchmarking Graph Neural Networks
Vijay Prakash Dwivedi
Chaitanya K. Joshi
Anh Tuan Luu
T. Laurent
Yoshua Bengio
Xavier Bresson
457
950
0
02 Mar 2020
PhoBERT: Pre-trained language models for Vietnamese
Dat Quoc Nguyen
A. Nguyen
226
355
0
02 Mar 2020
A Dataset Independent Set of Baselines for Relation Prediction in Argument Mining
O. Cocarascu
Elena Cabrio
S. Villata
Francesca Toni
78
7
0
14 Feb 2020
Localized Flood DetectionWith Minimal Labeled Social Media Data Using Transfer Learning
Neha Singh
Nirmalya Roy
A. Gangopadhyay
71
6
0
10 Feb 2020
Can Monolingual Pretrained Models Help Cross-Lingual Classification?
Zewen Chi
Li Dong
Furu Wei
Xian-Ling Mao
Heyan Huang
LRM
VLM
98
13
0
10 Nov 2019
Sentence Meta-Embeddings for Unsupervised Semantic Textual Similarity
Nina Poerner
Ulli Waltinger
Hinrich Schütze
AI4TS
168
20
0
09 Nov 2019
How Decoding Strategies Affect the Verifiability of Generated Text
Luca Massarelli
Fabio Petroni
Aleksandra Piktus
Myle Ott
Tim Rocktaschel
Vassilis Plachouras
Fabrizio Silvestri
Sebastian Riedel
109
50
0
09 Nov 2019
On the Relationship between Self-Attention and Convolutional Layers
Jean-Baptiste Cordonnier
Andreas Loukas
Martin Jaggi
116
535
0
08 Nov 2019
Lipschitz Constrained Parameter Initialization for Deep Transformers
Hongfei Xu
Qiuhui Liu
Josef van Genabith
Deyi Xiong
Jingyi Zhang
ODL
79
26
0
08 Nov 2019
Hierarchical Contextualized Representation for Named Entity Recognition
Ying Luo
Fengshun Xiao
Zhao Hai
101
129
0
06 Nov 2019
Fast Transformer Decoding: One Write-Head is All You Need
Noam M. Shazeer
159
475
0
06 Nov 2019
Learning to Fix Build Errors with Graph2Diff Neural Networks
Daniel Tarlow
Subhodeep Moitra
Andrew Rice
Zimin Chen
Pierre-Antoine Manzagol
Charles Sutton
E. Aftandilian
GNN
107
63
0
04 Nov 2019
MRNN: A Multi-Resolution Neural Network with Duplex Attention for Document Retrieval in the Context of Question Answering
Tolgahan Cakaloglu
Xiaowei Xu
85
2
0
03 Nov 2019
Question Answering for Privacy Policies: Combining Computational and Legal Perspectives
Abhilasha Ravichander
A. Black
Shomir Wilson
Thomas B. Norton
Norman M. Sadeh
AILaw
95
112
0
03 Nov 2019
Dreem Open Datasets: Multi-Scored Sleep Datasets to compare Human and Automated sleep staging
Antoine Guillot
F. Sauvet
E. During
Valentin Thorey
105
107
0
31 Oct 2019
Discourse-Aware Neural Extractive Text Summarization
Jiacheng Xu
Zhe Gan
Yu Cheng
Jingjing Liu
BDL
142
282
0
30 Oct 2019
Inducing brain-relevant bias in natural language processing models
Dan Schwartz
Mariya Toneva
Leila Wehbe
51
80
0
29 Oct 2019
Previous
1
2
3
...
42
43
44
Next