Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1706.03762
Cited By
Attention Is All You Need
12 June 2017
Ashish Vaswani
Noam M. Shazeer
Niki Parmar
Jakob Uszkoreit
Llion Jones
Aidan Gomez
Lukasz Kaiser
Illia Polosukhin
3DV
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Attention Is All You Need"
50 / 19,025 papers shown
Title
BERT for Joint Intent Classification and Slot Filling
Qian Chen
Zhu Zhuo
Wen Wang
VLM
31
546
0
28 Feb 2019
Financial series prediction using Attention LSTM
Sangyeon Kim
Myung-joo Kang
AI4TS
HAI
28
52
0
28 Feb 2019
Bridging the Gap: Attending to Discontinuity in Identification of Multiword Expressions
Omid Rohanian
Shiva Taslimipoor
Samaneh Kouchaki
L. Ha
R. Mitkov
27
26
0
27 Feb 2019
Regularity Normalization: Neuroscience-Inspired Unsupervised Attention across Neural Network Layers
Baihan Lin
21
2
0
27 Feb 2019
Still a Pain in the Neck: Evaluating Text Representations on Lexical Composition
Vered Shwartz
Ido Dagan
CoGe
27
79
0
27 Feb 2019
Attributes-aided Part Detection and Refinement for Person Re-identification
Shuzhao Li
Huimin Yu
Wei Huang
Jing Zhang
35
52
0
27 Feb 2019
Multilingual Neural Machine Translation with Knowledge Distillation
Xu Tan
Yi Ren
Di He
Tao Qin
Zhou Zhao
Tie-Yan Liu
25
248
0
27 Feb 2019
EvolveGCN: Evolving Graph Convolutional Networks for Dynamic Graphs
A. Pareja
Giacomo Domeniconi
Jie Chen
Tengfei Ma
Toyotaro Suzumura
H. Kanezashi
Tim Kaler
Tao B. Schardl
Charles E. Leisersen
GNN
52
1,043
0
26 Feb 2019
Attention is not Explanation
Sarthak Jain
Byron C. Wallace
FAtt
31
1,301
0
26 Feb 2019
The State of Sparsity in Deep Neural Networks
Trevor Gale
Erich Elsen
Sara Hooker
33
747
0
25 Feb 2019
Dual Attention Networks for Visual Reference Resolution in Visual Dialog
Gi-Cheon Kang
Jaeseo Lim
Byoung-Tak Zhang
22
72
0
25 Feb 2019
Attentional Encoder Network for Targeted Sentiment Classification
Youwei Song
Jiahai Wang
Tao Jiang
Zhiyue Liu
Yanghui Rao
14
275
0
25 Feb 2019
Star-Transformer
Qipeng Guo
Xipeng Qiu
Pengfei Liu
Yunfan Shao
Xiangyang Xue
Zheng Zhang
27
262
0
25 Feb 2019
Enhancing Clinical Concept Extraction with Contextual Embeddings
Yuqi Si
Jingqi Wang
Hua Xu
Kirk Roberts
AI4MH
29
286
0
22 Feb 2019
Improving Multilingual Sentence Embedding using Bi-directional Dual Encoder with Additive Margin Softmax
Yinfei Yang
Gustavo Hernández Ábrego
Steve Yuan
Mandy Guo
Qinlan Shen
Daniel Cer
Yun-hsuan Sung
B. Strope
R. Kurzweil
52
115
0
22 Feb 2019
Non-Autoregressive Machine Translation with Auxiliary Regularization
Yiren Wang
Fei Tian
Di He
Tao Qin
ChengXiang Zhai
Tie-Yan Liu
24
158
0
22 Feb 2019
Deep Discriminative Representation Learning with Attention Map for Scene Classification
Jun Yu Li
Daoyu Lin
Yang Wang
Guangluan Xu
C. Ding
33
81
0
21 Feb 2019
Audio-Linguistic Embeddings for Spoken Sentences
Albert Haque
Michelle Guo
Prateek Verma
Li Fei-Fei
28
51
0
20 Feb 2019
Mixture Models for Diverse Machine Translation: Tricks of the Trade
T. Shen
Myle Ott
Michael Auli
MarcÁurelio Ranzato
MoE
33
148
0
20 Feb 2019
Semantic Neural Machine Translation using AMR
Linfeng Song
D. Gildea
Yue Zhang
Zhiguo Wang
Jinsong Su
27
141
0
19 Feb 2019
Context-Aware Self-Attention Networks
Baosong Yang
Jian Li
Derek F. Wong
Lidia S. Chao
Xing Wang
Zhaopeng Tu
39
113
0
15 Feb 2019
Situation-Aware Pedestrian Trajectory Prediction with Spatio-Temporal Attention Model
Sirin Haddad
Meiqing Wu
He Wei
S. Lam
21
56
0
13 Feb 2019
Optimal Kronecker-Sum Approximation of Real Time Recurrent Learning
Frederik Benzing
M. Gauy
Asier Mujika
A. Martinsson
Angelika Steger
23
23
0
11 Feb 2019
Unsupervised Polyglot Text To Speech
Eliya Nachmani
Lior Wolf
19
42
0
06 Feb 2019
Fine-Grained Temporal Relation Extraction
Siddharth Vashishtha
Benjamin Van Durme
A. White
NAI
33
62
0
04 Feb 2019
Improving Question Answering with External Knowledge
Xiaoman Pan
Kai Sun
Dian Yu
Jianshu Chen
Heng Ji
Claire Cardie
Dong Yu
KELM
19
66
0
03 Feb 2019
Multi-Task Deep Neural Networks for Natural Language Understanding
Xiaodong Liu
Pengcheng He
Weizhu Chen
Jianfeng Gao
AI4CE
60
1,262
0
31 Jan 2019
Learning and Evaluating General Linguistic Intelligence
Dani Yogatama
Cyprien de Masson dÁutume
Jerome T. Connor
Tomás Kociský
Mike Chrzanowski
...
Angeliki Lazaridou
Wang Ling
Lei Yu
Chris Dyer
Phil Blunsom
ELM
AI4CE
33
209
0
31 Jan 2019
End-to-End Learned Early Classification of Time Series for In-Season Crop Type Mapping
M. Rußwurm
Nicolas Courty
Rémi Emonet
Sébastien Lefèvre
D. Tuia
R. Tavenard
AI4TS
25
57
0
30 Jan 2019
Latent Normalizing Flows for Discrete Sequences
Zachary M. Ziegler
Alexander M. Rush
BDL
DRL
27
123
0
29 Jan 2019
Visual Rhythm Prediction with Feature-Aligning Network
Yutong Xie
Haiyang Wang
Yan Hao
Zihao Xu
32
5
0
29 Jan 2019
Glyce: Glyph-vectors for Chinese Character Representations
Yuxian Meng
Wei Wu
Fei Wang
Xiaoya Li
Ping Nie
J. Mei
Muyu Li
Qinghong Han
Xiaofei Sun
Jiwei Li
VLM
27
190
0
29 Jan 2019
Conditioning by adaptive sampling for robust design
David H. Brookes
Hahnbeom Park
Jennifer Listgarten
26
193
0
29 Jan 2019
Activation Adaptation in Neural Networks
Farnoush Farhadi
V. Nia
Andrea Lodi
AI4CE
29
14
0
28 Jan 2019
Evaluating Word Embedding Models: Methods and Experimental Results
Bin Wang
Angela Wang
Fenxiao Chen
Yun Cheng Wang
C.-C. Jay Kuo
ELM
27
260
0
28 Jan 2019
Semantic Relation Classification via Bidirectional LSTM Networks with Entity-aware Attention using Latent Entity Typing
Joohong Lee
Sang-gyu Seo
Y. Choi
33
116
0
23 Jan 2019
Hypergraph Convolution and Hypergraph Attention
S. Bai
Feihu Zhang
Philip Torr
GNN
31
613
0
23 Jan 2019
TransferTransfo: A Transfer Learning Approach for Neural Network Based Conversational Agents
Thomas Wolf
Victor Sanh
Julien Chaumond
Clement Delangue
40
493
0
23 Jan 2019
Pedestrian Attribute Recognition: A Survey
Tianlin Li
Shaofei Zheng
Rui Yang
Aihua Zheng
Zhe Chen
Jin Tang
Bin Luo
CVBM
30
127
0
22 Jan 2019
Cross-lingual Language Model Pretraining
Guillaume Lample
Alexis Conneau
25
2,712
0
22 Jan 2019
Transfer Meets Hybrid: A Synthetic Approach for Cross-Domain Collaborative Filtering with Text
Guangneng Hu
Yu Zhang
Qiang Yang
29
81
0
22 Jan 2019
Adversarial Attacks on Deep Learning Models in Natural Language Processing: A Survey
W. Zhang
Quan Z. Sheng
A. Alhazmi
Chenliang Li
AAML
24
57
0
21 Jan 2019
Explainable Failure Predictions with RNN Classifiers based on Time Series Data
I. Giurgiu
Anika Schumann
AI4TS
11
8
0
20 Jan 2019
Visual Entailment: A Novel Task for Fine-Grained Image Understanding
Ning Xie
Farley Lai
Derek Doran
Asim Kadav
CoGe
56
322
0
20 Jan 2019
Attentive Neural Processes
Hyunjik Kim
A. Mnih
Jonathan Richard Schwarz
M. Garnelo
S. M. Ali Eslami
Dan Rosenbaum
Oriol Vinyals
Yee Whye Teh
54
429
0
17 Jan 2019
Analysis of Large-Scale Multi-Tenant GPU Clusters for DNN Training Workloads
Myeongjae Jeon
Shivaram Venkataraman
Amar Phanishayee
Junjie Qian
Wencong Xiao
Fan Yang
GNN
27
349
0
17 Jan 2019
Learning from Dialogue after Deployment: Feed Yourself, Chatbot!
Braden Hancock
Antoine Bordes
Pierre-Emmanuel Mazaré
Jason Weston
39
190
0
16 Jan 2019
Sentence transition matrix: An efficient approach that preserves sentence semantics
Myeongjun Jang
Pilsung Kang
19
2
0
16 Jan 2019
Exploiting Synchronized Lyrics And Vocal Features For Music Emotion Detection
Loreto Parisi
Simone Francia
Silvio Olivastri
Maria Stella Tavella
26
11
0
15 Jan 2019
Global-to-local Memory Pointer Networks for Task-Oriented Dialogue
Chien-Sheng Wu
R. Socher
Caiming Xiong
24
165
0
15 Jan 2019
Previous
1
2
3
...
372
373
374
...
379
380
381
Next