ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1706.03762
  4. Cited By
Attention Is All You Need

Attention Is All You Need

12 June 2017
Ashish Vaswani
Noam M. Shazeer
Niki Parmar
Jakob Uszkoreit
Llion Jones
Aidan Gomez
Lukasz Kaiser
Illia Polosukhin
    3DV
ArXivPDFHTML

Papers citing "Attention Is All You Need"

50 / 17,643 papers shown
Title
Video Object Segmentation using Space-Time Memory Networks
Video Object Segmentation using Space-Time Memory Networks
Seoung Wug Oh
Joon-Young Lee
N. Xu
Seon Joo Kim
VOS
23
699
0
01 Apr 2019
Distant Supervision Relation Extraction with Intra-Bag and Inter-Bag
  Attentions
Distant Supervision Relation Extraction with Intra-Bag and Inter-Bag Attentions
Zhiquan Ye
Zhenhua Ling
12
125
0
30 Mar 2019
CUTIE: Learning to Understand Documents with Convolutional Universal
  Text Information Extractor
CUTIE: Learning to Understand Documents with Convolutional Universal Text Information Extractor
Xiaohui Zhao
Endi Niu
Zhuo Wu
Xiaoguang Wang
26
55
0
29 Mar 2019
A Large-Scale Multi-Length Headline Corpus for Analyzing
  Length-Constrained Headline Generation Model Evaluation
A Large-Scale Multi-Length Headline Corpus for Analyzing Length-Constrained Headline Generation Model Evaluation
Yuta Hitomi
Yuya Taguchi
Hideaki Tamori
Ko Kikuta
Jiro Nishitoba
Naoaki Okazaki
Kentaro Inui
Manabu Okumura
31
9
0
28 Mar 2019
Interoperability and machine-to-machine translation model with mappings
  to machine learning tasks
Interoperability and machine-to-machine translation model with mappings to machine learning tasks
Jacob Nilsson
Fredrik Sandin
J. Delsing
AI4CE
39
18
0
26 Mar 2019
On Measuring Social Biases in Sentence Encoders
On Measuring Social Biases in Sentence Encoders
Chandler May
Alex Jinpeng Wang
Shikha Bordia
Samuel R. Bowman
Rachel Rudinger
42
591
0
25 Mar 2019
Video Relationship Reasoning using Gated Spatio-Temporal Energy Graph
Video Relationship Reasoning using Gated Spatio-Temporal Energy Graph
Yao-Hung Hubert Tsai
S. Divvala
Louis-Philippe Morency
Ruslan Salakhutdinov
Ali Farhadi
27
103
0
25 Mar 2019
Periphery-Fovea Multi-Resolution Driving Model guided by Human Attention
Periphery-Fovea Multi-Resolution Driving Model guided by Human Attention
Ye Xia
Jinkyu Kim
John F. Canny
K. Zipser
D. Whitney
24
51
0
24 Mar 2019
Neural Abstractive Text Summarization and Fake News Detection
Neural Abstractive Text Summarization and Fake News Detection
S. Esmaeilzadeh
Gao Xian Peh
Angela Xu
20
25
0
24 Mar 2019
Table understanding in structured documents
Table understanding in structured documents
Martin Holecek
A. Hoskovec
P. Baudis
Pavel Klinger
LMTD
15
27
0
22 Mar 2019
Progressive Sparse Local Attention for Video object detection
Progressive Sparse Local Attention for Video object detection
Chaoxu Guo
Bin Fan
Jie Gu
Qian Zhang
Shiming Xiang
V. Prinet
Chunhong Pan
16
85
0
21 Mar 2019
Linguistic Knowledge and Transferability of Contextual Representations
Linguistic Knowledge and Transferability of Contextual Representations
Nelson F. Liu
Matt Gardner
Yonatan Belinkov
Matthew E. Peters
Noah A. Smith
52
718
0
21 Mar 2019
Selective Attention for Context-aware Neural Machine Translation
Selective Attention for Context-aware Neural Machine Translation
Sameen Maruf
André F. T. Martins
Gholamreza Haffari
20
175
0
21 Mar 2019
Neutron: An Implementation of the Transformer Translation Model and its
  Variants
Neutron: An Implementation of the Transformer Translation Model and its Variants
Hongfei Xu
Qiuhui Liu
32
19
0
18 Mar 2019
Evaluating Sequence-to-Sequence Models for Handwritten Text Recognition
Evaluating Sequence-to-Sequence Models for Handwritten Text Recognition
Johannes Michael
R. Labahn
Tobias Grüning
Jochen Zöllner
21
112
0
18 Mar 2019
Looking for the Devil in the Details: Learning Trilinear Attention
  Sampling Network for Fine-grained Image Recognition
Looking for the Devil in the Details: Learning Trilinear Attention Sampling Network for Fine-grained Image Recognition
Heliang Zheng
Jianlong Fu
Zhengjun Zha
Jiebo Luo
13
382
0
14 Mar 2019
Stochastic Beams and Where to Find Them: The Gumbel-Top-k Trick for
  Sampling Sequences Without Replacement
Stochastic Beams and Where to Find Them: The Gumbel-Top-k Trick for Sampling Sequences Without Replacement
W. Kool
H. V. Hoof
Max Welling
71
215
0
14 Mar 2019
Episodic Memory Reader: Learning What to Remember for Question Answering
  from Streaming Data
Episodic Memory Reader: Learning What to Remember for Question Answering from Streaming Data
Moonsu Han
Minki Kang
Hyunwoo Jung
Sung Ju Hwang
RALM
27
19
0
14 Mar 2019
To Tune or Not to Tune? Adapting Pretrained Representations to Diverse
  Tasks
To Tune or Not to Tune? Adapting Pretrained Representations to Diverse Tasks
Matthew E. Peters
Sebastian Ruder
Noah A. Smith
33
433
0
14 Mar 2019
Learning Parallax Attention for Stereo Image Super-Resolution
Learning Parallax Attention for Stereo Image Super-Resolution
Longguang Wang
Yingqian Wang
Zhengfa Liang
Zaiping Lin
Jungang Yang
W. An
Yulan Guo
SupR
29
249
0
14 Mar 2019
Maybe Deep Neural Networks are the Best Choice for Modeling Source Code
Maybe Deep Neural Networks are the Best Choice for Modeling Source Code
Rafael-Michael Karampatsis
Charles Sutton
26
54
0
13 Mar 2019
DeepOBS: A Deep Learning Optimizer Benchmark Suite
DeepOBS: A Deep Learning Optimizer Benchmark Suite
Frank Schneider
Lukas Balles
Philipp Hennig
ODL
28
71
0
13 Mar 2019
Neural Network Model Extraction Attacks in Edge Devices by Hearing
  Architectural Hints
Neural Network Model Extraction Attacks in Edge Devices by Hearing Architectural Hints
Xing Hu
Ling Liang
Lei Deng
Shuangchen Li
Xinfeng Xie
Yu Ji
Yufei Ding
Chang Liu
T. Sherwood
Yuan Xie
AAML
MLAU
21
36
0
10 Mar 2019
Scene Memory Transformer for Embodied Agents in Long-Horizon Tasks
Scene Memory Transformer for Embodied Agents in Long-Horizon Tasks
Kuan Fang
Alexander Toshev
Li Fei-Fei
Silvio Savarese
OffRL
13
199
0
09 Mar 2019
Fast Prototyping a Dialogue Comprehension System for Nurse-Patient
  Conversations on Symptom Monitoring
Fast Prototyping a Dialogue Comprehension System for Nurse-Patient Conversations on Symptom Monitoring
Zhengyuan Liu
Jia Hui Hazel Lim
Nur Farah Ain Binte Sahimi
Shao Chuen Tong
Sharon Ong
...
M. Macdonald
Savitha Ramasamy
Pavitra Krishnaswamy
W. Chow
Nancy F. Chen
23
24
0
08 Mar 2019
SR-LSTM: State Refinement for LSTM towards Pedestrian Trajectory
  Prediction
SR-LSTM: State Refinement for LSTM towards Pedestrian Trajectory Prediction
Pu Zhang
Wanli Ouyang
Pengfei Zhang
Jianru Xue
Nanning Zheng
33
453
0
07 Mar 2019
Hierarchical Autoregressive Image Models with Auxiliary Decoders
Hierarchical Autoregressive Image Models with Auxiliary Decoders
J. Fauw
Sander Dieleman
Karen Simonyan
GAN
30
37
0
06 Mar 2019
Selective Sensor Fusion for Neural Visual-Inertial Odometry
Selective Sensor Fusion for Neural Visual-Inertial Odometry
Changhao Chen
Stefano Rosa
Yishu Miao
Chris Xiaoxuan Lu
Wei Wu
Andrew Markham
A. Trigoni
22
132
0
04 Mar 2019
VideoFlow: A Conditional Flow-Based Model for Stochastic Video
  Generation
VideoFlow: A Conditional Flow-Based Model for Stochastic Video Generation
Manoj Kumar
Mohammad Babaeizadeh
D. Erhan
Chelsea Finn
Sergey Levine
Laurent Dinh
Durk Kingma
VGen
25
131
0
04 Mar 2019
Collaborative Spatio-temporal Feature Learning for Video Action
  Recognition
Collaborative Spatio-temporal Feature Learning for Video Action Recognition
Chong Li
Qiaoyong Zhong
Di Xie
Shiliang Pu
27
82
0
04 Mar 2019
Calibration of Encoder Decoder Models for Neural Machine Translation
Calibration of Encoder Decoder Models for Neural Machine Translation
Aviral Kumar
Sunita Sarawagi
24
98
0
03 Mar 2019
Efficient Reinforcement Learning for StarCraft by Abstract Forward
  Models and Transfer Learning
Efficient Reinforcement Learning for StarCraft by Abstract Forward Models and Transfer Learning
Ruo-Ze Liu
Haifeng Guo
Xiaozhong Ji
Yang Yu
Zhen-Jia Pang
Zitai Xiao
Yuzhou Wu
Tong Lu
OffRL
19
13
0
02 Mar 2019
Outcome-Driven Clustering of Acute Coronary Syndrome Patients using
  Multi-Task Neural Network with Attention
Outcome-Driven Clustering of Acute Coronary Syndrome Patients using Multi-Task Neural Network with Attention
Eryu Xia
Xin Du
Jing Mei
Wen Sun
Suijun Tong
...
Jian Sheng
Jian Li
Changsheng Ma
Jianzeng Dong
Shaochun Li
12
10
0
01 Mar 2019
Chinese-Japanese Unsupervised Neural Machine Translation Using
  Sub-character Level Information
Chinese-Japanese Unsupervised Neural Machine Translation Using Sub-character Level Information
Longtu Zhang
Mamoru Komachi
12
10
0
01 Mar 2019
Improving Grammatical Error Correction via Pre-Training a Copy-Augmented
  Architecture with Unlabeled Data
Improving Grammatical Error Correction via Pre-Training a Copy-Augmented Architecture with Unlabeled Data
Wei-Ye Zhao
Liang Wang
Kewei Shen
Ruoyu Jia
Jingming Liu
16
210
0
01 Mar 2019
Massively Multilingual Neural Machine Translation
Massively Multilingual Neural Machine Translation
Roee Aharoni
Melvin Johnson
Orhan Firat
LRM
AI4CE
17
482
0
28 Feb 2019
Infer Your Enemies and Know Yourself, Learning in Real-Time Bidding with
  Partially Observable Opponents
Infer Your Enemies and Know Yourself, Learning in Real-Time Bidding with Partially Observable Opponents
Manxing Du
Alexander I. Cowen-Rivers
Ying Wen
Phu Sakulwongtana
Jun Wang
M. Brorsson
R. State
16
1
0
28 Feb 2019
Representation Learning for Recommender Systems with Application to the
  Scientific Literature
Representation Learning for Recommender Systems with Application to the Scientific Literature
Robin Brochier
16
5
0
28 Feb 2019
Link Prediction with Mutual Attention for Text-Attributed Networks
Link Prediction with Mutual Attention for Text-Attributed Networks
Robin Brochier
Adrien Guille
Julien Velcin
14
12
0
28 Feb 2019
Deep learning in bioinformatics: introduction, application, and
  perspective in big data era
Deep learning in bioinformatics: introduction, application, and perspective in big data era
Yu Li
Chao Huang
Lizhong Ding
Zhongxiao Li
Yijie Pan
Xin Gao
AI4CE
24
295
0
28 Feb 2019
BERT for Joint Intent Classification and Slot Filling
BERT for Joint Intent Classification and Slot Filling
Qian Chen
Zhu Zhuo
Wen Wang
VLM
16
545
0
28 Feb 2019
Bridging the Gap: Attending to Discontinuity in Identification of
  Multiword Expressions
Bridging the Gap: Attending to Discontinuity in Identification of Multiword Expressions
Omid Rohanian
Shiva Taslimipoor
Samaneh Kouchaki
L. Ha
R. Mitkov
27
26
0
27 Feb 2019
Regularity Normalization: Neuroscience-Inspired Unsupervised Attention
  across Neural Network Layers
Regularity Normalization: Neuroscience-Inspired Unsupervised Attention across Neural Network Layers
Baihan Lin
16
2
0
27 Feb 2019
Still a Pain in the Neck: Evaluating Text Representations on Lexical
  Composition
Still a Pain in the Neck: Evaluating Text Representations on Lexical Composition
Vered Shwartz
Ido Dagan
CoGe
27
78
0
27 Feb 2019
Attributes-aided Part Detection and Refinement for Person
  Re-identification
Attributes-aided Part Detection and Refinement for Person Re-identification
Shuzhao Li
Huimin Yu
Wei Huang
Jing Zhang
30
52
0
27 Feb 2019
Multilingual Neural Machine Translation with Knowledge Distillation
Multilingual Neural Machine Translation with Knowledge Distillation
Xu Tan
Yi Ren
Di He
Tao Qin
Zhou Zhao
Tie-Yan Liu
20
248
0
27 Feb 2019
EvolveGCN: Evolving Graph Convolutional Networks for Dynamic Graphs
EvolveGCN: Evolving Graph Convolutional Networks for Dynamic Graphs
A. Pareja
Giacomo Domeniconi
Jie Chen
Tengfei Ma
Toyotaro Suzumura
H. Kanezashi
Tim Kaler
Tao B. Schardl
Charles E. Leisersen
GNN
52
1,041
0
26 Feb 2019
Attention is not Explanation
Attention is not Explanation
Sarthak Jain
Byron C. Wallace
FAtt
31
1,299
0
26 Feb 2019
The State of Sparsity in Deep Neural Networks
The State of Sparsity in Deep Neural Networks
Trevor Gale
Erich Elsen
Sara Hooker
21
743
0
25 Feb 2019
Dual Attention Networks for Visual Reference Resolution in Visual Dialog
Dual Attention Networks for Visual Reference Resolution in Visual Dialog
Gi-Cheon Kang
Jaeseo Lim
Byoung-Tak Zhang
22
72
0
25 Feb 2019
Previous
123...345346347...351352353
Next