Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1706.03762
Cited By
v1
v2
v3
v4
v5
v6
v7 (latest)
Attention Is All You Need
12 June 2017
Ashish Vaswani
Noam M. Shazeer
Niki Parmar
Jakob Uszkoreit
Llion Jones
Aidan Gomez
Lukasz Kaiser
Illia Polosukhin
3DV
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Attention Is All You Need"
50 / 27,337 papers shown
Title
BERT4GCN: Using BERT Intermediate Layers to Augment GCN for Aspect-based Sentiment Classification
Zeguan Xiao
Jiarun Wu
Qingliang Chen
Congjian Deng
70
76
0
01 Oct 2021
Building an Efficient and Effective Retrieval-based Dialogue System via Mutual Learning
Chongyang Tao
Jiazhan Feng
Chang Liu
Juntao Li
Xiubo Geng
Daxin Jiang
RALM
68
6
0
01 Oct 2021
Simulated annealing for optimization of graphs and sequences
Xianggen Liu
Pengyong Li
Fandong Meng
Hao Zhou
Huasong Zhong
Jie Zhou
Lili Mou
Sen Song
102
19
0
01 Oct 2021
Unsupervised Few-Shot Action Recognition via Action-Appearance Aligned Meta-Adaptation
Jay Patravali
Gaurav Mittal
Ye Yu
Fuxin Li
Mei Chen
92
19
0
30 Sep 2021
Multi-granular Legal Topic Classification on Greek Legislation
C. Papaloukas
Ilias Chalkidis
Konstantinos Athinaios
D. Pantazi
Manolis Koubarakis
AILaw
80
25
0
30 Sep 2021
Inducing Transformer's Compositional Generalization Ability via Auxiliary Sequence Prediction Tasks
Yichen Jiang
Joey Tianyi Zhou
140
26
0
30 Sep 2021
SlovakBERT: Slovak Masked Language Model
Matúš Pikuliak
Stefan Grivalsky
Martin Konopka
Miroslav Blšták
Martin Tamajka
Viktor Bachratý
Marian Simko
Pavol Balázik
Michal Trnka
Filip Uhlárik
66
27
0
30 Sep 2021
Language-Aligned Waypoint (LAW) Supervision for Vision-and-Language Navigation in Continuous Environments
Sonia Raychaudhuri
Saim Wani
Shivansh Patel
Unnat Jain
Angel X. Chang
LM&Ro
87
54
0
30 Sep 2021
PortaSpeech: Portable and High-Quality Generative Text-to-Speech
Yi Ren
Jinglin Liu
Zhou Zhao
122
79
0
30 Sep 2021
Multi-Modal Sarcasm Detection Based on Contrastive Attention Mechanism
Xiaoqiang Zhang
Ying Chen
Guang-ying Li
88
12
0
30 Sep 2021
A Review of Text Style Transfer using Deep Learning
Martina Toshevska
Sonja Gievska
CLIP
92
45
0
30 Sep 2021
Towards Efficient Post-training Quantization of Pre-trained Language Models
Haoli Bai
Lu Hou
Lifeng Shang
Xin Jiang
Irwin King
Michael R. Lyu
MQ
147
48
0
30 Sep 2021
SCIMAT: Science and Mathematics Dataset
Neeraj Kollepara
Snehith Kumar Chatakonda
Kiran Ravish
23
1
0
30 Sep 2021
Structural Persistence in Language Models: Priming as a Window into Abstract Language Representations
Arabella J. Sinclair
Jaap Jumelet
Willem H. Zuidema
Raquel Fernández
114
41
0
30 Sep 2021
MobTCast: Leveraging Auxiliary Trajectory Forecasting for Human Mobility Prediction
Hao Xue
Flora D. Salim
Yongli Ren
Nuria Oliver
73
71
0
30 Sep 2021
GT U-Net: A U-Net Like Group Transformer Network for Tooth Root Segmentation
Yunxiang Li
Shuai Wang
Jun Wang
G. Zeng
Wenjun Liu
Qianni Zhang
Qun Jin
Yaqi Wang
ViT
MedIm
72
49
0
30 Sep 2021
Introducing the DOME Activation Functions
Mohamed E. Hussein
Wael AbdAlmageed
57
1
0
30 Sep 2021
Emergency Vehicles Audio Detection and Localization in Autonomous Driving
Hongyi Sun
Xinyi Liu
Kecheng Xu
Jinghao Miao
Qi Luo
59
20
0
30 Sep 2021
Bitcoin Transaction Strategy Construction Based on Deep Reinforcement Learning
Fengrui Liu
Yang Li
Baitong Li
Jiaxin Li
Huiyang Xie
53
45
0
30 Sep 2021
Unlocking the potential of deep learning for marine ecology: overview, applications, and outlook
M. G. Olsen
K. Halvorsen
Lei Jiao
Kristian Muri Knausgård
A. H. Martin
M. Moyano
Rebekah A. Oomen
J. H. Rasmussen
T. K. Sørdalen
Susanna Huneide Thorbjørnsen
62
27
0
29 Sep 2021
Subdimensional Expansion Using Attention-Based Learning For Multi-Agent Path Finding
Lakshay Virmani
Z. Ren
Sivakumar Rathinam
Howie Choset
64
3
0
29 Sep 2021
Programmable Spectral Filter Arrays using Phase Spatial Light Modulator
Vishwanath Saragadam
Vijay Rengarajan
Ryuichi Tadano
Tuo Zhuang
H. Oyaizu
J. Murayama
Aswin C. Sankaranarayanan
63
1
0
29 Sep 2021
FastCorrect 2: Fast Error Correction on Multiple Candidates for Automatic Speech Recognition
Yichong Leng
Xu Tan
Rui Wang
Linchen Zhu
Jin Xu
...
Linquan Liu
Tao Qin
Xiang-Yang Li
Ed Lin
Tie-Yan Liu
129
42
0
29 Sep 2021
UFO-ViT: High Performance Linear Vision Transformer without Softmax
Jeonggeun Song
ViT
173
21
0
29 Sep 2021
Infrared Small-Dim Target Detection with Transformer under Complex Backgrounds
Fangcen Liu
Chenqiang Gao
Fangge Chen
Deyu Meng
W. Zuo
Xinbo Gao
ViT
81
38
0
29 Sep 2021
Comparison of Self-Supervised Speech Pre-Training Methods on Flemish Dutch
Jakob Poncelet
Hugo Van hamme
SSL
56
1
0
29 Sep 2021
A Systematic Survey of Deep Learning-based Single-Image Super-Resolution
Juncheng Li
Zehua Pei
Wenjie Li
Guangwei Gao
Longguang Wang
Yingqian Wang
T. Zeng
143
48
0
29 Sep 2021
Localizing Objects with Self-Supervised Transformers and no Labels
Oriane Siméoni
Gilles Puy
Huy V. Vo
Simon Roburin
Spyros Gidaris
Andrei Bursuc
P. Pérez
Renaud Marlet
Jean Ponce
ViT
260
203
0
29 Sep 2021
Sequential Deep Learning Architectures for Anomaly Detection in Virtual Network Function Chains
Chung-Ung Lee
Jibum Hong
DongNyeong Heo
Heeyoul Choi
AI4TS
33
6
0
29 Sep 2021
Hierarchical Character Tagger for Short Text Spelling Error Correction
Mengyi Gao
Canran Xu
Peng Shi
VLM
3DV
72
6
0
29 Sep 2021
Road Network Guided Fine-Grained Urban Traffic Flow Inference
Lingbo Liu
Mengmeng Liu
Guanbin Li
Ziyi Wu
Junfan Lin
Liang Lin
AI4TS
123
12
0
29 Sep 2021
Improved Xception with Dual Attention Mechanism and Feature Fusion for Face Forgery Detection
Hao Lin
Weiqi Luo
Kangkang Wei
Minglin Liu
CVBM
AAML
3DPC
68
18
0
29 Sep 2021
Vitruvion: A Generative Model of Parametric CAD Sketches
Ari Seff
Wenda Zhou
Nick Richardson
Ryan P. Adams
70
66
0
29 Sep 2021
Visually Grounded Concept Composition
Bowen Zhang
Hexiang Hu
Linlu Qiu
Peter Shaw
Fei Sha
CoGe
122
6
0
29 Sep 2021
VideoCLIP: Contrastive Pre-training for Zero-shot Video-Text Understanding
Hu Xu
Gargi Ghosh
Po-Yao (Bernie) Huang
Dmytro Okhonko
Armen Aghajanyan
Florian Metze
Luke Zettlemoyer
Florian Metze Luke Zettlemoyer Christoph Feichtenhofer
CLIP
VLM
317
582
0
28 Sep 2021
Shaking Syntactic Trees on the Sesame Street: Multilingual Probing with Controllable Perturbations
Ekaterina Taktasheva
Vladislav Mikhailov
Ekaterina Artemova
91
13
0
28 Sep 2021
Symbolic Brittleness in Sequence Models: on Systematic Generalization in Symbolic Mathematics
Sean Welleck
Peter West
Jize Cao
Yejin Choi
146
31
0
28 Sep 2021
Text2Brain: Synthesis of Brain Activation Maps from Free-form Text Query
G. Ngo
Minh Le Nguyen
Nancy F. Chen
M. Sabuncu
57
6
0
28 Sep 2021
One to rule them all: Towards Joint Indic Language Hate Speech Detection
Mehar Bhatia
Tenzin Singhay Bhotia
Akshat Agarwal
Prakash Ramesh
Shubham Gupta
Kumar Shridhar
F. Laumann
Ayushman Dash
89
16
0
28 Sep 2021
CIDEr-R: Robust Consensus-based Image Description Evaluation
G. O. D. Santos
Esther Luna Colombini
Sandra Avila
81
30
0
28 Sep 2021
Cross-lingual Intermediate Fine-tuning improves Dialogue State Tracking
Nikita Moghe
Mark Steedman
Alexandra Birch
105
13
0
28 Sep 2021
DEBOSH: Deep Bayesian Shape Optimization
Nikita Durasov
Artem Lukoyanov
Jonathan Donier
Pascal Fua
UQCV
AI4CE
105
15
0
28 Sep 2021
Active Learning for Argument Mining: A Practical Approach
Nikolai Solmsdorf
Dietrich Trautmann
Hinrich Schütze
HAI
30
1
0
28 Sep 2021
SafetyNet: Safe planning for real-world self-driving vehicles using machine-learned policies
Matt Vitelli
Yan-Xia Chang
Yawei Ye
Maciej Wołczyk
Bla.zej Osiñski
Moritz Niendorf
Hugo Grimmett
Qiangui Huang
Ashesh Jain
Peter Ondruska
73
80
0
28 Sep 2021
PPL-MCTS: Constrained Textual Generation Through Discriminator-Guided MCTS Decoding
Antoine Chaffin
Vincent Claveau
Ewa Kijak
78
38
0
28 Sep 2021
A Strong Baseline for the VIPriors Data-Efficient Image Classification Challenge
Björn Barz
Lorenzo Brigato
Luca Iocchi
Joachim Denzler
VLM
69
2
0
28 Sep 2021
Extracting Attentive Social Temporal Excitation for Sequential Recommendation
Yunzhe Li
Yue Ding
Bo Chen
Xin Xin
Yule Wang
Yuxiang Shi
Ruiming Tang
Dong Wang
AI4TS
42
15
0
28 Sep 2021
An Offline Deep Reinforcement Learning for Maintenance Decision-Making
H. Khorasgani
Haiyan Wang
Chetan Gupta
Ahmed K. Farahat
KELM
OffRL
70
5
0
28 Sep 2021
Runtime Safety Assurance for Learning-enabled Control of Autonomous Driving Vehicles
Shengduo Chen
Yao Sun
Dachuan Li
Qiang Wang
Qi Hao
J. Sifakis
79
18
0
28 Sep 2021
Urban Driver: Learning to Drive from Real-world Demonstrations Using Policy Gradients
Oliver Scheel
Luca Bergamini
Maciej Wołczyk
Bla.zej Osiñski
Peter Ondruska
87
110
0
27 Sep 2021
Previous
1
2
3
...
355
356
357
...
545
546
547
Next