Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1706.03762
Cited By
v1
v2
v3
v4
v5
v6
v7 (latest)
Attention Is All You Need
12 June 2017
Ashish Vaswani
Noam M. Shazeer
Niki Parmar
Jakob Uszkoreit
Llion Jones
Aidan Gomez
Lukasz Kaiser
Illia Polosukhin
3DV
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Attention Is All You Need"
50 / 26,952 papers shown
Title
Detecting Logical Relation In Contract Clauses
Alexandre Yukio Ichida
Felipe Meneguzzi
38
1
0
02 Nov 2021
A Recommendation System to Enhance Midwives' Capacities in Low-Income Countries
Anna Guitart
A. Heydari
Eniola Olaleye
Jelena Ljubicic
Ana Fernández del Río
África Periánez
Lauren Bellhouse
31
6
0
02 Nov 2021
PatchGame: Learning to Signal Mid-level Patches in Referential Games
Kamal Gupta
Gowthami Somepalli
Anubhav Gupta
Vinoj Jayasundara
Matthias Zwicker
Abhinav Shrivastava
74
4
0
02 Nov 2021
Recent Advances in End-to-End Automatic Speech Recognition
Jinyu Li
VLM
158
377
0
02 Nov 2021
Relational Self-Attention: What's Missing in Attention for Video Understanding
Manjin Kim
Heeseung Kwon
Chunyu Wang
Suha Kwak
Minsu Cho
ViT
83
29
0
02 Nov 2021
LogLAB: Attention-Based Labeling of Log Data Anomalies via Weak Supervision
Thorsten Wittkopp
Philipp Wiesner
Dominik Scheinert
Alexander Acker
59
11
0
02 Nov 2021
Trajectory Prediction with Graph-based Dual-scale Context Fusion
Lu Zhang
Peiliang Li
Jing Chen
Shaojie Shen
101
30
0
02 Nov 2021
Zero-Shot Translation using Diffusion Models
Eliya Nachmani
Shaked Dovrat
DiffM
VLM
77
9
0
02 Nov 2021
Synthesizing Speech from Intracranial Depth Electrodes using an Encoder-Decoder Framework
Jonas Köhler
Maarten C. Ottenhoff
Sophocles Goulis
Miguel Angrick
A. Colon
Louis Wagner
S. Tousseyn
P. Kubben
Christian Herff
52
28
0
02 Nov 2021
Callee: Recovering Call Graphs for Binaries with Transfer and Contrastive Learning
Wenyu Zhu
Zhiyao Feng
Zihan Zhang
Jian-jun Chen
Zhijian Ou
Min Yang
Chao Zhang
AAML
49
8
0
02 Nov 2021
A Review of Dialogue Systems: From Trained Monkeys to Stochastic Parrots
Atharv Singh Patlan
Shiven Tripathi
S. Korde
OffRL
53
3
0
02 Nov 2021
DeepParticle: learning invariant measure by a deep neural network minimizing Wasserstein distance on data generated from an interacting particle method
Zhongjian Wang
Jack Xin
Zhiwen Zhang
81
15
0
02 Nov 2021
Can Vision Transformers Perform Convolution?
Shanda Li
Xiangning Chen
Di He
Cho-Jui Hsieh
ViT
110
21
0
02 Nov 2021
Federated Split Vision Transformer for COVID-19 CXR Diagnosis using Task-Agnostic Training
Sangjoon Park
Gwanghyun Kim
Jeongsol Kim
Boah Kim
Jong Chul Ye
ViT
FedML
MedIm
95
30
0
02 Nov 2021
Cross-lingual Transfer for Speech Processing using Acoustic Language Similarity
Peter Wu
Jiatong Shi
Yifan Zhong
Shinji Watanabe
A. Black
52
8
0
02 Nov 2021
Multi network InfoMax: A pre-training method involving graph convolutional networks
Usman Mahmood
Z. Fu
Vince D. Calhoun
Sergey Plis
AI4CE
24
1
0
01 Nov 2021
Recent Advances in Natural Language Processing via Large Pre-Trained Language Models: A Survey
Bonan Min
Hayley L Ross
Elior Sulem
Amir Pouran Ben Veyseh
Thien Huu Nguyen
Oscar Sainz
Eneko Agirre
Ilana Heinz
Dan Roth
LM&MA
VLM
AI4CE
189
1,093
0
01 Nov 2021
Identifying causal relations in tweets using deep learning: Use case on diabetes-related tweets from 2017-2021
Adrian Ahne
Vivek Khetan
X. Tannier
Md Imbesat Hassan Rizvi
T. Czernichow
Francisco Orchard
Charline Bour
Andy E. Fano
G. Fagherazzi
CML
46
2
0
01 Nov 2021
Kernel Deformed Exponential Families for Sparse Continuous Attention
Alexander Moreno
Supriya Nagesh
Zhenke Wu
Walter Dempsey
James M. Rehg
35
1
0
01 Nov 2021
Transformers for prompt-level EMA non-response prediction
Supriya Nagesh
Alexander Moreno
Stephanie M Carpenter
Jamie Yap
Soujanya Chatterjee
...
Santosh Kumar
Cho Lam
D. Wetter
Inbal Nahum-Shani
James M. Rehg
27
1
0
01 Nov 2021
Cross-lingual Hate Speech Detection using Transformer Models
Teodor Tita
A. Zubiaga
46
13
0
01 Nov 2021
Structure Information is the Key: Self-Attention RoI Feature Extractor in 3D Object Detection
Diankun Zhang
Zhijie Zheng
Xueting Bi
Xiaojun Liu
3DPC
64
1
0
01 Nov 2021
Enhanced Language Representation with Label Knowledge for Span Extraction
Pan Yang
Xin Cong
Zhenyu Sun
Xingwu Liu
52
29
0
01 Nov 2021
Benchmarks for Corruption Invariant Person Re-identification
Minghui Chen
Zhiqiang Wang
Feng Zheng
69
26
0
01 Nov 2021
Large-Scale Deep Learning Optimizations: A Comprehensive Survey
Xiaoxin He
Fuzhao Xue
Xiaozhe Ren
Yang You
83
15
0
01 Nov 2021
Exploring Non-Autoregressive End-To-End Neural Modeling For English Mispronunciation Detection And Diagnosis
Hsin-Wei Wang
Bi-Cheng Yan
Hsuan-Sheng Chiu
Yung-Chang Hsu
Berlin Chen
108
7
0
01 Nov 2021
Dense Prediction with Attentive Feature Aggregation
Yung-Hsu Yang
Thomas E. Huang
Min Sun
Samuel Rota Buló
Peter Kontschieder
Feng Yu
86
7
0
01 Nov 2021
Outlining and Filling: Hierarchical Query Graph Generation for Answering Complex Questions over Knowledge Graphs
Yongrui Chen
Huiying Li
Guilin Qi
Tianxing Wu
Tenggou Wang
119
24
0
01 Nov 2021
Accounting for Dependencies in Deep Learning Based Multiple Instance Learning for Whole Slide Imaging
Andriy Myronenko
Ziyue Xu
Dong Yang
H. Roth
Daguang Xu
122
51
0
01 Nov 2021
Unsupervised Domain Adaptation with Adapter
Rongsheng Zhang
Yinhe Zheng
Xiaoxi Mao
Minlie Huang
49
18
0
01 Nov 2021
RMNA: A Neighbor Aggregation-Based Knowledge Graph Representation Learning Model Using Rule Mining
Ling-Hao Chen
Jun Cui
Xing Tang
Chao Song
Y. Qian
Yansheng Li
Yongjun Zhang
33
0
0
01 Nov 2021
SADGA: Structure-Aware Dual Graph Aggregation Network for Text-to-SQL
Ruichu Cai
Jinjie Yuan
Boyan Xu
Zhifeng Hao
104
64
0
01 Nov 2021
VSEC: Transformer-based Model for Vietnamese Spelling Correction
Dinh-Truong Do
Nguyen Ha Thanh
Thang Bui
Dinh-Hieu Vo
54
12
0
01 Nov 2021
A Systematic Investigation of Commonsense Knowledge in Large Language Models
Xiang Lorraine Li
A. Kuncoro
Jordan Hoffmann
Cyprien de Masson dÁutume
Phil Blunsom
Aida Nematzadeh
LRM
101
59
0
31 Oct 2021
FinEAS: Financial Embedding Analysis of Sentiment
Asier Gutiérrez-Fandiño
M. N. Alonso
P. Kolm
Jordi Armengol-Estapé
AIFin
26
6
0
31 Oct 2021
Hierarchical Deep Residual Reasoning for Temporal Moment Localization
Ziyang Ma
Xianjing Han
Xuemeng Song
Yiran Cui
Liqiang Nie
56
9
0
31 Oct 2021
FANS: Fusing ASR and NLU for on-device SLU
Martin H. Radfar
Athanasios Mouchtaris
Siegfried Kunzmann
Ariya Rastrow
75
12
0
31 Oct 2021
Efficiently Modeling Long Sequences with Structured State Spaces
Albert Gu
Karan Goel
Christopher Ré
229
1,843
0
31 Oct 2021
Real-time Speaker counting in a cocktail party scenario using Attention-guided Convolutional Neural Network
Midia Yousefi
John H. L. Hansen
37
10
0
30 Oct 2021
Cross-Modality Fusion Transformer for Multispectral Object Detection
Q. Fang
D. Han
Zhaokui Wang
ViT
90
155
0
30 Oct 2021
Magic Pyramid: Accelerating Inference with Early Exiting and Token Pruning
Xuanli He
I. Keivanloo
Yi Xu
Xiang He
Belinda Zeng
Santosh Rajagopalan
Trishul Chilimbi
81
19
0
30 Oct 2021
PatchFormer: An Efficient Point Transformer with Patch Attention
Zhang Cheng
Haocheng Wan
Xinyi Shen
Zizhao Wu
3DPC
117
68
0
30 Oct 2021
Context Meta-Reinforcement Learning via Neuromodulation
Eseoghene Ben-Iwhiwhu
Jeffery Dick
Nicholas A. Ketz
Praveen K. Pilly
Andrea Soltoggio
OffRL
118
13
0
30 Oct 2021
Data-Based Models for Hurricane Evolution Prediction: A Deep Learning Approach
Rikhi Bose
A. Pintar
E. Simiu
55
0
0
30 Oct 2021
Cross-attention conformer for context modeling in speech enhancement for ASR
A. Narayanan
Chung-Cheng Chiu
Tom O'Malley
Quan Wang
Yanzhang He
63
14
0
30 Oct 2021
Measuring a Texts Fairness Dimensions Using Machine Learning Based on Social Psychological Factors
A. Izzidien
D. Stillwell
28
0
0
29 Oct 2021
Differentiable Tracking-Based Training of Deep Learning Sound Source Localizers
Sharath Adavanne
Archontis Politis
Tuomas Virtanen
72
17
0
29 Oct 2021
Visual Keyword Spotting with Attention
Prajwal K R
Liliane Momeni
Triantafyllos Afouras
Andrew Zisserman
72
13
0
29 Oct 2021
Sparsely Changing Latent States for Prediction and Planning in Partially Observable Domains
Christian Gumbsch
Martin Volker Butz
Georg Martius
AI4CE
78
22
0
29 Oct 2021
Transformer Ensembles for Sexism Detection
Lily Davies
Marta Baldracchi
C. Borella
K. Perifanos
ViT
24
3
0
29 Oct 2021
Previous
1
2
3
...
338
339
340
...
538
539
540
Next