Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1706.03762
Cited By
v1
v2
v3
v4
v5
v6
v7 (latest)
Attention Is All You Need
12 June 2017
Ashish Vaswani
Noam M. Shazeer
Niki Parmar
Jakob Uszkoreit
Llion Jones
Aidan Gomez
Lukasz Kaiser
Illia Polosukhin
3DV
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Attention Is All You Need"
50 / 27,337 papers shown
Title
Learning Domain Specific Language Models for Automatic Speech Recognition through Machine Translation
Saurav Jha
85
1
0
21 Sep 2021
DS-Net++: Dynamic Weight Slicing for Efficient Inference in CNNs and Transformers
Changlin Li
Guangrun Wang
Bing Wang
Xiaodan Liang
Zhihui Li
Xiaojun Chang
96
9
0
21 Sep 2021
LOTR: Face Landmark Localization Using Localization Transformer
Ukrit Watchareeruetai
Benjaphan Sommanna
Sanjana Jain
Pavit Noinongyao
Ankush Ganguly
Aubin Samacoits
Samuel W. F. Earp
Nakarin Sritrakool
ViT
85
13
0
21 Sep 2021
Learning Kernel-Smoothed Machine Translation with Retrieved Examples
Qingnan Jiang
Mingxuan Wang
Jun Cao
Shanbo Cheng
Shujian Huang
Lei Li
91
33
0
21 Sep 2021
Multi-Domain Few-Shot Learning and Dataset for Agricultural Applications
Sai Vidyaranya Nuthalapati
Anirudh Tunga
84
37
0
21 Sep 2021
Survey: Transformer based Video-Language Pre-training
Ludan Ruan
Qin Jin
VLM
ViT
125
45
0
21 Sep 2021
Chemical-Reaction-Aware Molecule Representation Learning
Hongwei Wang
Weijian Li
Xiaomeng Jin
Kyunghyun Cho
Heng Ji
Jiawei Han
Martin D. Burke
183
62
0
21 Sep 2021
iRNN: Integer-only Recurrent Neural Network
Eyyub Sari
Vanessa Courville
V. Nia
MQ
85
4
0
20 Sep 2021
Well Googled is Half Done: Multimodal Forecasting of New Fashion Product Sales with Image-based Google Trends
Geri Skenderi
Christian Joppi
Matteo Denitto
Marco Cristani
AI4TS
91
26
0
20 Sep 2021
Transforming Fake News: Robust Generalisable News Classification Using Transformers
C. Blackledge
Amir Atapour-Abarghouei
67
14
0
20 Sep 2021
Neural Distance Embeddings for Biological Sequences
Gabriele Corso
Rex Ying
Michal Pándy
Petar Velivcković
J. Leskovec
Pietro Lio
70
40
0
20 Sep 2021
MFEViT: A Robust Lightweight Transformer-based Network for Multimodal 2D+3D Facial Expression Recognition
Hanting Li
Ming-Fa Sui
Zhaoqing Zhu
Feng Zhao
ViT
76
3
0
20 Sep 2021
BARTpho: Pre-trained Sequence-to-Sequence Models for Vietnamese
Nguyen Luong Tran
Duong Minh Le
Dat Quoc Nguyen
70
55
0
20 Sep 2021
BERT Cannot Align Characters
Antonis Maronikolakis
Philipp Dufter
Hinrich Schütze
55
0
0
20 Sep 2021
Recommender systems based on graph embedding techniques: A comprehensive review
Yue Deng
121
25
0
20 Sep 2021
Audio-Visual Speech Recognition is Worth 32
×
\times
×
32
×
\times
×
8 Voxels
Dmitriy Serdyuk
Otavio Braga
Olivier Siohan
ViT
87
7
0
20 Sep 2021
Dyadformer: A Multi-modal Transformer for Long-Range Modeling of Dyadic Interactions
D. Curto
Albert Clapés
Javier Selva
Sorina Smeureanu
Julio C. S. Jacques Junior
...
G. Guilera
D. Leiva
T. Moeslund
Sergio Escalera
Cristina Palmero
73
30
0
20 Sep 2021
A Survey of Text Games for Reinforcement Learning informed by Natural Language
P. Osborne
Heido Nomm
André Freitas
AI4CE
101
24
0
20 Sep 2021
Few-Shot Emotion Recognition in Conversation with Sequential Prototypical Networks
Gaël Guibon
Matthieu Labeau
Hélène Flamein
Luce Lefeuvre
Chloé Clavel
81
34
0
20 Sep 2021
CUNI systems for WMT21: Multilingual Low-Resource Translation for Indo-European Languages Shared Task
Josef Jon
Michal Novák
João Paulo Aires
Duvsan Varivs
Ondrej Bojar
40
3
0
20 Sep 2021
CUNI systems for WMT21: Terminology translation Shared Task
Josef Jon
Michal Novák
João Paulo Aires
Duvsan Varivs
Ondrej Bojar
47
4
0
20 Sep 2021
Latexify Math: Mathematical Formula Markup Revision to Assist Collaborative Editing in Math Q&A Sites
Suyu Ma
Chunyang Chen
Hourieh Khalajzadeh
J. Grundy
HAI
AIMat
36
5
0
20 Sep 2021
Conditional probing: measuring usable information beyond a baseline
John Hewitt
Kawin Ethayarajh
Percy Liang
Christopher D. Manning
90
57
0
19 Sep 2021
Unified and Multilingual Author Profiling for Detecting Haters
Ipek Baris Schlicht
Angel Felipe Magnossão de Paula
67
6
0
19 Sep 2021
Capsule networks with non-iterative cluster routing
Zhihao Zhao
Samuel Cheng
24
10
0
19 Sep 2021
Multi-Task Learning in Natural Language Processing: An Overview
Shijie Chen
Yu Zhang
Qiang Yang
AIMat
145
113
0
19 Sep 2021
Do Long-Range Language Models Actually Use Long-Range Context?
Simeng Sun
Kalpesh Krishna
Andrew Mattarella-Micke
Mohit Iyyer
RALM
91
84
0
19 Sep 2021
Hierarchical Relation-Guided Type-Sentence Alignment for Long-Tail Relation Extraction with Distant Supervision
Tao Shen
Guodong Long
Tao Shen
Jing Jiang
74
1
0
19 Sep 2021
SDTP: Semantic-aware Decoupled Transformer Pyramid for Dense Image Prediction
Zekun Li
Yufan Liu
Bing Li
Weiming Hu
Kebin Wu
Chengwei Peng
ViT
71
23
0
18 Sep 2021
AutoInit: Analytic Signal-Preserving Weight Initialization for Neural Networks
G. Bingham
Risto Miikkulainen
ODL
74
4
0
18 Sep 2021
UNetFormer: A UNet-like Transformer for Efficient Semantic Segmentation of Remote Sensing Urban Scene Imagery
Libo Wang
Rui Li
Ce Zhang
Shenghui Fang
Chenxi Duan
Xiaoliang Meng
P. M. Atkinson
ViT
123
682
0
18 Sep 2021
Complex Temporal Question Answering on Knowledge Graphs
Zhen Jia
Soumajit Pramanik
Rishiraj Saha Roy
Gerhard Weikum
267
113
0
18 Sep 2021
Weakly Supervised Explainable Phrasal Reasoning with Neural Fuzzy Logic
Zijun Wu
Zi Xuan Zhang
Atharva Naik
Zhijian Mei
Mauajama Firdaus
Lili Mou
LRM
NAI
75
14
0
18 Sep 2021
Towards Joint Intent Detection and Slot Filling via Higher-order Attention
Dongsheng Chen
Zhiqi Huang
Xian Wu
Shen Ge
Yuexian Zou
75
22
0
18 Sep 2021
DyLex: Incorporating Dynamic Lexicons into BERT for Sequence Labeling
Baojun Wang
Zhao Zhang
Kun Xu
Guang-Yuan Hao
Yuyang Zhang
Lifeng Shang
Linlin Li
Xiao Chen
Xin Jiang
Qun Liu
82
6
0
18 Sep 2021
Structured Pattern Pruning Using Regularization
Dongju Park
Geunghee Lee
101
0
0
18 Sep 2021
Long-Range Modeling of Source Code Files with eWASH: Extended Window Access by Syntax Hierarchy
Colin B. Clement
Shuai Lu
Xiaoyu Liu
Michele Tufano
Dawn Drain
Nan Duan
Neel Sundaresan
Alexey Svyatkovskiy
94
27
0
17 Sep 2021
RnG-KBQA: Generation Augmented Iterative Ranking for Knowledge Base Question Answering
Xi Ye
Semih Yavuz
Kazuma Hashimoto
Yingbo Zhou
Caiming Xiong
222
148
0
17 Sep 2021
Primer: Searching for Efficient Transformers for Language Modeling
David R. So
Wojciech Mañke
Hanxiao Liu
Zihang Dai
Noam M. Shazeer
Quoc V. Le
VLM
277
156
0
17 Sep 2021
Classification-based Quality Estimation: Small and Efficient Models for Real-world Applications
Shuo Sun
Ahmed El-Kishky
Vishrav Chaudhary
James Cross
Francisco Guzmán
Lucia Specia
64
1
0
17 Sep 2021
A review and experimental evaluation of deep learning methods for MRI reconstruction
Arghya Pal
Yogesh Rathi
3DV
111
47
0
17 Sep 2021
Diverse Generation from a Single Video Made Possible
Niv Haim
Ben Feinstein
Niv Granot
Assaf Shocher
Shai Bagon
Tali Dekel
Michal Irani
DiffM
VGen
107
19
0
17 Sep 2021
Does Commonsense help in detecting Sarcasm?
Somnath Basu Roy Chowdhury
Snigdha Chaturvedi
46
10
0
17 Sep 2021
Multimodal Incremental Transformer with Visual Grounding for Visual Dialogue Generation
Feilong Chen
Fandong Meng
Xiuyi Chen
Peng Li
Jie Zhou
99
23
0
17 Sep 2021
GoG: Relation-aware Graph-over-Graph Network for Visual Dialog
Feilong Chen
Xiuyi Chen
Fandong Meng
Peng Li
Jie Zhou
145
35
0
17 Sep 2021
Cross Modification Attention Based Deliberation Model for Image Captioning
Zheng Lian
Yanan Zhang
Haichang Li
Rui Wang
Xiaohui Hu
64
5
0
17 Sep 2021
Expression Snippet Transformer for Robust Video-based Facial Expression Recognition
Yuanyuan Liu
Wenbin Wang
Chuanxu Feng
Haoyu Zhang
Zhe Chen
Yibing Zhan
ViT
79
65
0
17 Sep 2021
From Known to Unknown: Knowledge-guided Transformer for Time-Series Sales Forecasting in Alibaba
Xinyuan Qi
Kai Hou
Tong Liu
Zhongzhong Yu
Sihao Hu
Wenwu Ou
AI4TS
90
20
0
17 Sep 2021
CodeQA: A Question Answering Dataset for Source Code Comprehension
Chenxiao Liu
Xiaojun Wan
107
29
0
17 Sep 2021
Adaptive Hierarchical Dual Consistency for Semi-Supervised Left Atrium Segmentation on Cross-Domain Data
Jun Chen
Heye Zhang
R. Mohiaddin
Tom Wong
D. Firmin
J. Keegan
Guang Yang
91
37
0
17 Sep 2021
Previous
1
2
3
...
358
359
360
...
545
546
547
Next