ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1706.03762
  4. Cited By
Attention Is All You Need

Attention Is All You Need

12 June 2017
Ashish Vaswani
Noam M. Shazeer
Niki Parmar
Jakob Uszkoreit
Llion Jones
Aidan Gomez
Lukasz Kaiser
Illia Polosukhin
    3DV
ArXivPDFHTML

Papers citing "Attention Is All You Need"

50 / 18,521 papers shown
Title
DFSMN-SAN with Persistent Memory Model for Automatic Speech Recognition
DFSMN-SAN with Persistent Memory Model for Automatic Speech Recognition
Zhao You
Dan Su
Jie Chen
Chao Weng
Dong Yu
28
13
0
28 Oct 2019
Input-Cell Attention Reduces Vanishing Saliency of Recurrent Neural
  Networks
Input-Cell Attention Reduces Vanishing Saliency of Recurrent Neural Networks
Aya Abdelsalam Ismail
Mohamed K. Gunady
L. Pessoa
H. C. Bravo
S. Feizi
AI4TS
25
50
0
27 Oct 2019
An Adaptive and Momental Bound Method for Stochastic Learning
An Adaptive and Momental Bound Method for Stochastic Learning
Jianbang Ding
Xuancheng Ren
Ruixuan Luo
Xu Sun
ODL
19
46
0
27 Oct 2019
Pre-train and Learn: Preserve Global Information for Graph Neural
  Networks
Pre-train and Learn: Preserve Global Information for Graph Neural Networks
Danhao Zhu
Xinyu Dai
Jiajun Chen
21
23
0
27 Oct 2019
PRNet: Self-Supervised Learning for Partial-to-Partial Registration
PRNet: Self-Supervised Learning for Partial-to-Partial Registration
Yue Wang
Justin Solomon
SSL
3DPC
28
379
0
27 Oct 2019
Do Sentence Interactions Matter? Leveraging Sentence Level
  Representations for Fake News Classification
Do Sentence Interactions Matter? Leveraging Sentence Level Representations for Fake News Classification
Vaibhav Vaibhav
Raghuram Mandyam Annasamy
Eduard H. Hovy
GNN
13
66
0
27 Oct 2019
ViGGO: A Video Game Corpus for Data-To-Text Generation in Open-Domain
  Conversation
ViGGO: A Video Game Corpus for Data-To-Text Generation in Open-Domain Conversation
Juraj Juraska
Kevin K. Bowden
M. Walker
21
42
0
26 Oct 2019
Fair Generative Modeling via Weak Supervision
Fair Generative Modeling via Weak Supervision
Kristy Choi
Aditya Grover
Trisha Singh
Rui Shu
Stefano Ermon
36
133
0
26 Oct 2019
Data Augmentation for Skin Lesion using Self-Attention based Progressive
  Generative Adversarial Network
Data Augmentation for Skin Lesion using Self-Attention based Progressive Generative Adversarial Network
Ibrahim Saad Ali
Mamdouh Farouk Mohamed
Y. B. Mahdy
GAN
MedIm
20
119
0
25 Oct 2019
FineText: Text Classification via Attention-based Language Model
  Fine-tuning
FineText: Text Classification via Attention-based Language Model Fine-tuning
Yunzhe Tao
Saurabh Gupta
Satyapriya Krishna
Xiong Zhou
Orchid Majumder
Vineet Khare
21
3
0
25 Oct 2019
Improving Graph Attention Networks with Large Margin-based Constraints
Improving Graph Attention Networks with Large Margin-based Constraints
Guangtao Wang
Rex Ying
Jing-ling Huang
J. Leskovec
22
80
0
25 Oct 2019
On the Cross-lingual Transferability of Monolingual Representations
On the Cross-lingual Transferability of Monolingual Representations
Mikel Artetxe
Sebastian Ruder
Dani Yogatama
28
777
0
25 Oct 2019
Meta-Learning with Dynamic-Memory-Based Prototypical Network for
  Few-Shot Event Detection
Meta-Learning with Dynamic-Memory-Based Prototypical Network for Few-Shot Event Detection
Shumin Deng
Ningyu Zhang
Jiaojian Kang
Yichi Zhang
Wei Zhang
Huajun Chen
31
130
0
25 Oct 2019
SpeechBERT: An Audio-and-text Jointly Learned Language Model for
  End-to-end Spoken Question Answering
SpeechBERT: An Audio-and-text Jointly Learned Language Model for End-to-end Spoken Question Answering
Yung-Sung Chuang
Chi-Liang Liu
Hung-yi Lee
Lin-shan Lee
AuLLM
27
39
0
25 Oct 2019
Fast Structured Decoding for Sequence Models
Fast Structured Decoding for Sequence Models
Zhiqing Sun
Zhuohan Li
Haoqing Wang
Zi Lin
Di He
Zhihong Deng
27
122
0
25 Oct 2019
HUBERT Untangles BERT to Improve Transfer across NLP Tasks
HUBERT Untangles BERT to Improve Transfer across NLP Tasks
M. Moradshahi
Hamid Palangi
M. Lam
P. Smolensky
Jianfeng Gao
26
16
0
25 Oct 2019
Towards Online End-to-end Transformer Automatic Speech Recognition
Towards Online End-to-end Transformer Automatic Speech Recognition
E. Tsunoo
Yosuke Kashiwagi
Toshiyuki Kumakura
Shinji Watanabe
22
32
0
25 Oct 2019
Mockingjay: Unsupervised Speech Representation Learning with Deep
  Bidirectional Transformer Encoders
Mockingjay: Unsupervised Speech Representation Learning with Deep Bidirectional Transformer Encoders
Andy T. Liu
Shu-Wen Yang
Po-Han Chi
Po-Chun Hsu
Hung-yi Lee
SSL
45
372
0
25 Oct 2019
Neurlux: Dynamic Malware Analysis Without Feature Engineering
Neurlux: Dynamic Malware Analysis Without Feature Engineering
Chani Jindal
Christopher Salls
H. Aghakhani
Keith Long
Christopher Kruegel
Giovanni Vigna
15
62
0
24 Oct 2019
Cross-Lingual Vision-Language Navigation
Cross-Lingual Vision-Language Navigation
An Yan
Xinze Wang
Jiangtao Feng
Lei Li
William Yang Wang
LM&Ro
32
16
0
24 Oct 2019
Unsupervised Representation Learning with Future Observation Prediction
  for Speech Emotion Recognition
Unsupervised Representation Learning with Future Observation Prediction for Speech Emotion Recognition
Zheng Lian
J. Tao
Bin Liu
Jian Huang
SSL
22
17
0
24 Oct 2019
U-Time: A Fully Convolutional Network for Time Series Segmentation
  Applied to Sleep Staging
U-Time: A Fully Convolutional Network for Time Series Segmentation Applied to Sleep Staging
Mathias Perslev
M. Jensen
S. Darkner
P. Jennum
Christian Igel
AI4TS
25
243
0
24 Oct 2019
Anchor Diffusion for Unsupervised Video Object Segmentation
Anchor Diffusion for Unsupervised Video Object Segmentation
Zhao Yang
Qiang Wang
Luca Bertinetto
Weiming Hu
S. Bai
Philip Torr
VOS
43
115
0
24 Oct 2019
Attention-based Curiosity-driven Exploration in Deep Reinforcement
  Learning
Attention-based Curiosity-driven Exploration in Deep Reinforcement Learning
Patrik Reizinger
Marton Szemenyei
31
16
0
23 Oct 2019
Hierarchical Transformers for Long Document Classification
Hierarchical Transformers for Long Document Classification
R. Pappagari
Piotr Żelasko
Jesús Villalba
Yishay Carmiel
Najim Dehak
22
239
0
23 Oct 2019
Correction of Automatic Speech Recognition with Transformer
  Sequence-to-sequence Model
Correction of Automatic Speech Recognition with Transformer Sequence-to-sequence Model
Oleksii Hrinchuk
Mariya Popova
Boris Ginsburg
VLM
20
87
0
23 Oct 2019
Exploring the Limits of Transfer Learning with a Unified Text-to-Text
  Transformer
Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer
Colin Raffel
Noam M. Shazeer
Adam Roberts
Katherine Lee
Sharan Narang
Michael Matena
Yanqi Zhou
Wei Li
Peter J. Liu
AIMat
129
19,529
0
23 Oct 2019
Generative Pre-Training for Speech with Autoregressive Predictive Coding
Generative Pre-Training for Speech with Autoregressive Predictive Coding
Yu-An Chung
James R. Glass
SSL
29
173
0
23 Oct 2019
Controlling the Output Length of Neural Machine Translation
Controlling the Output Length of Neural Machine Translation
Surafel Melaku Lakew
Mattia Antonino Di Gangi
Marcello Federico
17
67
0
23 Oct 2019
A Transformer with Interleaved Self-attention and Convolution for Hybrid
  Acoustic Models
A Transformer with Interleaved Self-attention and Convolution for Hybrid Acoustic Models
Liang Lu
19
4
0
23 Oct 2019
Deja-vu: Double Feature Presentation and Iterated Loss in Deep
  Transformer Networks
Deja-vu: Double Feature Presentation and Iterated Loss in Deep Transformer Networks
Andros Tjandra
Chunxi Liu
Frank Zhang
Xiaohui Zhang
Yongqiang Wang
Gabriel Synnaeve
Satoshi Nakamura
Geoffrey Zweig
ViT
25
44
0
23 Oct 2019
Robust Neural Machine Translation for Clean and Noisy Speech Transcripts
Robust Neural Machine Translation for Clean and Noisy Speech Transcripts
Mattia Antonino Di Gangi
Robert Enyedi
A. Brusadin
Marcello Federico
31
25
0
22 Oct 2019
Beyond Human Parts: Dual Part-Aligned Representations for Person
  Re-Identification
Beyond Human Parts: Dual Part-Aligned Representations for Person Re-Identification
Jianyuan Guo
Yuhui Yuan
Lang Huang
Chao Zhang
J. Yao
Kai Han
30
182
0
22 Oct 2019
Depth-Adaptive Transformer
Depth-Adaptive Transformer
Maha Elbayad
Jiatao Gu
Edouard Grave
Michael Auli
19
186
0
22 Oct 2019
Explainable Artificial Intelligence (XAI): Concepts, Taxonomies,
  Opportunities and Challenges toward Responsible AI
Explainable Artificial Intelligence (XAI): Concepts, Taxonomies, Opportunities and Challenges toward Responsible AI
Alejandro Barredo Arrieta
Natalia Díaz Rodríguez
Javier Del Ser
Adrien Bennetot
Siham Tabik
...
S. Gil-Lopez
Daniel Molina
Richard Benjamins
Raja Chatila
Francisco Herrera
XAI
41
6,119
0
22 Oct 2019
Sequence-to-sequence Singing Synthesis Using the Feed-forward
  Transformer
Sequence-to-sequence Singing Synthesis Using the Feed-forward Transformer
Merlijn Blaauw
J. Bonada
27
55
0
22 Oct 2019
Improving Transformer-based Speech Recognition Using Unsupervised
  Pre-training
Improving Transformer-based Speech Recognition Using Unsupervised Pre-training
Dongwei Jiang
Xiaoning Lei
Wubo Li
Ne Luo
Yuxuan Hu
Wei Zou
Xiangang Li
24
99
0
22 Oct 2019
Transformer-based Acoustic Modeling for Hybrid Speech Recognition
Transformer-based Acoustic Modeling for Hybrid Speech Recognition
Yongqiang Wang
Abdel-rahman Mohamed
Duc Le
Chunxi Liu
Alex Xiao
...
Xiaohui Zhang
Frank Zhang
Christian Fuegen
Geoffrey Zweig
M. Seltzer
16
248
0
22 Oct 2019
Discriminative Neural Clustering for Speaker Diarisation
Discriminative Neural Clustering for Speaker Diarisation
Qiujia Li
Florian Kreyssig
Chao Zhang
P. Woodland
11
44
0
22 Oct 2019
Learning to Make Generalizable and Diverse Predictions for
  Retrosynthesis
Learning to Make Generalizable and Diverse Predictions for Retrosynthesis
Benson Chen
T. Shen
Tommi Jaakkola
Regina Barzilay
24
46
0
21 Oct 2019
Domain-agnostic Question-Answering with Adversarial Training
Domain-agnostic Question-Answering with Adversarial Training
Seanie Lee
Donggyu Kim
Jangwon Park
OOD
35
72
0
21 Oct 2019
A Neural Entity Coreference Resolution Review
A Neural Entity Coreference Resolution Review
Nikolaos Stylianou
I. Vlahavas
24
38
0
21 Oct 2019
Localization of Fake News Detection via Multitask Transfer Learning
Localization of Fake News Detection via Multitask Transfer Learning
Jan Christian Blaise Cruz
Julianne Agatha Tan
C. Cheng
23
33
0
21 Oct 2019
Constructing Artificial Data for Fine-tuning for Low-Resource Biomedical
  Text Tagging with Applications in PICO Annotation
Constructing Artificial Data for Fine-tuning for Low-Resource Biomedical Text Tagging with Applications in PICO Annotation
Gaurav Singh
Zahra Sabet
John Shawe-Taylor
James Thomas
26
7
0
21 Oct 2019
Discovering the Compositional Structure of Vector Representations with
  Role Learning Networks
Discovering the Compositional Structure of Vector Representations with Role Learning Networks
Paul Soulos
R. Thomas McCoy
Tal Linzen
P. Smolensky
CoGe
29
43
0
21 Oct 2019
Findings of the NLP4IF-2019 Shared Task on Fine-Grained Propaganda
  Detection
Findings of the NLP4IF-2019 Shared Task on Fine-Grained Propaganda Detection
Giovanni Da San Martino
Alberto Barrón-Cedeño
Preslav Nakov
22
80
0
20 Oct 2019
LinesToFacePhoto: Face Photo Generation from Lines with Conditional
  Self-Attention Generative Adversarial Network
LinesToFacePhoto: Face Photo Generation from Lines with Conditional Self-Attention Generative Adversarial Network
Yuhang Li
Xiao Chen
Feng Wu
Zhengjun Zha
CVBM
GAN
27
65
0
20 Oct 2019
Unsupervised High-Resolution Depth Learning From Videos With Dual
  Networks
Unsupervised High-Resolution Depth Learning From Videos With Dual Networks
Junsheng Zhou
Yuwang Wang
K. Qin
Wenjun Zeng
MDE
29
71
0
20 Oct 2019
Personalized Graph Neural Networks with Attention Mechanism for
  Session-Aware Recommendation
Personalized Graph Neural Networks with Attention Mechanism for Session-Aware Recommendation
Mengqi Zhang
Shu Wu
Meng Gao
Xin Jiang
Ke Xu
Liang Wang
27
43
0
20 Oct 2019
XL-Editor: Post-editing Sentences with XLNet
XL-Editor: Post-editing Sentences with XLNet
Yong-Siang Shih
Wei-Cheng Chang
Yiming Yang
KELM
22
11
0
19 Oct 2019
Previous
123...350351352...369370371
Next