ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1810.07595
  4. Cited By
An Analysis of Attention Mechanisms: The Case of Word Sense
  Disambiguation in Neural Machine Translation

An Analysis of Attention Mechanisms: The Case of Word Sense Disambiguation in Neural Machine Translation

17 October 2018
Gongbo Tang
Rico Sennrich
Joakim Nivre
ArXivPDFHTML

Papers citing "An Analysis of Attention Mechanisms: The Case of Word Sense Disambiguation in Neural Machine Translation"

41 / 41 papers shown
Title
StreamAtt: Direct Streaming Speech-to-Text Translation with
  Attention-based Audio History Selection
StreamAtt: Direct Streaming Speech-to-Text Translation with Attention-based Audio History Selection
Sara Papi
Marco Gaido
Matteo Negri
L. Bentivogli
79
4
0
10 Jun 2024
SBAAM! Eliminating Transcript Dependency in Automatic Subtitling
SBAAM! Eliminating Transcript Dependency in Automatic Subtitling
Marco Gaido
Sara Papi
Matteo Negri
Mauro Cettolo
L. Bentivogli
43
1
0
17 May 2024
Code-Switching with Word Senses for Pretraining in Neural Machine
  Translation
Code-Switching with Word Senses for Pretraining in Neural Machine Translation
Vivek Iyer
Edoardo Barba
Alexandra Birch
Jeff Z. Pan
Roberto Navigli
20
3
0
21 Oct 2023
AlignAtt: Using Attention-based Audio-Translation Alignments as a Guide
  for Simultaneous Speech Translation
AlignAtt: Using Attention-based Audio-Translation Alignments as a Guide for Simultaneous Speech Translation
Sara Papi
Marco Turchi
Matteo Negri
35
20
0
19 May 2023
Learning Homographic Disambiguation Representation for Neural Machine
  Translation
Learning Homographic Disambiguation Representation for Neural Machine Translation
Weixuan Wang
Wei Peng
Qun Liu
30
0
0
12 Apr 2023
Compositional Zero-Shot Domain Transfer with Text-to-Text Models
Compositional Zero-Shot Domain Transfer with Text-to-Text Models
Fangyu Liu
Qianchu Liu
Shruthi Bannur
Fernando Pérez-García
Naoto Usuyama
...
A. Nori
Hoifung Poon
Javier Alvarez-Valle
Ozan Oktay
Stephanie L. Hyland
VLM
51
6
0
23 Mar 2023
Attention as a Guide for Simultaneous Speech Translation
Attention as a Guide for Simultaneous Speech Translation
Sara Papi
Matteo Negri
Marco Turchi
26
30
0
15 Dec 2022
Can Transformer be Too Compositional? Analysing Idiom Processing in
  Neural Machine Translation
Can Transformer be Too Compositional? Analysing Idiom Processing in Neural Machine Translation
Verna Dankers
Christopher G. Lucas
Ivan Titov
38
36
0
30 May 2022
Building on Huang et al. GlossBERT for Word Sense Disambiguation
Building on Huang et al. GlossBERT for Word Sense Disambiguation
Nikhil Patel
James Hale
Kanika Jindal
Apoorva Sharma
Yichun Yu
11
2
0
14 Dec 2021
Incorporating Residual and Normalization Layers into Analysis of Masked
  Language Models
Incorporating Residual and Normalization Layers into Analysis of Masked Language Models
Goro Kobayashi
Tatsuki Kuribayashi
Sho Yokoi
Kentaro Inui
160
46
0
15 Sep 2021
Inspecting the concept knowledge graph encoded by modern language models
Inspecting the concept knowledge graph encoded by modern language models
Carlos Aspillaga
Marcelo Mendoza
Alvaro Soto
27
13
0
27 May 2021
An empirical analysis of phrase-based and neural machine translation
An empirical analysis of phrase-based and neural machine translation
Hamidreza Ghader
29
1
0
04 Mar 2021
Gender Bias in Multilingual Neural Machine Translation: The Architecture
  Matters
Gender Bias in Multilingual Neural Machine Translation: The Architecture Matters
Marta R. Costa-jussá
Carlos Escolano
Christine Basta
Javier Ferrando
Roser Batlle-Roca
Ksenia Kharitonova
22
18
0
24 Dec 2020
Machine Learning for Detecting Data Exfiltration: A Review
Machine Learning for Detecting Data Exfiltration: A Review
Bushra Sabir
Faheem Ullah
M. Babar
R. Gaire
AAML
19
31
0
17 Dec 2020
Understanding Pure Character-Based Neural Machine Translation: The Case
  of Translating Finnish into English
Understanding Pure Character-Based Neural Machine Translation: The Case of Translating Finnish into English
Gongbo Tang
Rico Sennrich
Joakim Nivre
30
7
0
06 Nov 2020
Focus on the present: a regularization method for the ASR source-target
  attention layer
Focus on the present: a regularization method for the ASR source-target attention layer
Nanxin Chen
Piotr Żelasko
Jesús Villalba
Najim Dehak
15
3
0
02 Nov 2020
Analyzing the Source and Target Contributions to Predictions in Neural
  Machine Translation
Analyzing the Source and Target Contributions to Predictions in Neural Machine Translation
Elena Voita
Rico Sennrich
Ivan Titov
19
85
0
21 Oct 2020
It's not a Non-Issue: Negation as a Source of Error in Machine
  Translation
It's not a Non-Issue: Negation as a Source of Error in Machine Translation
Md Mosharaf Hossain
Antonios Anastasopoulos
Eduardo Blanco
Alexis Palmer
16
25
0
12 Oct 2020
Linguistic Profiling of a Neural Language Model
Linguistic Profiling of a Neural Language Model
Alessio Miaschi
D. Brunato
F. Dell’Orletta
Giulia Venturi
36
46
0
05 Oct 2020
Pruning Redundant Mappings in Transformer Models via Spectral-Normalized
  Identity Prior
Pruning Redundant Mappings in Transformer Models via Spectral-Normalized Identity Prior
Zi Lin
Jeremiah Zhe Liu
Ziao Yang
Nan Hua
Dan Roth
30
46
0
05 Oct 2020
Alleviating the Inequality of Attention Heads for Neural Machine
  Translation
Alleviating the Inequality of Attention Heads for Neural Machine Translation
Zewei Sun
Shujian Huang
Xinyu Dai
Jiajun Chen
21
7
0
21 Sep 2020
Fine-grained Human Evaluation of Transformer and Recurrent Approaches to
  Neural Machine Translation for English-to-Chinese
Fine-grained Human Evaluation of Transformer and Recurrent Approaches to Neural Machine Translation for English-to-Chinese
Yuying Ye
Antonio Toral
13
7
0
15 Jun 2020
Disentangled Non-Local Neural Networks
Disentangled Non-Local Neural Networks
Minghao Yin
Zhuliang Yao
Yue Cao
Xiu Li
Zheng-Wei Zhang
Stephen Lin
Han Hu
17
327
0
11 Jun 2020
MultiMWE: Building a Multi-lingual Multi-Word Expression (MWE) Parallel
  Corpora
MultiMWE: Building a Multi-lingual Multi-Word Expression (MWE) Parallel Corpora
Lifeng Han
Gareth J. F. Jones
Alan F. Smeaton
16
14
0
21 May 2020
Attention is Not Only a Weight: Analyzing Transformers with Vector Norms
Attention is Not Only a Weight: Analyzing Transformers with Vector Norms
Goro Kobayashi
Tatsuki Kuribayashi
Sho Yokoi
Kentaro Inui
30
15
0
21 Apr 2020
Neural Machine Translation: A Review and Survey
Neural Machine Translation: A Review and Survey
Felix Stahlberg
3DV
AI4TS
MedIm
25
312
0
04 Dec 2019
What do you mean, BERT? Assessing BERT as a Distributional Semantics
  Model
What do you mean, BERT? Assessing BERT as a Distributional Semantics Model
Timothee Mickus
Denis Paperno
Mathieu Constant
Kees van Deemter
26
45
0
13 Nov 2019
Interrogating the Explanatory Power of Attention in Neural Machine
  Translation
Interrogating the Explanatory Power of Attention in Neural Machine Translation
Pooya Moradi
Nishant Kambhatla
Anoop Sarkar
21
16
0
30 Sep 2019
Jointly Learning to Align and Translate with Transformer Models
Jointly Learning to Align and Translate with Transformer Models
Sarthak Garg
Stephan Peitz
Udhyakumar Nallasamy
Matthias Paulik
11
172
0
04 Sep 2019
Encoders Help You Disambiguate Word Senses in Neural Machine Translation
Encoders Help You Disambiguate Word Senses in Neural Machine Translation
Gongbo Tang
Rico Sennrich
Joakim Nivre
19
22
0
30 Aug 2019
Hard but Robust, Easy but Sensitive: How Encoder and Decoder Perform in
  Neural Machine Translation
Hard but Robust, Easy but Sensitive: How Encoder and Decoder Perform in Neural Machine Translation
Tianyu He
Xu Tan
Tao Qin
25
12
0
17 Aug 2019
On Identifiability in Transformers
On Identifiability in Transformers
Gino Brunner
Yang Liu
Damian Pascual
Oliver Richter
Massimiliano Ciaramita
Roger Wattenhofer
ViT
30
186
0
12 Aug 2019
Understanding Neural Machine Translation by Simplification: The Case of
  Encoder-free Models
Understanding Neural Machine Translation by Simplification: The Case of Encoder-free Models
Gongbo Tang
Rico Sennrich
Joakim Nivre
AI4CE
24
14
0
18 Jul 2019
Saliency-driven Word Alignment Interpretation for Neural Machine
  Translation
Saliency-driven Word Alignment Interpretation for Neural Machine Translation
Shuoyang Ding
Hainan Xu
Philipp Koehn
30
55
0
25 Jun 2019
Analyzing Multi-Head Self-Attention: Specialized Heads Do the Heavy
  Lifting, the Rest Can Be Pruned
Analyzing Multi-Head Self-Attention: Specialized Heads Do the Heavy Lifting, the Rest Can Be Pruned
Elena Voita
David Talbot
F. Moiseev
Rico Sennrich
Ivan Titov
33
1,105
0
23 May 2019
An Empirical Study of Spatial Attention Mechanisms in Deep Networks
An Empirical Study of Spatial Attention Mechanisms in Deep Networks
Xizhou Zhu
Dazhi Cheng
Zheng-Wei Zhang
Stephen Lin
Jifeng Dai
40
403
0
11 Apr 2019
Augmenting Neural Machine Translation with Knowledge Graphs
Augmenting Neural Machine Translation with Knowledge Graphs
Diego Moussallem
Mihael Arcan
A. N. Ngomo
P. Buitelaar
20
21
0
23 Feb 2019
Context in Neural Machine Translation: A Review of Models and
  Evaluations
Context in Neural Machine Translation: A Review of Models and Evaluations
Andrei Popescu-Belis
MedIm
10
28
0
25 Jan 2019
Analysis Methods in Neural Language Processing: A Survey
Analysis Methods in Neural Language Processing: A Survey
Yonatan Belinkov
James R. Glass
39
547
0
21 Dec 2018
Six Challenges for Neural Machine Translation
Six Challenges for Neural Machine Translation
Philipp Koehn
Rebecca Knowles
AAML
AIMat
224
1,209
0
12 Jun 2017
Effective Approaches to Attention-based Neural Machine Translation
Effective Approaches to Attention-based Neural Machine Translation
Thang Luong
Hieu H. Pham
Christopher D. Manning
218
7,929
0
17 Aug 2015
1