ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1409.0473
  4. Cited By
Neural Machine Translation by Jointly Learning to Align and Translate

Neural Machine Translation by Jointly Learning to Align and Translate

1 September 2014
Dzmitry Bahdanau
Kyunghyun Cho
Yoshua Bengio
    AIMat
ArXivPDFHTML

Papers citing "Neural Machine Translation by Jointly Learning to Align and Translate"

50 / 6,328 papers shown
Title
The Quest for Visual Understanding: A Journey Through the Evolution of Visual Question Answering
The Quest for Visual Understanding: A Journey Through the Evolution of Visual Question Answering
Anupam Pandey
Deepjyoti Bodo
Arpan Phukan
Asif Ekbal
51
0
0
13 Jan 2025
TFLAG:Towards Practical APT Detection via Deviation-Aware Learning on Temporal Provenance Graph
TFLAG:Towards Practical APT Detection via Deviation-Aware Learning on Temporal Provenance Graph
Wenhan Jiang
Tingting Chai
Hongri Liu
Kai Wang
Hongke Zhang
49
0
0
13 Jan 2025
Iconicity in Large Language Models
Iconicity in Large Language Models
Anna Marklová
Jiří Milička
Leonid Ryvkin
Ľudmila Lacková Bennet
Libuše Kormaníková
46
0
0
10 Jan 2025
On Creating A Brain-To-Text Decoder
On Creating A Brain-To-Text Decoder
Zenon Lamprou
Yashar Moshfeghi
43
0
0
10 Jan 2025
Koopman Learning with Episodic Memory
Koopman Learning with Episodic Memory
William T. Redman
Dean Huang
M. Fonoberova
Igor Mezić
46
0
0
08 Jan 2025
Clinical Insights: A Comprehensive Review of Language Models in Medicine
Clinical Insights: A Comprehensive Review of Language Models in Medicine
Nikita Neveditsin
Pawan Lingras
V. Mago
LM&MA
63
4
0
08 Jan 2025
CORD: Generalizable Cooperation via Role Diversity
CORD: Generalizable Cooperation via Role Diversity
Kanefumi Matsuyama
Kefan Su
Jiangxing Wang
Deheng Ye
Zongqing Lu
45
0
0
04 Jan 2025
Graph-Aware Isomorphic Attention for Adaptive Dynamics in Transformers
Graph-Aware Isomorphic Attention for Adaptive Dynamics in Transformers
Markus J. Buehler
AI4CE
37
1
0
04 Jan 2025
Diversity Optimization for Travelling Salesman Problem via Deep Reinforcement Learning
Qi Li
Zhiguang Cao
Yining Ma
Yaoxin Wu
Yue-jiao Gong
55
0
0
03 Jan 2025
Kolmogorov GAM Networks are all you need!
Sarah Polson
Vadim Sokolov
39
0
0
03 Jan 2025
Decoupling Knowledge and Reasoning in Transformers: A Modular Architecture with Generalized Cross-Attention
Zhenyu Guo
Wenguang Chen
51
0
0
01 Jan 2025
Deep Kalman Filters Can Filter
Deep Kalman Filters Can Filter
Blanka Hovart
Anastasis Kratsios
Yannick Limmer
Xuwei Yang
61
1
0
31 Dec 2024
Towards Visual Grounding: A Survey
Towards Visual Grounding: A Survey
Linhui Xiao
Xiaoshan Yang
X. Lan
Yaowei Wang
Changsheng Xu
ObjD
67
4
0
31 Dec 2024
Out-of-distribution generalization via composition: a lens through induction heads in Transformers
Out-of-distribution generalization via composition: a lens through induction heads in Transformers
Jiajun Song
Zhuoyan Xu
Yiqiao Zhong
88
5
0
31 Dec 2024
Symbolic Disentangled Representations for Images
Symbolic Disentangled Representations for Images
Alexandr Korchemnyi
A. Kovalev
Aleksandr I. Panov
OCL
53
0
0
31 Dec 2024
The Jungle of Generative Drug Discovery: Traps, Treasures, and Ways Out
Rıza Özçelik
F. Grisoni
50
0
0
24 Dec 2024
Reconsidering SMT Over NMT for Closely Related Languages: A Case Study
  of Persian-Hindi Pair
Reconsidering SMT Over NMT for Closely Related Languages: A Case Study of Persian-Hindi Pair
Waisullah Yousofi
Pushpak Bhattacharyya
84
0
0
22 Dec 2024
Sensitive Image Classification by Vision Transformers
Sensitive Image Classification by Vision Transformers
Hanxian He
Campbell Wilson
Thanh Thi Nguyen
Janis Dalins
ViT
89
0
0
21 Dec 2024
Reframing Image Difference Captioning with BLIP2IDC and Synthetic
  Augmentation
Reframing Image Difference Captioning with BLIP2IDC and Synthetic Augmentation
Gautier Evennou
Antoine Chaffin
Vivien Chappelier
Ewa Kijak
DiffM
89
0
0
20 Dec 2024
Mention Attention for Pronoun Translation
Mention Attention for Pronoun Translation
Gongbo Tang
Christian Hardmeier
118
0
0
19 Dec 2024
On the Use of Deep Learning Models for Semantic Clone Detection
On the Use of Deep Learning Models for Semantic Clone Detection
Subroto Nag Pinku
Debajyoti Mondal
C. Roy
79
3
0
19 Dec 2024
Knowledge Distillation in RNN-Attention Models for Early Prediction of
  Student Performance
Knowledge Distillation in RNN-Attention Models for Early Prediction of Student Performance
Sukrit Leelaluk
Cheng Tang
Valdemar Švábenský
Atsushi Shimada
83
1
0
19 Dec 2024
Language verY Rare for All
Language verY Rare for All
Ibrahim Merad
Amos Wolf
Ziad Mazzawi
Yannick Léo
77
0
0
18 Dec 2024
Development of an End-to-end Machine Learning System with Application to
  In-app Purchases
Development of an End-to-end Machine Learning System with Application to In-app Purchases
Dionysios Varelas
Elena Bonan
Lewis Anderson
Anders Englesson
Christoffer Åhrling
Adrian Chmielewski-Anders
OffRL
92
0
0
16 Dec 2024
A comprehensive GeoAI review: Progress, Challenges and Outlooks
A comprehensive GeoAI review: Progress, Challenges and Outlooks
Anasse Boutayeb
Iyad Lahsen-cherif
Ahmed El Khadimi
89
0
0
16 Dec 2024
Learning Latent Spaces for Domain Generalization in Time Series
  Forecasting
Learning Latent Spaces for Domain Generalization in Time Series Forecasting
Songgaojun Deng
Maarten de Rijke
CML
AI4TS
OOD
BDL
73
0
0
15 Dec 2024
The Superalignment of Superhuman Intelligence with Large Language Models
The Superalignment of Superhuman Intelligence with Large Language Models
Minlie Huang
Yingkang Wang
Shiyao Cui
Pei Ke
J. Tang
126
1
0
15 Dec 2024
One Pixel is All I Need
One Pixel is All I Need
Deng Siqin
Zhou Xiaoyi
ViT
235
0
0
14 Dec 2024
Training LayoutLM from Scratch for Efficient Named-Entity Recognition in
  the Insurance Domain
Training LayoutLM from Scratch for Efficient Named-Entity Recognition in the Insurance Domain
Benno Uthayasooriyar
A. Ly
Franck Vermet
Caio Corro
76
0
0
12 Dec 2024
A Self-guided Multimodal Approach to Enhancing Graph Representation
  Learning for Alzheimer's Diseases
A Self-guided Multimodal Approach to Enhancing Graph Representation Learning for Alzheimer's Diseases
Zhepeng Wang
Runxue Bao
Yawen Wu
Guodong Liu
Lei Yang
Liang Zhan
Feng Zheng
Weiwen Jiang
Yanfu Zhang
86
0
0
09 Dec 2024
Beyond [cls]: Exploring the true potential of Masked Image Modeling representations
Beyond [cls]: Exploring the true potential of Masked Image Modeling representations
Marcin Przewiȩźlikowski
Randall Balestriero
Wojciech Jasiński
Marek 'Smieja
Bartosz Zieliñski
74
0
0
04 Dec 2024
Towards Fault Tolerance in Multi-Agent Reinforcement Learning
Towards Fault Tolerance in Multi-Agent Reinforcement Learning
Yuchen Shi
Huaxin Pei
Liang Feng
Yi Zhang
D. Yao
75
0
0
30 Nov 2024
Does Self-Attention Need Separate Weights in Transformers?
Md. Kowsher
Nusrat Jahan Prottasha
Chun-Nam Yu
O. Garibay
Niloofar Yousefi
282
0
0
30 Nov 2024
Twisted Convolutional Networks (TCNs): Enhancing Feature Interactions for Non-Spatial Data Classification
Junbo Jacob Lian
67
0
0
29 Nov 2024
Towards Santali Linguistic Inclusion: Building the First
  Santali-to-English Translation Model using mT5 Transformer and Data
  Augmentation
Towards Santali Linguistic Inclusion: Building the First Santali-to-English Translation Model using mT5 Transformer and Data Augmentation
Syed Mohammed Mostaque Billah
Ateya Ahmed Subarna
Sudipta Nandi Sarna
Ahmad Shawkat Wasit
Anika Fariha
Asif Sushmit
Arig Yousuf Sadeque
67
0
0
29 Nov 2024
An Extensive Evaluation of Factual Consistency in Large Language Models
  for Data-to-Text Generation
An Extensive Evaluation of Factual Consistency in Large Language Models for Data-to-Text Generation
Joy Mahapatra
Utpal Garain
HILM
ALM
69
1
0
28 Nov 2024
Neural Networks Use Distance Metrics
Neural Networks Use Distance Metrics
Alan Oursland
64
0
0
26 Nov 2024
Unsupervised Event Outlier Detection in Continuous Time
Unsupervised Event Outlier Detection in Continuous Time
Somjit Nath
Yik Chau Lui
Siqi Liu
AI4TS
75
0
0
25 Nov 2024
Leveraging the Power of MLLMs for Gloss-Free Sign Language Translation
Leveraging the Power of MLLMs for Gloss-Free Sign Language Translation
Jungeun Kim
Hyeongwoo Jeon
Jongseong Bae
Ha Young Kim
SLR
90
0
0
25 Nov 2024
Transforming NLU with Babylon: A Case Study in Development of Real-time,
  Edge-Efficient, Multi-Intent Translation System for Automated Drive-Thru
  Ordering
Transforming NLU with Babylon: A Case Study in Development of Real-time, Edge-Efficient, Multi-Intent Translation System for Automated Drive-Thru Ordering
Mostafa Varzaneh
Pooja Voladoddi
Tanmay Bakshi
Uma Gunturi
75
0
0
22 Nov 2024
NMT-Obfuscator Attack: Ignore a sentence in translation with only one
  word
NMT-Obfuscator Attack: Ignore a sentence in translation with only one word
Sahar Sadrizadeh
César Descalzo
Ljiljana Dolamic
P. Frossard
AAML
81
0
0
19 Nov 2024
Forecasting Application Counts in Talent Acquisition Platforms:
  Harnessing Multimodal Signals using LMs
Forecasting Application Counts in Talent Acquisition Platforms: Harnessing Multimodal Signals using LMs
Md. Ahsanul Kabir
Kareem E. Abdelfatah
Shushan He
M. Korayem
Mohammad Al Hasan
AI4TS
75
0
0
19 Nov 2024
Transcending Language Boundaries: Harnessing LLMs for Low-Resource Language Translation
Peng Shu
Jianfei Chen
Zichen Liu
Haoran Wang
Zihao Wu
...
Constance Owl
Xiaoming Zhai
Ninghao Liu
Claudio Saunt
Tianming Liu
40
6
0
18 Nov 2024
An exploration of the effect of quantisation on energy consumption and
  inference time of StarCoder2
An exploration of the effect of quantisation on energy consumption and inference time of StarCoder2
Pepijn de Reus
Ana Oprescu
Jelle Zuidema
MQ
92
1
0
15 Nov 2024
On the Shortcut Learning in Multilingual Neural Machine Translation
On the Shortcut Learning in Multilingual Neural Machine Translation
Wenxuan Wang
Wenxiang Jiao
Jen-tse Huang
Zhaopeng Tu
Michael R. Lyu
223
1
0
15 Nov 2024
Multidimensional Byte Pair Encoding: Shortened Sequences for Improved
  Visual Data Generation
Multidimensional Byte Pair Encoding: Shortened Sequences for Improved Visual Data Generation
Tim Elsner
Paula Usinger
Julius Nehring-Wirxel
Gregor Kobsik
Victor Czech
Yanjiang He
I. Lim
Leif Kobbelt
39
1
0
15 Nov 2024
Neural Operators Can Play Dynamic Stackelberg Games
Neural Operators Can Play Dynamic Stackelberg Games
Guillermo Alvarez
Ibrahim Ekren
Anastasis Kratsios
Xuwei Yang
35
0
0
14 Nov 2024
Unstructured Text Enhanced Open-domain Dialogue System: A Systematic
  Survey
Unstructured Text Enhanced Open-domain Dialogue System: A Systematic Survey
Longxuan Ma
Mingda Li
Weinan Zhang
Jiapeng Li
Ting Liu
48
16
0
14 Nov 2024
More Expressive Attention with Negative Weights
More Expressive Attention with Negative Weights
Ang Lv
Ruobing Xie
Shuaipeng Li
Jiayi Liao
Xingchen Sun
Zhanhui Kang
Di Wang
Rui Yan
42
0
0
11 Nov 2024
Understanding Scaling Laws with Statistical and Approximation Theory for
  Transformer Neural Networks on Intrinsically Low-dimensional Data
Understanding Scaling Laws with Statistical and Approximation Theory for Transformer Neural Networks on Intrinsically Low-dimensional Data
Alex Havrilla
Wenjing Liao
39
8
0
11 Nov 2024
Previous
12345...125126127
Next