Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1409.0473
Cited By
v1
v2
v3
v4
v5
v6
v7 (latest)
Neural Machine Translation by Jointly Learning to Align and Translate
1 September 2014
Dzmitry Bahdanau
Kyunghyun Cho
Yoshua Bengio
AIMat
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Neural Machine Translation by Jointly Learning to Align and Translate"
50 / 8,379 papers shown
Title
Koopman Learning with Episodic Memory
William T. Redman
Dean Huang
M. Fonoberova
Igor Mezić
93
0
0
08 Jan 2025
CORD: Generalizable Cooperation via Role Diversity
Kanefumi Matsuyama
Kefan Su
Jiangxing Wang
Deheng Ye
Zongqing Lu
106
0
0
04 Jan 2025
Graph-Aware Isomorphic Attention for Adaptive Dynamics in Transformers
Markus J. Buehler
AI4CE
154
3
0
04 Jan 2025
Kolmogorov GAM Networks are all you need!
Sarah Polson
Vadim Sokolov
68
0
0
03 Jan 2025
Diversity Optimization for Travelling Salesman Problem via Deep Reinforcement Learning
Qi Li
Zhiguang Cao
Yining Ma
Yaoxin Wu
Yue-Jiao Gong
96
0
0
03 Jan 2025
Decoupling Knowledge and Reasoning in Transformers: A Modular Architecture with Generalized Cross-Attention
Zhenyu Guo
Wenguang Chen
78
0
0
01 Jan 2025
Symbolic Disentangled Representations for Images
Alexandr Korchemnyi
A. Kovalev
Aleksandr I. Panov
OCL
129
0
0
31 Dec 2024
Out-of-distribution generalization via composition: a lens through induction heads in Transformers
Jiajun Song
Zhuoyan Xu
Yiqiao Zhong
158
10
0
31 Dec 2024
Towards Visual Grounding: A Survey
Linhui Xiao
Xiaoshan Yang
X. Lan
Yaowei Wang
Changsheng Xu
ObjD
282
5
0
31 Dec 2024
The Jungle of Generative Drug Discovery: Traps, Treasures, and Ways Out
Rıza Özçelik
F. Grisoni
86
0
0
24 Dec 2024
Reconsidering SMT Over NMT for Closely Related Languages: A Case Study of Persian-Hindi Pair
Waisullah Yousofi
Pushpak Bhattacharyya
116
0
0
22 Dec 2024
Sensitive Image Classification by Vision Transformers
Hanxian He
Campbell Wilson
Thanh Thi Nguyen
Janis Dalins
ViT
117
0
0
21 Dec 2024
Reframing Image Difference Captioning with BLIP2IDC and Synthetic Augmentation
Gautier Evennou
Antoine Chaffin
Vivien Chappelier
Ewa Kijak
DiffM
125
0
0
20 Dec 2024
Mention Attention for Pronoun Translation
Gongbo Tang
Christian Hardmeier
145
0
0
19 Dec 2024
On the Use of Deep Learning Models for Semantic Clone Detection
Subroto Nag Pinku
Debajyoti Mondal
C. Roy
107
3
0
19 Dec 2024
Knowledge Distillation in RNN-Attention Models for Early Prediction of Student Performance
Sukrit Leelaluk
Cheng Tang
Valdemar Švábenský
Atsushi Shimada
119
1
0
19 Dec 2024
Language verY Rare for All
Ibrahim Merad
Amos Wolf
Ziad Mazzawi
Yannick Léo
110
0
0
18 Dec 2024
Development of an End-to-end Machine Learning System with Application to In-app Purchases
Dionysios Varelas
Elena Bonan
Lewis Anderson
Anders Englesson
Christoffer Åhrling
Adrian Chmielewski-Anders
OffRL
180
0
0
16 Dec 2024
A comprehensive GeoAI review: Progress, Challenges and Outlooks
Anasse Boutayeb
Iyad Lahsen-cherif
Ahmed El Khadimi
102
0
0
16 Dec 2024
Learning Latent Spaces for Domain Generalization in Time Series Forecasting
Songgaojun Deng
Maarten de Rijke
CML
AI4TS
OOD
BDL
122
0
0
15 Dec 2024
The Superalignment of Superhuman Intelligence with Large Language Models
Minlie Huang
Yingkang Wang
Shiyao Cui
Pei Ke
J. Tang
176
1
0
15 Dec 2024
One Pixel is All I Need
Deng Siqin
Zhou Xiaoyi
ViT
453
0
0
14 Dec 2024
Training LayoutLM from Scratch for Efficient Named-Entity Recognition in the Insurance Domain
Benno Uthayasooriyar
A. Ly
Franck Vermet
Caio Corro
94
0
0
12 Dec 2024
A Self-guided Multimodal Approach to Enhancing Graph Representation Learning for Alzheimer's Diseases
Zhepeng Wang
Runxue Bao
Yawen Wu
Guodong Liu
Lei Yang
Liang Zhan
Feng Zheng
Weiwen Jiang
Yanfu Zhang
183
0
0
09 Dec 2024
Beyond [cls]: Exploring the true potential of Masked Image Modeling representations
Marcin Przewiȩźlikowski
Randall Balestriero
Wojciech Jasiński
Marek 'Smieja
Bartosz Zieliñski
223
1
0
04 Dec 2024
Towards Fault Tolerance in Multi-Agent Reinforcement Learning
Yuchen Shi
Huaxin Pei
Liang Feng
Jianming Hu
Dingyi Yao
110
0
0
30 Nov 2024
Does Self-Attention Need Separate Weights in Transformers?
Md. Kowsher
Nusrat Jahan Prottasha
Chun-Nam Yu
O. Garibay
Niloofar Yousefi
547
1
0
30 Nov 2024
Twisted Convolutional Networks (TCNs): Enhancing Feature Interactions for Non-Spatial Data Classification
Junbo Jacob Lian
74
0
0
29 Nov 2024
Towards Santali Linguistic Inclusion: Building the First Santali-to-English Translation Model using mT5 Transformer and Data Augmentation
Syed Mohammed Mostaque Billah
Ateya Ahmed Subarna
Sudipta Nandi Sarna
Ahmad Shawkat Wasit
Anika Fariha
Asif Sushmit
Arig Yousuf Sadeque
82
0
0
29 Nov 2024
An Extensive Evaluation of Factual Consistency in Large Language Models for Data-to-Text Generation
Joy Mahapatra
Utpal Garain
HILM
ALM
100
2
0
28 Nov 2024
Neural Networks Use Distance Metrics
Alan Oursland
79
0
0
26 Nov 2024
Unsupervised Event Outlier Detection in Continuous Time
Somjit Nath
Yik Chau Lui
Siqi Liu
AI4TS
116
0
0
25 Nov 2024
Leveraging the Power of MLLMs for Gloss-Free Sign Language Translation
Jungeun Kim
Hyeongwoo Jeon
Jongseong Bae
Ha Young Kim
SLR
122
0
0
25 Nov 2024
Transforming NLU with Babylon: A Case Study in Development of Real-time, Edge-Efficient, Multi-Intent Translation System for Automated Drive-Thru Ordering
Mostafa Varzaneh
Pooja Voladoddi
Tanmay Bakshi
Uma Gunturi
102
0
0
22 Nov 2024
NMT-Obfuscator Attack: Ignore a sentence in translation with only one word
Sahar Sadrizadeh
César Descalzo
Ljiljana Dolamic
P. Frossard
AAML
115
0
0
19 Nov 2024
Forecasting Application Counts in Talent Acquisition Platforms: Harnessing Multimodal Signals using LMs
Md. Ahsanul Kabir
Kareem E. Abdelfatah
Shushan He
M. Korayem
Mohammad Al Hasan
AI4TS
84
0
0
19 Nov 2024
Transcending Language Boundaries: Harnessing LLMs for Low-Resource Language Translation
Peng Shu
Jianfei Chen
Ziqiang Liu
Haoran Wang
Zihao Wu
...
Constance Owl
Xiaoming Zhai
Ninghao Liu
Claudio Saunt
Tianming Liu
89
8
0
18 Nov 2024
An exploration of the effect of quantisation on energy consumption and inference time of StarCoder2
Pepijn de Reus
Ana Oprescu
Jelle Zuidema
MQ
140
1
0
15 Nov 2024
On the Shortcut Learning in Multilingual Neural Machine Translation
Wenxuan Wang
Wenxiang Jiao
Jen-tse Huang
Zhaopeng Tu
Michael R. Lyu
446
1
0
15 Nov 2024
Multidimensional Byte Pair Encoding: Shortened Sequences for Improved Visual Data Generation
Tim Elsner
Paula Usinger
Julius Nehring-Wirxel
Gregor Kobsik
Victor Czech
Yanjiang He
I. Lim
Leif Kobbelt
91
1
0
15 Nov 2024
Neural Operators Can Play Dynamic Stackelberg Games
Guillermo Alvarez
Ibrahim Ekren
Anastasis Kratsios
Xuwei Yang
68
0
0
14 Nov 2024
Unstructured Text Enhanced Open-domain Dialogue System: A Systematic Survey
Longxuan Ma
Mingda Li
Weinan Zhang
Jiapeng Li
Ting Liu
124
17
0
14 Nov 2024
More Expressive Attention with Negative Weights
Ang Lv
Ruobing Xie
Shuaipeng Li
Jiayi Liao
Xingwu Sun
Zhanhui Kang
Di Wang
Rui Yan
120
1
0
11 Nov 2024
Understanding Scaling Laws with Statistical and Approximation Theory for Transformer Neural Networks on Intrinsically Low-dimensional Data
Alex Havrilla
Wenjing Liao
87
12
0
11 Nov 2024
CULL-MT: Compression Using Language and Layer pruning for Machine Translation
Pedram Rostami
M. Dousti
97
1
0
10 Nov 2024
Trends, Challenges, and Future Directions in Deep Learning for Glaucoma: A Systematic Review
Mahtab Faraji
Homa Rashidisabet
George R. Nahass
R. Chan
Thasarat S Vajaranant
Darvin Yi
80
0
0
07 Nov 2024
Pruning Literals for Highly Efficient Explainability at Word Level
Rohan Kumar Yadav
Bimal Bhattarai
Abhik Jana
Lei Jiao
Seid Muhie Yimam
56
0
0
07 Nov 2024
LASER: Attention with Exponential Transformation
Sai Surya Duvvuri
Inderjit Dhillon
50
1
0
05 Nov 2024
Grouped Discrete Representation for Object-Centric Learning
Rongzhen Zhao
V. Wang
Arno Solin
Joni Pajarinen
BDL
OCL
82
1
0
04 Nov 2024
BiT-MamSleep: Bidirectional Temporal Mamba for EEG Sleep Staging
Xinliang Zhou
Yuzhe Han
Zhenpeng Chen
Chenyu Liu
Yi Ding
Ziyu Jia
Yang Liu
Mamba
68
1
0
03 Nov 2024
Previous
1
2
3
4
5
6
...
166
167
168
Next