ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1409.0473
  4. Cited By
Neural Machine Translation by Jointly Learning to Align and Translate
v1v2v3v4v5v6v7 (latest)

Neural Machine Translation by Jointly Learning to Align and Translate

1 September 2014
Dzmitry Bahdanau
Kyunghyun Cho
Yoshua Bengio
    AIMat
ArXiv (abs)PDFHTML

Papers citing "Neural Machine Translation by Jointly Learning to Align and Translate"

50 / 8,379 papers shown
Title
Training-free and Adaptive Sparse Attention for Efficient Long Video Generation
Training-free and Adaptive Sparse Attention for Efficient Long Video Generation
Yifei Xia
Suhan Ling
Fangcheng Fu
Yijiao Wang
Huixia Li
Xuefeng Xiao
Tengjiao Wang
VGen
147
11
0
28 Feb 2025
Learning to Substitute Components for Compositional Generalization
Learning to Substitute Components for Compositional Generalization
Zechao Li
Gangwei Jiang
Chenwang Wu
Ying Wei
Defu Lian
Enhong Chen
114
0
0
28 Feb 2025
Attend or Perish: Benchmarking Attention in Algorithmic Reasoning
Michal Spiegel
Michal Štefánik
Marek Kadlcík
Josef Kuchař
110
0
0
28 Feb 2025
Representing Signs as Signs: One-Shot ISLR to Facilitate Functional Sign Language Technologies
Representing Signs as Signs: One-Shot ISLR to Facilitate Functional Sign Language Technologies
Toon Vandendriessche
Mathieu De Coster
Annelies Lejon
J. Dambre
SLR
133
0
0
27 Feb 2025
Chitranuvad: Adapting Multi-Lingual LLMs for Multimodal Translation
Chitranuvad: Adapting Multi-Lingual LLMs for Multimodal Translation
Shaharukh Khan
Ayush Tarun
Ali Faraz
Palash Kamble
Vivek Dahiya
Praveen Kumar Pokala
Ashish Kulkarni
Chandra Khatri
Abhinav Ravi
Shubham Agarwal
439
1
0
27 Feb 2025
Revisiting Kernel Attention with Correlated Gaussian Process Representation
Revisiting Kernel Attention with Correlated Gaussian Process Representation
Long Minh Bui
Tho Tran Huu
Duy-Tung Dinh
T. Nguyen
Trong Nghia Hoang
127
2
0
27 Feb 2025
A HEART for the environment: Transformer-Based Spatiotemporal Modeling for Air Quality Prediction
A HEART for the environment: Transformer-Based Spatiotemporal Modeling for Air Quality Prediction
Norbert Bodendorfer
137
1
0
26 Feb 2025
Multiview graph dual-attention deep learning and contrastive learning for multi-criteria recommender systems
Multiview graph dual-attention deep learning and contrastive learning for multi-criteria recommender systems
Saman Forouzandeh
P. Krivitsky
Rohitash Chandra
89
0
0
26 Feb 2025
Integrating Biological and Machine Intelligence: Attention Mechanisms in Brain-Computer Interfaces
Integrating Biological and Machine Intelligence: Attention Mechanisms in Brain-Computer Interfaces
Jing Wang
Weishan Ye
Jialin He
Li Zhang
G. Huang
Zhuliang Yu
Zhen Liang
108
0
0
26 Feb 2025
Application of Attention Mechanism with Bidirectional Long Short-Term Memory (BiLSTM) and CNN for Human Conflict Detection using Computer Vision
Application of Attention Mechanism with Bidirectional Long Short-Term Memory (BiLSTM) and CNN for Human Conflict Detection using Computer Vision
Erick da Silva Farias
Eduardo Palhares Junior
82
0
0
25 Feb 2025
Self-Adjust Softmax
Self-Adjust Softmax
Chuanyang Zheng
Yihang Gao
Guoxuan Chen
Han Shi
Jing Xiong
Xiaozhe Ren
Chao Huang
Xin Jiang
Zhiyu Li
Yu Li
81
1
0
25 Feb 2025
Recurrent Neural Networks for Dynamic VWAP Execution: Adaptive Trading Strategies with Temporal Kolmogorov-Arnold Networks
Recurrent Neural Networks for Dynamic VWAP Execution: Adaptive Trading Strategies with Temporal Kolmogorov-Arnold Networks
Remi Genet
148
1
0
25 Feb 2025
TabulaTime: A Novel Multimodal Deep Learning Framework for Advancing Acute Coronary Syndrome Prediction through Environmental and Clinical Data Integration
TabulaTime: A Novel Multimodal Deep Learning Framework for Advancing Acute Coronary Syndrome Prediction through Environmental and Clinical Data Integration
Xin Zhang
Liangxiu Han
Stephen White
Saad Hassan
Philip A Kalra
James Ritchie
Carl Diver
Jennie Shorley
112
1
0
24 Feb 2025
GraphFM: Graph Factorization Machines for Feature Interaction Modeling
GraphFM: Graph Factorization Machines for Feature Interaction Modeling
Shu Wu
Zekun Li
Yunyue Su
Zeyu Cui
Xiaoyu Zhang
Liang Wang
285
23
0
24 Feb 2025
Neural Attention: A Novel Mechanism for Enhanced Expressive Power in Transformer Models
Andrew DiGiugno
Ausif Mahmood
108
0
0
24 Feb 2025
AttentionEngine: A Versatile Framework for Efficient Attention Mechanisms on Diverse Hardware Platforms
AttentionEngine: A Versatile Framework for Efficient Attention Mechanisms on Diverse Hardware Platforms
Feiyang Chen
Yu Cheng
Lei Wang
Yuqing Xia
Ziming Miao
...
Fan Yang
Jinbao Xue
Zhi Yang
M. Yang
H. Chen
127
1
0
24 Feb 2025
Emoti-Attack: Zero-Perturbation Adversarial Attacks on NLP Systems via Emoji Sequences
Emoti-Attack: Zero-Perturbation Adversarial Attacks on NLP Systems via Emoji Sequences
Yangshijie Zhang
AAML
86
0
0
24 Feb 2025
SR-LLM: Rethinking the Structured Representation in Large Language Model
SR-LLM: Rethinking the Structured Representation in Large Language Model
Jiahuan Zhang
Tianheng Wang
Hanqing Wu
Ziyi Huang
Yulong Wu
Dongbai Chen
Linfeng Song
Yue Zhang
Guozheng Rao
Kaicheng Yu
85
1
0
21 Feb 2025
Data Attribution for Text-to-Image Models by Unlearning Synthesized Images
Data Attribution for Text-to-Image Models by Unlearning Synthesized Images
Sheng-Yu Wang
Aaron Hertzmann
Alexei A. Efros
Jun-Yan Zhu
Richard Zhang
TDI
209
3
0
21 Feb 2025
Connecting the geometry and dynamics of many-body complex systems with message passing neural operators
N. Gabriel
N. Johnson
George Em Karniadakis
AI4CE
115
0
0
21 Feb 2025
A Survey of Model Architectures in Information Retrieval
A Survey of Model Architectures in Information Retrieval
Zhichao Xu
Fengran Mo
Zhiqi Huang
Crystina Zhang
Puxuan Yu
Bei Wang
Jimmy J. Lin
Vivek Srikumar
KELM3DV
182
2
0
21 Feb 2025
Quantum Recurrent Neural Networks with Encoder-Decoder for Time-Dependent Partial Differential Equations
Quantum Recurrent Neural Networks with Encoder-Decoder for Time-Dependent Partial Differential Equations
Yuan Chen
Abdul Khaliq
Khaled M. Furati
AI4CE
205
0
0
20 Feb 2025
From Features to Graphs: Exploring Graph Structures and Pairwise Interactions via GNNs
From Features to Graphs: Exploring Graph Structures and Pairwise Interactions via GNNs
Phaphontee Yamchote
Saw Nay Htet Win
Chainarong Amornbunchornvej
Thanapon Noraset
FAtt
140
0
0
19 Feb 2025
Generalized Attention Flow: Feature Attribution for Transformer Models via Maximum Flow
Generalized Attention Flow: Feature Attribution for Transformer Models via Maximum Flow
Behrooz Azarkhalili
Maxwell Libbrecht
79
0
0
14 Feb 2025
Theoretical Benefit and Limitation of Diffusion Language Model
Theoretical Benefit and Limitation of Diffusion Language Model
Guhao Feng
Yihan Geng
Jian Guan
Wei Wu
Liwei Wang
Di He
DiffM
152
1
0
13 Feb 2025
A Deep Inverse-Mapping Model for a Flapping Robotic Wing
A Deep Inverse-Mapping Model for a Flapping Robotic Wing
Hadar Sharvit
Raz Karl
Tsevi Beatus
102
0
0
13 Feb 2025
Handwritten Text Recognition: A Survey
Handwritten Text Recognition: A Survey
Carlos Garrido-Munoz
Antonio Ríos-Vila
Jorge Calvo-Zaragoza
137
0
0
12 Feb 2025
Enhanced Load Forecasting with GAT-LSTM: Leveraging Grid and Temporal Features
Enhanced Load Forecasting with GAT-LSTM: Leveraging Grid and Temporal Features
Ugochukwu Orji
Çiçek Güven
Dan Stowell
AI4TS
64
0
0
12 Feb 2025
Beyond Literal Token Overlap: Token Alignability for Multilinguality
Beyond Literal Token Overlap: Token Alignability for Multilinguality
Katharina Hämmerl
Tomasz Limisiewicz
Jindrich Libovický
Alexander Fraser
73
0
0
10 Feb 2025
A Multimodal PDE Foundation Model for Prediction and Scientific Text Descriptions
Elisa Negrini
Yuxuan Liu
Liu Yang
Stanley Osher
Hayden Schaeffer
AI4CE
148
0
0
09 Feb 2025
Invizo: Arabic Handwritten Document Optical Character Recognition Solution
Alhossien Waly
Bassant Tarek
Ali Feteha
Rewan Yehia
Gasser Amr
Walid Gomaa
Ahmed M. Fares
146
0
0
07 Feb 2025
Aligner-Encoders: Self-Attention Transformers Can Be Self-Transducers
Aligner-Encoders: Self-Attention Transformers Can Be Self-Transducers
Adam Stooke
Rohit Prabhavalkar
K. Sim
P. M. Mengibar
187
0
0
06 Feb 2025
Distribution Transformers: Fast Approximate Bayesian Inference With On-The-Fly Prior Adaptation
Distribution Transformers: Fast Approximate Bayesian Inference With On-The-Fly Prior Adaptation
George Whittle
Juliusz Ziomek
Jacob Rawling
Michael A. Osborne
210
4
0
04 Feb 2025
Fine-tuning Language Models for Recipe Generation: A Comparative Analysis and Benchmark Study
Fine-tuning Language Models for Recipe Generation: A Comparative Analysis and Benchmark Study
Anneketh Vij
Changhao Liu
Rahul Anil Nair
Theo Ho
Edward Shi
Ayan Bhowmick
134
1
0
04 Feb 2025
A comparison of translation performance between DeepL and Supertext
A comparison of translation performance between DeepL and Supertext
Alex Flückiger
Chantal Amrhein
Tim Graf
Frédéric Odermatt
Martin Pömsl
Philippe Schläpfer
Florian Schottmann
Samuel Läubli
ELM
128
0
0
04 Feb 2025
Efficient Language Modeling for Low-Resource Settings with Hybrid RNN-Transformer Architectures
Efficient Language Modeling for Low-Resource Settings with Hybrid RNN-Transformer Architectures
Gabriel Lindenmaier
Sean Papay
Sebastian Padó
141
0
0
02 Feb 2025
Emotion Recognition and Generation: A Comprehensive Review of Face, Speech, and Text Modalities
Emotion Recognition and Generation: A Comprehensive Review of Face, Speech, and Text Modalities
Rebecca Mobbs
Dimitrios Makris
Vasileios Argyriou
67
0
0
02 Feb 2025
PolarQuant: Leveraging Polar Transformation for Efficient Key Cache Quantization and Decoding Acceleration
PolarQuant: Leveraging Polar Transformation for Efficient Key Cache Quantization and Decoding Acceleration
Songhao Wu
Ang Lv
Xiao Feng
Yanzhe Zhang
Xun Zhang
Guojun Yin
Wei Lin
Rui Yan
MQ
91
1
0
01 Feb 2025
Abstractive Text Summarization for Bangla Language Using NLP and Machine Learning Approaches
Asif Ahammad Miazee
Tonmoy Roy
Md Robiul Islam
Yeamin Safat
CVBM
41
0
0
28 Jan 2025
Efficient and Interpretable Neural Networks Using Complex Lehmer Transform
M. Ataei
Xiaogang Wang
92
0
0
28 Jan 2025
ZETA: Leveraging Z-order Curves for Efficient Top-k Attention
ZETA: Leveraging Z-order Curves for Efficient Top-k Attention
Qiuhao Zeng
Jerry Huang
Peng Lu
Gezheng Xu
Boxing Chen
Charles Ling
Boyu Wang
195
3
0
24 Jan 2025
A Study of the Plausibility of Attention between RNN Encoders in Natural Language Inference
A Study of the Plausibility of Attention between RNN Encoders in Natural Language Inference
Duc Hau Nguyen
Duc Hau Nguyen
Pascale Sébillot
128
5
0
23 Jan 2025
Infinite Time Turing Machines and their Applications
Infinite Time Turing Machines and their Applications
Rukmal Weerawarana
Maxwell Braun
AI4CE
15
0
0
22 Jan 2025
Extend Adversarial Policy Against Neural Machine Translation via Unknown Token
Extend Adversarial Policy Against Neural Machine Translation via Unknown Token
Wei Zou
Shujian Huang
Jiajun Chen
AAML
115
0
0
21 Jan 2025
LD-DETR: Loop Decoder DEtection TRansformer for Video Moment Retrieval and Highlight Detection
LD-DETR: Loop Decoder DEtection TRansformer for Video Moment Retrieval and Highlight Detection
Pengcheng Zhao
Zhixian He
Fuwei Zhang
Shujin Lin
Fan Zhou
136
2
0
18 Jan 2025
The Quest for Visual Understanding: A Journey Through the Evolution of Visual Question Answering
The Quest for Visual Understanding: A Journey Through the Evolution of Visual Question Answering
Anupam Pandey
Deepjyoti Bodo
Arpan Phukan
Asif Ekbal
150
0
0
13 Jan 2025
TFLAG:Towards Practical APT Detection via Deviation-Aware Learning on Temporal Provenance Graph
TFLAG:Towards Practical APT Detection via Deviation-Aware Learning on Temporal Provenance Graph
Wenhan Jiang
Tingting Chai
Hongri Liu
Kai Wang
Hongke Zhang
83
0
0
13 Jan 2025
Iconicity in Large Language Models
Iconicity in Large Language Models
Anna Marklová
Jiří Milička
Leonid Ryvkin
Ľudmila Lacková Bennet
Libuše Kormaníková
89
0
0
10 Jan 2025
On Creating A Brain-To-Text Decoder
On Creating A Brain-To-Text Decoder
Zenon Lamprou
Yashar Moshfeghi
80
0
0
10 Jan 2025
Clinical Insights: A Comprehensive Review of Language Models in Medicine
Clinical Insights: A Comprehensive Review of Language Models in Medicine
Nikita Neveditsin
Pawan Lingras
V. Mago
LM&MA
117
5
0
08 Jan 2025
Previous
12345...166167168
Next