ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1409.0473
  4. Cited By
Neural Machine Translation by Jointly Learning to Align and Translate

Neural Machine Translation by Jointly Learning to Align and Translate

1 September 2014
Dzmitry Bahdanau
Kyunghyun Cho
Yoshua Bengio
    AIMat
ArXivPDFHTML

Papers citing "Neural Machine Translation by Jointly Learning to Align and Translate"

50 / 5,946 papers shown
Title
Learning to Dissipate Energy in Oscillatory State-Space Models
Learning to Dissipate Energy in Oscillatory State-Space Models
Jared Boyer
T. Konstantin Rusch
Daniela Rus
4
0
0
17 May 2025
Security through the Eyes of AI: How Visualization is Shaping Malware Detection
Security through the Eyes of AI: How Visualization is Shaping Malware Detection
Matteo Brosolo
A. Aazami
R. Agarwal
M. Prabhakaran
S. Nicolazzo
Antonino Nocera
V. P.
AAML
34
0
0
12 May 2025
Learning Penalty for Optimal Partitioning via Automatic Feature Extraction
Learning Penalty for Optimal Partitioning via Automatic Feature Extraction
Tung L Nguyen
T. D. Hocking
16
0
0
12 May 2025
Putting It All into Context: Simplifying Agents with LCLMs
Putting It All into Context: Simplifying Agents with LCLMs
Mingjian Jiang
Yangjun Ruan
Luis Lastras
Pavan Kapanipathi
Tatsunori Hashimoto
LLMAG
31
0
0
12 May 2025
Generative Models for Long Time Series: Approximately Equivariant Recurrent Network Structures for an Adjusted Training Scheme
Generative Models for Long Time Series: Approximately Equivariant Recurrent Network Structures for an Adjusted Training Scheme
Ruwen Fulek
Markus Lange-Hegermann
AI4TS
40
0
0
08 May 2025
SwinLip: An Efficient Visual Speech Encoder for Lip Reading Using Swin Transformer
SwinLip: An Efficient Visual Speech Encoder for Lip Reading Using Swin Transformer
Young-Hu Park
R.-H. Park
Hyung-Min Park
49
0
0
07 May 2025
QiMeng-Xpiler: Transcompiling Tensor Programs for Deep Learning Systems with a Neural-Symbolic Approach
QiMeng-Xpiler: Transcompiling Tensor Programs for Deep Learning Systems with a Neural-Symbolic Approach
Shouyang Dong
Yuanbo Wen
Jun Bi
Di Huang
Jiaming Guo
...
Yifan Hao
Xuehai Zhou
Tianshi Chen
Qi Guo
Yunji Chen
32
0
0
04 May 2025
A Comprehensive Analysis of Adversarial Attacks against Spam Filters
A Comprehensive Analysis of Adversarial Attacks against Spam Filters
Esra Hotoğlu
Sevil Sen
Burcu Can
AAML
29
0
0
04 May 2025
GeloVec: Higher Dimensional Geometric Smoothing for Coherent Visual Feature Extraction in Image Segmentation
GeloVec: Higher Dimensional Geometric Smoothing for Coherent Visual Feature Extraction in Image Segmentation
Boris Kriuk
Matey Yordanov
35
0
0
02 May 2025
Harnessing Structured Knowledge: A Concept Map-Based Approach for High-Quality Multiple Choice Question Generation with Effective Distractors
Harnessing Structured Knowledge: A Concept Map-Based Approach for High-Quality Multiple Choice Question Generation with Effective Distractors
Nicy Scaria
Silvester John Joseph Kennedy
Diksha Seth
Ananya Thakur
Deepak N. Subramani
AI4Ed
23
0
0
02 May 2025
SA-GAT-SR: Self-Adaptable Graph Attention Networks with Symbolic Regression for high-fidelity material property prediction
SA-GAT-SR: Self-Adaptable Graph Attention Networks with Symbolic Regression for high-fidelity material property prediction
Liu Junchi
Tang Ying
Tretiak Sergei
Duan Wenhui
Zhou Liujiang
38
0
0
01 May 2025
Polysemy of Synthetic Neurons Towards a New Type of Explanatory Categorical Vector Spaces
Polysemy of Synthetic Neurons Towards a New Type of Explanatory Categorical Vector Spaces
Michael Pichat
William Pogrund
Paloma Pichat
Judicael Poumay
Armanouche Gasparian
Samuel Demarchi
Martin Corbet
Alois Georgeon
Michael Veillet-Guillem
MILM
29
0
0
30 Apr 2025
Hierarchical Multi-Label Generation with Probabilistic Level-Constraint
Hierarchical Multi-Label Generation with Probabilistic Level-Constraint
Linqing Chen
Weilei Wang
Wentao Wu
Hanmeng Zhong
37
0
0
30 Apr 2025
A comparative study of deep learning and ensemble learning to extend the horizon of traffic forecasting
A comparative study of deep learning and ensemble learning to extend the horizon of traffic forecasting
Xiao Zheng
Saeed Asadi Bagloee
Majid Sarvi
AI4TS
43
0
0
30 Apr 2025
Leveraging Depth Maps and Attention Mechanisms for Enhanced Image Inpainting
Leveraging Depth Maps and Attention Mechanisms for Enhanced Image Inpainting
Jin Hyun Park
Harine Choi
Praewa Pitiphat
54
0
0
29 Apr 2025
Jekyll-and-Hyde Tipping Point in an AI's Behavior
Jekyll-and-Hyde Tipping Point in an AI's Behavior
Neil F. Johnson
Frank Yingjie Huo
46
0
0
29 Apr 2025
Softpick: No Attention Sink, No Massive Activations with Rectified Softmax
Softpick: No Attention Sink, No Massive Activations with Rectified Softmax
Zayd Muhammad Kawakibi Zuhri
Erland Hilman Fuadi
Alham Fikri Aji
33
0
0
29 Apr 2025
Exploiting Inter-Sample Correlation and Intra-Sample Redundancy for Partially Relevant Video Retrieval
Exploiting Inter-Sample Correlation and Intra-Sample Redundancy for Partially Relevant Video Retrieval
Junlong Ren
Gangjian Zhang
Y. Hu
Jian Shu
Haoran Wang
29
0
0
28 Apr 2025
Hierarchical Reinforcement Learning in Multi-Goal Spatial Navigation with Autonomous Mobile Robots
Hierarchical Reinforcement Learning in Multi-Goal Spatial Navigation with Autonomous Mobile Robots
Brendon Johnson
Alfredo Weitzenfeld
29
0
0
26 Apr 2025
Spatial Speech Translation: Translating Across Space With Binaural Hearables
Spatial Speech Translation: Translating Across Space With Binaural Hearables
Tuochao Chen
Qirui Wang
Runlin He
Shyam Gollakota
31
0
0
25 Apr 2025
The Sparse Frontier: Sparse Attention Trade-offs in Transformer LLMs
The Sparse Frontier: Sparse Attention Trade-offs in Transformer LLMs
Piotr Nawrot
Robert Li
Renjie Huang
Sebastian Ruder
Kelly Marchisio
E. Ponti
39
0
0
24 Apr 2025
Low-Resource Neural Machine Translation Using Recurrent Neural Networks and Transfer Learning: A Case Study on English-to-Igbo
Low-Resource Neural Machine Translation Using Recurrent Neural Networks and Transfer Learning: A Case Study on English-to-Igbo
Ocheme Anthony Ekle
Biswarup Das
34
0
0
24 Apr 2025
A Novel Hybrid Approach Using an Attention-Based Transformer + GRU Model for Predicting Cryptocurrency Prices
A Novel Hybrid Approach Using an Attention-Based Transformer + GRU Model for Predicting Cryptocurrency Prices
Esam Mahdi
C. Martin-Barreiro
X. Cabezas
AI4TS
34
0
0
23 Apr 2025
GADS: A Super Lightweight Model for Head Pose Estimation
GADS: A Super Lightweight Model for Head Pose Estimation
Menan Velayuthan
Asiri Gawesha
Purushoth Velayuthan
N. Kodagoda
D. Kasthurirathna
Pradeepa Samarasinghe
3DH
36
0
0
22 Apr 2025
Quantitative Clustering in Mean-Field Transformer Models
Quantitative Clustering in Mean-Field Transformer Models
Shi Chen
Zhengjiang Lin
Yury Polyanskiy
Philippe Rigollet
38
0
0
20 Apr 2025
From Large to Super-Tiny: End-to-End Optimization for Cost-Efficient LLMs
From Large to Super-Tiny: End-to-End Optimization for Cost-Efficient LLMs
Jiliang Ni
Jiachen Pu
Zhongyi Yang
Kun Zhou
Hui Wang
Xiaoliang Xiao
Dakui Wang
Xin Li
Jingfeng Luo
Conggang Hu
37
0
0
18 Apr 2025
Learning to Attribute with Attention
Learning to Attribute with Attention
Benjamin Cohen-Wang
Yung-Sung Chuang
Aleksander Madry
30
0
0
18 Apr 2025
Explainable Scene Understanding with Qualitative Representations and Graph Neural Networks
Explainable Scene Understanding with Qualitative Representations and Graph Neural Networks
Nassim Belmecheri
A. Gotlieb
Nadjib Lazaar
Helge Spieker
GNN
51
0
0
17 Apr 2025
Why and How LLMs Hallucinate: Connecting the Dots with Subsequence Associations
Why and How LLMs Hallucinate: Connecting the Dots with Subsequence Associations
Yiyou Sun
Y. Gai
Lijie Chen
Abhilasha Ravichander
Yejin Choi
D. Song
HILM
57
0
0
17 Apr 2025
Simplifying Graph Transformers
Simplifying Graph Transformers
Liheng Ma
Soumyasundar Pal
Yingxue Zhang
Philip Torr
Mark J. Coates
28
0
0
17 Apr 2025
Sparks of Science: Hypothesis Generation Using Structured Paper Data
Sparks of Science: Hypothesis Generation Using Structured Paper Data
Charles OÑeill
Tirthankar Ghosal
Roberta Răileanu
Mike Walmsley
Thang Bui
Kevin Schawinski
I. Ciucă
LRM
56
0
0
17 Apr 2025
SemDiff: Generating Natural Unrestricted Adversarial Examples via Semantic Attributes Optimization in Diffusion Models
SemDiff: Generating Natural Unrestricted Adversarial Examples via Semantic Attributes Optimization in Diffusion Models
Zeyu Dai
Shengcai Liu
Rui He
Jiahao Wu
Ning Lu
Wenqi Fan
Qing Li
Ke Tang
DiffM
AAML
38
0
0
16 Apr 2025
Clarifying Ambiguities: on the Role of Ambiguity Types in Prompting Methods for Clarification Generation
Clarifying Ambiguities: on the Role of Ambiguity Types in Prompting Methods for Clarification Generation
Anfu Tang
Laure Soulier
Vincent Guigue
LRM
82
0
0
16 Apr 2025
Multilingual Contextualization of Large Language Models for Document-Level Machine Translation
Multilingual Contextualization of Large Language Models for Document-Level Machine Translation
Miguel Moura Ramos
Patrick Fernandes
Sweta Agrawal
André F.T. Martins
71
0
0
16 Apr 2025
DeepMLF: Multimodal language model with learnable tokens for deep fusion in sentiment analysis
DeepMLF: Multimodal language model with learnable tokens for deep fusion in sentiment analysis
Efthymios Georgiou
V. Katsouros
Yannis Avrithis
Alexandros Potamianos
24
1
0
15 Apr 2025
Pay Attention to What and Where? Interpretable Feature Extractor in Vision-based Deep Reinforcement Learning
Pay Attention to What and Where? Interpretable Feature Extractor in Vision-based Deep Reinforcement Learning
Tien Pham
Angelo Cangelosi
31
0
0
14 Apr 2025
Siamese Network with Dual Attention for EEG-Driven Social Learning: Bridging the Human-Robot Gap in Long-Tail Autonomous Driving
Siamese Network with Dual Attention for EEG-Driven Social Learning: Bridging the Human-Robot Gap in Long-Tail Autonomous Driving
Xiaoshan Zhou
Carol Menassa
V. Kamat
29
0
0
14 Apr 2025
Ordinary Least Squares as an Attention Mechanism
Ordinary Least Squares as an Attention Mechanism
Philippe Goulet Coulombe
29
0
0
13 Apr 2025
Bidirectional Linear Recurrent Models for Sequence-Level Multisource Fusion
Bidirectional Linear Recurrent Models for Sequence-Level Multisource Fusion
Qisai Liu
Zhanhong Jiang
Joshua R. Waite
Chao Liu
Aditya Balu
S. Sarkar
AI4TS
29
0
0
11 Apr 2025
Hardware Design and Security Needs Attention: From Survey to Path Forward
Hardware Design and Security Needs Attention: From Survey to Path Forward
Sujan Ghimire
Muhtasim Alam Chowdhury
B. S. Latibari
M. Mamun
Jaeden Wolf Carpenter
Benjamin Tan
Hammond Pearce
Pratik Satam
Soheil Salehi
3DV
48
0
0
11 Apr 2025
SRVP: Strong Recollection Video Prediction Model Using Attention-Based Spatiotemporal Correlation Fusion
SRVP: Strong Recollection Video Prediction Model Using Attention-Based Spatiotemporal Correlation Fusion
Yuseon Kim
Kyongseok Park
36
0
0
10 Apr 2025
SaRoHead: A Dataset for Satire Detection in Romanian Multi-Domain News Headlines
SaRoHead: A Dataset for Satire Detection in Romanian Multi-Domain News Headlines
Mihnea-Alexandru Vîrlan
Razvan-Alexandru Smadu
Dumitru-Clementin Cercel
26
0
0
10 Apr 2025
PROPEL: Supervised and Reinforcement Learning for Large-Scale Supply Chain Planning
PROPEL: Supervised and Reinforcement Learning for Large-Scale Supply Chain Planning
Vahid Eghbal Akhlaghi
Reza Zandehshahvar
Pascal Van Hentenryck
31
0
0
10 Apr 2025
Analogical Learning for Cross-Scenario Generalization: Framework and Application to Intelligent Localization
Analogical Learning for Cross-Scenario Generalization: Framework and Application to Intelligent Localization
Zirui Chen
Zhaoyang Zhang
Ziqing Xing
Ridong Li
Zhaohui Yang
Richeng Jin
Chongwen Huang
YuZhi Yang
Mérouane Debbah
26
1
0
09 Apr 2025
NNN: Next-Generation Neural Networks for Marketing Mix Modeling
NNN: Next-Generation Neural Networks for Marketing Mix Modeling
Thomas Mulc
Mike Anderson
Paul Cubre
Huikun Zhang
Ivy Liu
Saket Kumar
155
0
0
08 Apr 2025
Separator Injection Attack: Uncovering Dialogue Biases in Large Language Models Caused by Role Separators
Separator Injection Attack: Uncovering Dialogue Biases in Large Language Models Caused by Role Separators
Xitao Li
Haoran Wang
Jiang Wu
Ting Liu
AAML
26
0
0
08 Apr 2025
Hogwild! Inference: Parallel LLM Generation via Concurrent Attention
Hogwild! Inference: Parallel LLM Generation via Concurrent Attention
Gleb Rodionov
Roman Garipov
Alina Shutova
George Yakushev
Vage Egiazarian
Anton Sinitsin
Denis Kuznedelev
Dan Alistarh
LRM
32
2
0
08 Apr 2025
High-Resource Translation:Turning Abundance into Accessibility
High-Resource Translation:Turning Abundance into Accessibility
Abhiram Reddy Yanampally
24
0
0
08 Apr 2025
Capturing AI's Attention: Physics of Repetition, Hallucination, Bias and Beyond
Capturing AI's Attention: Physics of Repetition, Hallucination, Bias and Beyond
Frank Yingjie Huo
Neil F. Johnson
62
1
0
06 Apr 2025
Deep learning for music generation. Four approaches and their comparative evaluation
Deep learning for music generation. Four approaches and their comparative evaluation
Razvan Paroiu
Stefan Trausan-Matu
MGen
64
0
0
03 Apr 2025
1234...117118119
Next