ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1409.0473
  4. Cited By
Neural Machine Translation by Jointly Learning to Align and Translate
v1v2v3v4v5v6v7 (latest)

Neural Machine Translation by Jointly Learning to Align and Translate

1 September 2014
Dzmitry Bahdanau
Kyunghyun Cho
Yoshua Bengio
    AIMat
ArXiv (abs)PDFHTML

Papers citing "Neural Machine Translation by Jointly Learning to Align and Translate"

50 / 8,358 papers shown
Title
Better Language Model Inversion by Compactly Representing Next-Token Distributions
Better Language Model Inversion by Compactly Representing Next-Token Distributions
Murtaza Nazir
Matthew Finlayson
John X. Morris
Xiang Ren
Swabha Swayamdipta
5
0
0
20 Jun 2025
A Hybrid DeBERTa and Gated Broad Learning System for Cyberbullying Detection in English Text
A Hybrid DeBERTa and Gated Broad Learning System for Cyberbullying Detection in English Text
Devesh Kumar
7
0
0
19 Jun 2025
Detecting Hard-Coded Credentials in Software Repositories via LLMs
Detecting Hard-Coded Credentials in Software Repositories via LLMs
Chidera Biringa
Gökhan Kul
17
0
0
16 Jun 2025
AlphaEvolve: A coding agent for scientific and algorithmic discovery
AlphaEvolve: A coding agent for scientific and algorithmic discovery
Alexander Novikov
Ngan Vu
Marvin Eisenberger
Emilien Dupont
Po-Sen Huang
...
George Holland
Alex Davies
Sebastian Nowozin
Pushmeet Kohli
Matej Balog
43
17
0
16 Jun 2025
Bridging the Digital Divide: Small Language Models as a Pathway for Physics and Photonics Education in Underdeveloped Regions
Bridging the Digital Divide: Small Language Models as a Pathway for Physics and Photonics Education in Underdeveloped Regions
Asghar Ghorbani
Hanieh Fattahi
7
0
0
14 Jun 2025
Manager: Aggregating Insights from Unimodal Experts in Two-Tower VLMs and MLLMs
Manager: Aggregating Insights from Unimodal Experts in Two-Tower VLMs and MLLMs
Xiao Xu
L. Qin
Wanxiang Che
Min-Yen Kan
MoEVLM
30
0
0
13 Jun 2025
FAD-Net: Frequency-Domain Attention-Guided Diffusion Network for Coronary Artery Segmentation using Invasive Coronary Angiography
FAD-Net: Frequency-Domain Attention-Guided Diffusion Network for Coronary Artery Segmentation using Invasive Coronary Angiography
Nan Mu
Ruiqi Song
Xiaoning Li
Zhihui Xu
Jingfeng Jiang
Chen Zhao
MedIm
69
0
0
13 Jun 2025
Interior-Point Vanishing Problem in Semidefinite Relaxations for Neural Network Verification
Interior-Point Vanishing Problem in Semidefinite Relaxations for Neural Network Verification
Ryota Ueda
Takami Sato
Ken Kobayashi
Kazuhide Nakata
AAML
98
0
0
12 Jun 2025
Principled Approaches for Extending Neural Architectures to Function Spaces for Operator Learning
Principled Approaches for Extending Neural Architectures to Function Spaces for Operator Learning
Julius Berner
Miguel Liu-Schiaffini
Jean Kossaifi
Valentin Duruisseaux
Boris Bonev
Kamyar Azizzadenesheli
A. Anandkumar
AI4CE
116
0
0
12 Jun 2025
Enhancing Medical Dialogue Generation through Knowledge Refinement and Dynamic Prompt Adjustment
Enhancing Medical Dialogue Generation through Knowledge Refinement and Dynamic Prompt Adjustment
Hongda Sun
Jiaren Peng
Wenzhong Yang
Liang He
Bo Du
Rui Yan
MedIm
112
0
0
12 Jun 2025
Survival Analysis as Imprecise Classification with Trainable Kernels
Survival Analysis as Imprecise Classification with Trainable Kernels
A. Konstantinov
Vlada A. Efremenko
Lev V. Utkin
35
0
0
11 Jun 2025
Analyzing Emotions in Bangla Social Media Comments Using Machine Learning and LIME
Analyzing Emotions in Bangla Social Media Comments Using Machine Learning and LIME
Bidyarthi Paul
SM Musfiqur Rahman
Dipta Biswas
Md. Ziaul Hasan
Md. Zahid Hossain
34
0
0
11 Jun 2025
Mutual-Supervised Learning for Sequential-to-Parallel Code Translation
Mutual-Supervised Learning for Sequential-to-Parallel Code Translation
Changxin Ke
Rui Zhang
Shuo Wang
Li Ding
Guangli Li
...
Jiaming Guo
Chenxi Wang
Ling Li
Qi Guo
Y. Chen
28
0
0
11 Jun 2025
BF-Max: an Efficient Bit Flipping Decoder with Predictable Decoding Failure Rate
Alessio Baldelli
Marco Baldi
F. Chiaraluce
Paolo Santini
115
0
0
11 Jun 2025
Quantifying Mix Network Privacy Erosion with Generative Models
Vasilios Mavroudis
Tariq Elahi
23
0
0
10 Jun 2025
TSRec: Enhancing Repeat-Aware Recommendation from a Temporal-Sequential Perspective
Shigang Quan
Shui Liu
Zhenzhe Zheng
Fan Wu
18
6
0
10 Jun 2025
TACTIC: Translation Agents with Cognitive-Theoretic Interactive Collaboration
Weiya Li
Junjie Chen
Bei Li
Boyang Liu
Zichen Wen
...
Xiaoqian Liu
Anping Liu
Huajie Liu
Hu Song
Linfeng Zhang
LLMAG
31
0
0
10 Jun 2025
Multilingual Hate Speech Detection in Social Media Using Translation-Based Approaches with Large Language Models
Muhammad Usman
Muhammad Ahmad
Moein Shahiki Tash
Irina Gelbukh
Rolando Quintero Tellez
Grigori Sidorov
13
0
0
09 Jun 2025
Hierarchical Lexical Graph for Enhanced Multi-Hop Retrieval
Abdellah Ghassel
Ian Robinson
Gabriel Tanase
Hal Cooper
Bryan Thompson
Zhen Han
V. Ioannidis
Soji Adeshina
Huzefa Rangwala
RALM
13
0
0
09 Jun 2025
Beyond the Sentence: A Survey on Context-Aware Machine Translation with Large Language Models
Beyond the Sentence: A Survey on Context-Aware Machine Translation with Large Language Models
Ramakrishna Appicharla
Baban Gain
Santanu Pal
Asif Ekbal
LRM
12
0
0
09 Jun 2025
Controllable Coupled Image Generation via Diffusion Models
Controllable Coupled Image Generation via Diffusion Models
Chenfei Yuan
Nanshan Jia
Hangqi Li
Peter W. Glynn
Zeyu Zheng
DiffM
21
0
0
07 Jun 2025
Transformative or Conservative? Conservation laws for ResNets and Transformers
Transformative or Conservative? Conservation laws for ResNets and Transformers
Sibylle Marcotte
Rémi Gribonval
Gabriel Peyré
35
0
0
06 Jun 2025
Power Law Guided Dynamic Sifting for Efficient Attention
Nirav Koley
Prajwal Singhania
A. Bhatele
155
0
0
05 Jun 2025
A Novel Transformer-Based Method for Full Lower-Limb Joint Angles and Moments Prediction in Gait Using sEMG and IMU data
Farshad Haghgoo Daryakenari
Tara Farizeh
103
0
0
05 Jun 2025
Log-Linear Attention
Log-Linear Attention
Han Guo
Songlin Yang
Tarushii Goel
Eric P. Xing
Tri Dao
Yoon Kim
Mamba
148
1
0
05 Jun 2025
MesaNet: Sequence Modeling by Locally Optimal Test-Time Training
J. Oswald
Nino Scherrer
Seijin Kobayashi
Luca Versari
Songlin Yang
...
Guillaume Lajoie
Charlotte Frenkel
Razvan Pascanu
Blaise Agüera y Arcas
João Sacramento
94
1
0
05 Jun 2025
Average Calibration Losses for Reliable Uncertainty in Medical Image Segmentation
Average Calibration Losses for Reliable Uncertainty in Medical Image Segmentation
Theodore Barfoot
Luis C. Garcia-Peraza-Herrera
Samet Akcay
Ben Glocker
Tom Vercauteren
UQCV
127
0
0
04 Jun 2025
MVAN: Multi-View Attention Networks for Fake News Detection on Social Media
MVAN: Multi-View Attention Networks for Fake News Detection on Social Media
Shiwen Ni
Jiawen Li
Hung-Yu kao
65
61
0
02 Jun 2025
Natural, Artificial, and Human Intelligences
Natural, Artificial, and Human Intelligences
E. Pothos
Dominic Widdows
14
0
0
02 Jun 2025
Blending Complementary Memory Systems in Hybrid Quadratic-Linear Transformers
Blending Complementary Memory Systems in Hybrid Quadratic-Linear Transformers
Kazuki Irie
Morris Yau
Samuel J. Gershman
20
0
0
31 May 2025
Channel-Imposed Fusion: A Simple yet Effective Method for Medical Time Series Classification
Channel-Imposed Fusion: A Simple yet Effective Method for Medical Time Series Classification
Ming Hu
Jianfu Yin
Mingyu Dou
Yuqi Wang
Ruochen Dang
Siyi Liang
Cong Hu
Yao Wang
Bingliang Hu
Quan Wang
AI4TS
21
0
0
31 May 2025
Improving Language and Modality Transfer in Translation by Character-level Modeling
Improving Language and Modality Transfer in Translation by Character-level Modeling
Ioannis Tsiamas
David Dale
Marta R. Costa-jussá
15
1
0
30 May 2025
Explainable Depression Detection using Masked Hard Instance Mining
Explainable Depression Detection using Masked Hard Instance Mining
Patawee Prakrankamanant
Shinji Watanabe
Ekapol Chuangsuwanich
28
0
0
30 May 2025
Two failure modes of deep transformers and how to avoid them: a unified theory of signal propagation at initialisation
Two failure modes of deep transformers and how to avoid them: a unified theory of signal propagation at initialisation
Alessio Giorlandino
Sebastian Goldt
16
0
0
30 May 2025
A Survey of Generative Categories and Techniques in Multimodal Large Language Models
A Survey of Generative Categories and Techniques in Multimodal Large Language Models
Longzhen Han
Awes Mubarak
Almas Baimagambetov
Nikolaos Polatidis
Thar Baker
LRM
27
0
0
29 May 2025
ATLAS: Learning to Optimally Memorize the Context at Test Time
ATLAS: Learning to Optimally Memorize the Context at Test Time
Ali Behrouz
Zeman Li
Praneeth Kacham
Majid Daliri
Yuan Deng
Peilin Zhong
Meisam Razaviyayn
Vahab Mirrokni
78
2
0
29 May 2025
Two-Stage Feature Generation with Transformer and Reinforcement Learning
Two-Stage Feature Generation with Transformer and Reinforcement Learning
Wanfu Gao
Zengyao Man
Zebin He
Yuhao Tang
Jun Gao
Kunpeng Liu
11
0
0
28 May 2025
Identifying Super Spreaders in Multilayer Networks
Identifying Super Spreaders in Multilayer Networks
Michał Czuba
Mateusz Stolarski
Adam Piróg
Piotr Bielak
Piotr Bródka
31
0
0
27 May 2025
A Physics-Augmented GraphGPS Framework for the Reconstruction of 3D Riemann Problems from Sparse Data
A Physics-Augmented GraphGPS Framework for the Reconstruction of 3D Riemann Problems from Sparse Data
Rami Cassia
Rich Kerswell
AI4CE
59
0
0
27 May 2025
VoiceStar: Robust Zero-Shot Autoregressive TTS with Duration Control and Extrapolation
VoiceStar: Robust Zero-Shot Autoregressive TTS with Duration Control and Extrapolation
Puyuan Peng
Shang-Wen Li
Abdelrahman Mohamed
David Harwath
18
0
0
26 May 2025
Classification of assembly tasks combining multiple primitive actions using Transformers and xLSTMs
Classification of assembly tasks combining multiple primitive actions using Transformers and xLSTMs
Miguel Neves
Pedro Neto
22
0
0
23 May 2025
Low-Resource NMT: A Case Study on the Written and Spoken Languages in Hong Kong
Low-Resource NMT: A Case Study on the Written and Spoken Languages in Hong Kong
Hei Yi Mak
Tan Lee
29
7
0
23 May 2025
Selection Mechanisms for Sequence Modeling using Linear State Space Models
Umberto Casti
Sandro Zampieri
Fabio Pasqualetti
Mamba
99
0
0
23 May 2025
A Multi-Head Attention Soft Random Forest for Interpretable Patient No-Show Prediction
A Multi-Head Attention Soft Random Forest for Interpretable Patient No-Show Prediction
Ninda Nurseha Amalina
Kwadwo Boateng Ofori-Amanfo
Heungjo An
154
0
0
22 May 2025
Tversky Neural Networks: Psychologically Plausible Deep Learning with Differentiable Tversky Similarity
Tversky Neural Networks: Psychologically Plausible Deep Learning with Differentiable Tversky Similarity
M. Doumbouya
Dan Jurafsky
Christopher D. Manning
FedML
15
0
0
21 May 2025
A Framework for Non-Linear Attention via Modern Hopfield Networks
A Framework for Non-Linear Attention via Modern Hopfield Networks
Ahmed Farooq
8
0
0
21 May 2025
SUS backprop: linear backpropagation algorithm for long inputs in transformers
SUS backprop: linear backpropagation algorithm for long inputs in transformers
Sergey Pankov
Georges Harik
110
0
0
21 May 2025
TransBench: Benchmarking Machine Translation for Industrial-Scale Applications
TransBench: Benchmarking Machine Translation for Industrial-Scale Applications
Haijun Li
Tianqi Shi
Zifu Shang
Yuxuan Han
Xueyu Zhao
...
Longyue Wang
Gongbo Tang
Weihua Luo
Zhao Xu
Kaifu Zhang
ELM
53
0
0
20 May 2025
Combining the Best of Both Worlds: A Method for Hybrid NMT and LLM Translation
Combining the Best of Both Worlds: A Method for Hybrid NMT and LLM Translation
Zhanglin Wu
Daimeng Wei
Xiaoyu Chen
Hengchao Shang
Jiaxin Guo
Zongyao Li
Yuanchang Luo
Jinlong Yang
Zhiqiang Rao
Hao Yang
42
0
0
19 May 2025
Lightweight Transformer via Unrolling of Mixed Graph Algorithms for Traffic Forecast
Lightweight Transformer via Unrolling of Mixed Graph Algorithms for Traffic Forecast
Ji Qi
Tam Thuc Do
Mingxiao Liu
Zhuoshi Pan
Yuzhe Li
Gene Cheung
H. Vicky Zhao
AI4TS
26
0
0
19 May 2025
1234...166167168
Next