ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1409.0473
  4. Cited By
Neural Machine Translation by Jointly Learning to Align and Translate

Neural Machine Translation by Jointly Learning to Align and Translate

1 September 2014
Dzmitry Bahdanau
Kyunghyun Cho
Yoshua Bengio
    AIMat
ArXivPDFHTML

Papers citing "Neural Machine Translation by Jointly Learning to Align and Translate"

50 / 6,757 papers shown
Title
Nebula: Self-Attention for Dynamic Malware Analysis
Nebula: Self-Attention for Dynamic Malware Analysis
Dmitrijs Trizna
Christian Scano
Battista Biggio
Fabio Roli
29
13
0
19 Sep 2023
Weakly Supervised Reasoning by Neuro-Symbolic Approaches
Weakly Supervised Reasoning by Neuro-Symbolic Approaches
Xianggen Liu
Zhengdong Lu
Lili Mou
LRM
NAI
68
4
0
19 Sep 2023
QXAI: Explainable AI Framework for Quantitative Analysis in Patient
  Monitoring Systems
QXAI: Explainable AI Framework for Quantitative Analysis in Patient Monitoring Systems
T. Shaik
Xiaohui Tao
Haoran Xie
Lin Li
Juan D. Velásquez
Niall Higgins
65
2
0
19 Sep 2023
OptiRoute: A Heuristic-assisted Deep Reinforcement Learning Framework
  for UAV-UGV Collaborative Route Planning
OptiRoute: A Heuristic-assisted Deep Reinforcement Learning Framework for UAV-UGV Collaborative Route Planning
Md Safwan Mondal
S. Ramasamy
Pranav A. Bhounsule
27
0
0
18 Sep 2023
FastGraphTTS: An Ultrafast Syntax-Aware Speech Synthesis Framework
FastGraphTTS: An Ultrafast Syntax-Aware Speech Synthesis Framework
Jianzong Wang
Xulong Zhang
Aolan Sun
Ning Cheng
Jing Xiao
49
1
0
16 Sep 2023
Replacing softmax with ReLU in Vision Transformers
Replacing softmax with ReLU in Vision Transformers
Mitchell Wortsman
Jaehoon Lee
Justin Gilmer
Simon Kornblith
ViT
43
33
0
15 Sep 2023
Chunked Attention-based Encoder-Decoder Model for Streaming Speech
  Recognition
Chunked Attention-based Encoder-Decoder Model for Streaming Speech Recognition
Mohammad Zeineldeen
Albert Zeyer
Ralf Schluter
Hermann Ney
AuLLM
46
4
0
15 Sep 2023
A Fast Optimization View: Reformulating Single Layer Attention in LLM
  Based on Tensor and SVM Trick, and Solving It in Matrix Multiplication Time
A Fast Optimization View: Reformulating Single Layer Attention in LLM Based on Tensor and SVM Trick, and Solving It in Matrix Multiplication Time
Yeqi Gao
Zhao Song
Weixin Wang
Junze Yin
37
26
0
14 Sep 2023
Deep Attentive Time Warping
Deep Attentive Time Warping
Shinnosuke Matsuo
Xiaomeng Wu
Gantugs Atarsaikhan
Akisato Kimura
K. Kashino
Brian Kenji Iwana
Seiichi Uchida
AI4TS
59
3
0
13 Sep 2023
The Relational Bottleneck as an Inductive Bias for Efficient Abstraction
The Relational Bottleneck as an Inductive Bias for Efficient Abstraction
Taylor Webb
Steven M. Frankland
Awni Altabaa
Simon N. Segert
Kamesh Krishnamurthy
...
Tyler Giallanza
Zack Dulberg
Randall O'Reilly
John Lafferty
Jonathan D. Cohen
52
27
0
12 Sep 2023
Robust-MBDL: A Robust Multi-branch Deep Learning Based Model for
  Remaining Useful Life Prediction and Operational Condition Identification of
  Rotating Machines
Robust-MBDL: A Robust Multi-branch Deep Learning Based Model for Remaining Useful Life Prediction and Operational Condition Identification of Rotating Machines
Khoa Tran
H. Vu
L. D. Pham
N. Boudaoud
18
0
0
12 Sep 2023
Uncovering mesa-optimization algorithms in Transformers
Uncovering mesa-optimization algorithms in Transformers
J. Oswald
Eyvind Niklasson
Maximilian Schlegel
Seijin Kobayashi
Nicolas Zucchet
...
Mark Sandler
Blaise Agüera y Arcas
Max Vladymyrov
Razvan Pascanu
João Sacramento
37
57
0
11 Sep 2023
Long-Range Transformer Architectures for Document Understanding
Long-Range Transformer Architectures for Document Understanding
Thibault Douzon
S. Duffner
Christophe Garcia
Jérémy Espinas
VLM
42
2
0
11 Sep 2023
LeBenchmark 2.0: a Standardized, Replicable and Enhanced Framework for
  Self-supervised Representations of French Speech
LeBenchmark 2.0: a Standardized, Replicable and Enhanced Framework for Self-supervised Representations of French Speech
Titouan Parcollet
H. Nguyen
Solène Evain
Marcely Zanon Boito
Adrien Pupier
...
François Portet
Solange Rossato
Fabien Ringeval
D. Schwab
Laurent Besacier
47
15
0
11 Sep 2023
Detecting Natural Language Biases with Prompt-based Learning
Detecting Natural Language Biases with Prompt-based Learning
Md Abdul Aowal
Maliha T Islam
P. Mammen
Sandesh Shetty
32
1
0
11 Sep 2023
The Effect of Alignment Objectives on Code-Switching Translation
The Effect of Alignment Objectives on Code-Switching Translation
Mohamed Anwar
29
1
0
10 Sep 2023
Towards Better Multi-modal Keyphrase Generation via Visual Entity
  Enhancement and Multi-granularity Image Noise Filtering
Towards Better Multi-modal Keyphrase Generation via Visual Entity Enhancement and Multi-granularity Image Noise Filtering
Yifan Dong
Suhang Wu
Fandong Meng
Jie Zhou
Xiaoli Wang
Jianxin Lin
Jinsong Su
48
3
0
09 Sep 2023
RST-style Discourse Parsing Guided by Document-level Content Structures
RST-style Discourse Parsing Guided by Document-level Content Structures
Ming Li
Ruihong Huang
24
1
0
08 Sep 2023
Meta predictive learning model of languages in neural circuits
Meta predictive learning model of languages in neural circuits
Chan Li
Junbin Qiu
Haiping Huang
MILM
46
1
0
08 Sep 2023
A deep Natural Language Inference predictor without language-specific
  training data
A deep Natural Language Inference predictor without language-specific training data
Lorenzo Corradi
Alessandro Manenti
Francesca Del Bonifro
Francesco Setti
D. Sorbo
21
0
0
06 Sep 2023
Rubric-Specific Approach to Automated Essay Scoring with Augmentation
  Training
Rubric-Specific Approach to Automated Essay Scoring with Augmentation Training
Brian Cho
Youngbin Jang
Jaewoong Yoon
38
1
0
06 Sep 2023
TFBEST: Dual-Aspect Transformer with Learnable Positional Encoding for
  Failure Prediction
TFBEST: Dual-Aspect Transformer with Learnable Positional Encoding for Failure Prediction
Rohan Mohapatra
Saptarshi Sengupta
30
3
0
06 Sep 2023
Epi-Curriculum: Episodic Curriculum Learning for Low-Resource Domain
  Adaptation in Neural Machine Translation
Epi-Curriculum: Episodic Curriculum Learning for Low-Resource Domain Adaptation in Neural Machine Translation
Keyu Chen
Zhuang Di
Mingchen Li
J. M. Chang
61
3
0
06 Sep 2023
MA-VAE: Multi-head Attention-based Variational Autoencoder Approach for
  Anomaly Detection in Multivariate Time-series Applied to Automotive Endurance
  Powertrain Testing
MA-VAE: Multi-head Attention-based Variational Autoencoder Approach for Anomaly Detection in Multivariate Time-series Applied to Automotive Endurance Powertrain Testing
Lucas Correia
Jan-Christoph Goos
Philipp Klein
Thomas Bäck
Anna V. Kononova
9
1
0
05 Sep 2023
Advancing Text-to-GLOSS Neural Translation Using a Novel Hyper-parameter
  Optimization Technique
Advancing Text-to-GLOSS Neural Translation Using a Novel Hyper-parameter Optimization Technique
Younes Ouargani
N. E. Khattabi
24
1
0
05 Sep 2023
Generalized Simplicial Attention Neural Networks
Generalized Simplicial Attention Neural Networks
Claudio Battiloro
Lucia Testa
Lorenzo Giusti
S. Sardellitti
P. Lorenzo
Sergio Barbarossa
43
20
0
05 Sep 2023
A survey on efficient vision transformers: algorithms, techniques, and
  performance benchmarking
A survey on efficient vision transformers: algorithms, techniques, and performance benchmarking
Lorenzo Papa
Paolo Russo
Irene Amerini
Luping Zhou
46
43
0
05 Sep 2023
Gated recurrent neural networks discover attention
Gated recurrent neural networks discover attention
Nicolas Zucchet
Seijin Kobayashi
Yassir Akram
J. Oswald
Maxime Larcher
Angelika Steger
João Sacramento
36
8
0
04 Sep 2023
An Empirical Analysis for Zero-Shot Multi-Label Classification on
  COVID-19 CT Scans and Uncurated Reports
An Empirical Analysis for Zero-Shot Multi-Label Classification on COVID-19 CT Scans and Uncurated Reports
Ethan Dack
Lorenzo Brigato
Matthew McMurray
Matthias Fontanellaz
Thomas Frauenfelder
...
Thomas Geiser
M. Funke-Chambour
Andreas Christe
L. Ebner
Stavroula Mougiakakou
51
2
0
04 Sep 2023
LoRA-like Calibration for Multimodal Deception Detection using ATSFace
  Data
LoRA-like Calibration for Multimodal Deception Detection using ATSFace Data
Shun-Wen Hsiao
Chengbin Sun
CVBM
23
1
0
04 Sep 2023
A Visual Interpretation-Based Self-Improved Classification System Using
  Virtual Adversarial Training
A Visual Interpretation-Based Self-Improved Classification System Using Virtual Adversarial Training
Shuai Jiang
Sayaka Kamei
Chen Li
Shengzhe Hou
Yasuhiko Morimoto
SSL
18
1
0
03 Sep 2023
Multilingual Text Representation
Multilingual Text Representation
Fahim Faisal
32
0
0
02 Sep 2023
Evaluating Transformer's Ability to Learn Mildly Context-Sensitive
  Languages
Evaluating Transformer's Ability to Learn Mildly Context-Sensitive Languages
Shunjie Wang
Shane Steinert-Threlkeld
38
4
0
02 Sep 2023
Learning multi-modal generative models with permutation-invariant
  encoders and tighter variational bounds
Learning multi-modal generative models with permutation-invariant encoders and tighter variational bounds
Marcel Hirt
Domenico Campolo
Victoria Leong
Juan-Pablo Ortega
DRL
23
0
0
01 Sep 2023
Distraction-free Embeddings for Robust VQA
Distraction-free Embeddings for Robust VQA
Atharvan Dogra
Deeksha Varshney
Ashwin Kalyan
Ameet Deshpande
Neeraj Kumar
43
0
0
31 Aug 2023
Unsupervised Text Style Transfer with Deep Generative Models
Unsupervised Text Style Transfer with Deep Generative Models
Zhongtao Jiang
Yuanzhe Zhang
Yiming Ju
Kang Liu
51
0
0
31 Aug 2023
Enhancing Robot Learning through Learned Human-Attention Feature Maps
Enhancing Robot Learning through Learned Human-Attention Feature Maps
D. Scheuchenstuhl
Stefan Ulmer
Felix Resch
Luigi Berducci
Radu Grosu
42
0
0
29 Aug 2023
A Classification-Guided Approach for Adversarial Attacks against Neural
  Machine Translation
A Classification-Guided Approach for Adversarial Attacks against Neural Machine Translation
Sahar Sadrizadeh
Ljiljana Dolamic
P. Frossard
AAML
SILM
49
2
0
29 Aug 2023
CLIPTrans: Transferring Visual Knowledge with Pre-trained Models for
  Multimodal Machine Translation
CLIPTrans: Transferring Visual Knowledge with Pre-trained Models for Multimodal Machine Translation
Devaansh Gupta
Siddhant Kharbanda
Jiawei Zhou
Wanhua Li
Hanspeter Pfister
D. Wei
VLM
44
10
0
29 Aug 2023
FIRE: Food Image to REcipe generation
FIRE: Food Image to REcipe generation
P. Chhikara
Dhiraj Chaurasia
Yifan Jiang
Omkar Masur
Filip Ilievski
53
23
0
28 Aug 2023
Construction Grammar and Language Models
Construction Grammar and Language Models
Harish Tayyar Madabushi
Laurence Romain
P. Milin
Dagmar Divjak
58
5
0
25 Aug 2023
Dense Text-to-Image Generation with Attention Modulation
Dense Text-to-Image Generation with Attention Modulation
Yunji Kim
Jiyoung Lee
Jin-Hwa Kim
Jung-Woo Ha
Jun-Yan Zhu
DiffM
66
135
0
24 Aug 2023
Evaluating the Vulnerabilities in ML systems in terms of adversarial
  attacks
Evaluating the Vulnerabilities in ML systems in terms of adversarial attacks
John Harshith
Mantej Singh Gill
Madhan Jothimani
AAML
30
1
0
24 Aug 2023
Easy attention: A simple attention mechanism for temporal predictions
  with transformers
Easy attention: A simple attention mechanism for temporal predictions with transformers
Marcial Sanchis-Agudo
Yuning Wang
Roger Arnau
L. Guastoni
Jasmin Lim
Karthik Duraisamy
Ricardo Vinuesa
AI4TS
32
0
0
24 Aug 2023
LISTER: Neighbor Decoding for Length-Insensitive Scene Text Recognition
LISTER: Neighbor Decoding for Length-Insensitive Scene Text Recognition
Changxu Cheng
Peng Wang
Cheng Da
Qi Zheng
Cong Yao
50
15
0
24 Aug 2023
Improving Translation Faithfulness of Large Language Models via
  Augmenting Instructions
Improving Translation Faithfulness of Large Language Models via Augmenting Instructions
Yijie Chen
Yanjun Liu
Fandong Meng
Jinan Xu
Jinan Xu
Jie Zhou
55
26
0
24 Aug 2023
Sign Language Translation with Iterative Prototype
Sign Language Translation with Iterative Prototype
Huijie Yao
Wen-gang Zhou
Hao Feng
Hezhen Hu
Hao Zhou
Houqiang Li
SLR
18
16
0
23 Aug 2023
Instruction Position Matters in Sequence Generation with Large Language
  Models
Instruction Position Matters in Sequence Generation with Large Language Models
Yanjun Liu
Xianfeng Zeng
Fandong Meng
Jie Zhou
LRM
67
8
0
23 Aug 2023
Coarse-to-Fine Multi-Scene Pose Regression with Transformers
Coarse-to-Fine Multi-Scene Pose Regression with Transformers
Yoli Shavit
Ron Ferens
Y. Keller
ViT
44
13
0
22 Aug 2023
An Effective Method using Phrase Mechanism in Neural Machine Translation
An Effective Method using Phrase Mechanism in Neural Machine Translation
Phuong Minh Nguyen
Le-Minh Nguyen
19
0
0
21 Aug 2023
Previous
123...192021...134135136
Next