Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1409.0473
Cited By
v1
v2
v3
v4
v5
v6
v7 (latest)
Neural Machine Translation by Jointly Learning to Align and Translate
1 September 2014
Dzmitry Bahdanau
Kyunghyun Cho
Yoshua Bengio
AIMat
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Neural Machine Translation by Jointly Learning to Align and Translate"
50 / 8,379 papers shown
Title
Differentiable architecture search with multi-dimensional attention for spiking neural networks
Yilei Man
Linhai Xie
Shushan Qiao
Yumei Zhou
Delong Shang
67
1
0
01 Nov 2024
Deep Convolutional Neural Networks on Multiclass Classification of Three-Dimensional Brain Images for Parkinson's Disease Stage Prediction
Guan-Hua Huang
Wan-Chen Lai
Tai-Been Chen
Chien-Chin Hsu
Huei-Yung Chen
Yi-Chen Wu
Li-Ren Yeh
MedIm
74
2
0
31 Oct 2024
A Monte Carlo Framework for Calibrated Uncertainty Estimation in Sequence Prediction
Qidong Yang
Weicheng Zhu
Joseph Keslin
L. Zanna
Tim G. J. Rudner
Carlos Fernandez-Granda
BDL
UQCV
AI4TS
83
0
0
30 Oct 2024
Emergence of meta-stable clustering in mean-field transformer models
Giuseppe Bruno
Federico Pasqualotto
Andrea Agazzi
131
9
0
30 Oct 2024
Kinetix: Investigating the Training of General Agents through Open-Ended Physics-Based Control Tasks
Michael T. Matthews
Michael Beukman
Chris Xiaoxuan Lu
Jakob Foerster
OffRL
AI4CE
117
8
0
30 Oct 2024
A Pointer Network-based Approach for Joint Extraction and Detection of Multi-Label Multi-Class Intents
Ankan Mullick
Sombit Bose
Abhilash Nandy
G. Chaitanya
Pawan Goyal
59
0
0
29 Oct 2024
Scalable Message Passing Neural Networks: No Need for Attention in Large Graph Representation Learning
Haitz Sáez de Ocáriz Borde
Artem Lukoianov
Anastasis Kratsios
Michael M. Bronstein
Xiaowen Dong
GNN
73
2
0
29 Oct 2024
Efficient Machine Translation with a BiLSTM-Attention Approach
Yuxu Wu
Yiren Xing
49
0
0
29 Oct 2024
Atrial Fibrillation Detection System via Acoustic Sensing for Mobile Phones
Xuanyu Liu
Jiao Li
Haoxian Liu
Zongqi Yang
Yi Huang
Jin Zhang
21
0
0
28 Oct 2024
A Static and Dynamic Attention Framework for Multi Turn Dialogue Generation
Weinan Zhang
Yiming Cui
Kaiyan Zhang
Yifa Wang
Qingfu Zhu
Lingzhi Li
Ting Liu
114
8
0
28 Oct 2024
Visualizing attention zones in machine reading comprehension models
Yiming Cui
Weinan Zhang
Ting Liu
30
0
0
28 Oct 2024
Referring Human Pose and Mask Estimation in the Wild
Bo Miao
Mingtao Feng
Zijie Wu
Mohammed Bennamoun
Yongsheng Gao
Ajmal Mian
79
0
0
27 Oct 2024
Think Carefully and Check Again! Meta-Generation Unlocking LLMs for Low-Resource Cross-Lingual Summarization
Zhecheng Li
Yijiao Wang
Bryan Hooi
Yujun Cai
Naifan Cheung
Nanyun Peng
Kai-Wei Chang
198
1
0
26 Oct 2024
Provable optimal transport with transformers: The essence of depth and prompt engineering
Hadi Daneshmand
OT
80
0
0
25 Oct 2024
Explainable News Summarization -- Analysis and mitigation of Disagreement Problem
Seema Aswani
Sujala D. Shetty
58
0
0
24 Oct 2024
Monolingual and Multilingual Misinformation Detection for Low-Resource Languages: A Comprehensive Survey
Xinyu Wang
Wenbo Zhang
Sarah Rajtmajer
87
3
0
24 Oct 2024
Improving Handwritten Text Recognition via 3D Attention and Multi-Scale Training
Zi-Rui Wang
66
0
0
24 Oct 2024
Melody Construction for Persian lyrics using LSTM recurrent neural networks
Farshad Jafari
Farzad Didehvar
Amin Gheibi
26
0
0
23 Oct 2024
Dynamic graph neural networks for enhanced volatility prediction in financial markets
Pulikandala Nithish Kumar
Nneka Umeorah
Alex Alochukwu
57
0
0
22 Oct 2024
Do Robot Snakes Dream like Electric Sheep? Investigating the Effects of Architectural Inductive Biases on Hallucination
Jerry Huang
Prasanna Parthasarathi
Mehdi Rezagholizadeh
Boxing Chen
Sarath Chandar
169
0
0
22 Oct 2024
A Fusion-Driven Approach of Attention-Based CNN-BiLSTM for Protein Family Classification -- ProFamNet
Bahar Ali
Anwar Shah
Malik Niaz
Musadaq Mansoord
Sami Ullah
Muhammad Adnan
3DV
59
0
0
21 Oct 2024
Deep Graph Attention Networks
Jun Kato
Airi Mita
Keita Gobara
Akihiro Inokuchi
GNN
45
0
0
21 Oct 2024
Acoustic Model Optimization over Multiple Data Sources: Merging and Valuation
Victor Junqiu Wei
Weicheng Wang
Di Jiang
Conghui Tan
Rongzhong Lian
MoMe
94
0
0
21 Oct 2024
Bridging the Training-Inference Gap in LLMs by Leveraging Self-Generated Tokens
Zhepeng Cen
Yao Liu
Siliang Zeng
Pratik Chaudhar
Huzefa Rangwala
George Karypis
Rasool Fakoor
SyDa
AIFin
133
3
0
18 Oct 2024
Feint and Attack: Attention-Based Strategies for Jailbreaking and Protecting LLMs
Rui Pu
Chaozhuo Li
Rui Ha
Zejian Chen
Litian Zhang
Ziqiang Liu
Lirong Qiu
Xi Zhang
AAML
59
3
0
18 Oct 2024
ViConsFormer: Constituting Meaningful Phrases of Scene Texts using Transformer-based Method in Vietnamese Text-based Visual Question Answering
Nghia Hieu Nguyen
Tho Thanh Quan
Ngan Luu-Thuy Nguyen
75
0
0
18 Oct 2024
Analyzing Deep Transformer Models for Time Series Forecasting via Manifold Learning
Ilya Kaufman
Omri Azencot
AI4TS
67
3
0
17 Oct 2024
Quantity vs. Quality of Monolingual Source Data in Automatic Text Translation: Can It Be Too Little If It Is Too Good?
Idris Abdulmumin
B. Galadanci
G. Aliyu
Shamsuddeen Hassan Muhammad
73
1
0
17 Oct 2024
Transformer Guided Coevolution: Improved Team Selection in Multiagent Adversarial Team Games
Pranav Rajbhandari
Prithviraj Dasgupta
D. Sofge
112
0
0
17 Oct 2024
Reducing the Transformer Architecture to a Minimum
Bernhard Bermeitinger
T. Hrycej
Massimo Pavone
Julianus Kath
Siegfried Handschuh
26
0
0
17 Oct 2024
DiffImp: Efficient Diffusion Model for Probabilistic Time Series Imputation with Bidirectional Mamba Backbone
Hongfan Gao
Wangmeng Shen
Xiangfei Qiu
Ronghui Xu
Jilin Hu
Bin Yang
132
6
0
17 Oct 2024
Artificial Kuramoto Oscillatory Neurons
Takeru Miyato
Sindy Löwe
Andreas Geiger
Max Welling
AI4CE
204
10
0
17 Oct 2024
Recurrent Neural Goodness-of-Fit Test for Time Series
Aoran Zhang
Wenbin Zhou
Liyan Xie
Shixiang Zhu
93
1
0
17 Oct 2024
Unifying Economic and Language Models for Enhanced Sentiment Analysis of the Oil Market
Himmet Kaplan
R. Mundani
Heiko Rölke
A. Weichselbraun
Martin Tschudy
31
0
0
16 Oct 2024
How much do contextualized representations encode long-range context?
Simeng Sun
Cheng-Ping Hsieh
129
0
0
16 Oct 2024
Network Representation Learning for Biophysical Neural Network Analysis
Youngmok Ha
Yongjoo Kim
Hyun Jae Jang
Seungyeon Lee
Eunji Pak
60
0
0
15 Oct 2024
Survey and Evaluation of Converging Architecture in LLMs based on Footsteps of Operations
Seongho Kim
Jihyun Moon
Juntaek Oh
Insu Choi
Joon-Sung Yang
38
0
0
15 Oct 2024
Quadratic Gating Functions in Mixture of Experts: A Statistical Insight
Pedram Akbarian
Huy Le Nguyen
Xing Han
Nhat Ho
MoE
62
3
0
15 Oct 2024
Geometric Inductive Biases of Deep Networks: The Role of Data and Architecture
Sajad Movahedi
Antonio Orvieto
Seyed-Mohsen Moosavi-Dezfooli
AI4CE
AAML
583
0
0
15 Oct 2024
ChakmaNMT: A Low-resource Machine Translation On Chakma Language
Aunabil Chakma
Aditya Chakma
Soham Khisa
Chumui Tripura
Masum Hasan
Rifat Shahriyar
36
1
0
14 Oct 2024
A Framework to Enable Algorithmic Design Choice Exploration in DNNs
Timothy L. Cronin IV
Sanmukh Kuppannagari
89
0
0
10 Oct 2024
Self-Attention Mechanism in Multimodal Context for Banking Transaction Flow
Cyrile Delestre
Yoann Sola
34
0
0
10 Oct 2024
When and Where Did it Happen? An Encoder-Decoder Model to Identify Scenario Context
Enrique Noriega-Atala
Robert Vacareanu
Salena Torres Ashton
A. Pyarelal
Clayton T. Morrison
Mihai Surdeanu
48
0
0
10 Oct 2024
Tally: Non-Intrusive Performance Isolation for Concurrent Deep Learning Workloads
Wei Zhao
Anand Jayarajan
Gennady Pekhimenko
FedML
74
1
0
09 Oct 2024
Stochastic Sparse Sampling: A Framework for Variable-Length Medical Time Series Classification
Xavier Mootoo
Alan A. Díaz-Montiel
Milad Lankarany
Hina Tabassum
AI4TS
59
0
0
08 Oct 2024
GLOV: Guided Large Language Models as Implicit Optimizers for Vision Language Models
Muhammad Jehanzeb Mirza
Mengjie Zhao
Zhuoyuan Mao
Sivan Doveh
Wei Lin
...
Yuki Mitsufuji
Horst Possegger
Rogerio Feris
Leonid Karlinsky
James Glass
VLM
212
1
0
08 Oct 2024
CTC-GMM: CTC guided modality matching for fast and accurate streaming speech translation
Rui Zhao
Jinyu Li
Ruchao Fan
Matt Post
85
2
0
07 Oct 2024
Mechanistic?
Naomi Saphra
Sarah Wiegreffe
AI4CE
75
13
0
07 Oct 2024
DAPE V2: Process Attention Score as Feature Map for Length Extrapolation
Chuanyang Zheng
Yihang Gao
Han Shi
Jing Xiong
Jiankai Sun
...
Xiaozhe Ren
Michael Ng
Xin Jiang
Zhenguo Li
Yu Li
83
3
0
07 Oct 2024
SegINR: Segment-wise Implicit Neural Representation for Sequence Alignment in Neural Text-to-Speech
Minchan Kim
Myeonghun Jeong
Joun Yeop Lee
Nam Soo Kim
67
0
0
07 Oct 2024
Previous
1
2
3
...
5
6
7
...
166
167
168
Next