v1v2v3v4v5v6v7 (latest)

Neural Machine Translation by Jointly Learning to Align and Translate

1 September 2014

Papers citing "Neural Machine Translation by Jointly Learning to Align and Translate"

50 / 8,379 papers shown

Title
Training-free and Adaptive Sparse Attention for Efficient Long Video Generation Yifei Xia Suhan Ling Fangcheng Fu Yijiao Wang Huixia Li Xuefeng Xiao Tengjiao Wang VGen 147 11 0 28 Feb 2025
Learning to Substitute Components for Compositional Generalization Zechao Li Gangwei Jiang Chenwang Wu Ying Wei Defu Lian Enhong Chen 114 0 0 28 Feb 2025
Attend or Perish: Benchmarking Attention in Algorithmic Reasoning Michal Spiegel Michal Štefánik Marek Kadlcík Josef Kuchař 110 0 0 28 Feb 2025
Representing Signs as Signs: One-Shot ISLR to Facilitate Functional Sign Language Technologies Toon Vandendriessche Mathieu De Coster Annelies Lejon J. Dambre SLR 133 0 0 27 Feb 2025
Chitranuvad: Adapting Multi-Lingual LLMs for Multimodal Translation Shaharukh Khan Ayush Tarun Ali Faraz Palash Kamble Vivek Dahiya Praveen Kumar Pokala Ashish Kulkarni Chandra Khatri Abhinav Ravi Shubham Agarwal 439 1 0 27 Feb 2025
Revisiting Kernel Attention with Correlated Gaussian Process Representation Long Minh Bui Tho Tran Huu Duy-Tung Dinh T. Nguyen Trong Nghia Hoang 127 2 0 27 Feb 2025
A HEART for the environment: Transformer-Based Spatiotemporal Modeling for Air Quality Prediction Norbert Bodendorfer 137 1 0 26 Feb 2025
Multiview graph dual-attention deep learning and contrastive learning for multi-criteria recommender systems Saman Forouzandeh P. Krivitsky Rohitash Chandra 89 0 0 26 Feb 2025
Integrating Biological and Machine Intelligence: Attention Mechanisms in Brain-Computer Interfaces Jing Wang Weishan Ye Jialin He Li Zhang G. Huang Zhuliang Yu Zhen Liang 108 0 0 26 Feb 2025
Application of Attention Mechanism with Bidirectional Long Short-Term Memory (BiLSTM) and CNN for Human Conflict Detection using Computer Vision Erick da Silva Farias Eduardo Palhares Junior 82 0 0 25 Feb 2025
Self-Adjust Softmax Chuanyang Zheng Yihang Gao Guoxuan Chen Han Shi Jing Xiong Xiaozhe Ren Chao Huang Xin Jiang Zhiyu Li Yu Li 81 1 0 25 Feb 2025
Recurrent Neural Networks for Dynamic VWAP Execution: Adaptive Trading Strategies with Temporal Kolmogorov-Arnold Networks Remi Genet 148 1 0 25 Feb 2025
TabulaTime: A Novel Multimodal Deep Learning Framework for Advancing Acute Coronary Syndrome Prediction through Environmental and Clinical Data Integration Xin Zhang Liangxiu Han Stephen White Saad Hassan Philip A Kalra James Ritchie Carl Diver Jennie Shorley 112 1 0 24 Feb 2025
GraphFM: Graph Factorization Machines for Feature Interaction Modeling Shu Wu Zekun Li Yunyue Su Zeyu Cui Xiaoyu Zhang Liang Wang 285 23 0 24 Feb 2025
Neural Attention: A Novel Mechanism for Enhanced Expressive Power in Transformer Models Andrew DiGiugno Ausif Mahmood 108 0 0 24 Feb 2025
AttentionEngine: A Versatile Framework for Efficient Attention Mechanisms on Diverse Hardware Platforms Feiyang Chen Yu Cheng Lei Wang Yuqing Xia Ziming Miao ... Fan Yang Jinbao Xue Zhi Yang M. Yang H. Chen 127 1 0 24 Feb 2025
Emoti-Attack: Zero-Perturbation Adversarial Attacks on NLP Systems via Emoji Sequences Yangshijie Zhang AAML 86 0 0 24 Feb 2025
SR-LLM: Rethinking the Structured Representation in Large Language Model Jiahuan Zhang Tianheng Wang Hanqing Wu Ziyi Huang Yulong Wu Dongbai Chen Linfeng Song Yue Zhang Guozheng Rao Kaicheng Yu 85 1 0 21 Feb 2025
Data Attribution for Text-to-Image Models by Unlearning Synthesized Images Sheng-Yu Wang Aaron Hertzmann Alexei A. Efros Jun-Yan Zhu Richard Zhang TDI 209 3 0 21 Feb 2025
Connecting the geometry and dynamics of many-body complex systems with message passing neural operators N. Gabriel N. Johnson George Em Karniadakis AI4CE 115 0 0 21 Feb 2025
A Survey of Model Architectures in Information Retrieval Zhichao Xu Fengran Mo Zhiqi Huang Crystina Zhang Puxuan Yu Bei Wang Jimmy J. Lin Vivek Srikumar KELM 3DV 182 2 0 21 Feb 2025
Quantum Recurrent Neural Networks with Encoder-Decoder for Time-Dependent Partial Differential Equations Yuan Chen Abdul Khaliq Khaled M. Furati AI4CE 205 0 0 20 Feb 2025
From Features to Graphs: Exploring Graph Structures and Pairwise Interactions via GNNs Phaphontee Yamchote Saw Nay Htet Win Chainarong Amornbunchornvej Thanapon Noraset FAtt 140 0 0 19 Feb 2025
Generalized Attention Flow: Feature Attribution for Transformer Models via Maximum Flow Behrooz Azarkhalili Maxwell Libbrecht 79 0 0 14 Feb 2025
Theoretical Benefit and Limitation of Diffusion Language Model Guhao Feng Yihan Geng Jian Guan Wei Wu Liwei Wang Di He DiffM 152 1 0 13 Feb 2025
A Deep Inverse-Mapping Model for a Flapping Robotic Wing Hadar Sharvit Raz Karl Tsevi Beatus 102 0 0 13 Feb 2025
Handwritten Text Recognition: A Survey Carlos Garrido-Munoz Antonio Ríos-Vila Jorge Calvo-Zaragoza 137 0 0 12 Feb 2025
Enhanced Load Forecasting with GAT-LSTM: Leveraging Grid and Temporal Features Ugochukwu Orji Çiçek Güven Dan Stowell AI4TS 64 0 0 12 Feb 2025
Beyond Literal Token Overlap: Token Alignability for Multilinguality Katharina Hämmerl Tomasz Limisiewicz Jindrich Libovický Alexander Fraser 73 0 0 10 Feb 2025
A Multimodal PDE Foundation Model for Prediction and Scientific Text Descriptions Elisa Negrini Yuxuan Liu Liu Yang Stanley Osher Hayden Schaeffer AI4CE 148 0 0 09 Feb 2025
Invizo: Arabic Handwritten Document Optical Character Recognition Solution Alhossien Waly Bassant Tarek Ali Feteha Rewan Yehia Gasser Amr Walid Gomaa Ahmed M. Fares 146 0 0 07 Feb 2025
Aligner-Encoders: Self-Attention Transformers Can Be Self-Transducers Adam Stooke Rohit Prabhavalkar K. Sim P. M. Mengibar 187 0 0 06 Feb 2025
Distribution Transformers: Fast Approximate Bayesian Inference With On-The-Fly Prior Adaptation George Whittle Juliusz Ziomek Jacob Rawling Michael A. Osborne 210 4 0 04 Feb 2025
Fine-tuning Language Models for Recipe Generation: A Comparative Analysis and Benchmark Study Anneketh Vij Changhao Liu Rahul Anil Nair Theo Ho Edward Shi Ayan Bhowmick 134 1 0 04 Feb 2025
A comparison of translation performance between DeepL and Supertext Alex Flückiger Chantal Amrhein Tim Graf Frédéric Odermatt Martin Pömsl Philippe Schläpfer Florian Schottmann Samuel Läubli ELM 128 0 0 04 Feb 2025
Efficient Language Modeling for Low-Resource Settings with Hybrid RNN-Transformer Architectures Gabriel Lindenmaier Sean Papay Sebastian Padó 141 0 0 02 Feb 2025
Emotion Recognition and Generation: A Comprehensive Review of Face, Speech, and Text Modalities Rebecca Mobbs Dimitrios Makris Vasileios Argyriou 67 0 0 02 Feb 2025
PolarQuant: Leveraging Polar Transformation for Efficient Key Cache Quantization and Decoding Acceleration Songhao Wu Ang Lv Xiao Feng Yanzhe Zhang Xun Zhang Guojun Yin Wei Lin Rui Yan MQ 91 1 0 01 Feb 2025
Abstractive Text Summarization for Bangla Language Using NLP and Machine Learning Approaches Asif Ahammad Miazee Tonmoy Roy Md Robiul Islam Yeamin Safat CVBM 41 0 0 28 Jan 2025
Efficient and Interpretable Neural Networks Using Complex Lehmer Transform M. Ataei Xiaogang Wang 92 0 0 28 Jan 2025
ZETA: Leveraging Z-order Curves for Efficient Top-k Attention Qiuhao Zeng Jerry Huang Peng Lu Gezheng Xu Boxing Chen Charles Ling Boyu Wang 195 3 0 24 Jan 2025
A Study of the Plausibility of Attention between RNN Encoders in Natural Language Inference Duc Hau Nguyen Duc Hau Nguyen Pascale Sébillot 128 5 0 23 Jan 2025
Infinite Time Turing Machines and their Applications Rukmal Weerawarana Maxwell Braun AI4CE 15 0 0 22 Jan 2025
Extend Adversarial Policy Against Neural Machine Translation via Unknown Token Wei Zou Shujian Huang Jiajun Chen AAML 115 0 0 21 Jan 2025
LD-DETR: Loop Decoder DEtection TRansformer for Video Moment Retrieval and Highlight Detection Pengcheng Zhao Zhixian He Fuwei Zhang Shujin Lin Fan Zhou 136 2 0 18 Jan 2025
The Quest for Visual Understanding: A Journey Through the Evolution of Visual Question Answering Anupam Pandey Deepjyoti Bodo Arpan Phukan Asif Ekbal 150 0 0 13 Jan 2025
TFLAG:Towards Practical APT Detection via Deviation-Aware Learning on Temporal Provenance Graph Wenhan Jiang Tingting Chai Hongri Liu Kai Wang Hongke Zhang 83 0 0 13 Jan 2025
Iconicity in Large Language Models Anna Marklová Jiří Milička Leonid Ryvkin Ľudmila Lacková Bennet Libuše Kormaníková 89 0 0 10 Jan 2025
On Creating A Brain-To-Text Decoder Zenon Lamprou Yashar Moshfeghi 80 0 0 10 Jan 2025
Clinical Insights: A Comprehensive Review of Language Models in Medicine Nikita Neveditsin Pawan Lingras V. Mago LM&MA 117 5 0 08 Jan 2025