v1v2v3v4v5v6v7 (latest)

Neural Machine Translation by Jointly Learning to Align and Translate

1 September 2014

Papers citing "Neural Machine Translation by Jointly Learning to Align and Translate"

50 / 8,379 papers shown

Title
Differentiable architecture search with multi-dimensional attention for spiking neural networks Yilei Man Linhai Xie Shushan Qiao Yumei Zhou Delong Shang 67 1 0 01 Nov 2024
Deep Convolutional Neural Networks on Multiclass Classification of Three-Dimensional Brain Images for Parkinson's Disease Stage Prediction Guan-Hua Huang Wan-Chen Lai Tai-Been Chen Chien-Chin Hsu Huei-Yung Chen Yi-Chen Wu Li-Ren Yeh MedIm 74 2 0 31 Oct 2024
A Monte Carlo Framework for Calibrated Uncertainty Estimation in Sequence Prediction Qidong Yang Weicheng Zhu Joseph Keslin L. Zanna Tim G. J. Rudner Carlos Fernandez-Granda BDL UQCV AI4TS 83 0 0 30 Oct 2024
Emergence of meta-stable clustering in mean-field transformer models Giuseppe Bruno Federico Pasqualotto Andrea Agazzi 131 9 0 30 Oct 2024
Kinetix: Investigating the Training of General Agents through Open-Ended Physics-Based Control Tasks Michael T. Matthews Michael Beukman Chris Xiaoxuan Lu Jakob Foerster OffRL AI4CE 117 8 0 30 Oct 2024
A Pointer Network-based Approach for Joint Extraction and Detection of Multi-Label Multi-Class Intents Ankan Mullick Sombit Bose Abhilash Nandy G. Chaitanya Pawan Goyal 59 0 0 29 Oct 2024
Scalable Message Passing Neural Networks: No Need for Attention in Large Graph Representation Learning Haitz Sáez de Ocáriz Borde Artem Lukoianov Anastasis Kratsios Michael M. Bronstein Xiaowen Dong GNN 73 2 0 29 Oct 2024
Efficient Machine Translation with a BiLSTM-Attention Approach Yuxu Wu Yiren Xing 49 0 0 29 Oct 2024
Atrial Fibrillation Detection System via Acoustic Sensing for Mobile Phones Xuanyu Liu Jiao Li Haoxian Liu Zongqi Yang Yi Huang Jin Zhang 21 0 0 28 Oct 2024
A Static and Dynamic Attention Framework for Multi Turn Dialogue Generation Weinan Zhang Yiming Cui Kaiyan Zhang Yifa Wang Qingfu Zhu Lingzhi Li Ting Liu 114 8 0 28 Oct 2024
Visualizing attention zones in machine reading comprehension models Yiming Cui Weinan Zhang Ting Liu 30 0 0 28 Oct 2024
Referring Human Pose and Mask Estimation in the Wild Bo Miao Mingtao Feng Zijie Wu Mohammed Bennamoun Yongsheng Gao Ajmal Mian 79 0 0 27 Oct 2024
Think Carefully and Check Again! Meta-Generation Unlocking LLMs for Low-Resource Cross-Lingual Summarization Zhecheng Li Yijiao Wang Bryan Hooi Yujun Cai Naifan Cheung Nanyun Peng Kai-Wei Chang 198 1 0 26 Oct 2024
Provable optimal transport with transformers: The essence of depth and prompt engineering Hadi Daneshmand OT 80 0 0 25 Oct 2024
Explainable News Summarization -- Analysis and mitigation of Disagreement Problem Seema Aswani Sujala D. Shetty 58 0 0 24 Oct 2024
Monolingual and Multilingual Misinformation Detection for Low-Resource Languages: A Comprehensive Survey Xinyu Wang Wenbo Zhang Sarah Rajtmajer 87 3 0 24 Oct 2024
Improving Handwritten Text Recognition via 3D Attention and Multi-Scale Training Zi-Rui Wang 66 0 0 24 Oct 2024
Melody Construction for Persian lyrics using LSTM recurrent neural networks Farshad Jafari Farzad Didehvar Amin Gheibi 26 0 0 23 Oct 2024
Dynamic graph neural networks for enhanced volatility prediction in financial markets Pulikandala Nithish Kumar Nneka Umeorah Alex Alochukwu 57 0 0 22 Oct 2024
Do Robot Snakes Dream like Electric Sheep? Investigating the Effects of Architectural Inductive Biases on Hallucination Jerry Huang Prasanna Parthasarathi Mehdi Rezagholizadeh Boxing Chen Sarath Chandar 169 0 0 22 Oct 2024
A Fusion-Driven Approach of Attention-Based CNN-BiLSTM for Protein Family Classification -- ProFamNet Bahar Ali Anwar Shah Malik Niaz Musadaq Mansoord Sami Ullah Muhammad Adnan 3DV 59 0 0 21 Oct 2024
Deep Graph Attention Networks Jun Kato Airi Mita Keita Gobara Akihiro Inokuchi GNN 45 0 0 21 Oct 2024
Acoustic Model Optimization over Multiple Data Sources: Merging and Valuation Victor Junqiu Wei Weicheng Wang Di Jiang Conghui Tan Rongzhong Lian MoMe 94 0 0 21 Oct 2024
Bridging the Training-Inference Gap in LLMs by Leveraging Self-Generated Tokens Zhepeng Cen Yao Liu Siliang Zeng Pratik Chaudhar Huzefa Rangwala George Karypis Rasool Fakoor SyDa AIFin 133 3 0 18 Oct 2024
Feint and Attack: Attention-Based Strategies for Jailbreaking and Protecting LLMs Rui Pu Chaozhuo Li Rui Ha Zejian Chen Litian Zhang Ziqiang Liu Lirong Qiu Xi Zhang AAML 59 3 0 18 Oct 2024
ViConsFormer: Constituting Meaningful Phrases of Scene Texts using Transformer-based Method in Vietnamese Text-based Visual Question Answering Nghia Hieu Nguyen Tho Thanh Quan Ngan Luu-Thuy Nguyen 75 0 0 18 Oct 2024
Analyzing Deep Transformer Models for Time Series Forecasting via Manifold Learning Ilya Kaufman Omri Azencot AI4TS 67 3 0 17 Oct 2024
Quantity vs. Quality of Monolingual Source Data in Automatic Text Translation: Can It Be Too Little If It Is Too Good? Idris Abdulmumin B. Galadanci G. Aliyu Shamsuddeen Hassan Muhammad 73 1 0 17 Oct 2024
Transformer Guided Coevolution: Improved Team Selection in Multiagent Adversarial Team Games Pranav Rajbhandari Prithviraj Dasgupta D. Sofge 112 0 0 17 Oct 2024
Reducing the Transformer Architecture to a Minimum Bernhard Bermeitinger T. Hrycej Massimo Pavone Julianus Kath Siegfried Handschuh 26 0 0 17 Oct 2024
DiffImp: Efficient Diffusion Model for Probabilistic Time Series Imputation with Bidirectional Mamba Backbone Hongfan Gao Wangmeng Shen Xiangfei Qiu Ronghui Xu Jilin Hu Bin Yang 132 6 0 17 Oct 2024
Artificial Kuramoto Oscillatory Neurons Takeru Miyato Sindy Löwe Andreas Geiger Max Welling AI4CE 204 10 0 17 Oct 2024
Recurrent Neural Goodness-of-Fit Test for Time Series Aoran Zhang Wenbin Zhou Liyan Xie Shixiang Zhu 93 1 0 17 Oct 2024
Unifying Economic and Language Models for Enhanced Sentiment Analysis of the Oil Market Himmet Kaplan R. Mundani Heiko Rölke A. Weichselbraun Martin Tschudy 31 0 0 16 Oct 2024
How much do contextualized representations encode long-range context? Simeng Sun Cheng-Ping Hsieh 129 0 0 16 Oct 2024
Network Representation Learning for Biophysical Neural Network Analysis Youngmok Ha Yongjoo Kim Hyun Jae Jang Seungyeon Lee Eunji Pak 60 0 0 15 Oct 2024
Survey and Evaluation of Converging Architecture in LLMs based on Footsteps of Operations Seongho Kim Jihyun Moon Juntaek Oh Insu Choi Joon-Sung Yang 38 0 0 15 Oct 2024
Quadratic Gating Functions in Mixture of Experts: A Statistical Insight Pedram Akbarian Huy Le Nguyen Xing Han Nhat Ho MoE 62 3 0 15 Oct 2024
Geometric Inductive Biases of Deep Networks: The Role of Data and Architecture Sajad Movahedi Antonio Orvieto Seyed-Mohsen Moosavi-Dezfooli AI4CE AAML 583 0 0 15 Oct 2024
ChakmaNMT: A Low-resource Machine Translation On Chakma Language Aunabil Chakma Aditya Chakma Soham Khisa Chumui Tripura Masum Hasan Rifat Shahriyar 36 1 0 14 Oct 2024
A Framework to Enable Algorithmic Design Choice Exploration in DNNs Timothy L. Cronin IV Sanmukh Kuppannagari 89 0 0 10 Oct 2024
Self-Attention Mechanism in Multimodal Context for Banking Transaction Flow Cyrile Delestre Yoann Sola 34 0 0 10 Oct 2024
When and Where Did it Happen? An Encoder-Decoder Model to Identify Scenario Context Enrique Noriega-Atala Robert Vacareanu Salena Torres Ashton A. Pyarelal Clayton T. Morrison Mihai Surdeanu 48 0 0 10 Oct 2024
Tally: Non-Intrusive Performance Isolation for Concurrent Deep Learning Workloads Wei Zhao Anand Jayarajan Gennady Pekhimenko FedML 74 1 0 09 Oct 2024
Stochastic Sparse Sampling: A Framework for Variable-Length Medical Time Series Classification Xavier Mootoo Alan A. Díaz-Montiel Milad Lankarany Hina Tabassum AI4TS 59 0 0 08 Oct 2024
GLOV: Guided Large Language Models as Implicit Optimizers for Vision Language Models Muhammad Jehanzeb Mirza Mengjie Zhao Zhuoyuan Mao Sivan Doveh Wei Lin ... Yuki Mitsufuji Horst Possegger Rogerio Feris Leonid Karlinsky James Glass VLM 212 1 0 08 Oct 2024
CTC-GMM: CTC guided modality matching for fast and accurate streaming speech translation Rui Zhao Jinyu Li Ruchao Fan Matt Post 85 2 0 07 Oct 2024
Mechanistic? Naomi Saphra Sarah Wiegreffe AI4CE 75 13 0 07 Oct 2024
DAPE V2: Process Attention Score as Feature Map for Length Extrapolation Chuanyang Zheng Yihang Gao Han Shi Jing Xiong Jiankai Sun ... Xiaozhe Ren Michael Ng Xin Jiang Zhenguo Li Yu Li 83 3 0 07 Oct 2024
SegINR: Segment-wise Implicit Neural Representation for Sequence Alignment in Neural Text-to-Speech Minchan Kim Myeonghun Jeong Joun Yeop Lee Nam Soo Kim 67 0 0 07 Oct 2024