On Using Very Large Target Vocabulary for Neural Machine Translation

5 December 2014

Papers citing "On Using Very Large Target Vocabulary for Neural Machine Translation"

50 / 384 papers shown

Title
Killing Two Birds with One Stone: Unifying Retrieval and Ranking with a Single Generative Recommendation Model Lefei Zhang Kenan Song Yi Quan Lee Wei Guo Hao Wang Yawen Li Huifeng Guo Yong-Jin Liu Defu Lian Enhong Chen 24 0 0 23 Apr 2025
Sparse Logit Sampling: Accelerating Knowledge Distillation in LLMs Anshumann Mohd Abbas Zaidi Akhil Kedia Jinwoo Ahn Taehwak Kwon Kangwook Lee Haejun Lee Joohyung Lee FedML 194 1 0 21 Mar 2025
GiGL: Large-Scale Graph Neural Networks at Snapchat Tong Zhao Yozen Liu Matthew Kolodner Kyle Montemayor Elham Ghazizadeh ... Serim Park Peicheng Yu Jun Yu Shubham Vij Neil Shah GNN 60 0 0 24 Feb 2025
Multi-Head Encoding for Extreme Label Classification Daojun Liang Haixia Zhang Dongfeng Yuan Minggao Zhang 73 0 0 13 Dec 2024
An Effective Context-Balanced Adaptation Approach for Long-Tailed Speech Recognition Yi-Cheng Wang Li-Ting Pai Bi-Cheng Yan Hsin-Wei Wang Chi-Han Lin Berlin Chen 30 1 0 10 Sep 2024
DimeRec: A Unified Framework for Enhanced Sequential Recommendation via Generative Diffusion Models Wuchao Li Rui Huang Haijun Zhao Chi Liu Kai Zheng ... Defu Lian Yang Song Wentian Bao Enyun Yu Wenwu Ou DiffM 32 7 0 22 Aug 2024
Multi-word Term Embeddings Improve Lexical Product Retrieval Viktor Shcherbakov Fedor Krasnov 26 0 0 03 Jun 2024
Multi-Tower Multi-Interest Recommendation with User Representation Repel Tianyu Xiong Xiaohan Yu 30 0 0 08 Mar 2024
UGMAE: A Unified Framework for Graph Masked Autoencoders Yijun Tian Chuxu Zhang Ziyi Kou Zheyuan Liu Xiangliang Zhang Nitesh V. Chawla 24 1 0 12 Feb 2024
Expressivity and Approximation Properties of Deep Neural Networks with ReLU $^k$ Activation Juncai He Tong Mao Jinchao Xu 37 3 0 27 Dec 2023
Revisiting Recommendation Loss Functions through Contrastive Learning (Technical Report) Dong Li Ruoming Jin Bin Ren 25 4 0 13 Dec 2023
(Debiased) Contrastive Learning Loss for Recommendation (Technical Report) Ruoming Jin Dong Li 29 0 0 13 Dec 2023
Task-Adaptive Tokenization: Enhancing Long-Form Text Generation Efficacy in Mental Health and Beyond Siyang Liu Naihao Deng Sahand Sabour Yilin Jia Minlie Huang Rada Mihalcea 30 18 0 09 Oct 2023
TinyProp -- Adaptive Sparse Backpropagation for Efficient TinyML On-device Learning Marcus Rüb Daniel Maier Daniel Mueller-Gritschneder Axel Sikora 34 3 0 17 Aug 2023
gSASRec: Reducing Overconfidence in Sequential Recommendation Trained with Negative Sampling Aleksandr V. Petrov Craig Macdonald 29 33 0 14 Aug 2023
SelfSeg: A Self-supervised Sub-word Segmentation Method for Neural Machine Translation Haiyue Song Raj Dabre Chenhui Chu Sadao Kurohashi Eiichiro Sumita 16 3 0 31 Jul 2023
UniMatch: A Unified User-Item Matching Framework for the Multi-purpose Merchant Marketing Qifang Zhao Tianyu Li Meng Du Yu-lin Jiang Qinghui Sun Zhongyao Wang Hong Liu Huan Xu 24 1 0 19 Jul 2023
Tokenization and the Noiseless Channel Vilém Zouhar Clara Meister Juan Luis Gastaldi Li Du Mrinmaya Sachan Ryan Cotterell 30 31 0 29 Jun 2023
Lookaround Optimizer: $k$ steps around, 1 step average Jiangtao Zhang Shunyu Liu Mingli Song Tongtian Zhu Zhenxing Xu Mingli Song MoMe 34 6 0 13 Jun 2023
Neural Machine Translation for the Indigenous Languages of the Americas: An Introduction Manuel Mager Rajat Bhatnagar Graham Neubig Ngoc Thang Vu Katharina Kann 30 10 0 11 Jun 2023
Large-Scale Distributed Learning via Private On-Device Locality-Sensitive Hashing Tahseen Rabbani Marco Bornstein Fu-Hui Huang 11 2 0 05 Jun 2023
Assessing the Importance of Frequency versus Compositionality for Subword-based Tokenization in NMT Benoist Wolleb Romain Silvestri Giorgos Vernikos Ljiljana Dolamic Ljiljana Dolamic Andrei Popescu-Belis 16 4 0 02 Jun 2023
Abstractive Summarization as Augmentation for Document-Level Event Detection Janko Vidaković Filip Karlo Dosilovic Domagoj Pluscec 16 0 0 29 May 2023
Neural Machine Translation for Code Generation K. Dharma Clayton T. Morrison 32 4 0 22 May 2023
Effects of sub-word segmentation on performance of transformer language models Jue Hou Anisia Katinskaia Anh Vu R. Yangarber 13 4 0 09 May 2023
A Cookbook of Self-Supervised Learning Randall Balestriero Mark Ibrahim Vlad Sobal Ari S. Morcos Shashank Shekhar ... Pierre Fernandez Amir Bar Hamed Pirsiavash Yann LeCun Micah Goldblum SyDa FedML SSL 50 274 0 24 Apr 2023
Deep Stable Multi-Interest Learning for Out-of-distribution Sequential Recommendation Qiang Liu Zhaocheng Liu Zhen Zhu Shu Wu Liang Wang OOD OODD 40 3 0 12 Apr 2023
Towards energy-efficient Deep Learning: An overview of energy-efficient approaches along the Deep Learning Lifecycle Vanessa Mehlin Sigurd Schacht Carsten Lanquillon HAI MedIm 33 19 0 05 Feb 2023
BESS: Balanced Entity Sampling and Sharing for Large-Scale Knowledge Graph Completion A. Cattaneo Daniel Justus Harry Mellor Douglas Orr Jérôme Maloberti Ziqiang Liu Thorin Farnsworth Andrew Fitzgibbon Bla.zej Banaszewski Carlo Luschi 16 4 0 22 Nov 2022
Learning to Generate Image Embeddings with User-level Differential Privacy Zheng Xu Maxwell D. Collins Yuxiao Wang Liviu Panait Sewoong Oh S. Augenstein Ting Liu Florian Schroff H. B. McMahan FedML 30 29 0 20 Nov 2022
AutoTemplate: A Simple Recipe for Lexically Constrained Text Generation Hayate Iso 27 7 0 15 Nov 2022
Knowledge Prompting in Pre-trained Language Model for Natural Language Understanding Jiadong Wang Wenkang Huang Qiuhui Shi Hongbin Wang Minghui Qiu Xiang Li Ming Gao KELM VLM 27 17 0 16 Oct 2022
The boundaries of meaning: a case study in neural machine translation Yuri Balashov 16 2 0 02 Oct 2022
Contrastive Corpus Attribution for Explaining Representations Christy Lin Hugh Chen Chanwoo Kim Su-In Lee SSL 19 8 0 30 Sep 2022
A Review of the Convergence of 5G/6G Architecture and Deep Learning O. Odeyomi Olubiyi O. Akintade T. Olowu G. Záruba AILaw 3DV AI4TS 23 1 0 16 Aug 2022
ProjB: An Improved Bilinear Biased ProjE model for Knowledge Graph Completion Mojtaba Moattari S. Vahdati F. Zulkernine 21 0 0 15 Aug 2022
How Effective is Byte Pair Encoding for Out-Of-Vocabulary Words in Neural Machine Translation? Ali Araabi Christof Monz Vlad Niculae 28 10 0 10 Aug 2022
Algorithms to estimate Shapley value feature attributions Hugh Chen Ian Covert Scott M. Lundberg Su-In Lee TDI FAtt 31 214 0 15 Jul 2022
Improving Multi-Interest Network with Stable Learning Zhaocheng Liu Yingtao Luo Di Zeng Qiang Liu Daqing Chang Dongying Kong Zhi Chen HAI 44 1 0 14 Jul 2022
Reduce Indonesian Vocabularies with an Indonesian Sub-word Separator Mukhlis Amien Chong Feng Heyan Huang 11 0 0 01 Jul 2022
MultiBiSage: A Web-Scale Recommendation System Using Multiple Bipartite Graphs at Pinterest Saket Gurukar Nikil Pancha Andrew Zhai Eric Kim Samson Hu Srinivas Parthasarathy Charles R. Rosenberg J. Leskovec 64 14 0 21 May 2022
The Devil is in the Details: On the Pitfalls of Vocabulary Selection in Neural Machine Translation Tobias Domhan Eva Hasler Ke M. Tran Sony Trenous Bill Byrne Felix Hieber 13 5 0 13 May 2022
A Neural Network Architecture for Program Understanding Inspired by Human Behaviors Renyu Zhu Lei Yuan Xiang Li Ming Gao Wenyuan Cai 29 8 0 10 May 2022
A Survey on Neural Abstractive Summarization Methods and Factual Consistency of Summarization Meng Cao 8 6 0 20 Apr 2022
Memory-Efficient Training of RNN-Transducer with Sampled Softmax Jaesong Lee Lukas Lee Shinji Watanabe 25 8 0 31 Mar 2022
Efficient Image Representation Learning with Federated Sampled Softmax Sagar M. Waghmare Qi Huizhong Chen Mikhail Sirotenko Tomer Meron FedML 13 2 0 09 Mar 2022
WSLRec: Weakly Supervised Learning for Neural Sequential Recommendation Models Jingwei Zhuo Binda Liu Xiang Li Ziru Xu Xiaoqiang Zhu 6 0 0 28 Feb 2022
NxtPost: User to Post Recommendations in Facebook Groups Kaushik Rangadurai Yiqun Liu Siddarth Malreddy Xiaoyi Liu P. Maheshwari Vishwanath Sangale Fedor Borisyuk 16 6 0 08 Feb 2022
DKPLM: Decomposable Knowledge-enhanced Pre-trained Language Model for Natural Language Understanding Taolin Zhang Chengyu Wang Nan Hu Minghui Qiu Chengguang Tang Xiaofeng He Jun Huang KELM VLM 19 30 0 02 Dec 2021
Attention based end to end Speech Recognition for Voice Search in Hindi and English Raviraj Joshi Venkateshan Kannan 20 6 0 15 Nov 2021