Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1412.2007
Cited By
On Using Very Large Target Vocabulary for Neural Machine Translation
5 December 2014
Sébastien Jean
Kyunghyun Cho
Roland Memisevic
Yoshua Bengio
Re-assign community
ArXiv
PDF
HTML
Papers citing
"On Using Very Large Target Vocabulary for Neural Machine Translation"
50 / 384 papers shown
Title
Killing Two Birds with One Stone: Unifying Retrieval and Ranking with a Single Generative Recommendation Model
Lefei Zhang
Kenan Song
Yi Quan Lee
Wei Guo
Hao Wang
Yawen Li
Huifeng Guo
Yong-Jin Liu
Defu Lian
Enhong Chen
24
0
0
23 Apr 2025
Sparse Logit Sampling: Accelerating Knowledge Distillation in LLMs
Anshumann
Mohd Abbas Zaidi
Akhil Kedia
Jinwoo Ahn
Taehwak Kwon
Kangwook Lee
Haejun Lee
Joohyung Lee
FedML
194
1
0
21 Mar 2025
GiGL: Large-Scale Graph Neural Networks at Snapchat
Tong Zhao
Yozen Liu
Matthew Kolodner
Kyle Montemayor
Elham Ghazizadeh
...
Serim Park
Peicheng Yu
Jun Yu
Shubham Vij
Neil Shah
GNN
60
0
0
24 Feb 2025
Multi-Head Encoding for Extreme Label Classification
Daojun Liang
Haixia Zhang
Dongfeng Yuan
Minggao Zhang
73
0
0
13 Dec 2024
An Effective Context-Balanced Adaptation Approach for Long-Tailed Speech Recognition
Yi-Cheng Wang
Li-Ting Pai
Bi-Cheng Yan
Hsin-Wei Wang
Chi-Han Lin
Berlin Chen
30
1
0
10 Sep 2024
DimeRec: A Unified Framework for Enhanced Sequential Recommendation via Generative Diffusion Models
Wuchao Li
Rui Huang
Haijun Zhao
Chi Liu
Kai Zheng
...
Defu Lian
Yang Song
Wentian Bao
Enyun Yu
Wenwu Ou
DiffM
32
7
0
22 Aug 2024
Multi-word Term Embeddings Improve Lexical Product Retrieval
Viktor Shcherbakov
Fedor Krasnov
26
0
0
03 Jun 2024
Multi-Tower Multi-Interest Recommendation with User Representation Repel
Tianyu Xiong
Xiaohan Yu
30
0
0
08 Mar 2024
UGMAE: A Unified Framework for Graph Masked Autoencoders
Yijun Tian
Chuxu Zhang
Ziyi Kou
Zheyuan Liu
Xiangliang Zhang
Nitesh V. Chawla
24
1
0
12 Feb 2024
Expressivity and Approximation Properties of Deep Neural Networks with ReLU
k
^k
k
Activation
Juncai He
Tong Mao
Jinchao Xu
37
3
0
27 Dec 2023
Revisiting Recommendation Loss Functions through Contrastive Learning (Technical Report)
Dong Li
Ruoming Jin
Bin Ren
25
4
0
13 Dec 2023
(Debiased) Contrastive Learning Loss for Recommendation (Technical Report)
Ruoming Jin
Dong Li
29
0
0
13 Dec 2023
Task-Adaptive Tokenization: Enhancing Long-Form Text Generation Efficacy in Mental Health and Beyond
Siyang Liu
Naihao Deng
Sahand Sabour
Yilin Jia
Minlie Huang
Rada Mihalcea
30
18
0
09 Oct 2023
TinyProp -- Adaptive Sparse Backpropagation for Efficient TinyML On-device Learning
Marcus Rüb
Daniel Maier
Daniel Mueller-Gritschneder
Axel Sikora
34
3
0
17 Aug 2023
gSASRec: Reducing Overconfidence in Sequential Recommendation Trained with Negative Sampling
Aleksandr V. Petrov
Craig Macdonald
29
33
0
14 Aug 2023
SelfSeg: A Self-supervised Sub-word Segmentation Method for Neural Machine Translation
Haiyue Song
Raj Dabre
Chenhui Chu
Sadao Kurohashi
Eiichiro Sumita
16
3
0
31 Jul 2023
UniMatch: A Unified User-Item Matching Framework for the Multi-purpose Merchant Marketing
Qifang Zhao
Tianyu Li
Meng Du
Yu-lin Jiang
Qinghui Sun
Zhongyao Wang
Hong Liu
Huan Xu
24
1
0
19 Jul 2023
Tokenization and the Noiseless Channel
Vilém Zouhar
Clara Meister
Juan Luis Gastaldi
Li Du
Mrinmaya Sachan
Ryan Cotterell
30
31
0
29 Jun 2023
Lookaround Optimizer:
k
k
k
steps around, 1 step average
Jiangtao Zhang
Shunyu Liu
Mingli Song
Tongtian Zhu
Zhenxing Xu
Mingli Song
MoMe
34
6
0
13 Jun 2023
Neural Machine Translation for the Indigenous Languages of the Americas: An Introduction
Manuel Mager
Rajat Bhatnagar
Graham Neubig
Ngoc Thang Vu
Katharina Kann
30
10
0
11 Jun 2023
Large-Scale Distributed Learning via Private On-Device Locality-Sensitive Hashing
Tahseen Rabbani
Marco Bornstein
Fu-Hui Huang
11
2
0
05 Jun 2023
Assessing the Importance of Frequency versus Compositionality for Subword-based Tokenization in NMT
Benoist Wolleb
Romain Silvestri
Giorgos Vernikos
Ljiljana Dolamic
Ljiljana Dolamic Andrei Popescu-Belis
16
4
0
02 Jun 2023
Abstractive Summarization as Augmentation for Document-Level Event Detection
Janko Vidaković
Filip Karlo Dosilovic
Domagoj Pluscec
16
0
0
29 May 2023
Neural Machine Translation for Code Generation
K. Dharma
Clayton T. Morrison
32
4
0
22 May 2023
Effects of sub-word segmentation on performance of transformer language models
Jue Hou
Anisia Katinskaia
Anh Vu
R. Yangarber
13
4
0
09 May 2023
A Cookbook of Self-Supervised Learning
Randall Balestriero
Mark Ibrahim
Vlad Sobal
Ari S. Morcos
Shashank Shekhar
...
Pierre Fernandez
Amir Bar
Hamed Pirsiavash
Yann LeCun
Micah Goldblum
SyDa
FedML
SSL
50
274
0
24 Apr 2023
Deep Stable Multi-Interest Learning for Out-of-distribution Sequential Recommendation
Qiang Liu
Zhaocheng Liu
Zhen Zhu
Shu Wu
Liang Wang
OOD
OODD
40
3
0
12 Apr 2023
Towards energy-efficient Deep Learning: An overview of energy-efficient approaches along the Deep Learning Lifecycle
Vanessa Mehlin
Sigurd Schacht
Carsten Lanquillon
HAI
MedIm
33
19
0
05 Feb 2023
BESS: Balanced Entity Sampling and Sharing for Large-Scale Knowledge Graph Completion
A. Cattaneo
Daniel Justus
Harry Mellor
Douglas Orr
Jérôme Maloberti
Ziqiang Liu
Thorin Farnsworth
Andrew Fitzgibbon
Bla.zej Banaszewski
Carlo Luschi
16
4
0
22 Nov 2022
Learning to Generate Image Embeddings with User-level Differential Privacy
Zheng Xu
Maxwell D. Collins
Yuxiao Wang
Liviu Panait
Sewoong Oh
S. Augenstein
Ting Liu
Florian Schroff
H. B. McMahan
FedML
30
29
0
20 Nov 2022
AutoTemplate: A Simple Recipe for Lexically Constrained Text Generation
Hayate Iso
27
7
0
15 Nov 2022
Knowledge Prompting in Pre-trained Language Model for Natural Language Understanding
Jiadong Wang
Wenkang Huang
Qiuhui Shi
Hongbin Wang
Minghui Qiu
Xiang Li
Ming Gao
KELM
VLM
27
17
0
16 Oct 2022
The boundaries of meaning: a case study in neural machine translation
Yuri Balashov
16
2
0
02 Oct 2022
Contrastive Corpus Attribution for Explaining Representations
Christy Lin
Hugh Chen
Chanwoo Kim
Su-In Lee
SSL
19
8
0
30 Sep 2022
A Review of the Convergence of 5G/6G Architecture and Deep Learning
O. Odeyomi
Olubiyi O. Akintade
T. Olowu
G. Záruba
AILaw
3DV
AI4TS
23
1
0
16 Aug 2022
ProjB: An Improved Bilinear Biased ProjE model for Knowledge Graph Completion
Mojtaba Moattari
S. Vahdati
F. Zulkernine
21
0
0
15 Aug 2022
How Effective is Byte Pair Encoding for Out-Of-Vocabulary Words in Neural Machine Translation?
Ali Araabi
Christof Monz
Vlad Niculae
28
10
0
10 Aug 2022
Algorithms to estimate Shapley value feature attributions
Hugh Chen
Ian Covert
Scott M. Lundberg
Su-In Lee
TDI
FAtt
31
214
0
15 Jul 2022
Improving Multi-Interest Network with Stable Learning
Zhaocheng Liu
Yingtao Luo
Di Zeng
Qiang Liu
Daqing Chang
Dongying Kong
Zhi Chen
HAI
44
1
0
14 Jul 2022
Reduce Indonesian Vocabularies with an Indonesian Sub-word Separator
Mukhlis Amien
Chong Feng
Heyan Huang
11
0
0
01 Jul 2022
MultiBiSage: A Web-Scale Recommendation System Using Multiple Bipartite Graphs at Pinterest
Saket Gurukar
Nikil Pancha
Andrew Zhai
Eric Kim
Samson Hu
Srinivas Parthasarathy
Charles R. Rosenberg
J. Leskovec
64
14
0
21 May 2022
The Devil is in the Details: On the Pitfalls of Vocabulary Selection in Neural Machine Translation
Tobias Domhan
Eva Hasler
Ke M. Tran
Sony Trenous
Bill Byrne
Felix Hieber
13
5
0
13 May 2022
A Neural Network Architecture for Program Understanding Inspired by Human Behaviors
Renyu Zhu
Lei Yuan
Xiang Li
Ming Gao
Wenyuan Cai
29
8
0
10 May 2022
A Survey on Neural Abstractive Summarization Methods and Factual Consistency of Summarization
Meng Cao
8
6
0
20 Apr 2022
Memory-Efficient Training of RNN-Transducer with Sampled Softmax
Jaesong Lee
Lukas Lee
Shinji Watanabe
25
8
0
31 Mar 2022
Efficient Image Representation Learning with Federated Sampled Softmax
Sagar M. Waghmare
Qi
Huizhong Chen
Mikhail Sirotenko
Tomer Meron
FedML
13
2
0
09 Mar 2022
WSLRec: Weakly Supervised Learning for Neural Sequential Recommendation Models
Jingwei Zhuo
Binda Liu
Xiang Li
Ziru Xu
Xiaoqiang Zhu
6
0
0
28 Feb 2022
NxtPost: User to Post Recommendations in Facebook Groups
Kaushik Rangadurai
Yiqun Liu
Siddarth Malreddy
Xiaoyi Liu
P. Maheshwari
Vishwanath Sangale
Fedor Borisyuk
16
6
0
08 Feb 2022
DKPLM: Decomposable Knowledge-enhanced Pre-trained Language Model for Natural Language Understanding
Taolin Zhang
Chengyu Wang
Nan Hu
Minghui Qiu
Chengguang Tang
Xiaofeng He
Jun Huang
KELM
VLM
19
30
0
02 Dec 2021
Attention based end to end Speech Recognition for Voice Search in Hindi and English
Raviraj Joshi
Venkateshan Kannan
20
6
0
15 Nov 2021
1
2
3
4
5
6
7
8
Next