v1v2v3 (latest)

Von Mises-Fisher Loss for Training Sequence to Sequence Models with Continuous Outputs

10 December 2018

Papers citing "Von Mises-Fisher Loss for Training Sequence to Sequence Models with Continuous Outputs"

27 / 27 papers shown

Title
Deep Sparse Latent Feature Models for Knowledge Graph Completion Haotian Li Rui Zhang Lingzhi Wang Bin Yu Yuanbo Wang Yuliang Wei Kai Wang Richard Y. D. Xu Bailing Wang BDL 134 0 0 24 Nov 2024
The Unreasonable Effectiveness of Random Target Embeddings for Continuous-Output Neural Machine Translation Evgeniia Tokarchuk Vlad Niculae 60 2 0 31 Oct 2023
Unsupervised Discovery of Continuous Skills on a Sphere Takahisa Imagawa Takuya Hiraoka Yoshimasa Tsuruoka 76 0 0 21 May 2023
No Shifted Augmentations (NSA): compact distributions for robust self-supervised Anomaly Detection Mohamed Yousef Marcel R. Ackermann Unmesh Kurup Tom E. Bishop OODD OOD 92 3 0 19 Mar 2022
Controlled Text Generation as Continuous Optimization with Multiple Constraints Sachin Kumar Eric Malmi Aliaksei Severyn Yulia Tsvetkov BDL AI4CE 98 79 0 04 Aug 2021
Machine Translation into Low-resource Language Varieties Sachin Kumar Antonios Anastasopoulos S. Wintner Yulia Tsvetkov 83 30 0 12 Jun 2021
BERTTune: Fine-Tuning Neural Machine Translation with BERTScore Inigo Jauregi Unanue Jacob Parnell Massimo Piccardi 56 34 0 04 Jun 2021
von Mises-Fisher Loss: An Exploration of Embedding Geometries for Supervised Learning Tyler R. Scott Andrew C. Gallagher Michael C. Mozer 82 42 0 29 Mar 2021
Civil Rephrases Of Toxic Texts With Self-Supervised Transformers Leo Laugier John Pavlopoulos Jeffrey Scott Sorensen Lucas Dixon 87 48 0 01 Feb 2021
Neural Machine Translation: A Review of Methods, Resources, and Tools Zhixing Tan Shuo Wang Zonghan Yang Gang Chen Xuancheng Huang Maosong Sun Yang Liu 3DV AI4TS 87 110 0 31 Dec 2020
Hierarchical Metadata-Aware Document Categorization under Weak Supervision Yu Zhang Xiusi Chen Yu Meng Jiawei Han 112 22 0 26 Oct 2020
Plug and Play Autoencoders for Conditional Text Generation Florian Mai Nikolaos Pappas Ivan Montero Noah A. Smith U. Washington 104 37 0 06 Oct 2020
Generating Dialogue Responses from a Semantic Latent Space Wei-Jen Ko Avik Ray Yilin Shen Hongxia Jin VLM 97 6 0 04 Oct 2020
Language Models as Knowledge Bases: On Entity Representations, Storage Capacity, and Paraphrased Queries Benjamin Heinzerling Kentaro Inui KELM 68 133 0 20 Aug 2020
Hierarchical Topic Mining via Joint Spherical Tree and Text Embedding Yu Meng Yunyi Zhang Jiaxin Huang Yu Zhang Chao Zhang Jiawei Han 92 68 0 18 Jul 2020
The Power Spherical distribution Nicola De Cao Wilker Aziz 86 29 0 08 Jun 2020
Worse WER, but Better BLEU? Leveraging Word Embedding as Intermediate in Multitask End-to-End Speech Translation Shun-Po Chuang Tzu-Wei Sung Alexander H. Liu Hung-yi Lee 76 20 0 21 May 2020
Generalized Entropy Regularization or: There's Nothing Special about Label Smoothing Clara Meister Elizabeth Salesky Ryan Cotterell UQCV 51 61 0 02 May 2020
Directional Deep Embedding and Appearance Learning for Fast Video Object Segmentation Yingjie Yin De Xu Xingang Wang Lei Zhang VOS 51 16 0 17 Feb 2020
Spherical Text Embedding Yu Meng Jiaxin Huang Guangyuan Wang Chao Zhang Honglei Zhuang Lance M. Kaplan Jiawei Han RALM 54 118 0 04 Nov 2019
Regressing Word and Sentence Embeddings for Regularization of Neural Machine Translation Inigo Jauregi Unanue E. Z. Borzeshi Massimo Piccardi AI4TS 40 0 0 30 Sep 2019
Fast and Simple Natural-Gradient Variational Inference with Mixture of Exponential-family Approximations Wu Lin Mohammad Emtiyaz Khan Mark Schmidt BDL 99 71 0 07 Jun 2019
Sparse Sequence-to-Sequence Models Ben Peters Vlad Niculae André F. T. Martins TPM 200 214 0 14 May 2019
Multimodal Machine Translation with Embedding Prediction Tosho Hirasawa Hayahide Yamagishi Yukio Matsumura Mamoru Komachi 32 16 0 01 Apr 2019
compare-mt: A Tool for Holistic Comparison of Language Generation Systems Graham Neubig Zi-Yi Dou Junjie Hu Paul Michel Danish Pruthi Xinyi Wang John Wieting ELM 77 116 0 19 Mar 2019
An Embarrassingly Simple Approach for Transfer Learning from Pretrained Language Models Alexandra Chronopoulou Christos Baziotis Alexandros Potamianos CLL 70 132 0 27 Feb 2019
Billion-scale similarity search with GPUs Jeff Johnson Matthijs Douze Hervé Jégou 347 3,748 0 28 Feb 2017