Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1812.04616
Cited By
v1
v2
v3 (latest)
Von Mises-Fisher Loss for Training Sequence to Sequence Models with Continuous Outputs
10 December 2018
Sachin Kumar
Yulia Tsvetkov
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Von Mises-Fisher Loss for Training Sequence to Sequence Models with Continuous Outputs"
27 / 27 papers shown
Title
Deep Sparse Latent Feature Models for Knowledge Graph Completion
Haotian Li
Rui Zhang
Lingzhi Wang
Bin Yu
Yuanbo Wang
Yuliang Wei
Kai Wang
Richard Y. D. Xu
Bailing Wang
BDL
134
0
0
24 Nov 2024
The Unreasonable Effectiveness of Random Target Embeddings for Continuous-Output Neural Machine Translation
Evgeniia Tokarchuk
Vlad Niculae
60
2
0
31 Oct 2023
Unsupervised Discovery of Continuous Skills on a Sphere
Takahisa Imagawa
Takuya Hiraoka
Yoshimasa Tsuruoka
76
0
0
21 May 2023
No Shifted Augmentations (NSA): compact distributions for robust self-supervised Anomaly Detection
Mohamed Yousef
Marcel R. Ackermann
Unmesh Kurup
Tom E. Bishop
OODD
OOD
92
3
0
19 Mar 2022
Controlled Text Generation as Continuous Optimization with Multiple Constraints
Sachin Kumar
Eric Malmi
Aliaksei Severyn
Yulia Tsvetkov
BDL
AI4CE
98
79
0
04 Aug 2021
Machine Translation into Low-resource Language Varieties
Sachin Kumar
Antonios Anastasopoulos
S. Wintner
Yulia Tsvetkov
83
30
0
12 Jun 2021
BERTTune: Fine-Tuning Neural Machine Translation with BERTScore
Inigo Jauregi Unanue
Jacob Parnell
Massimo Piccardi
56
34
0
04 Jun 2021
von Mises-Fisher Loss: An Exploration of Embedding Geometries for Supervised Learning
Tyler R. Scott
Andrew C. Gallagher
Michael C. Mozer
82
42
0
29 Mar 2021
Civil Rephrases Of Toxic Texts With Self-Supervised Transformers
Leo Laugier
John Pavlopoulos
Jeffrey Scott Sorensen
Lucas Dixon
87
48
0
01 Feb 2021
Neural Machine Translation: A Review of Methods, Resources, and Tools
Zhixing Tan
Shuo Wang
Zonghan Yang
Gang Chen
Xuancheng Huang
Maosong Sun
Yang Liu
3DV
AI4TS
87
110
0
31 Dec 2020
Hierarchical Metadata-Aware Document Categorization under Weak Supervision
Yu Zhang
Xiusi Chen
Yu Meng
Jiawei Han
112
22
0
26 Oct 2020
Plug and Play Autoencoders for Conditional Text Generation
Florian Mai
Nikolaos Pappas
Ivan Montero
Noah A. Smith
U. Washington
104
37
0
06 Oct 2020
Generating Dialogue Responses from a Semantic Latent Space
Wei-Jen Ko
Avik Ray
Yilin Shen
Hongxia Jin
VLM
97
6
0
04 Oct 2020
Language Models as Knowledge Bases: On Entity Representations, Storage Capacity, and Paraphrased Queries
Benjamin Heinzerling
Kentaro Inui
KELM
68
133
0
20 Aug 2020
Hierarchical Topic Mining via Joint Spherical Tree and Text Embedding
Yu Meng
Yunyi Zhang
Jiaxin Huang
Yu Zhang
Chao Zhang
Jiawei Han
92
68
0
18 Jul 2020
The Power Spherical distribution
Nicola De Cao
Wilker Aziz
86
29
0
08 Jun 2020
Worse WER, but Better BLEU? Leveraging Word Embedding as Intermediate in Multitask End-to-End Speech Translation
Shun-Po Chuang
Tzu-Wei Sung
Alexander H. Liu
Hung-yi Lee
76
20
0
21 May 2020
Generalized Entropy Regularization or: There's Nothing Special about Label Smoothing
Clara Meister
Elizabeth Salesky
Ryan Cotterell
UQCV
51
61
0
02 May 2020
Directional Deep Embedding and Appearance Learning for Fast Video Object Segmentation
Yingjie Yin
De Xu
Xingang Wang
Lei Zhang
VOS
51
16
0
17 Feb 2020
Spherical Text Embedding
Yu Meng
Jiaxin Huang
Guangyuan Wang
Chao Zhang
Honglei Zhuang
Lance M. Kaplan
Jiawei Han
RALM
54
118
0
04 Nov 2019
Regressing Word and Sentence Embeddings for Regularization of Neural Machine Translation
Inigo Jauregi Unanue
E. Z. Borzeshi
Massimo Piccardi
AI4TS
40
0
0
30 Sep 2019
Fast and Simple Natural-Gradient Variational Inference with Mixture of Exponential-family Approximations
Wu Lin
Mohammad Emtiyaz Khan
Mark Schmidt
BDL
99
71
0
07 Jun 2019
Sparse Sequence-to-Sequence Models
Ben Peters
Vlad Niculae
André F. T. Martins
TPM
200
214
0
14 May 2019
Multimodal Machine Translation with Embedding Prediction
Tosho Hirasawa
Hayahide Yamagishi
Yukio Matsumura
Mamoru Komachi
32
16
0
01 Apr 2019
compare-mt: A Tool for Holistic Comparison of Language Generation Systems
Graham Neubig
Zi-Yi Dou
Junjie Hu
Paul Michel
Danish Pruthi
Xinyi Wang
John Wieting
ELM
77
116
0
19 Mar 2019
An Embarrassingly Simple Approach for Transfer Learning from Pretrained Language Models
Alexandra Chronopoulou
Christos Baziotis
Alexandros Potamianos
CLL
70
132
0
27 Feb 2019
Billion-scale similarity search with GPUs
Jeff Johnson
Matthijs Douze
Hervé Jégou
347
3,748
0
28 Feb 2017
1