Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1707.05612
Cited By
v1
v2
v3
v4 (latest)
VSE++: Improving Visual-Semantic Embeddings with Hard Negatives
18 July 2017
Fartash Faghri
David J. Fleet
J. Kiros
Sanja Fidler
VLM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"VSE++: Improving Visual-Semantic Embeddings with Hard Negatives"
27 / 77 papers shown
Title
Learning semantic sentence representations from visually grounded language without lexical knowledge
Danny Merkx
S. Frank
SSL
26
13
0
27 Mar 2019
Show, Translate and Tell
D. Peri
Shagan Sah
R. Ptucha
19
5
0
14 Mar 2019
Improving Referring Expression Grounding with Cross-modal Attention-guided Erasing
Xihui Liu
Zihao Wang
Jing Shao
Xiaogang Wang
Hongsheng Li
ObjD
101
186
0
03 Mar 2019
Learning Shared Semantic Space with Correlation Alignment for Cross-modal Event Retrieval
Zhenguo Yang
Zehang Lin
Peipei Kang
Jianming Lv
Qing Li
Wenyin Liu
3DPC
91
26
0
14 Jan 2019
Multi-task Learning of Hierarchical Vision-Language Representation
Duy-Kien Nguyen
Takayuki Okatani
105
52
0
03 Dec 2018
Towards Coherent and Cohesive Long-form Text Generation
W. Cho
Pengchuan Zhang
Yizhe Zhang
Xiujun Li
Michel Galley
Chris Brockett
Mengdi Wang
Jianfeng Gao
30
0
0
01 Nov 2018
Engaging Image Captioning Via Personality
Kurt Shuster
Samuel Humeau
Hexiang Hu
Antoine Bordes
Jason Weston
87
152
0
25 Oct 2018
Lessons learned in multilingual grounded language learning
Ákos Kádár
Desmond Elliott
Marc-Alexandre Côté
Grzegorz Chrupała
Afra Alishahi
VLM
112
24
0
20 Sep 2018
Dual Encoding for Zero-Example Video Retrieval
Jianfeng Dong
Xirong Li
Chaoxi Xu
S. Ji
Yuan He
Gang Yang
Xun Wang
135
271
0
17 Sep 2018
Evaluating Multimodal Representations on Sentence Similarity: vSTS, Visual Semantic Textual Similarity Dataset
Oier López de Lacalle
Aitor Soroa Etxabe
Eneko Agirre
19
3
0
11 Sep 2018
Webly Supervised Joint Embedding for Cross-Modal Image-Text Retrieval
Niluthpol Chowdhury Mithun
Yikang Shen
Evangelos E. Papalexakis
Amit K. Roy-Chowdhury
75
77
0
23 Aug 2018
Talking Face Generation by Adversarially Disentangled Audio-Visual Representation
Hang Zhou
Yu Liu
Ziwei Liu
Ping Luo
Xiaogang Wang
CVBM
94
443
0
20 Jul 2018
Learning Visually-Grounded Semantics from Contrastive Adversarial Samples
Freda Shi
Jiayuan Mao
Tete Xiao
Yuning Jiang
Jian Sun
ObjD
96
52
0
27 Jun 2018
Large-Scale Visual Relationship Understanding
Ji Zhang
Yannis Kalantidis
Marcus Rohrbach
Manohar Paluri
Ahmed Elgammal
Mohamed Elhoseiny
67
169
0
27 Apr 2018
Dynamic Meta-Embeddings for Improved Sentence Representations
Douwe Kiela
Changhan Wang
Kyunghyun Cho
AI4TS
92
108
0
21 Apr 2018
Zero-Shot Object Detection
Ankan Bansal
Karan Sikka
Gaurav Sharma
Rama Chellappa
Ajay Divakaran
VLM
ObjD
109
361
0
12 Apr 2018
Imagine This! Scripts to Compositions to Videos
Tanmay Gupta
Dustin Schwenk
Ali Farhadi
Derek Hoiem
Aniruddha Kembhavi
CoGe
VGen
148
91
0
10 Apr 2018
Finding beans in burgers: Deep semantic-visual embedding with localization
Martin Engilberge
Louis Chevallier
P. Pérez
Matthieu Cord
81
96
0
05 Apr 2018
Show, Tell and Discriminate: Image Captioning by Self-retrieval with Partially Labeled Data
Xihui Liu
Hongsheng Li
Jing Shao
Dapeng Chen
Xiaogang Wang
93
133
0
22 Mar 2018
Stacked Cross Attention for Image-Text Matching
Kuang-Huei Lee
Xi Chen
G. Hua
Houdong Hu
Xiaodong He
122
1,163
0
21 Mar 2018
Discriminability objective for training descriptive captions
Ruotian Luo
Brian L. Price
Scott D. Cohen
Gregory Shakhnarovich
134
203
0
12 Mar 2018
VSE-ens: Visual-Semantic Embeddings with Efficient Negative Sampling
G. Guo
Songlin Zhai
Fajie Yuan
Yuan Liu
Xingwei Wang
VLM
48
11
0
05 Jan 2018
Learning Semantic Concepts and Order for Image and Sentence Matching
Yan Huang
Qi Wu
Liang Wang
VLM
85
305
0
06 Dec 2017
ADVISE: Symbolism and External Knowledge for Decoding Advertisements
Keren Ye
Adriana Kovashka
79
51
0
17 Nov 2017
Look, Imagine and Match: Improving Textual-Visual Cross-Modal Retrieval with Generative Models
Jiuxiang Gu
Jianfei Cai
Shafiq Joty
Li Niu
G. Wang
VLM
122
361
0
17 Nov 2017
Dual-Path Convolutional Image-Text Embeddings with Instance Loss
Zhedong Zheng
Liang Zheng
Michael Garrett
Yi Yang
Mingliang Xu
Yi-Dong Shen
181
483
0
15 Nov 2017
Learning Visually Grounded Sentence Representations
Douwe Kiela
Alexis Conneau
Allan Jabri
Maximilian Nickel
SSL
88
69
0
19 Jul 2017
Previous
1
2