ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1707.05612
  4. Cited By
VSE++: Improving Visual-Semantic Embeddings with Hard Negatives
v1v2v3v4 (latest)

VSE++: Improving Visual-Semantic Embeddings with Hard Negatives

18 July 2017
Fartash Faghri
David J. Fleet
J. Kiros
Sanja Fidler
    VLM
ArXiv (abs)PDFHTML

Papers citing "VSE++: Improving Visual-Semantic Embeddings with Hard Negatives"

27 / 77 papers shown
Title
Learning semantic sentence representations from visually grounded
  language without lexical knowledge
Learning semantic sentence representations from visually grounded language without lexical knowledge
Danny Merkx
S. Frank
SSL
26
13
0
27 Mar 2019
Show, Translate and Tell
Show, Translate and Tell
D. Peri
Shagan Sah
R. Ptucha
19
5
0
14 Mar 2019
Improving Referring Expression Grounding with Cross-modal
  Attention-guided Erasing
Improving Referring Expression Grounding with Cross-modal Attention-guided Erasing
Xihui Liu
Zihao Wang
Jing Shao
Xiaogang Wang
Hongsheng Li
ObjD
101
186
0
03 Mar 2019
Learning Shared Semantic Space with Correlation Alignment for
  Cross-modal Event Retrieval
Learning Shared Semantic Space with Correlation Alignment for Cross-modal Event Retrieval
Zhenguo Yang
Zehang Lin
Peipei Kang
Jianming Lv
Qing Li
Wenyin Liu
3DPC
91
26
0
14 Jan 2019
Multi-task Learning of Hierarchical Vision-Language Representation
Multi-task Learning of Hierarchical Vision-Language Representation
Duy-Kien Nguyen
Takayuki Okatani
105
52
0
03 Dec 2018
Towards Coherent and Cohesive Long-form Text Generation
Towards Coherent and Cohesive Long-form Text Generation
W. Cho
Pengchuan Zhang
Yizhe Zhang
Xiujun Li
Michel Galley
Chris Brockett
Mengdi Wang
Jianfeng Gao
30
0
0
01 Nov 2018
Engaging Image Captioning Via Personality
Engaging Image Captioning Via Personality
Kurt Shuster
Samuel Humeau
Hexiang Hu
Antoine Bordes
Jason Weston
87
152
0
25 Oct 2018
Lessons learned in multilingual grounded language learning
Lessons learned in multilingual grounded language learning
Ákos Kádár
Desmond Elliott
Marc-Alexandre Côté
Grzegorz Chrupała
Afra Alishahi
VLM
112
24
0
20 Sep 2018
Dual Encoding for Zero-Example Video Retrieval
Dual Encoding for Zero-Example Video Retrieval
Jianfeng Dong
Xirong Li
Chaoxi Xu
S. Ji
Yuan He
Gang Yang
Xun Wang
135
271
0
17 Sep 2018
Evaluating Multimodal Representations on Sentence Similarity: vSTS,
  Visual Semantic Textual Similarity Dataset
Evaluating Multimodal Representations on Sentence Similarity: vSTS, Visual Semantic Textual Similarity Dataset
Oier López de Lacalle
Aitor Soroa Etxabe
Eneko Agirre
19
3
0
11 Sep 2018
Webly Supervised Joint Embedding for Cross-Modal Image-Text Retrieval
Webly Supervised Joint Embedding for Cross-Modal Image-Text Retrieval
Niluthpol Chowdhury Mithun
Yikang Shen
Evangelos E. Papalexakis
Amit K. Roy-Chowdhury
75
77
0
23 Aug 2018
Talking Face Generation by Adversarially Disentangled Audio-Visual
  Representation
Talking Face Generation by Adversarially Disentangled Audio-Visual Representation
Hang Zhou
Yu Liu
Ziwei Liu
Ping Luo
Xiaogang Wang
CVBM
94
443
0
20 Jul 2018
Learning Visually-Grounded Semantics from Contrastive Adversarial
  Samples
Learning Visually-Grounded Semantics from Contrastive Adversarial Samples
Freda Shi
Jiayuan Mao
Tete Xiao
Yuning Jiang
Jian Sun
ObjD
96
52
0
27 Jun 2018
Large-Scale Visual Relationship Understanding
Large-Scale Visual Relationship Understanding
Ji Zhang
Yannis Kalantidis
Marcus Rohrbach
Manohar Paluri
Ahmed Elgammal
Mohamed Elhoseiny
67
169
0
27 Apr 2018
Dynamic Meta-Embeddings for Improved Sentence Representations
Dynamic Meta-Embeddings for Improved Sentence Representations
Douwe Kiela
Changhan Wang
Kyunghyun Cho
AI4TS
92
108
0
21 Apr 2018
Zero-Shot Object Detection
Zero-Shot Object Detection
Ankan Bansal
Karan Sikka
Gaurav Sharma
Rama Chellappa
Ajay Divakaran
VLMObjD
109
361
0
12 Apr 2018
Imagine This! Scripts to Compositions to Videos
Imagine This! Scripts to Compositions to Videos
Tanmay Gupta
Dustin Schwenk
Ali Farhadi
Derek Hoiem
Aniruddha Kembhavi
CoGeVGen
148
91
0
10 Apr 2018
Finding beans in burgers: Deep semantic-visual embedding with
  localization
Finding beans in burgers: Deep semantic-visual embedding with localization
Martin Engilberge
Louis Chevallier
P. Pérez
Matthieu Cord
81
96
0
05 Apr 2018
Show, Tell and Discriminate: Image Captioning by Self-retrieval with
  Partially Labeled Data
Show, Tell and Discriminate: Image Captioning by Self-retrieval with Partially Labeled Data
Xihui Liu
Hongsheng Li
Jing Shao
Dapeng Chen
Xiaogang Wang
93
133
0
22 Mar 2018
Stacked Cross Attention for Image-Text Matching
Stacked Cross Attention for Image-Text Matching
Kuang-Huei Lee
Xi Chen
G. Hua
Houdong Hu
Xiaodong He
122
1,163
0
21 Mar 2018
Discriminability objective for training descriptive captions
Discriminability objective for training descriptive captions
Ruotian Luo
Brian L. Price
Scott D. Cohen
Gregory Shakhnarovich
134
203
0
12 Mar 2018
VSE-ens: Visual-Semantic Embeddings with Efficient Negative Sampling
VSE-ens: Visual-Semantic Embeddings with Efficient Negative Sampling
G. Guo
Songlin Zhai
Fajie Yuan
Yuan Liu
Xingwei Wang
VLM
48
11
0
05 Jan 2018
Learning Semantic Concepts and Order for Image and Sentence Matching
Learning Semantic Concepts and Order for Image and Sentence Matching
Yan Huang
Qi Wu
Liang Wang
VLM
85
305
0
06 Dec 2017
ADVISE: Symbolism and External Knowledge for Decoding Advertisements
ADVISE: Symbolism and External Knowledge for Decoding Advertisements
Keren Ye
Adriana Kovashka
79
51
0
17 Nov 2017
Look, Imagine and Match: Improving Textual-Visual Cross-Modal Retrieval
  with Generative Models
Look, Imagine and Match: Improving Textual-Visual Cross-Modal Retrieval with Generative Models
Jiuxiang Gu
Jianfei Cai
Shafiq Joty
Li Niu
G. Wang
VLM
122
361
0
17 Nov 2017
Dual-Path Convolutional Image-Text Embeddings with Instance Loss
Dual-Path Convolutional Image-Text Embeddings with Instance Loss
Zhedong Zheng
Liang Zheng
Michael Garrett
Yi Yang
Mingliang Xu
Yi-Dong Shen
181
483
0
15 Nov 2017
Learning Visually Grounded Sentence Representations
Learning Visually Grounded Sentence Representations
Douwe Kiela
Alexis Conneau
Allan Jabri
Maximilian Nickel
SSL
88
69
0
19 Jul 2017
Previous
12