ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1711.05535
  4. Cited By
Dual-Path Convolutional Image-Text Embeddings with Instance Loss
v1v2v3v4 (latest)

Dual-Path Convolutional Image-Text Embeddings with Instance Loss

15 November 2017
Zhedong Zheng
Liang Zheng
Michael Garrett
Yi Yang
Mingliang Xu
Yi-Dong Shen
ArXiv (abs)PDFHTML

Papers citing "Dual-Path Convolutional Image-Text Embeddings with Instance Loss"

39 / 139 papers shown
Title
T-EMDE: Sketching-based global similarity for cross-modal retrieval
T-EMDE: Sketching-based global similarity for cross-modal retrieval
Barbara Rychalska
Mikolaj Wieczorek
Jacek Dąbrowski
59
0
0
10 May 2021
Person Search Challenges and Solutions: A Survey
Person Search Challenges and Solutions: A Survey
Xiangtan Lin
Pengzhen Ren
Yun Xiao
Xiaojun Chang
Alexander G. Hauptmann
105
14
0
01 May 2021
Discrete-continuous Action Space Policy Gradient-based Attention for
  Image-Text Matching
Discrete-continuous Action Space Policy Gradient-based Attention for Image-Text Matching
Shiyang Yan
Li Yu
Yuan Xie
88
34
0
21 Apr 2021
Integrating Information Theory and Adversarial Learning for Cross-modal
  Retrieval
Integrating Information Theory and Adversarial Learning for Cross-modal Retrieval
Wei Chen
Yu Liu
E. Bakker
M. Lew
GAN
41
27
0
11 Apr 2021
Retrieve Fast, Rerank Smart: Cooperative and Joint Approaches for
  Improved Cross-Modal Retrieval
Retrieve Fast, Rerank Smart: Cooperative and Joint Approaches for Improved Cross-Modal Retrieval
Gregor Geigle
Jonas Pfeiffer
Nils Reimers
Ivan Vulić
Iryna Gurevych
104
60
0
22 Mar 2021
An Unsupervised Sampling Approach for Image-Sentence Matching Using
  Document-Level Structural Information
An Unsupervised Sampling Approach for Image-Sentence Matching Using Document-Level Structural Information
Zejun Li
Zhongyu Wei
Zhihao Fan
Haijun Shan
Xuanjing Huang
52
5
0
21 Mar 2021
AXM-Net: Implicit Cross-Modal Feature Alignment for Person
  Re-identification
AXM-Net: Implicit Cross-Modal Feature Alignment for Person Re-identification
Ammarah Farooq
Muhammad Awais
J. Kittler
S. S. Khalid
3DPC
156
90
0
19 Jan 2021
Contextual Non-Local Alignment over Full-Scale Representation for
  Text-Based Person Search
Contextual Non-Local Alignment over Full-Scale Representation for Text-Based Person Search
Chen Gao
Guanyu Cai
Xinyang Jiang
Feng Zheng
Jinchao Zhang
Yifei Gong
Pai Peng
Xiao-Wei Guo
Xing Sun
DiffM
140
96
0
08 Jan 2021
VinVL: Revisiting Visual Representations in Vision-Language Models
VinVL: Revisiting Visual Representations in Vision-Language Models
Pengchuan Zhang
Xiujun Li
Xiaowei Hu
Jianwei Yang
Lei Zhang
Lijuan Wang
Yejin Choi
Jianfeng Gao
ObjDVLM
347
158
0
02 Jan 2021
Beyond the Deep Metric Learning: Enhance the Cross-Modal Matching with
  Adversarial Discriminative Domain Regularization
Beyond the Deep Metric Learning: Enhance the Cross-Modal Matching with Adversarial Discriminative Domain Regularization
Li Ren
Keqin Li
Liqiang Wang
K. Hua
44
4
0
23 Oct 2020
Taking Modality-free Human Identification as Zero-shot Learning
Taking Modality-free Human Identification as Zero-shot Learning
Zhizhe Liu
Xingxing Zhang
Zhenfeng Zhu
Shuai Zheng
Yao Zhao
Jian Cheng
54
4
0
02 Oct 2020
Dual-path CNN with Max Gated block for Text-Based Person
  Re-identification
Dual-path CNN with Max Gated block for Text-Based Person Re-identification
Tinghuai Ma
Mingming Yang
Huan Rong
Yurong Qian
Yurong Qian
Y. Tian
N. Al-Nabhan
72
19
0
20 Sep 2020
Each Part Matters: Local Patterns Facilitate Cross-view Geo-localization
Each Part Matters: Local Patterns Facilitate Cross-view Geo-localization
Tingyu Wang
Zhedong Zheng
C. Yan
Jiyong Zhang
Yaoqi Sun
Bolun Zheng
Yi Yang
59
171
0
26 Aug 2020
Weakly supervised cross-domain alignment with optimal transport
Weakly supervised cross-domain alignment with optimal transport
Siyang Yuan
Ke Bai
Liqun Chen
Yizhe Zhang
Chenyang Tao
Chunyuan Li
Guoyin Wang
Ricardo Henao
Lawrence Carin
OT
60
7
0
14 Aug 2020
Dual Convolutional Neural Networks for Breast Mass Segmentation and
  Diagnosis in Mammography
Dual Convolutional Neural Networks for Breast Mass Segmentation and Diagnosis in Mammography
Heyi Li
Dongdong Chen
W. Nailon
Mike E. Davies
Dave Laurenson
75
54
0
07 Aug 2020
Deep Learning Techniques for Future Intelligent Cross-Media Retrieval
Deep Learning Techniques for Future Intelligent Cross-Media Retrieval
S. Rehman
M. Waqas
Shanshan Tu
Anis Koubaa
O. Rehman
Jawad Ahmad
Muhammad Hanif
Zhu Han
37
6
0
21 Jul 2020
Symbiotic Adversarial Learning for Attribute-based Person Search
Symbiotic Adversarial Learning for Attribute-based Person Search
Yu-Tong Cao
Jingya Wang
Dacheng Tao
GAN
48
28
0
19 Jul 2020
Graph Optimal Transport for Cross-Domain Alignment
Graph Optimal Transport for Cross-Domain Alignment
Liqun Chen
Zhe Gan
Yu Cheng
Linjie Li
Lawrence Carin
Jingjing Liu
OT
115
152
0
26 Jun 2020
Compositional Learning of Image-Text Query for Image Retrieval
Compositional Learning of Image-Text Query for Image Retrieval
Muhammad Umer Anwaar
Egor Labintcev
M. Kleinsteuber
CoGe
109
96
0
19 Jun 2020
Parameter-Efficient Person Re-identification in the 3D Space
Parameter-Efficient Person Re-identification in the 3D Space
Zhedong Zheng
Nenggan Zheng
Yi Yang
3DPC
89
63
0
08 Jun 2020
VehicleNet: Learning Robust Visual Representation for Vehicle
  Re-identification
VehicleNet: Learning Robust Visual Representation for Vehicle Re-identification
Zhedong Zheng
Tao Ruan
Yunchao Wei
Yi Yang
Tao Mei
91
152
0
14 Apr 2020
Oscar: Object-Semantics Aligned Pre-training for Vision-Language Tasks
Oscar: Object-Semantics Aligned Pre-training for Vision-Language Tasks
Xiujun Li
Xi Yin
Chunyuan Li
Pengchuan Zhang
Xiaowei Hu
...
Houdong Hu
Li Dong
Furu Wei
Yejin Choi
Jianfeng Gao
VLM
204
1,953
0
13 Apr 2020
Graph Structured Network for Image-Text Matching
Graph Structured Network for Image-Text Matching
Chunxiao Liu
Zhendong Mao
Tianzhu Zhang
Hongtao Xie
Bin Wang
Yongdong Zhang
84
239
0
01 Apr 2020
IMRAM: Iterative Matching with Recurrent Attention Memory for
  Cross-Modal Image-Text Retrieval
IMRAM: Iterative Matching with Recurrent Attention Memory for Cross-Modal Image-Text Retrieval
Hui Chen
Guiguang Ding
Xudong Liu
Zijia Lin
Ji Liu
Jungong Han
78
326
0
08 Mar 2020
University-1652: A Multi-view Multi-source Benchmark for Drone-based
  Geo-localization
University-1652: A Multi-view Multi-source Benchmark for Drone-based Geo-localization
Zhedong Zheng
Yunchao Wei
Yi Yang
66
247
0
27 Feb 2020
Deep Multimodal Image-Text Embeddings for Automatic Cross-Media
  Retrieval
Deep Multimodal Image-Text Embeddings for Automatic Cross-Media Retrieval
Hadi Abdi Khojasteh
Ebrahim Ansari
Parvin Razzaghi
Akbar Karimi
VLM
43
4
0
23 Feb 2020
A Convolutional Baseline for Person Re-Identification Using Vision and
  Language Descriptions
A Convolutional Baseline for Person Re-Identification Using Vision and Language Descriptions
Ammarah Farooq
Muhammad Awais
F. Yan
J. Kittler
A. Akbari
S. S. Khalid
114
8
0
20 Feb 2020
Expressing Objects just like Words: Recurrent Visual Embedding for
  Image-Text Matching
Expressing Objects just like Words: Recurrent Visual Embedding for Image-Text Matching
Tianlang Chen
Jiebo Luo
67
69
0
20 Feb 2020
MHSAN: Multi-Head Self-Attention Network for Visual Semantic Embedding
MHSAN: Multi-Head Self-Attention Network for Visual Semantic Embedding
Geondo Park
Chihye Han
Wonjun Yoon
Dae-Shik Kim
30
18
0
11 Jan 2020
Visual-Textual Association with Hardest and Semi-Hard Negative Pairs
  Mining for Person Search
Visual-Textual Association with Hardest and Semi-Hard Negative Pairs Mining for Person Search
Jing Ge
Guangyu Gao
Zhen Liu
87
18
0
06 Dec 2019
Attend to the Difference: Cross-Modality Person Re-identification via
  Contrastive Correlation
Attend to the Difference: Cross-Modality Person Re-identification via Contrastive Correlation
Shizhou Zhang
Yifei Yang
Peng Wang
Guoqiang Liang
Xiuwei Zhang
Yanning Zhang
47
52
0
25 Oct 2019
Target-Oriented Deformation of Visual-Semantic Embedding Space
Target-Oriented Deformation of Visual-Semantic Embedding Space
Takashi Matsubara
55
7
0
15 Oct 2019
Adversarial Representation Learning for Text-to-Image Matching
Adversarial Representation Learning for Text-to-Image Matching
N. Sarafianos
Xiang Xu
I. Kakadiaris
GAN
117
188
0
28 Aug 2019
Unicoder-VL: A Universal Encoder for Vision and Language by Cross-modal
  Pre-training
Unicoder-VL: A Universal Encoder for Vision and Language by Cross-modal Pre-training
Gen Li
Nan Duan
Yuejian Fang
Ming Gong
Daxin Jiang
Ming Zhou
SSLVLMMLLM
221
907
0
16 Aug 2019
Attract or Distract: Exploit the Margin of Open Set
Attract or Distract: Exploit the Margin of Open Set
Qianyu Feng
Guoliang Kang
Hehe Fan
Yezhou Yang
76
56
0
06 Aug 2019
Position Focused Attention Network for Image-Text Matching
Position Focused Attention Network for Image-Text Matching
Yaxiong Wang
Hao-Hsiang Yang
Xueming Qian
Lin Ma
Jing Lu
Biao Li
Xin Fan
47
172
0
23 Jul 2019
A New Benchmark and Approach for Fine-grained Cross-media Retrieval
A New Benchmark and Approach for Fine-grained Cross-media Retrieval
Xiangteng He
Yuxin Peng
Liu Xie
VLM
104
65
0
10 Jul 2019
Improving Description-based Person Re-identification by
  Multi-granularity Image-text Alignments
Improving Description-based Person Re-identification by Multi-granularity Image-text Alignments
K. Niu
Y. Huang
Wanli Ouyang
Liang Wang
58
143
0
23 Jun 2019
Exploring Uncertainty Measures for Image-Caption Embedding-and-Retrieval
  Task
Exploring Uncertainty Measures for Image-Caption Embedding-and-Retrieval Task
Kenta Hama
Takashi Matsubara
K. Uehara
Jianfei Cai
BDLUQCV
40
6
0
09 Apr 2019
Previous
123