Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1708.01988
Cited By
Identity-Aware Textual-Visual Matching with Latent Co-attention
7 August 2017
Shuang Li
Tong Xiao
Hongsheng Li
Wei Yang
Xiaogang Wang
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Identity-Aware Textual-Visual Matching with Latent Co-attention"
28 / 28 papers shown
Title
Multi-modal Reference Learning for Fine-grained Text-to-Image Retrieval
Zehong Ma
Hao Chen
Wei Zeng
Limin Su
Shiliang Zhang
AI4TS
110
0
0
10 Apr 2025
Enhancing Visual Representation for Text-based Person Searching
Wei Shen
Ming Fang
Yuxia Wang
Jiafeng Xiao
Diping Li
Ningyu Zhang
Ling Xu
Weinan Zhang
84
1
0
31 Dec 2024
From Data Deluge to Data Curation: A Filtering-WoRA Paradigm for Efficient Text-based Person Search
Jintao Sun
Zhedong Zheng
Gangyi Ding
Gangyi Ding
83
8
0
16 Apr 2024
Person Search with Natural Language Description
Shuang Li
Tong Xiao
Hongsheng Li
Bolei Zhou
Dayu Yue
Xiaogang Wang
80
392
0
19 Feb 2017
Dual Attention Networks for Multimodal Reasoning and Matching
Hyeonseob Nam
Jung-Woo Ha
Jeonghee Kim
100
667
0
02 Nov 2016
Multimodal Compact Bilinear Pooling for Visual Question Answering and Visual Grounding
Akira Fukui
Dong Huk Park
Daylen Yang
Anna Rohrbach
Trevor Darrell
Marcus Rohrbach
305
1,465
0
06 Jun 2016
Adversarial Feature Learning
Jiasen Lu
Philipp Krahenbuhl
Trevor Darrell
GAN
111
1,611
0
31 May 2016
Learning Deep Representations of Fine-grained Visual Descriptions
Scott E. Reed
Zeynep Akata
Bernt Schiele
Honglak Lee
OCL
VLM
201
842
0
17 May 2016
Learning Deep Feature Representations with Domain Guided Dropout for Person Re-identification
Tong Xiao
Hongsheng Li
Wanli Ouyang
Xiaogang Wang
OOD
77
977
0
26 Apr 2016
Person Re-identification in the Wild
Liang Zheng
Hengheng Zhang
Shaoyan Sun
Manmohan Chandraker
Yi Yang
Q. Tian
64
684
0
09 Apr 2016
Joint Detection and Identification Feature Learning for Person Search
Tong Xiao
Shuang Li
Bochao Wang
Liang Lin
Xiaogang Wang
105
823
0
07 Apr 2016
Learning a Discriminative Null Space for Person Re-identification
Li Zhang
Tao Xiang
S. Gong
53
655
0
07 Mar 2016
Visual Genome: Connecting Language and Vision Using Crowdsourced Dense Image Annotations
Ranjay Krishna
Yuke Zhu
Oliver Groth
Justin Johnson
Kenji Hata
...
Yannis Kalantidis
Li Li
David A. Shamma
Michael S. Bernstein
Fei-Fei Li
217
5,747
0
23 Feb 2016
Simple Baseline for Visual Question Answering
Bolei Zhou
Yuandong Tian
Sainbayar Sukhbaatar
Arthur Szlam
Rob Fergus
FAtt
77
323
0
07 Dec 2015
Learning Deep Structure-Preserving Image-Text Embeddings
Liwei Wang
Yin Li
Svetlana Lazebnik
83
781
0
19 Nov 2015
Visual7W: Grounded Question Answering in Images
Yuke Zhu
Oliver Groth
Michael S. Bernstein
Li Fei-Fei
94
884
0
11 Nov 2015
VQA: Visual Question Answering
Aishwarya Agrawal
Jiasen Lu
Stanislaw Antol
Margaret Mitchell
C. L. Zitnick
Dhruv Batra
Devi Parikh
CoGe
211
5,478
0
03 May 2015
FaceNet: A Unified Embedding for Face Recognition and Clustering
Florian Schroff
Dmitry Kalenichenko
James Philbin
3DH
376
13,145
0
12 Mar 2015
Deep Captioning with Multimodal Recurrent Neural Networks (m-RNN)
Junhua Mao
Wenyuan Xu
Yi Yang
Jiang Wang
Zhiheng Huang
Alan Yuille
VLM
171
1,240
0
20 Dec 2014
Deep Visual-Semantic Alignments for Generating Image Descriptions
A. Karpathy
Li Fei-Fei
129
5,585
0
07 Dec 2014
Show and Tell: A Neural Image Caption Generator
Oriol Vinyals
Alexander Toshev
Samy Bengio
D. Erhan
3DV
249
6,029
0
17 Nov 2014
Explain Images with Multimodal Recurrent Neural Networks
Junhua Mao
Wenyuan Xu
Yi Yang
Jiang Wang
Alan Yuille
VLM
GAN
103
385
0
04 Oct 2014
Evaluation of Output Embeddings for Fine-Grained Image Classification
Zeynep Akata
Scott E. Reed
D. Walter
Honglak Lee
Bernt Schiele
83
1,020
0
30 Sep 2014
Going Deeper with Convolutions
Christian Szegedy
Wei Liu
Yangqing Jia
P. Sermanet
Scott E. Reed
Dragomir Anguelov
D. Erhan
Vincent Vanhoucke
Andrew Rabinovich
477
43,658
0
17 Sep 2014
Very Deep Convolutional Networks for Large-Scale Image Recognition
Karen Simonyan
Andrew Zisserman
FAtt
MDE
1.7K
100,386
0
04 Sep 2014
Neural Machine Translation by Jointly Learning to Align and Translate
Dzmitry Bahdanau
Kyunghyun Cho
Yoshua Bengio
AIMat
564
27,311
0
01 Sep 2014
Microsoft COCO: Common Objects in Context
Nayeon Lee
Michael Maire
Serge J. Belongie
Lubomir Bourdev
Ross B. Girshick
James Hays
Pietro Perona
Deva Ramanan
C. L. Zitnick
Piotr Dollár
ObjD
413
43,667
0
01 May 2014
Distributed Representations of Words and Phrases and their Compositionality
Tomas Mikolov
Ilya Sutskever
Kai Chen
G. Corrado
J. Dean
NAI
OCL
397
33,550
0
16 Oct 2013
1