Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2311.09084
Cited By
Contrastive Transformer Learning with Proximity Data Generation for Text-Based Person Search
15 November 2023
Hefeng Wu
Weifeng Chen
Zhibin Liu
Tianshui Chen
Zhiguang Chen
Liang Lin
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Contrastive Transformer Learning with Proximity Data Generation for Text-Based Person Search"
40 / 40 papers shown
Title
Control-A-Video: Controllable Text-to-Video Generation with Diffusion Models
Weifeng Chen
Yatai Ji
Jie Wu
Hefeng Wu
Pan Xie
Jiashi Li
Xin Xia
Xuefeng Xiao
Liang Lin
VGen
136
92
0
23 May 2023
Null-text Inversion for Editing Real Images using Guided Diffusion Models
Ron Mokady
Amir Hertz
Kfir Aberman
Yael Pritch
Daniel Cohen-Or
DiffM
48
537
0
17 Nov 2022
DiffEdit: Diffusion-based semantic image editing with mask guidance
Guillaume Couairon
Jakob Verbeek
Holger Schwenk
Matthieu Cord
DiffM
103
490
0
20 Oct 2022
DreamBooth: Fine Tuning Text-to-Image Diffusion Models for Subject-Driven Generation
Nataniel Ruiz
Yuanzhen Li
Varun Jampani
Yael Pritch
Michael Rubinstein
Kfir Aberman
172
2,789
0
25 Aug 2022
See Finer, See More: Implicit Modality Alignment for Text-based Person Retrieval
Xiujun Shu
Wei Wen
Haoqian Wu
Keyun Chen
Yi-Zhe Song
Ruizhi Qiao
Bohan Ren
Xiao Wang
50
92
0
18 Aug 2022
Prompt-to-Prompt Image Editing with Cross Attention Control
Amir Hertz
Ron Mokady
J. Tenenbaum
Kfir Aberman
Yael Pritch
Daniel Cohen-Or
DiffM
115
1,727
0
02 Aug 2022
Learning Granularity-Unified Representations for Text-to-Image Person Re-identification
Zhiyin Shao
Xinyu Zhang
Meng Fang
Zhi-hao Lin
Jian Wang
Changxing Ding
46
100
0
16 Jul 2022
Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding
Chitwan Saharia
William Chan
Saurabh Saxena
Lala Li
Jay Whang
...
Raphael Gontijo-Lopes
Tim Salimans
Jonathan Ho
David J Fleet
Mohammad Norouzi
VLM
225
5,904
0
23 May 2022
Hierarchical Text-Conditional Image Generation with CLIP Latents
Aditya A. Ramesh
Prafulla Dhariwal
Alex Nichol
Casey Chu
Mark Chen
VLM
DiffM
259
6,768
0
13 Apr 2022
BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation
Junnan Li
Dongxu Li
Caiming Xiong
Guosheng Lin
MLLM
BDL
VLM
CLIP
426
4,283
0
28 Jan 2022
RePaint: Inpainting using Denoising Diffusion Probabilistic Models
Andreas Lugmayr
Martin Danelljan
Andrés Romero
Feng Yu
Radu Timofte
Luc Van Gool
DiffM
296
1,385
0
24 Jan 2022
High-Resolution Image Synthesis with Latent Diffusion Models
Robin Rombach
A. Blattmann
Dominik Lorenz
Patrick Esser
Bjorn Ommer
3DV
199
15,081
0
20 Dec 2021
FILIP: Fine-grained Interactive Language-Image Pre-Training
Lewei Yao
Runhu Huang
Lu Hou
Guansong Lu
Minzhe Niu
Hang Xu
Xiaodan Liang
Zhenguo Li
Xin Jiang
Chunjing Xu
VLM
CLIP
51
627
0
09 Nov 2021
Text-Based Person Search with Limited Data
Xiaoping Han
Sen He
Li Zhang
Tao Xiang
39
89
0
20 Oct 2021
Semantically Self-Aligned Network for Text-to-Image Part-aware Person Re-identification
Z. Ding
Changxing Ding
Zhiyin Shao
Dacheng Tao
42
132
0
27 Jul 2021
Alias-Free Generative Adversarial Networks
Tero Karras
M. Aittala
S. Laine
Erik Härkönen
Janne Hellsten
J. Lehtinen
Timo Aila
GAN
137
1,582
0
23 Jun 2021
TIPCB: A Simple but Effective Part-based Convolutional Baseline for Text-based Person Search
Yuhao Chen
Guoqing Zhang
Yujiang Lu
Zhenxing Wang
Yuhui Zheng
Ruili Wang
39
121
0
25 May 2021
Learning Transferable Visual Models From Natural Language Supervision
Alec Radford
Jong Wook Kim
Chris Hallacy
Aditya A. Ramesh
Gabriel Goh
...
Amanda Askell
Pamela Mishkin
Jack Clark
Gretchen Krueger
Ilya Sutskever
CLIP
VLM
548
28,659
0
26 Feb 2021
AXM-Net: Implicit Cross-Modal Feature Alignment for Person Re-identification
Ammarah Farooq
Muhammad Awais
J. Kittler
S. S. Khalid
3DPC
99
86
0
19 Jan 2021
Contextual Non-Local Alignment over Full-Scale Representation for Text-Based Person Search
Chen Gao
Guanyu Cai
Xinyang Jiang
Feng Zheng
Jinchao Zhang
Yifei Gong
Pai Peng
Xiao-Wei Guo
Xing Sun
DiffM
97
92
0
08 Jan 2021
Training data-efficient image transformers & distillation through attention
Hugo Touvron
Matthieu Cord
Matthijs Douze
Francisco Massa
Alexandre Sablayrolles
Hervé Jégou
ViT
245
6,657
0
23 Dec 2020
Understanding the Behaviour of Contrastive Loss
Feng Wang
Huaping Liu
SSL
65
679
0
15 Dec 2020
An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale
Alexey Dosovitskiy
Lucas Beyer
Alexander Kolesnikov
Dirk Weissenborn
Xiaohua Zhai
...
Matthias Minderer
G. Heigold
Sylvain Gelly
Jakob Uszkoreit
N. Houlsby
ViT
187
40,217
0
22 Oct 2020
Denoising Diffusion Implicit Models
Jiaming Song
Chenlin Meng
Stefano Ermon
VLM
DiffM
82
7,166
0
06 Oct 2020
Knowledge-Guided Multi-Label Few-Shot Learning for General Image Recognition
Tianshui Chen
Liang Lin
Riquan Chen
X. Hui
Hefeng Wu
67
155
0
20 Sep 2020
Denoising Diffusion Probabilistic Models
Jonathan Ho
Ajay Jain
Pieter Abbeel
DiffM
241
17,550
0
19 Jun 2020
ViTAA: Visual-Textual Attributes Alignment in Person Search by Natural Language
Zhe Wang
Zhiyuan Fang
Jun Wang
Yezhou Yang
75
153
0
15 May 2020
Oscar: Object-Semantics Aligned Pre-training for Vision-Language Tasks
Xiujun Li
Xi Yin
Chunyuan Li
Pengchuan Zhang
Xiaowei Hu
...
Houdong Hu
Li Dong
Furu Wei
Yejin Choi
Jianfeng Gao
VLM
58
1,927
0
13 Apr 2020
ViLBERT: Pretraining Task-Agnostic Visiolinguistic Representations for Vision-and-Language Tasks
Jiasen Lu
Dhruv Batra
Devi Parikh
Stefan Lee
SSL
VLM
178
3,659
0
06 Aug 2019
Improving Description-based Person Re-identification by Multi-granularity Image-text Alignments
K. Niu
Y. Huang
Wanli Ouyang
Liang Wang
33
139
0
23 Jun 2019
Bag of Tricks and A Strong Baseline for Deep Person Re-identification
Hao Luo
Youzhi Gu
Xingyu Liao
Shenqi Lai
Wei Jiang
BDL
3DPC
128
1,170
0
17 Mar 2019
A Style-Based Generator Architecture for Generative Adversarial Networks
Tero Karras
S. Laine
Timo Aila
479
10,466
0
12 Dec 2018
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
VLM
SSL
SSeg
815
93,936
0
11 Oct 2018
Large Scale GAN Training for High Fidelity Natural Image Synthesis
Andrew Brock
Jeff Donahue
Karen Simonyan
196
5,363
0
28 Sep 2018
Pose-Guided Multi-Granularity Attention Network for Text-Based Person Search
Ya Jing
Chenyang Si
Junbo Wang
Wei Wang
Liang Wang
Tieniu Tan
3DH
82
135
0
22 Sep 2018
Representation Learning with Contrastive Predictive Coding
Aaron van den Oord
Yazhe Li
Oriol Vinyals
DRL
SSL
204
10,152
0
10 Jul 2018
Dual-Path Convolutional Image-Text Embeddings with Instance Loss
Zhedong Zheng
Liang Zheng
Michael Garrett
Yi Yang
Mingliang Xu
Yi-Dong Shen
68
473
0
15 Nov 2017
Attention Is All You Need
Ashish Vaswani
Noam M. Shazeer
Niki Parmar
Jakob Uszkoreit
Llion Jones
Aidan Gomez
Lukasz Kaiser
Illia Polosukhin
3DV
324
129,831
0
12 Jun 2017
Person Search with Natural Language Description
Shuang Li
Tong Xiao
Hongsheng Li
Bolei Zhou
Dayu Yue
Xiaogang Wang
53
389
0
19 Feb 2017
Deep Residual Learning for Image Recognition
Kaiming He
Xinming Zhang
Shaoqing Ren
Jian Sun
MedIm
1.1K
192,638
0
10 Dec 2015
1