ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1708.01311
  4. Cited By
Automatic Spatially-aware Fashion Concept Discovery

Automatic Spatially-aware Fashion Concept Discovery

3 August 2017
Xintong Han
Zuxuan Wu
Phoenix X. Huang
Xiao Zhang
Menglong Zhu
Yuan Li
Yang Zhao
L. Davis
ArXivPDFHTML

Papers citing "Automatic Spatially-aware Fashion Concept Discovery"

50 / 120 papers shown
Title
Seeing the Abstract: Translating the Abstract Language for Vision Language Models
Seeing the Abstract: Translating the Abstract Language for Vision Language Models
Davide Talon
Federico Girella
Ziyue Liu
Marco Cristani
Yiming Wang
VLM
Presented at ResearchTrend Connect | VLM on 21 May 2025
59
0
0
06 May 2025
MIEB: Massive Image Embedding Benchmark
MIEB: Massive Image Embedding Benchmark
Chenghao Xiao
Isaac Chung
Imene Kerboua
Jamie Stirling
Xin Zhang
Márton Kardos
Roman Solomatin
Noura Al Moubayed
Kenneth C. Enevoldsen
Niklas Muennighoff
VLM
42
0
0
14 Apr 2025
Scaling Prompt Instructed Zero Shot Composed Image Retrieval with Image-Only Data
Scaling Prompt Instructed Zero Shot Composed Image Retrieval with Image-Only Data
Yiqun Duan
Sameera Ramasinghe
Stephen Gould
Ajanthan Thalaiyasingam
43
0
0
01 Apr 2025
IDMR: Towards Instance-Driven Precise Visual Correspondence in Multimodal Retrieval
IDMR: Towards Instance-Driven Precise Visual Correspondence in Multimodal Retrieval
Bangwei Liu
Yicheng Bao
Shaohui Lin
Xuhong Wang
Xin Tan
Yansen Wang
Yuan Xie
Chaochao Lu
84
0
0
01 Apr 2025
FineCIR: Explicit Parsing of Fine-Grained Modification Semantics for Composed Image Retrieval
FineCIR: Explicit Parsing of Fine-Grained Modification Semantics for Composed Image Retrieval
Zixu Li
Zhiheng Fu
Yupeng Hu
Zhiwei Chen
Haokun Wen
Liqiang Nie
38
0
0
27 Mar 2025
Compositional Caching for Training-free Open-vocabulary Attribute Detection
Compositional Caching for Training-free Open-vocabulary Attribute Detection
Marco Garosi
Alessandro Conti
Gaowen Liu
Elisa Ricci
Massimiliano Mancini
ObjD
VLM
55
0
0
24 Mar 2025
Composed Multi-modal Retrieval: A Survey of Approaches and Applications
Kun Zhang
Jingyu Li
Zhiyu Li
Jingjing Zhang
38
0
0
03 Mar 2025
PinLanding: Content-First Keyword Landing Page Generation via Multi-Modal AI for Web-Scale Discovery
Faye Zhang
Jasmine Wan
Qianyu Cheng
Jinfeng Rao
44
0
0
01 Mar 2025
Joint Fusion and Encoding: Advancing Multimodal Retrieval from the Ground Up
Joint Fusion and Encoding: Advancing Multimodal Retrieval from the Ground Up
Lang Huang
Qiyu Wu
Zhongtao Miao
T. Yamasaki
168
0
0
27 Feb 2025
A Comprehensive Survey on Composed Image Retrieval
A Comprehensive Survey on Composed Image Retrieval
Xuemeng Song
Haoqiang Lin
Haokun Wen
Bohan Hou
Mingzhu Xu
Liqiang Nie
53
1
0
19 Feb 2025
Composite Sketch+Text Queries for Retrieving Objects with Elusive Names and Complex Interactions
Composite Sketch+Text Queries for Retrieving Objects with Elusive Names and Complex Interactions
Prajwal Gatti
Kshitij Parikh
Dhriti Prasanna Paul
Manish Gupta
Anand Mishra
118
2
0
12 Feb 2025
SCOT: Self-Supervised Contrastive Pretraining For Zero-Shot Compositional Retrieval
Bhavin Jawade
JOÃO-BRUNO Soares
K. Thadani
D. Mohan
Amir Erfan Eshratifar
Benjamin Culpepper
Paloma de Juan
S. Setlur
V. Govindaraju
43
0
0
12 Jan 2025
GME: Improving Universal Multimodal Retrieval by Multimodal LLMs
GME: Improving Universal Multimodal Retrieval by Multimodal LLMs
Xin Zhang
Yanzhao Zhang
Wen Xie
Mingxin Li
Ziqi Dai
Dingkun Long
Pengjun Xie
Meishan Zhang
Wenjie Li
Hao Fei
116
8
0
22 Dec 2024
Composed Image Retrieval for Training-Free Domain Conversion
Composed Image Retrieval for Training-Free Domain Conversion
Nikos Efthymiadis
Bill Psomas
Zakaria Laskar
Konstantinos Karantzalos
Yannis Avrithis
Ondřej Chum
Giorgos Tolias
76
0
0
04 Dec 2024
Advancing Myopia To Holism: Fully Contrastive Language-Image
  Pre-training
Advancing Myopia To Holism: Fully Contrastive Language-Image Pre-training
Haicheng Wang
Chen Ju
Weixiong Lin
Shuai Xiao
Mengting Chen
...
Mingshuai Yao
Jinsong Lan
Ying Chen
Qingwen Liu
Yanfeng Wang
VLM
CLIP
78
4
0
30 Nov 2024
MM-Embed: Universal Multimodal Retrieval with Multimodal LLMs
MM-Embed: Universal Multimodal Retrieval with Multimodal LLMs
Sheng-Chieh Lin
Chankyu Lee
M. Shoeybi
Jimmy J. Lin
Bryan Catanzaro
Ming-Yu Liu
70
12
0
04 Nov 2024
Test-time Adaptation for Cross-modal Retrieval with Query Shift
Test-time Adaptation for Cross-modal Retrieval with Query Shift
Haobin Li
Peng Hu
Qianjun Zhang
Xi Peng
Xiting Liu
Mouxing Yang
TTA
33
0
0
21 Oct 2024
VisRAG: Vision-based Retrieval-augmented Generation on Multi-modality Documents
VisRAG: Vision-based Retrieval-augmented Generation on Multi-modality Documents
S. Yu
C. Tang
Bokai Xu
Junbo Cui
Junhao Ran
...
Zhenghao Liu
Shuo Wang
Xu Han
Zhiyuan Liu
Maosong Sun
VLM
39
23
0
14 Oct 2024
EUFCC-CIR: a Composed Image Retrieval Dataset for GLAM Collections
EUFCC-CIR: a Composed Image Retrieval Dataset for GLAM Collections
Francesc Net
Lluís Gómez
31
0
0
02 Oct 2024
Efficient and Discriminative Image Feature Extraction for Universal
  Image Retrieval
Efficient and Discriminative Image Feature Extraction for Universal Image Retrieval
Morris Florek
David Tschirschwitz
Björn Barz
Volker Rodehorst
VLM
33
0
0
20 Sep 2024
AnyDesign: Versatile Area Fashion Editing via Mask-Free Diffusion
AnyDesign: Versatile Area Fashion Editing via Mask-Free Diffusion
Yunfang Niu
Lingxiang Wu
Dong Yi
Jie Peng
Ning Jiang
Haiying Wu
Jinqiao Wang
DiffM
35
1
0
21 Aug 2024
UniFashion: A Unified Vision-Language Model for Multimodal Fashion
  Retrieval and Generation
UniFashion: A Unified Vision-Language Model for Multimodal Fashion Retrieval and Generation
Xiangyu Zhao
Yuehan Zhang
Wenlong Zhang
X. Wu
41
4
0
21 Aug 2024
DM2RM: Dual-Mode Multimodal Ranking for Target Objects and Receptacles
  Based on Open-Vocabulary Instructions
DM2RM: Dual-Mode Multimodal Ranking for Target Objects and Receptacles Based on Open-Vocabulary Instructions
Ryosuke Korekata
Kanta Kaneda
Shunya Nagashima
Yuto Imai
Komei Sugiura
ObjD
LM&Ro
53
2
0
15 Aug 2024
EgoCVR: An Egocentric Benchmark for Fine-Grained Composed Video
  Retrieval
EgoCVR: An Egocentric Benchmark for Fine-Grained Composed Video Retrieval
Thomas Hummel
Shyamgopal Karthik
Mariana-Iuliana Georgescu
Zeynep Akata
EgoV
34
4
0
23 Jul 2024
Assessing Brittleness of Image-Text Retrieval Benchmarks from
  Vision-Language Models Perspective
Assessing Brittleness of Image-Text Retrieval Benchmarks from Vision-Language Models Perspective
Mariya Hendriksen
Shuo Zhang
R. Reinanda
Mohamed Yahya
Edgar Meij
Maarten de Rijke
54
0
0
21 Jul 2024
Matryoshka-Adaptor: Unsupervised and Supervised Tuning for Smaller
  Embedding Dimensions
Matryoshka-Adaptor: Unsupervised and Supervised Tuning for Smaller Embedding Dimensions
Jinsung Yoon
Raj Sinha
Sercan Ö. Arik
Tomas Pfister
24
1
0
17 Jul 2024
Zero-shot Composed Image Retrieval Considering Query-target Relationship
  Leveraging Masked Image-text Pairs
Zero-shot Composed Image Retrieval Considering Query-target Relationship Leveraging Masked Image-text Pairs
Huaying Zhang
Rintaro Yanagi
Ren Togo
Takahiro Ogawa
Miki Haseyama
32
5
0
27 Jun 2024
Reminding Multimodal Large Language Models of Object-aware Knowledge
  with Retrieved Tags
Reminding Multimodal Large Language Models of Object-aware Knowledge with Retrieved Tags
Daiqing Qi
Handong Zhao
Zijun Wei
Sheng Li
46
2
0
16 Jun 2024
Gentle-CLIP: Exploring Aligned Semantic In Low-Quality Multimodal Data
  With Soft Alignment
Gentle-CLIP: Exploring Aligned Semantic In Low-Quality Multimodal Data With Soft Alignment
Zijia Song
Z. Zang
Yelin Wang
Guozheng Yang
Jiangbin Zheng
Kaicheng Yu
Wanyu Chen
Stan Z. Li
36
0
0
09 Jun 2024
CaLa: Complementary Association Learning for Augmenting Composed Image
  Retrieval
CaLa: Complementary Association Learning for Augmenting Composed Image Retrieval
Xintong Jiang
Yaxiong Wang
Mengjian Li
Yujiao Wu
Bingwen Hu
Xueming Qian
CoGe
40
4
0
29 May 2024
Composed Image Retrieval for Remote Sensing
Composed Image Retrieval for Remote Sensing
Bill Psomas
Ioannis Kakogeorgiou
Nikos Efthymiadis
Giorgos Tolias
Ondřej Chum
Yannis Avrithis
Konstantinos Karantzalos
48
4
0
24 May 2024
iSEARLE: Improving Textual Inversion for Zero-Shot Composed Image
  Retrieval
iSEARLE: Improving Textual Inversion for Zero-Shot Composed Image Retrieval
Lorenzo Agnolucci
Alberto Baldrati
Marco Bertini
A. Bimbo
38
10
0
05 May 2024
Spherical Linear Interpolation and Text-Anchoring for Zero-shot Composed
  Image Retrieval
Spherical Linear Interpolation and Text-Anchoring for Zero-shot Composed Image Retrieval
Young Kyun Jang
Dat Huynh
Ashish Shah
Wen-Kai Chen
Ser-Nam Lim
45
15
0
01 May 2024
Enhancing Interactive Image Retrieval With Query Rewriting Using Large
  Language Models and Vision Language Models
Enhancing Interactive Image Retrieval With Query Rewriting Using Large Language Models and Vision Language Models
Hongyi Zhu
Jia-Hong Huang
S. Rudinac
Evangelos Kanoulas
44
7
0
29 Apr 2024
Revisiting Relevance Feedback for CLIP-based Interactive Image Retrieval
Revisiting Relevance Feedback for CLIP-based Interactive Image Retrieval
Ryoya Nara
Yu-Chieh Lin
Yuji Nozawa
Youyang Ng
Goh Itoh
Osamu Torii
Yusuke Matsui
HAI
29
2
0
25 Apr 2024
Leveraging Large Language Models for Multimodal Search
Leveraging Large Language Models for Multimodal Search
Oriol Barbany
Michael Huang
Xinliang Zhu
Arnab Dhua
31
9
0
24 Apr 2024
Multimodal-Conditioned Latent Diffusion Models for Fashion Image Editing
Multimodal-Conditioned Latent Diffusion Models for Fashion Image Editing
Alberto Baldrati
Davide Morelli
Marcella Cornia
Marco Bertini
Rita Cucchiara
DiffM
62
8
0
21 Mar 2024
Enhancing Conceptual Understanding in Multimodal Contrastive Learning
  through Hard Negative Samples
Enhancing Conceptual Understanding in Multimodal Contrastive Learning through Hard Negative Samples
Philipp J. Rösch
Norbert Oswald
Michaela Geierhos
Jindrich Libovický
42
3
0
05 Mar 2024
Interactive Garment Recommendation with User in the Loop
Interactive Garment Recommendation with User in the Loop
Federico Becattini
Xiaolin Chen
Andrea Puccia
Haokun Wen
Xuemeng Song
Liqiang Nie
A. Bimbo
30
0
0
18 Feb 2024
Instilling Multi-round Thinking to Text-guided Image Generation
Instilling Multi-round Thinking to Text-guided Image Generation
Lidong Zeng
Zhedong Zheng
Yinwei Wei
Tat-Seng Chua
34
5
0
16 Jan 2024
Let's Go Shopping (LGS) -- Web-Scale Image-Text Dataset for Visual
  Concept Understanding
Let's Go Shopping (LGS) -- Web-Scale Image-Text Dataset for Visual Concept Understanding
Yatong Bai
Utsav Garg
Apaar Shanker
Haoming Zhang
Samyak Parajuli
...
Eugenia D Fomitcheva
E. Branson
Aerin Kim
Somayeh Sojoudi
Kyunghyun Cho
21
2
0
09 Jan 2024
Learning-To-Rank Approach for Identifying Everyday Objects Using a
  Physical-World Search Engine
Learning-To-Rank Approach for Identifying Everyday Objects Using a Physical-World Search Engine
Kanta Kaneda
Shunya Nagashima
Ryosuke Korekata
Motonari Kambara
Komei Sugiura
43
6
0
26 Dec 2023
Dynamic Weighted Combiner for Mixed-Modal Image Retrieval
Dynamic Weighted Combiner for Mixed-Modal Image Retrieval
Fuxiang Huang
Lei Zhang
Xiaowei Fu
Suqi Song
33
9
0
11 Dec 2023
FreestyleRet: Retrieving Images from Style-Diversified Queries
FreestyleRet: Retrieving Images from Style-Diversified Queries
Hao Li
Curise Jia
Peng Jin
Ze-Long Cheng
Kehan Li
Jialu Sui
Chang Liu
Li-ming Yuan
3DH
28
5
0
05 Dec 2023
UniIR: Training and Benchmarking Universal Multimodal Information
  Retrievers
UniIR: Training and Benchmarking Universal Multimodal Information Retrievers
Cong Wei
Yang Chen
Haonan Chen
Hexiang Hu
Ge Zhang
Jie Fu
Alan Ritter
Wenhu Chen
47
51
0
28 Nov 2023
Benchmarking Robustness of Text-Image Composed Retrieval
Benchmarking Robustness of Text-Image Composed Retrieval
Shitong Sun
Jindong Gu
Shaogang Gong
CoGe
44
1
0
24 Nov 2023
Vision-by-Language for Training-Free Compositional Image Retrieval
Vision-by-Language for Training-Free Compositional Image Retrieval
Shyamgopal Karthik
Karsten Roth
Massimiliano Mancini
Zeynep Akata
CoGe
28
52
0
13 Oct 2023
Search-Adaptor: Embedding Customization for Information Retrieval
Search-Adaptor: Embedding Customization for Information Retrieval
Jinsung Yoon
Sercan Ö. Arik
Yanfei Chen
Tomas Pfister
25
2
0
12 Oct 2023
OpenFashionCLIP: Vision-and-Language Contrastive Learning with
  Open-Source Fashion Data
OpenFashionCLIP: Vision-and-Language Contrastive Learning with Open-Source Fashion Data
Giuseppe Cartella
Alberto Baldrati
Davide Morelli
Marcella Cornia
Marco Bertini
Rita Cucchiara
VLM
CLIP
29
7
0
11 Sep 2023
Composed Image Retrieval using Contrastive Learning and Task-oriented
  CLIP-based Features
Composed Image Retrieval using Contrastive Learning and Task-oriented CLIP-based Features
Alberto Baldrati
Marco Bertini
Tiberio Uricchio
A. Bimbo
CLIP
CoGe
13
29
0
22 Aug 2023
123
Next