ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2101.01368
  4. Cited By
Similarity Reasoning and Filtration for Image-Text Matching

Similarity Reasoning and Filtration for Image-Text Matching

5 January 2021
Haiwen Diao
Ying Zhang
Lingyun Ma
Huchuan Lu
ArXivPDFHTML

Papers citing "Similarity Reasoning and Filtration for Image-Text Matching"

50 / 104 papers shown
Title
GeoMM: On Geodesic Perspective for Multi-modal Learning
GeoMM: On Geodesic Perspective for Multi-modal Learning
Shibin Mei
Hang Wang
Bingbing Ni
22
0
0
16 May 2025
Aligning Information Capacity Between Vision and Language via Dense-to-Sparse Feature Distillation for Image-Text Matching
Aligning Information Capacity Between Vision and Language via Dense-to-Sparse Feature Distillation for Image-Text Matching
Yang Liu
Wentao Feng
Zhuoyao Liu
Shudong Huang
Jiancheng Lv
DiffM
VLM
53
0
0
19 Mar 2025
ReCon: Enhancing True Correspondence Discrimination through Relation Consistency for Robust Noisy Correspondence Learning
ReCon: Enhancing True Correspondence Discrimination through Relation Consistency for Robust Noisy Correspondence Learning
Quanxing Zha
Xin Liu
Shu-Juan Peng
Y. Cheung
X. Xu
Nannan Wang
50
0
0
13 Mar 2025
NeighborRetr: Balancing Hub Centrality in Cross-Modal Retrieval
Zengrong Lin
Zheng Wang
Tianwen Qian
Pan Mu
Sixian Chan
Cong Bai
52
0
0
13 Mar 2025
Asymmetric Visual Semantic Embedding Framework for Efficient Vision-Language Alignment
Yang Liu
M. Liu
Shudong Huang
Jiancheng Lv
35
1
0
10 Mar 2025
ELIP: Enhanced Visual-Language Foundation Models for Image Retrieval
ELIP: Enhanced Visual-Language Foundation Models for Image Retrieval
Guanqi Zhan
Yuanpei Liu
Kai Han
Weidi Xie
Andrew Zisserman
VLM
171
0
0
21 Feb 2025
Bridging Text and Vision: A Multi-View Text-Vision Registration Approach for Cross-Modal Place Recognition
Bridging Text and Vision: A Multi-View Text-Vision Registration Approach for Cross-Modal Place Recognition
Tianyi Shang
Zhenyu Li
Pengjie Xu
Jinwei Qiao
Gang Chen
Zihan Ruan
Weijun Hu
59
0
0
20 Feb 2025
KARST: Multi-Kernel Kronecker Adaptation with Re-Scaling Transmission for Visual Classification
KARST: Multi-Kernel Kronecker Adaptation with Re-Scaling Transmission for Visual Classification
Yue Zhu
Haiwen Diao
Shang Gao
Long Chen
Huchuan Lu
89
0
0
10 Feb 2025
TSVC:Tripartite Learning with Semantic Variation Consistency for Robust Image-Text Retrieval
TSVC:Tripartite Learning with Semantic Variation Consistency for Robust Image-Text Retrieval
Shuai Lyu
Zijing Tian
Zhonghong Ou
Yifan Zhu
Xiao Zhang
Qiankun Ha
Haoran Luo
Meina Song
37
0
0
19 Jan 2025
Rebalanced Vision-Language Retrieval Considering Structure-Aware
  Distillation
Rebalanced Vision-Language Retrieval Considering Structure-Aware Distillation
Yang Yang
Wenjuan Xi
Luping Zhou
Jinhui Tang
77
0
0
14 Dec 2024
GSSF: Generalized Structural Sparse Function for Deep Cross-modal Metric
  Learning
GSSF: Generalized Structural Sparse Function for Deep Cross-modal Metric Learning
Haiwen Diao
Ying Zhang
Shang Gao
Jiawen Zhu
Long Chen
Huchuan Lu
34
4
0
20 Oct 2024
One-step Noisy Label Mitigation
One-step Noisy Label Mitigation
Hao Li
Jiayang Gu
Jingkuan Song
An Zhang
Lianli Gao
NoLa
31
0
0
02 Oct 2024
ComAlign: Compositional Alignment in Vision-Language Models
ComAlign: Compositional Alignment in Vision-Language Models
Ali Abdollah
Amirmohammad Izadi
Armin Saghafian
Reza Vahidimajd
Mohammad Mozafari
Amirreza Mirzaei
Mohammadmahdi Samiei
M. Baghshah
CoGe
VLM
30
0
0
12 Sep 2024
Riemann-based Multi-scale Attention Reasoning Network for Text-3D
  Retrieval
Riemann-based Multi-scale Attention Reasoning Network for Text-3D Retrieval
Wenrui Li
Wei Han
Yandu Chen
Yeyu Chai
Yidan Lu
Xingtao Wang
Xiaopeng Fan
3DPC
21
1
0
25 Aug 2024
Towards Deconfounded Image-Text Matching with Causal Inference
Towards Deconfounded Image-Text Matching with Causal Inference
Wenhui Li
Xinqi Su
Dan Song
Lanjun Wang
Kun Zhang
An-An Liu
BDL
CML
50
10
0
22 Aug 2024
Disentangled Noisy Correspondence Learning
Disentangled Noisy Correspondence Learning
Zhuohang Dang
Minnan Luo
Jihong Wang
Chengyou Jia
Haochen Han
Herun Wan
Guang Dai
Xiaojun Chang
Jingdong Wang
34
0
0
10 Aug 2024
PC$^2$: Pseudo-Classification Based Pseudo-Captioning for Noisy
  Correspondence Learning in Cross-Modal Retrieval
PC2^22: Pseudo-Classification Based Pseudo-Captioning for Noisy Correspondence Learning in Cross-Modal Retrieval
Yue Duan
Zhangxuan Gu
ZhenZhe Ying
Wei Li
Yu Zhang
Zibin Zheng
26
2
0
02 Aug 2024
Revolutionizing Text-to-Image Retrieval as Autoregressive Token-to-Voken
  Generation
Revolutionizing Text-to-Image Retrieval as Autoregressive Token-to-Voken Generation
Yongqi Li
Hongru Cai
Wenjie Wang
Leigang Qu
Yinwei Wei
Wenjie Li
Liqiang Nie
Tat-Seng Chua
DiffM
32
1
0
24 Jul 2024
Global-Local Similarity for Efficient Fine-Grained Image Recognition
  with Vision Transformers
Global-Local Similarity for Efficient Fine-Grained Image Recognition with Vision Transformers
Edwin Arkel Rios
Min-Chun Hu
Bo-Cheng Lai
ViT
32
2
0
17 Jul 2024
Object-Aware Query Perturbation for Cross-Modal Image-Text Retrieval
Object-Aware Query Perturbation for Cross-Modal Image-Text Retrieval
Naoya Sogi
Takashi Shibata
Makoto Terao
VLM
35
1
0
17 Jul 2024
SHERL: Synthesizing High Accuracy and Efficient Memory for
  Resource-Limited Transfer Learning
SHERL: Synthesizing High Accuracy and Efficient Memory for Resource-Limited Transfer Learning
Haiwen Diao
Bo Wan
Xu Jia
Yunzhi Zhuge
Ying Zhang
Huchuan Lu
Long Chen
VLM
50
4
0
10 Jul 2024
Unveiling Encoder-Free Vision-Language Models
Unveiling Encoder-Free Vision-Language Models
Haiwen Diao
Yufeng Cui
Xiaotong Li
Yueze Wang
Huchuan Lu
Xinlong Wang
VLM
56
29
0
17 Jun 2024
Composing Object Relations and Attributes for Image-Text Matching
Composing Object Relations and Attributes for Image-Text Matching
Khoi Pham
Chuong Huynh
Ser-Nam Lim
Abhinav Shrivastava
CoGe
44
3
0
17 Jun 2024
Hire: Hybrid-modal Interaction with Multiple Relational Enhancements for
  Image-Text Matching
Hire: Hybrid-modal Interaction with Multiple Relational Enhancements for Image-Text Matching
Xuri Ge
Fuhai Chen
Songpei Xu
Fuxiang Tao
Jie Wang
Joemon M. Jose
34
0
0
05 Jun 2024
Mitigating Noisy Correspondence by Geometrical Structure Consistency
  Learning
Mitigating Noisy Correspondence by Geometrical Structure Consistency Learning
Zihua Zhao
Mengxi Chen
Tianjie Dai
Jiangchao Yao
Bo han
Ya-Qin Zhang
Yanfeng Wang
NoLa
44
3
0
27 May 2024
Active Learning for Finely-Categorized Image-Text Retrieval by Selecting
  Hard Negative Unpaired Samples
Active Learning for Finely-Categorized Image-Text Retrieval by Selecting Hard Negative Unpaired Samples
D. Jo
Kyuewang Lee
Jaeho Chung
Jin Young Choi
18
0
0
25 May 2024
Deep Boosting Learning: A Brand-new Cooperative Approach for Image-Text
  Matching
Deep Boosting Learning: A Brand-new Cooperative Approach for Image-Text Matching
Haiwen Diao
Ying Zhang
Shang Gao
Xiang Ruan
Huchuan Lu
36
3
0
28 Apr 2024
3SHNet: Boosting Image-Sentence Retrieval via Visual Semantic-Spatial
  Self-Highlighting
3SHNet: Boosting Image-Sentence Retrieval via Visual Semantic-Spatial Self-Highlighting
Xuri Ge
Songpei Xu
Fuhai Chen
Jie Wang
Guoxin Wang
Shan An
Joemon M. Jose
3DPC
39
11
0
26 Apr 2024
Dynamic Self-adaptive Multiscale Distillation from Pre-trained
  Multimodal Large Model for Efficient Cross-modal Representation Learning
Dynamic Self-adaptive Multiscale Distillation from Pre-trained Multimodal Large Model for Efficient Cross-modal Representation Learning
Zhengyang Liang
Meiyu Liang
Wei Huang
Yawen Li
Zhe Xue
40
1
0
16 Apr 2024
PointCloud-Text Matching: Benchmark Datasets and a Baseline
PointCloud-Text Matching: Benchmark Datasets and a Baseline
Yanglin Feng
Yang Qin
Dezhong Peng
Hongyuan Zhu
Xi Peng
Peng Hu
50
1
0
28 Mar 2024
REPAIR: Rank Correlation and Noisy Pair Half-replacing with Memory for
  Noisy Correspondence
REPAIR: Rank Correlation and Noisy Pair Half-replacing with Memory for Noisy Correspondence
Ruochen Zheng
Jiahao Hong
Changxin Gao
Nong Sang
39
1
0
13 Mar 2024
Cross-Modal and Uni-Modal Soft-Label Alignment for Image-Text Retrieval
Cross-Modal and Uni-Modal Soft-Label Alignment for Image-Text Retrieval
Hailang Huang
Zhijie Nie
Ziqiao Wang
Ziyu Shang
35
10
0
08 Mar 2024
Learning to Rematch Mismatched Pairs for Robust Cross-Modal Retrieval
Learning to Rematch Mismatched Pairs for Robust Cross-Modal Retrieval
Haocheng Han
Qinghua Zheng
Guangwen Dai
Minnan Luo
Jingdong Wang
29
5
0
08 Mar 2024
Generative Cross-Modal Retrieval: Memorizing Images in Multimodal
  Language Models for Retrieval and Beyond
Generative Cross-Modal Retrieval: Memorizing Images in Multimodal Language Models for Retrieval and Beyond
Yongqi Li
Wenjie Wang
Leigang Qu
Liqiang Nie
Wenjie Li
Tat-Seng Chua
21
17
0
16 Feb 2024
CBVS: A Large-Scale Chinese Image-Text Benchmark for Real-World Short
  Video Search Scenarios
CBVS: A Large-Scale Chinese Image-Text Benchmark for Real-World Short Video Search Scenarios
Xiangshuo Qiao
Xianxin Li
Xiaozhe Qu
Jie M. Zhang
Yang Liu
Yu Luo
Cihang Jin
Jin Ma
VLM
33
0
0
19 Jan 2024
Enhancing medical vision-language contrastive learning via inter-matching relation modelling
Enhancing medical vision-language contrastive learning via inter-matching relation modelling
Mingjian Li
Mingyuan Meng
M. Fulham
David Dagan Feng
Lei Bi
Jinman Kim
VLM
42
1
0
19 Jan 2024
Noisy Correspondence Learning with Self-Reinforcing Errors Mitigation
Noisy Correspondence Learning with Self-Reinforcing Errors Mitigation
Zhuohang Dang
Minnan Luo
Chengyou Jia
Guangwen Dai
Xiao Chang
Jingdong Wang
24
6
0
27 Dec 2023
MAFA: Managing False Negatives for Vision-Language Pre-training
MAFA: Managing False Negatives for Vision-Language Pre-training
Jaeseok Byun
Dohoon Kim
Taesup Moon
VLM
13
4
0
11 Dec 2023
Negative Pre-aware for Noisy Cross-modal Matching
Negative Pre-aware for Noisy Cross-modal Matching
Xu-Yao Zhang
Hao Li
Mang Ye
30
7
0
10 Dec 2023
A New Fine-grained Alignment Method for Image-text Matching
A New Fine-grained Alignment Method for Image-text Matching
Yang Zhang
19
1
0
03 Nov 2023
MCAD: Multi-teacher Cross-modal Alignment Distillation for efficient
  image-text retrieval
MCAD: Multi-teacher Cross-modal Alignment Distillation for efficient image-text retrieval
Youbo Lei
Feifei He
Chen Chen
Yingbin Mo
Sijia Li
Defeng Xie
H. Lu
VLM
57
0
0
30 Oct 2023
Cross-modal Active Complementary Learning with Self-refining
  Correspondence
Cross-modal Active Complementary Learning with Self-refining Correspondence
Yang Qin
Yuan Sun
Dezhong Peng
Qiufeng Wang
Xiaocui Peng
Peng Hu
26
18
0
26 Oct 2023
Prototype-based Aleatoric Uncertainty Quantification for Cross-modal
  Retrieval
Prototype-based Aleatoric Uncertainty Quantification for Cross-modal Retrieval
Hao Li
Marie-Jeanne Lesot
Lianli Gao
Xiaosu Zhu
Christophe Marsala
EDL
16
11
0
29 Sep 2023
A Survey on Image-text Multimodal Models
A Survey on Image-text Multimodal Models
Ruifeng Guo
Jingxuan Wei
Linzhuang Sun
Khai Le-Duc
Guiyong Chang
Dawei Liu
Sibo Zhang
Zhengbing Yao
Mingjun Xu
Liping Bu
VLM
31
5
0
23 Sep 2023
Dynamic Visual Semantic Sub-Embeddings and Fast Re-Ranking
Dynamic Visual Semantic Sub-Embeddings and Fast Re-Ranking
Wenzhang Wei
Zhipeng Gui
Changguang Wu
Anqi Zhao
D. Peng
Huayi Wu
18
0
0
15 Sep 2023
UniPT: Universal Parallel Tuning for Transfer Learning with Efficient
  Parameter and Memory
UniPT: Universal Parallel Tuning for Transfer Learning with Efficient Parameter and Memory
Haiwen Diao
Bo Wan
Yuhang Zhang
Xuecong Jia
Huchuan Lu
Long Chen
VLM
33
18
0
28 Aug 2023
Noisy-Correspondence Learning for Text-to-Image Person Re-identification
Noisy-Correspondence Learning for Text-to-Image Person Re-identification
Yang Qin
Ying Chen
Dezhong Peng
Xiaocui Peng
Qiufeng Wang
Peng Hu
26
34
0
19 Aug 2023
Grounded Image Text Matching with Mismatched Relation Reasoning
Grounded Image Text Matching with Mismatched Relation Reasoning
Yu Wu
Yan-Tao Wei
Haozhe Jasper Wang
Yongfei Liu
Sibei Yang
Xuming He
34
6
0
02 Aug 2023
Hierarchical Matching and Reasoning for Multi-Query Image Retrieval
Hierarchical Matching and Reasoning for Multi-Query Image Retrieval
Zhong Ji
Zhihao Li
Yan Zhang
Haoran Wang
Yanwei Pang
Xuelong Li
24
11
0
26 Jun 2023
Efficient Token-Guided Image-Text Retrieval with Consistent Multimodal
  Contrastive Training
Efficient Token-Guided Image-Text Retrieval with Consistent Multimodal Contrastive Training
Chong Liu
Yuqi Zhang
Hongsong Wang
Weihua Chen
F. Wang
Yan Huang
Yixing Shen
Liang Wang
19
25
0
15 Jun 2023
123
Next