ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2004.09144
  4. Cited By
Transformer Reasoning Network for Image-Text Matching and Retrieval

Transformer Reasoning Network for Image-Text Matching and Retrieval

20 April 2020
Nicola Messina
Fabrizio Falchi
Andrea Esuli
Giuseppe Amato
    ViT
ArXivPDFHTML

Papers citing "Transformer Reasoning Network for Image-Text Matching and Retrieval"

8 / 8 papers shown
Title
A Spatio-Temporal Attentive Network for Video-Based Crowd Counting
A Spatio-Temporal Attentive Network for Video-Based Crowd Counting
M. Avvenuti
Marco Bongiovanni
Luca Ciampi
Fabrizio Falchi
Claudio Gennaro
Nicola Messina
25
9
0
24 Aug 2022
ALADIN: Distilling Fine-grained Alignment Scores for Efficient
  Image-Text Matching and Retrieval
ALADIN: Distilling Fine-grained Alignment Scores for Efficient Image-Text Matching and Retrieval
Nicola Messina
Matteo Stefanini
Marcella Cornia
Lorenzo Baraldi
Fabrizio Falchi
Giuseppe Amato
Rita Cucchiara
VLM
16
21
0
29 Jul 2022
A review of machine learning approaches, challenges and prospects for
  computational tumor pathology
A review of machine learning approaches, challenges and prospects for computational tumor pathology
Liangrui Pan
Zhichao Feng
Shaoliang Peng
AI4CE
21
7
0
31 May 2022
Text2Pos: Text-to-Point-Cloud Cross-Modal Localization
Text2Pos: Text-to-Point-Cloud Cross-Modal Localization
Manuel Kolmet
Qunjie Zhou
Aljosa Osep
Laura Leal-Taixe
21
22
0
28 Mar 2022
Do Lessons from Metric Learning Generalize to Image-Caption Retrieval?
Do Lessons from Metric Learning Generalize to Image-Caption Retrieval?
Maurits J. R. Bleeker
Maarten de Rijke
SSL
DML
29
9
0
14 Feb 2022
Recurrent Vision Transformer for Solving Visual Reasoning Problems
Recurrent Vision Transformer for Solving Visual Reasoning Problems
Nicola Messina
Giuseppe Amato
F. Carrara
Claudio Gennaro
Fabrizio Falchi
ViT
LRM
22
11
0
29 Nov 2021
Combining EfficientNet and Vision Transformers for Video Deepfake
  Detection
Combining EfficientNet and Vision Transformers for Video Deepfake Detection
D. Coccomini
Nicola Messina
Claudio Gennaro
Fabrizio Falchi
ViT
32
169
0
06 Jul 2021
Cross-Modal Retrieval Augmentation for Multi-Modal Classification
Cross-Modal Retrieval Augmentation for Multi-Modal Classification
Shir Gur
Natalia Neverova
C. Stauffer
Ser-Nam Lim
Douwe Kiela
A. Reiter
14
26
0
16 Apr 2021
1