Probabilistic Compositional Embeddings for Multimodal Image Retrieval

12 April 2022

ArXiv (abs)PDF HTML Github (24★)

Papers citing "Probabilistic Compositional Embeddings for Multimodal Image Retrieval"

46 / 46 papers shown

Title
Uni-Retrieval: A Multi-Style Retrieval Framework for STEM's Education Yanhao Jia Xinyi Wu Hao Li Qinglin Zhang Yuxiao Hu Shuai Zhao Wenqi Fan 165 5 0 09 Feb 2025
Probabilistic Language-Image Pre-Training Sanghyuk Chun Wonjae Kim Song Park Sangdoo Yun MLLM VLM CLIP 474 6 2 24 Oct 2024
Attention Bottlenecks for Multimodal Fusion Arsha Nagrani Shan Yang Anurag Arnab A. Jansen Cordelia Schmid Chen Sun 108 573 0 30 Jun 2021
Distilling Audio-Visual Knowledge by Compositional Contrastive Learning Yanbei Chen Yongqin Xian A. Sophia Koepke Ying Shan Zeynep Akata 141 83 0 22 Apr 2021
Multi-Modal Fusion Transformer for End-to-End Autonomous Driving Aditya Prakash Kashyap Chitta Andreas Geiger ViT 110 533 0 19 Apr 2021
Perceiver: General Perception with Iterative Attention Andrew Jaegle Felix Gimeno Andrew Brock Andrew Zisserman Oriol Vinyals João Carreira VLM ViT MDE 212 1,026 0 04 Mar 2021
Learning Graph Embeddings for Compositional Zero-shot Learning Muhammad Ferjad Naeem Yongqin Xian Federico Tombari Zeynep Akata CoGe 60 140 0 03 Feb 2021
Open World Compositional Zero-Shot Learning Massimiliano Mancini Muhammad Ferjad Naeem Yongqin Xian Zeynep Akata CoGe 156 130 0 29 Jan 2021
Probabilistic Embeddings for Cross-Modal Retrieval Sanghyuk Chun Seong Joon Oh Rafael Sampaio de Rezende Yannis Kalantidis Diane Larlus UQCV 489 210 0 13 Jan 2021
GIRAFFE: Representing Scenes as Compositional Generative Neural Feature Fields Michael Niemeyer Andreas Geiger OCL 166 963 0 24 Nov 2020
Visual Compositional Learning for Human-Object Interaction Detection Zhi Hou Xiaojiang Peng Yu Qiao Dacheng Tao VLM 96 184 0 24 Jul 2020
Multi-modal Transformer for Video Retrieval Valentin Gabeur Chen Sun Alahari Karteek Cordelia Schmid ViT 545 610 0 21 Jul 2020
RetrieveGAN: Image Synthesis via Differentiable Patch Retrieval Hung-Yu Tseng Hsin-Ying Lee Lu Jiang Ming-Hsuan Yang Weilong Yang DiffM 3DV 154 54 0 16 Jul 2020
A Metric Learning Reality Check Kevin Musgrave Serge J. Belongie Ser-Nam Lim 170 479 0 18 Mar 2020
A Simple Framework for Contrastive Learning of Visual Representations Ting-Li Chen Simon Kornblith Mohammad Norouzi Geoffrey E. Hinton SSL 402 18,913 0 13 Feb 2020
Something-Else: Compositional Action Recognition with Spatial-Temporal Interaction Networks Joanna Materzynska Tete Xiao Roei Herzig Huijuan Xu Xiaolong Wang Trevor Darrell CoGe 64 176 0 20 Dec 2019
Interactive Sketch & Fill: Multiclass Sketch-to-Image Translation Arna Ghosh Richard Y. Zhang P. Dokania Oliver Wang Alexei A. Efros Philip Torr Eli Shechtman VLM DiffM 107 131 0 24 Sep 2019
Use What You Have: Video Retrieval Using Representations From Collaborative Experts Yang Liu Samuel Albanie Arsha Nagrani Andrew Zisserman 89 389 0 31 Jul 2019
Multimodal End-to-End Autonomous Driving Yi Xiao Felipe Codevilla A. Gurram O. Urfalioglu Antonio M. López 83 244 0 07 Jun 2019
Fashion IQ: A New Dataset Towards Retrieving Images by Natural Language Feedback Hui Wu Yupeng Gao Xiaoxiao Guo Ziad Al-Halah Steven J. Rennie Kristen Grauman Rogerio Feris EgoV 139 67 0 30 May 2019
What Makes Training Multi-Modal Classification Networks Hard? Weiyao Wang Du Tran Matt Feiszli 158 453 0 29 May 2019
Task-Driven Modular Networks for Zero-Shot Compositional Learning Senthil Purushwalkam Maximilian Nickel Abhinav Gupta MarcÁurelio Ranzato 68 175 0 15 May 2019
VideoBERT: A Joint Model for Video and Language Representation Learning Chen Sun Austin Myers Carl Vondrick Kevin Patrick Murphy Cordelia Schmid VLM SSL 90 1,250 0 03 Apr 2019
Thinking Outside the Pool: Active Training Image Creation for Relative Attributes Aron Yu Kristen Grauman 46 23 0 08 Jan 2019
Learning Compositional Representations for Few-Shot Recognition P. Tokmakov Yu-Xiong Wang M. Hebert OCL 65 126 0 21 Dec 2018
Composing Text and Image for Image Retrieval - An Empirical Odyssey Nam S. Vo Lu Jiang Chen Sun Kevin Patrick Murphy Li Li Li Fei-Fei James Hays CoGe 68 368 0 18 Dec 2018
Modeling Uncertainty with Hedged Instance Embedding Seong Joon Oh Kevin Patrick Murphy Jiyan Pan Joseph Roth Florian Schroff Andrew C. Gallagher UQCV 500 70 0 30 Sep 2018
A Zero-Shot Framework for Sketch-based Image Retrieval Sasi Kiran Yelamarthi M. K. Reddy Ashish Mishra Anurag Mittal 147 187 0 31 Jul 2018
Dialog-based Interactive Image Retrieval Xiaoxiao Guo Hui Wu Yu Cheng Steven J. Rennie Gerald Tesauro Rogerio Feris 135 207 0 01 May 2018
ST-GAN: Spatial Transformer Generative Adversarial Networks for Image Compositing Chen-Hsuan Lin Ersin Yumer Oliver Wang Eli Shechtman Simon Lucey GAN 84 222 0 05 Mar 2018
Building machines that adapt and compute like brains Brenden M. Lake J. Tenenbaum AI4CE FedML NAI AILaw 327 887 0 11 Nov 2017
FiLM: Visual Reasoning with a General Conditioning Layer Ethan Perez Florian Strub H. D. Vries Vincent Dumoulin Aaron Courville FAtt AIMat OffRL AI4CE 375 2,239 0 22 Sep 2017
Improved Regularization of Convolutional Neural Networks with Cutout Terrance Devries Graham W. Taylor 139 3,775 0 15 Aug 2017
Tips and Tricks for Visual Question Answering: Learnings from the 2017 Challenge Damien Teney Peter Anderson Xiaodong He Anton Van Den Hengel 115 383 0 09 Aug 2017
Automatic Spatially-aware Fashion Concept Discovery Xintong Han Zuxuan Wu Phoenix X. Huang Xiao Zhang Menglong Zhu Yuan Li Yang Zhao L. Davis 83 272 0 03 Aug 2017
A simple neural network module for relational reasoning Adam Santoro David Raposo David Barrett Mateusz Malinowski Razvan Pascanu Peter W. Battaglia Timothy Lillicrap GNN NAI 189 1,615 0 05 Jun 2017
A Structured Self-attentive Sentence Embedding Zhouhan Lin Minwei Feng Cicero Nogueira dos Santos Mo Yu Bing Xiang Bowen Zhou Yoshua Bengio 124 2,142 0 09 Mar 2017
Deep Image Harmonization Yi-Hsuan Tsai Xiaohui Shen Zhe Lin Kalyan Sunkavalli Xin Lu Ming-Hsuan Yang 98 267 0 28 Feb 2017
Layer Normalization Jimmy Lei Ba J. Kiros Geoffrey E. Hinton 437 10,548 0 21 Jul 2016
Multimodal Residual Learning for Visual QA Jin-Hwa Kim Sang-Woo Lee Donghyun Kwak Min-Oh Heo Jeonghee Kim Jung-Woo Ha Byoung-Tak Zhang 71 300 0 05 Jun 2016
Deep Image Retrieval: Learning global representations for image search Albert Gordo Jon Almazán Jérôme Revaud Diane Larlus 76 806 0 05 Apr 2016
Deep Residual Learning for Image Recognition Kaiming He Xinming Zhang Shaoqing Ren Jian Sun MedIm 2.3K 194,641 0 10 Dec 2015
VQA: Visual Question Answering Aishwarya Agrawal Jiasen Lu Stanislaw Antol Margaret Mitchell C. L. Zitnick Dhruv Batra Devi Parikh CoGe 243 5,512 0 03 May 2015
Adam: A Method for Stochastic Optimization Diederik P. Kingma Jimmy Ba ODL 2.1K 150,433 0 22 Dec 2014
On the Properties of Neural Machine Translation: Encoder-Decoder Approaches Kyunghyun Cho B. V. Merrienboer Dzmitry Bahdanau Yoshua Bengio AI4CE AIMat 272 6,793 0 03 Sep 2014
Microsoft COCO: Common Objects in Context Nayeon Lee Michael Maire Serge J. Belongie Lubomir Bourdev Ross B. Girshick James Hays Pietro Perona Deva Ramanan C. L. Zitnick Piotr Dollár ObjD 442 43,875 0 01 May 2014