Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2204.05845
Cited By
Probabilistic Compositional Embeddings for Multimodal Image Retrieval
12 April 2022
Andrei Neculai
Yanbei Chen
Zeynep Akata
CoGe
Re-assign community
ArXiv (abs)
PDF
HTML
Github (24★)
Papers citing
"Probabilistic Compositional Embeddings for Multimodal Image Retrieval"
46 / 46 papers shown
Title
Uni-Retrieval: A Multi-Style Retrieval Framework for STEM's Education
Yanhao Jia
Xinyi Wu
Hao Li
Qinglin Zhang
Yuxiao Hu
Shuai Zhao
Wenqi Fan
165
5
0
09 Feb 2025
Probabilistic Language-Image Pre-Training
Sanghyuk Chun
Wonjae Kim
Song Park
Sangdoo Yun
MLLM
VLM
CLIP
474
6
2
24 Oct 2024
Attention Bottlenecks for Multimodal Fusion
Arsha Nagrani
Shan Yang
Anurag Arnab
A. Jansen
Cordelia Schmid
Chen Sun
108
573
0
30 Jun 2021
Distilling Audio-Visual Knowledge by Compositional Contrastive Learning
Yanbei Chen
Yongqin Xian
A. Sophia Koepke
Ying Shan
Zeynep Akata
141
83
0
22 Apr 2021
Multi-Modal Fusion Transformer for End-to-End Autonomous Driving
Aditya Prakash
Kashyap Chitta
Andreas Geiger
ViT
110
533
0
19 Apr 2021
Perceiver: General Perception with Iterative Attention
Andrew Jaegle
Felix Gimeno
Andrew Brock
Andrew Zisserman
Oriol Vinyals
João Carreira
VLM
ViT
MDE
212
1,026
0
04 Mar 2021
Learning Graph Embeddings for Compositional Zero-shot Learning
Muhammad Ferjad Naeem
Yongqin Xian
Federico Tombari
Zeynep Akata
CoGe
60
140
0
03 Feb 2021
Open World Compositional Zero-Shot Learning
Massimiliano Mancini
Muhammad Ferjad Naeem
Yongqin Xian
Zeynep Akata
CoGe
156
130
0
29 Jan 2021
Probabilistic Embeddings for Cross-Modal Retrieval
Sanghyuk Chun
Seong Joon Oh
Rafael Sampaio de Rezende
Yannis Kalantidis
Diane Larlus
UQCV
489
210
0
13 Jan 2021
GIRAFFE: Representing Scenes as Compositional Generative Neural Feature Fields
Michael Niemeyer
Andreas Geiger
OCL
166
963
0
24 Nov 2020
Visual Compositional Learning for Human-Object Interaction Detection
Zhi Hou
Xiaojiang Peng
Yu Qiao
Dacheng Tao
VLM
96
184
0
24 Jul 2020
Multi-modal Transformer for Video Retrieval
Valentin Gabeur
Chen Sun
Alahari Karteek
Cordelia Schmid
ViT
545
610
0
21 Jul 2020
RetrieveGAN: Image Synthesis via Differentiable Patch Retrieval
Hung-Yu Tseng
Hsin-Ying Lee
Lu Jiang
Ming-Hsuan Yang
Weilong Yang
DiffM
3DV
154
54
0
16 Jul 2020
A Metric Learning Reality Check
Kevin Musgrave
Serge J. Belongie
Ser-Nam Lim
170
479
0
18 Mar 2020
A Simple Framework for Contrastive Learning of Visual Representations
Ting-Li Chen
Simon Kornblith
Mohammad Norouzi
Geoffrey E. Hinton
SSL
402
18,913
0
13 Feb 2020
Something-Else: Compositional Action Recognition with Spatial-Temporal Interaction Networks
Joanna Materzynska
Tete Xiao
Roei Herzig
Huijuan Xu
Xiaolong Wang
Trevor Darrell
CoGe
64
176
0
20 Dec 2019
Interactive Sketch & Fill: Multiclass Sketch-to-Image Translation
Arna Ghosh
Richard Y. Zhang
P. Dokania
Oliver Wang
Alexei A. Efros
Philip Torr
Eli Shechtman
VLM
DiffM
107
131
0
24 Sep 2019
Use What You Have: Video Retrieval Using Representations From Collaborative Experts
Yang Liu
Samuel Albanie
Arsha Nagrani
Andrew Zisserman
89
389
0
31 Jul 2019
Multimodal End-to-End Autonomous Driving
Yi Xiao
Felipe Codevilla
A. Gurram
O. Urfalioglu
Antonio M. López
83
244
0
07 Jun 2019
Fashion IQ: A New Dataset Towards Retrieving Images by Natural Language Feedback
Hui Wu
Yupeng Gao
Xiaoxiao Guo
Ziad Al-Halah
Steven J. Rennie
Kristen Grauman
Rogerio Feris
EgoV
139
67
0
30 May 2019
What Makes Training Multi-Modal Classification Networks Hard?
Weiyao Wang
Du Tran
Matt Feiszli
158
453
0
29 May 2019
Task-Driven Modular Networks for Zero-Shot Compositional Learning
Senthil Purushwalkam
Maximilian Nickel
Abhinav Gupta
MarcÁurelio Ranzato
68
175
0
15 May 2019
VideoBERT: A Joint Model for Video and Language Representation Learning
Chen Sun
Austin Myers
Carl Vondrick
Kevin Patrick Murphy
Cordelia Schmid
VLM
SSL
90
1,250
0
03 Apr 2019
Thinking Outside the Pool: Active Training Image Creation for Relative Attributes
Aron Yu
Kristen Grauman
46
23
0
08 Jan 2019
Learning Compositional Representations for Few-Shot Recognition
P. Tokmakov
Yu-Xiong Wang
M. Hebert
OCL
65
126
0
21 Dec 2018
Composing Text and Image for Image Retrieval - An Empirical Odyssey
Nam S. Vo
Lu Jiang
Chen Sun
Kevin Patrick Murphy
Li Li
Li Fei-Fei
James Hays
CoGe
68
368
0
18 Dec 2018
Modeling Uncertainty with Hedged Instance Embedding
Seong Joon Oh
Kevin Patrick Murphy
Jiyan Pan
Joseph Roth
Florian Schroff
Andrew C. Gallagher
UQCV
500
70
0
30 Sep 2018
A Zero-Shot Framework for Sketch-based Image Retrieval
Sasi Kiran Yelamarthi
M. K. Reddy
Ashish Mishra
Anurag Mittal
147
187
0
31 Jul 2018
Dialog-based Interactive Image Retrieval
Xiaoxiao Guo
Hui Wu
Yu Cheng
Steven J. Rennie
Gerald Tesauro
Rogerio Feris
135
207
0
01 May 2018
ST-GAN: Spatial Transformer Generative Adversarial Networks for Image Compositing
Chen-Hsuan Lin
Ersin Yumer
Oliver Wang
Eli Shechtman
Simon Lucey
GAN
84
222
0
05 Mar 2018
Building machines that adapt and compute like brains
Brenden M. Lake
J. Tenenbaum
AI4CE
FedML
NAI
AILaw
327
887
0
11 Nov 2017
FiLM: Visual Reasoning with a General Conditioning Layer
Ethan Perez
Florian Strub
H. D. Vries
Vincent Dumoulin
Aaron Courville
FAtt
AIMat
OffRL
AI4CE
375
2,239
0
22 Sep 2017
Improved Regularization of Convolutional Neural Networks with Cutout
Terrance Devries
Graham W. Taylor
139
3,775
0
15 Aug 2017
Tips and Tricks for Visual Question Answering: Learnings from the 2017 Challenge
Damien Teney
Peter Anderson
Xiaodong He
Anton Van Den Hengel
115
383
0
09 Aug 2017
Automatic Spatially-aware Fashion Concept Discovery
Xintong Han
Zuxuan Wu
Phoenix X. Huang
Xiao Zhang
Menglong Zhu
Yuan Li
Yang Zhao
L. Davis
83
272
0
03 Aug 2017
A simple neural network module for relational reasoning
Adam Santoro
David Raposo
David Barrett
Mateusz Malinowski
Razvan Pascanu
Peter W. Battaglia
Timothy Lillicrap
GNN
NAI
189
1,615
0
05 Jun 2017
A Structured Self-attentive Sentence Embedding
Zhouhan Lin
Minwei Feng
Cicero Nogueira dos Santos
Mo Yu
Bing Xiang
Bowen Zhou
Yoshua Bengio
124
2,142
0
09 Mar 2017
Deep Image Harmonization
Yi-Hsuan Tsai
Xiaohui Shen
Zhe Lin
Kalyan Sunkavalli
Xin Lu
Ming-Hsuan Yang
98
267
0
28 Feb 2017
Layer Normalization
Jimmy Lei Ba
J. Kiros
Geoffrey E. Hinton
437
10,548
0
21 Jul 2016
Multimodal Residual Learning for Visual QA
Jin-Hwa Kim
Sang-Woo Lee
Donghyun Kwak
Min-Oh Heo
Jeonghee Kim
Jung-Woo Ha
Byoung-Tak Zhang
71
300
0
05 Jun 2016
Deep Image Retrieval: Learning global representations for image search
Albert Gordo
Jon Almazán
Jérôme Revaud
Diane Larlus
76
806
0
05 Apr 2016
Deep Residual Learning for Image Recognition
Kaiming He
Xinming Zhang
Shaoqing Ren
Jian Sun
MedIm
2.3K
194,641
0
10 Dec 2015
VQA: Visual Question Answering
Aishwarya Agrawal
Jiasen Lu
Stanislaw Antol
Margaret Mitchell
C. L. Zitnick
Dhruv Batra
Devi Parikh
CoGe
243
5,512
0
03 May 2015
Adam: A Method for Stochastic Optimization
Diederik P. Kingma
Jimmy Ba
ODL
2.1K
150,433
0
22 Dec 2014
On the Properties of Neural Machine Translation: Encoder-Decoder Approaches
Kyunghyun Cho
B. V. Merrienboer
Dzmitry Bahdanau
Yoshua Bengio
AI4CE
AIMat
272
6,793
0
03 Sep 2014
Microsoft COCO: Common Objects in Context
Nayeon Lee
Michael Maire
Serge J. Belongie
Lubomir Bourdev
Ross B. Girshick
James Hays
Pietro Perona
Deva Ramanan
C. L. Zitnick
Piotr Dollár
ObjD
442
43,875
0
01 May 2014
1