Learning Aligned Cross-Modal Representations from Weakly Aligned Data

25 July 2016

Carl Vondrick

Antonio Torralba

Papers citing "Learning Aligned Cross-Modal Representations from Weakly Aligned Data"

31 / 31 papers shown

Title
Domain Adaptation for Large-Vocabulary Object Detectors Kai Jiang Jiaxing Huang Weiying Xie Jie Lei Yunsong Li Ling Shao Shijian Lu ObjD VLM 40 2 0 13 Jan 2024
Information Theory-Guided Heuristic Progressive Multi-View Coding Jiangmeng Li Hang Gao Wenwen Qiang Changwen Zheng 22 2 0 21 Aug 2023
Lossless Adaptation of Pretrained Vision Models For Robotic Manipulation Mohit Sharma Claudio Fantacci Yuxiang Zhou Skanda Koppula N. Heess Jonathan Scholz Y. Aytar VLM 50 29 0 13 Apr 2023
Unifying Tracking and Image-Video Object Detection Peirong Liu Rui Wang Pengchuan Zhang Omid Poursaeed Yipin Zhou Xuefei Cao Sreya . Dutta Roy Ashish Shah Ser-Nam Lim 26 0 0 20 Nov 2022
Cross-Modal Alignment Learning of Vision-Language Conceptual Systems Taehyeong Kim H. Song Byoung-Tak Zhang 32 4 0 31 Jul 2022
OmniMAE: Single Model Masked Pretraining on Images and Videos Rohit Girdhar Alaaeldin El-Nouby Mannat Singh Kalyan Vasudev Alwala Armand Joulin Ishan Misra ViT 37 97 0 16 Jun 2022
Weakly-Supervised Action Detection Guided by Audio Narration Keren Ye Adriana Kovashka 38 0 0 12 May 2022
MultiMAE: Multi-modal Multi-task Masked Autoencoders Roman Bachmann David Mizrahi Andrei Atanov Amir Zamir 47 265 0 04 Apr 2022
Context Autoencoder for Self-Supervised Representation Learning Xiaokang Chen Mingyu Ding Xiaodi Wang Ying Xin Shentong Mo Yunhao Wang Shumin Han Ping Luo Gang Zeng Jingdong Wang SSL 45 386 0 07 Feb 2022
Sound and Visual Representation Learning with Multiple Pretraining Tasks A. Vasudevan Dengxin Dai Luc Van Gool SSL 33 6 0 04 Jan 2022
Machine Learning in Nuclear Physics A. Boehnlein M. Diefenthaler C. Fanelli M. Hjorth-Jensen T. Horn ... M. Schram A. Scheinker Michael S. Smith Xin-Nian Wang Veronique Ziegler AI4CE 37 41 0 04 Dec 2021
Explainability of deep vision-based autonomous driving systems: Review and challenges Éloi Zablocki H. Ben-younes P. Pérez Matthieu Cord XAI 48 170 0 13 Jan 2021
Deep Visual Domain Adaptation G. Csurka OOD 141 185 0 28 Dec 2020
SketchZooms: Deep multi-view descriptors for matching line drawings Pablo Navarro J. Orlando C. Delrieux Emmanuel Iarussi 3DPC 13 5 0 29 Nov 2019
PRNet: Self-Supervised Learning for Partial-to-Partial Registration Yue Wang Justin Solomon SSL 3DPC 25 379 0 27 Oct 2019
Deep Zero-Shot Learning for Scene Sketch Yao Xie Peng Xu Zhanyu Ma VLM 25 12 0 11 May 2019
Audio-Visual Model Distillation Using Acoustic Images Andrés F. Pérez Valentina Sanguineti Pietro Morerio Vittorio Murino VLM 15 27 0 16 Apr 2019
Scene Graph Reasoning with Prior Visual Relationship for Visual Question Answering Zhuoqian Yang Zengchang Qin Jing Yu Yue Hu GNN 25 16 0 23 Dec 2018
Recipe1M+: A Dataset for Learning Cross-Modal Embeddings for Cooking Recipes and Food Images Javier Marín Aritro Biswas Ferda Ofli Nick Hynes Amaia Salvador Y. Aytar Ingmar Weber Antonio Torralba 16 319 0 14 Oct 2018
Cross-Domain Weakly-Supervised Object Detection through Progressive Domain Adaptation Naoto Inoue Ryosuke Furuta T. Yamasaki Kiyoharu Aizawa ObjD 33 524 0 30 Mar 2018
Significance of Softmax-based Features in Comparison to Distance Metric Learning-based Features Shota Horiguchi Daiki Ikami Kiyoharu Aizawa 24 64 0 29 Dec 2017
Label Efficient Learning of Transferable Representations across Domains and Tasks Zelun Luo Yuliang Zou Judy Hoffman Li Fei-Fei 39 275 0 30 Nov 2017
Dual-Path Convolutional Image-Text Embeddings with Instance Loss Zhedong Zheng Liang Zheng Michael Garrett Yi Yang Mingliang Xu Yi-Dong Shen 27 470 0 15 Nov 2017
Cooperative Learning with Visual Attributes Tanmay Batra Devi Parikh 28 29 0 16 May 2017
Recent Advances in Transfer Learning for Cross-Dataset Visual Recognition: A Problem-Oriented Perspective Jing Zhang Wanqing Li P. Ogunbona Dong Xu OOD 27 46 0 11 May 2017
Domain Adaptation for Visual Applications: A Comprehensive Survey G. Csurka OOD 25 503 0 17 Feb 2017
Multi-source Transfer Learning with Convolutional Neural Networks for Lung Pattern Analysis Stergios Christodoulidis M. Anthimopoulos L. Ebner Andreas Christe Stavroula Mougiakakou 10 133 0 08 Dec 2016
Who is Mistaken? Benjamin Eysenbach Carl Vondrick Antonio Torralba 35 15 0 04 Dec 2016
The One Hundred Layers Tiramisu: Fully Convolutional DenseNets for Semantic Segmentation S. Jégou M. Drozdzal David Vazquez Adriana Romero Yoshua Bengio SSeg 43 1,573 0 28 Nov 2016
GuessWhat?! Visual object discovery through multi-modal dialogue H. D. Vries Florian Strub A. Chandar Olivier Pietquin Hugo Larochelle Aaron Courville VLM 32 426 0 23 Nov 2016
A Comprehensive Survey on Cross-modal Retrieval Kun Wang Qiyue Yin Wei Wang Shu Wu Liang Wang 42 294 0 21 Jul 2016