ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1607.07295
  4. Cited By
Learning Aligned Cross-Modal Representations from Weakly Aligned Data

Learning Aligned Cross-Modal Representations from Weakly Aligned Data

25 July 2016
Lluis Castrejon
Y. Aytar
Carl Vondrick
Hamed Pirsiavash
Antonio Torralba
    SSL
    DRL
    AI4TS
ArXivPDFHTML

Papers citing "Learning Aligned Cross-Modal Representations from Weakly Aligned Data"

31 / 31 papers shown
Title
Domain Adaptation for Large-Vocabulary Object Detectors
Domain Adaptation for Large-Vocabulary Object Detectors
Kai Jiang
Jiaxing Huang
Weiying Xie
Jie Lei
Yunsong Li
Ling Shao
Shijian Lu
ObjD
VLM
40
2
0
13 Jan 2024
Information Theory-Guided Heuristic Progressive Multi-View Coding
Information Theory-Guided Heuristic Progressive Multi-View Coding
Jiangmeng Li
Hang Gao
Wenwen Qiang
Changwen Zheng
22
2
0
21 Aug 2023
Lossless Adaptation of Pretrained Vision Models For Robotic Manipulation
Lossless Adaptation of Pretrained Vision Models For Robotic Manipulation
Mohit Sharma
Claudio Fantacci
Yuxiang Zhou
Skanda Koppula
N. Heess
Jonathan Scholz
Y. Aytar
VLM
50
29
0
13 Apr 2023
Unifying Tracking and Image-Video Object Detection
Unifying Tracking and Image-Video Object Detection
Peirong Liu
Rui Wang
Pengchuan Zhang
Omid Poursaeed
Yipin Zhou
Xuefei Cao
Sreya . Dutta Roy
Ashish Shah
Ser-Nam Lim
26
0
0
20 Nov 2022
Cross-Modal Alignment Learning of Vision-Language Conceptual Systems
Cross-Modal Alignment Learning of Vision-Language Conceptual Systems
Taehyeong Kim
H. Song
Byoung-Tak Zhang
32
4
0
31 Jul 2022
OmniMAE: Single Model Masked Pretraining on Images and Videos
OmniMAE: Single Model Masked Pretraining on Images and Videos
Rohit Girdhar
Alaaeldin El-Nouby
Mannat Singh
Kalyan Vasudev Alwala
Armand Joulin
Ishan Misra
ViT
37
97
0
16 Jun 2022
Weakly-Supervised Action Detection Guided by Audio Narration
Weakly-Supervised Action Detection Guided by Audio Narration
Keren Ye
Adriana Kovashka
38
0
0
12 May 2022
MultiMAE: Multi-modal Multi-task Masked Autoencoders
MultiMAE: Multi-modal Multi-task Masked Autoencoders
Roman Bachmann
David Mizrahi
Andrei Atanov
Amir Zamir
47
265
0
04 Apr 2022
Context Autoencoder for Self-Supervised Representation Learning
Context Autoencoder for Self-Supervised Representation Learning
Xiaokang Chen
Mingyu Ding
Xiaodi Wang
Ying Xin
Shentong Mo
Yunhao Wang
Shumin Han
Ping Luo
Gang Zeng
Jingdong Wang
SSL
45
386
0
07 Feb 2022
Sound and Visual Representation Learning with Multiple Pretraining Tasks
Sound and Visual Representation Learning with Multiple Pretraining Tasks
A. Vasudevan
Dengxin Dai
Luc Van Gool
SSL
33
6
0
04 Jan 2022
Machine Learning in Nuclear Physics
Machine Learning in Nuclear Physics
A. Boehnlein
M. Diefenthaler
C. Fanelli
M. Hjorth-Jensen
T. Horn
...
M. Schram
A. Scheinker
Michael S. Smith
Xin-Nian Wang
Veronique Ziegler
AI4CE
37
41
0
04 Dec 2021
Explainability of deep vision-based autonomous driving systems: Review
  and challenges
Explainability of deep vision-based autonomous driving systems: Review and challenges
Éloi Zablocki
H. Ben-younes
P. Pérez
Matthieu Cord
XAI
48
170
0
13 Jan 2021
Deep Visual Domain Adaptation
Deep Visual Domain Adaptation
G. Csurka
OOD
141
185
0
28 Dec 2020
SketchZooms: Deep multi-view descriptors for matching line drawings
SketchZooms: Deep multi-view descriptors for matching line drawings
Pablo Navarro
J. Orlando
C. Delrieux
Emmanuel Iarussi
3DPC
13
5
0
29 Nov 2019
PRNet: Self-Supervised Learning for Partial-to-Partial Registration
PRNet: Self-Supervised Learning for Partial-to-Partial Registration
Yue Wang
Justin Solomon
SSL
3DPC
25
379
0
27 Oct 2019
Deep Zero-Shot Learning for Scene Sketch
Deep Zero-Shot Learning for Scene Sketch
Yao Xie
Peng Xu
Zhanyu Ma
VLM
25
12
0
11 May 2019
Audio-Visual Model Distillation Using Acoustic Images
Audio-Visual Model Distillation Using Acoustic Images
Andrés F. Pérez
Valentina Sanguineti
Pietro Morerio
Vittorio Murino
VLM
15
27
0
16 Apr 2019
Scene Graph Reasoning with Prior Visual Relationship for Visual Question
  Answering
Scene Graph Reasoning with Prior Visual Relationship for Visual Question Answering
Zhuoqian Yang
Zengchang Qin
Jing Yu
Yue Hu
GNN
25
16
0
23 Dec 2018
Recipe1M+: A Dataset for Learning Cross-Modal Embeddings for Cooking
  Recipes and Food Images
Recipe1M+: A Dataset for Learning Cross-Modal Embeddings for Cooking Recipes and Food Images
Javier Marín
Aritro Biswas
Ferda Ofli
Nick Hynes
Amaia Salvador
Y. Aytar
Ingmar Weber
Antonio Torralba
16
319
0
14 Oct 2018
Cross-Domain Weakly-Supervised Object Detection through Progressive
  Domain Adaptation
Cross-Domain Weakly-Supervised Object Detection through Progressive Domain Adaptation
Naoto Inoue
Ryosuke Furuta
T. Yamasaki
Kiyoharu Aizawa
ObjD
33
524
0
30 Mar 2018
Significance of Softmax-based Features in Comparison to Distance Metric
  Learning-based Features
Significance of Softmax-based Features in Comparison to Distance Metric Learning-based Features
Shota Horiguchi
Daiki Ikami
Kiyoharu Aizawa
24
64
0
29 Dec 2017
Label Efficient Learning of Transferable Representations across Domains
  and Tasks
Label Efficient Learning of Transferable Representations across Domains and Tasks
Zelun Luo
Yuliang Zou
Judy Hoffman
Li Fei-Fei
39
275
0
30 Nov 2017
Dual-Path Convolutional Image-Text Embeddings with Instance Loss
Dual-Path Convolutional Image-Text Embeddings with Instance Loss
Zhedong Zheng
Liang Zheng
Michael Garrett
Yi Yang
Mingliang Xu
Yi-Dong Shen
27
470
0
15 Nov 2017
Cooperative Learning with Visual Attributes
Cooperative Learning with Visual Attributes
Tanmay Batra
Devi Parikh
28
29
0
16 May 2017
Recent Advances in Transfer Learning for Cross-Dataset Visual
  Recognition: A Problem-Oriented Perspective
Recent Advances in Transfer Learning for Cross-Dataset Visual Recognition: A Problem-Oriented Perspective
Jing Zhang
Wanqing Li
P. Ogunbona
Dong Xu
OOD
27
46
0
11 May 2017
Domain Adaptation for Visual Applications: A Comprehensive Survey
Domain Adaptation for Visual Applications: A Comprehensive Survey
G. Csurka
OOD
25
503
0
17 Feb 2017
Multi-source Transfer Learning with Convolutional Neural Networks for
  Lung Pattern Analysis
Multi-source Transfer Learning with Convolutional Neural Networks for Lung Pattern Analysis
Stergios Christodoulidis
M. Anthimopoulos
L. Ebner
Andreas Christe
Stavroula Mougiakakou
10
133
0
08 Dec 2016
Who is Mistaken?
Who is Mistaken?
Benjamin Eysenbach
Carl Vondrick
Antonio Torralba
35
15
0
04 Dec 2016
The One Hundred Layers Tiramisu: Fully Convolutional DenseNets for
  Semantic Segmentation
The One Hundred Layers Tiramisu: Fully Convolutional DenseNets for Semantic Segmentation
S. Jégou
M. Drozdzal
David Vazquez
Adriana Romero
Yoshua Bengio
SSeg
43
1,573
0
28 Nov 2016
GuessWhat?! Visual object discovery through multi-modal dialogue
GuessWhat?! Visual object discovery through multi-modal dialogue
H. D. Vries
Florian Strub
A. Chandar
Olivier Pietquin
Hugo Larochelle
Aaron Courville
VLM
32
426
0
23 Nov 2016
A Comprehensive Survey on Cross-modal Retrieval
A Comprehensive Survey on Cross-modal Retrieval
Kun Wang
Qiyue Yin
Wei Wang
Shu Wu
Liang Wang
42
294
0
21 Jul 2016
1