ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1212.4522
  4. Cited By
A Multi-View Embedding Space for Modeling Internet Images, Tags, and
  their Semantics

A Multi-View Embedding Space for Modeling Internet Images, Tags, and their Semantics

18 December 2012
Yunchao Gong
Qifa Ke
Michael Isard
Svetlana Lazebnik
    3DV
ArXivPDFHTML

Papers citing "A Multi-View Embedding Space for Modeling Internet Images, Tags, and their Semantics"

50 / 135 papers shown
Title
Learning from Noisy Labels with Contrastive Co-Transformer
Yan Han
S. Roy
Mehrtash Harandi
L. Petersson
NoLa
74
0
0
04 Mar 2025
Deep Learning for Multi-Label Learning: A Comprehensive Survey
Deep Learning for Multi-Label Learning: A Comprehensive Survey
A. Tarekegn
M. Ullah
F. A. Cheikh
AI4TS
40
8
0
29 Jan 2024
Hypothesis Testing for Class-Conditional Noise Using Local Maximum
  Likelihood
Hypothesis Testing for Class-Conditional Noise Using Local Maximum Likelihood
Weisong Yang
Rafael Poyiadzi
Niall Twomey
Raul Santos Rodriguez
22
0
0
15 Dec 2023
ALEX: Towards Effective Graph Transfer Learning with Noisy Labels
ALEX: Towards Effective Graph Transfer Learning with Noisy Labels
Jingyang Yuan
Xiao Luo
Yifang Qin
Zhengyan Mao
Wei Ju
Ming Zhang
AAML
26
18
0
26 Sep 2023
Unifying Two-Stream Encoders with Transformers for Cross-Modal Retrieval
Unifying Two-Stream Encoders with Transformers for Cross-Modal Retrieval
Yi Bin
Haoxuan Li
Yahui Xu
Xing Xu
Yang Yang
Heng Tao Shen
VOS
24
18
0
08 Aug 2023
Multi-Modal Machine Learning for Assessing Gaming Skills in Online
  Streaming: A Case Study with CS:GO
Multi-Modal Machine Learning for Assessing Gaming Skills in Online Streaming: A Case Study with CS:GO
Longxiang Zhang
Wenping Wang
37
1
0
23 Jul 2023
CiCo: Domain-Aware Sign Language Retrieval via Cross-Lingual Contrastive
  Learning
CiCo: Domain-Aware Sign Language Retrieval via Cross-Lingual Contrastive Learning
Yiting Cheng
Fangyun Wei
Jianmin Bao
Dong Chen
Wenqian Zhang
SLR
24
28
0
22 Mar 2023
Learning Visual Representations via Language-Guided Sampling
Learning Visual Representations via Language-Guided Sampling
Mohamed El Banani
Karan Desai
Justin Johnson
SSL
VLM
11
28
0
23 Feb 2023
Multilingual Multimodality: A Taxonomical Survey of Datasets,
  Techniques, Challenges and Opportunities
Multilingual Multimodality: A Taxonomical Survey of Datasets, Techniques, Challenges and Opportunities
Khyathi Raghavi Chandu
A. Geramifard
32
3
0
30 Oct 2022
Augmentation-Free Graph Contrastive Learning of Invariant-Discriminative
  Representations
Augmentation-Free Graph Contrastive Learning of Invariant-Discriminative Representations
Haifeng Li
Jun Cao
Jiawei Zhu
Qinyao Luo
Silu He
Xuying Wang
11
41
0
15 Oct 2022
LiveSeg: Unsupervised Multimodal Temporal Segmentation of Long
  Livestream Videos
LiveSeg: Unsupervised Multimodal Temporal Segmentation of Long Livestream Videos
Jielin Qiu
Franck Dernoncourt
Trung Bui
Zhaowen Wang
Ding Zhao
Hailin Jin
AI4TS
14
5
0
12 Oct 2022
Can Brain Signals Reveal Inner Alignment with Human Languages?
Can Brain Signals Reveal Inner Alignment with Human Languages?
William Jongwon Han
Jielin Qiu
Jiacheng Zhu
Mengdi Xu
Douglas Weber
Bo-wen Li
Ding Zhao
11
12
0
10 Aug 2022
Temporal Alignment Networks for Long-term Video
Temporal Alignment Networks for Long-term Video
Tengda Han
Weidi Xie
Andrew Zisserman
AI4TS
20
82
0
06 Apr 2022
Look for the Change: Learning Object States and State-Modifying Actions
  from Untrimmed Web Videos
Look for the Change: Learning Object States and State-Modifying Actions from Untrimmed Web Videos
Tomávs Souvcek
Jean-Baptiste Alayrac
Antoine Miech
Ivan Laptev
Josef Sivic
21
32
0
22 Mar 2022
Two-stream Hierarchical Similarity Reasoning for Image-text Matching
Two-stream Hierarchical Similarity Reasoning for Image-text Matching
Ran Chen
Hanli Wang
Lei Wang
Sam Kwong
13
9
0
10 Mar 2022
Contrastive Learning of Visual-Semantic Embeddings
Contrastive Learning of Visual-Semantic Embeddings
Anurag Jain
Yashaswi Verma
SSL
25
1
0
17 Oct 2021
Improving Video-Text Retrieval by Multi-Stream Corpus Alignment and Dual
  Softmax Loss
Improving Video-Text Retrieval by Multi-Stream Corpus Alignment and Dual Softmax Loss
Xingyi Cheng
Hezheng Lin
Xiangyu Wu
Fan Yang
Dong Shen
6
148
0
09 Sep 2021
On Support Recovery with Sparse CCA: Information Theoretic and
  Computational Limits
On Support Recovery with Sparse CCA: Information Theoretic and Computational Limits
Nilanjana Laha
Rajarshi Mukherjee
28
4
0
14 Aug 2021
A Survey on Personal Image Retrieval Systems
A Survey on Personal Image Retrieval Systems
Amit Kumar Nath
Andy Wang
25
0
0
09 Jul 2021
From Canonical Correlation Analysis to Self-supervised Graph Neural
  Networks
From Canonical Correlation Analysis to Self-supervised Graph Neural Networks
Hengrui Zhang
Qitian Wu
Junchi Yan
David Wipf
Philip S. Yu
SSL
22
210
0
23 Jun 2021
Understanding Latent Correlation-Based Multiview Learning and
  Self-Supervision: An Identifiability Perspective
Understanding Latent Correlation-Based Multiview Learning and Self-Supervision: An Identifiability Perspective
Qinjie Lyu
Xiao Fu
Weiran Wang
Songtao Lu
SSL
15
29
0
14 Jun 2021
FDDH: Fast Discriminative Discrete Hashing for Large-Scale Cross-Modal
  Retrieval
FDDH: Fast Discriminative Discrete Hashing for Large-Scale Cross-Modal Retrieval
Xin Liu
Xingzhi Wang
Y. Cheung
16
41
0
15 May 2021
Towards General Purpose Vision Systems
Towards General Purpose Vision Systems
Tanmay Gupta
Amita Kamath
Aniruddha Kembhavi
Derek Hoiem
11
49
0
01 Apr 2021
Thinking Fast and Slow: Efficient Text-to-Visual Retrieval with
  Transformers
Thinking Fast and Slow: Efficient Text-to-Visual Retrieval with Transformers
Antoine Miech
Jean-Baptiste Alayrac
Ivan Laptev
Josef Sivic
Andrew Zisserman
ViT
20
136
0
30 Mar 2021
Decoupling the Role of Data, Attention, and Losses in Multimodal
  Transformers
Decoupling the Role of Data, Attention, and Losses in Multimodal Transformers
Lisa Anne Hendricks
John F. J. Mellor
R. Schneider
Jean-Baptiste Alayrac
Aida Nematzadeh
75
110
0
31 Jan 2021
Image-to-Image Retrieval by Learning Similarity between Scene Graphs
Image-to-Image Retrieval by Learning Similarity between Scene Graphs
Sangwoong Yoon
Woo-Young Kang
Sungwook Jeon
SeongEun Lee
C. Han
Jonghun Park
Eun-Sol Kim
3DH
29
39
0
29 Dec 2020
The Geometry of Distributed Representations for Better Alignment,
  Attenuated Bias, and Improved Interpretability
The Geometry of Distributed Representations for Better Alignment, Attenuated Bias, and Improved Interpretability
Sunipa Dev
27
1
0
25 Nov 2020
Suppressing Mislabeled Data via Grouping and Self-Attention
Suppressing Mislabeled Data via Grouping and Self-Attention
Xiaojiang Peng
Kai Wang
Zhaoyang Zeng
Qing Li
Jianfei Yang
Yu Qiao
11
32
0
29 Oct 2020
Learning to Represent Image and Text with Denotation Graph
Learning to Represent Image and Text with Denotation Graph
Bowen Zhang
Hexiang Hu
Vihan Jain
Eugene Ie
Fei Sha
6
21
0
06 Oct 2020
Cross-Modal Hierarchical Modelling for Fine-Grained Sketch Based Image
  Retrieval
Cross-Modal Hierarchical Modelling for Fine-Grained Sketch Based Image Retrieval
Aneeshan Sain
A. Bhunia
Yongxin Yang
Tao Xiang
Yi-Zhe Song
16
49
0
29 Jul 2020
Learning Video Representations from Textual Web Supervision
Learning Video Representations from Textual Web Supervision
Jonathan C. Stroud
Zhichao Lu
Chen Sun
Jia Deng
Rahul Sukthankar
Cordelia Schmid
David A. Ross
SSL
29
48
0
29 Jul 2020
Deep Learning Techniques for Future Intelligent Cross-Media Retrieval
Deep Learning Techniques for Future Intelligent Cross-Media Retrieval
S. Rehman
M. Waqas
Shanshan Tu
Anis Koubaa
O. Rehman
Jawad Ahmad
Muhammad Hanif
Zhu Han
11
7
0
21 Jul 2020
COBE: Contextualized Object Embeddings from Narrated Instructional Video
COBE: Contextualized Object Embeddings from Narrated Instructional Video
Gedas Bertasius
Lorenzo Torresani
8
24
0
14 Jul 2020
Embedded Deep Bilinear Interactive Information and Selective Fusion for
  Multi-view Learning
Embedded Deep Bilinear Interactive Information and Selective Fusion for Multi-view Learning
Jinglin Xu
Wenbin Li
J. Shen
Xinwang Liu
Peicheng Zhou
Xiangsen Zhang
Xiwen Yao
Junwei Han
16
1
0
13 Jul 2020
Self-Supervised MultiModal Versatile Networks
Self-Supervised MultiModal Versatile Networks
Jean-Baptiste Alayrac
Adrià Recasens
R. Schneider
Relja Arandjelović
Jason Ramapuram
J. Fauw
Lucas Smaira
Sander Dieleman
Andrew Zisserman
SSL
40
371
0
29 Jun 2020
Learning Multi-Modal Nonlinear Embeddings: Performance Bounds and an
  Algorithm
Learning Multi-Modal Nonlinear Embeddings: Performance Bounds and an Algorithm
Semih Kaya
Elif Vural
6
3
0
03 Jun 2020
COBRA: Contrastive Bi-Modal Representation Algorithm
COBRA: Contrastive Bi-Modal Representation Algorithm
Vishaal Udandarao
A. Maiti
Deepak Srivatsav
Suryatej Reddy Vyalla
Yifang Yin
R. Shah
17
21
0
07 May 2020
Zero-Shot Learning and its Applications from Autonomous Vehicles to
  COVID-19 Diagnosis: A Review
Zero-Shot Learning and its Applications from Autonomous Vehicles to COVID-19 Diagnosis: A Review
Mahdi Rezaei
Mahsa Shahidi
19
53
0
29 Apr 2020
Survey on Visual Sentiment Analysis
Survey on Visual Sentiment Analysis
A. Ortis
G. Farinella
S. Battiato
6
75
0
24 Apr 2020
Multiple Visual-Semantic Embedding for Video Retrieval from Query
  Sentence
Multiple Visual-Semantic Embedding for Video Retrieval from Query Sentence
Huy Manh Nguyen
Tomo Miyazaki
Yoshihiro Sugaya
S. Omachi
37
1
0
16 Apr 2020
MCEN: Bridging Cross-Modal Gap between Cooking Recipes and Dish Images
  with Latent Variable Model
MCEN: Bridging Cross-Modal Gap between Cooking Recipes and Dish Images with Latent Variable Model
Han Fu
R. Wu
Chenghao Liu
Jianling Sun
8
48
0
02 Apr 2020
Adversarial Learning for Personalized Tag Recommendation
Adversarial Learning for Personalized Tag Recommendation
Erik Quintanilla
Y. S. Rawat
Andrey Sakryukin
M. Shah
Mohan S. Kankanhalli
VLM
6
21
0
01 Apr 2020
Cops-Ref: A new Dataset and Task on Compositional Referring Expression
  Comprehension
Cops-Ref: A new Dataset and Task on Compositional Referring Expression Comprehension
Zhenfang Chen
Peng Wang
Lin Ma
Kwan-Yee Kenneth Wong
Qi Wu
ObjD
26
67
0
01 Mar 2020
End-to-End Learning of Visual Representations from Uncurated
  Instructional Videos
End-to-End Learning of Visual Representations from Uncurated Instructional Videos
Antoine Miech
Jean-Baptiste Alayrac
Lucas Smaira
Ivan Laptev
Josef Sivic
Andrew Zisserman
VGen
SSL
31
700
0
13 Dec 2019
Ladder Loss for Coherent Visual-Semantic Embedding
Ladder Loss for Coherent Visual-Semantic Embedding
Mo Zhou
Zhenxing Niu
Le Wang
Zhanning Gao
Qilin Zhang
G. Hua
15
39
0
18 Nov 2019
HUSE: Hierarchical Universal Semantic Embeddings
HUSE: Hierarchical Universal Semantic Embeddings
P. Narayana
Aniket Pednekar
A. Krishnamoorthy
Kazoo Sone
Sugato Basu
23
10
0
14 Nov 2019
Cross-Modal Subspace Learning with Scheduled Adaptive Margin Constraints
Cross-Modal Subspace Learning with Scheduled Adaptive Margin Constraints
David Semedo
João Magalhães
11
11
0
30 Sep 2019
Harmonized Multimodal Learning with Gaussian Process Latent Variable
  Models
Harmonized Multimodal Learning with Gaussian Process Latent Variable Models
Guoli Song
Shuhui Wang
Qingming Huang
Q. Tian
6
22
0
14 Aug 2019
Polysemous Visual-Semantic Embedding for Cross-Modal Retrieval
Polysemous Visual-Semantic Embedding for Cross-Modal Retrieval
Yale Song
M. Soleymani
11
242
0
11 Jun 2019
HowTo100M: Learning a Text-Video Embedding by Watching Hundred Million
  Narrated Video Clips
HowTo100M: Learning a Text-Video Embedding by Watching Hundred Million Narrated Video Clips
Antoine Miech
Dimitri Zhukov
Jean-Baptiste Alayrac
Makarand Tapaswi
Ivan Laptev
Josef Sivic
VGen
25
1,172
0
07 Jun 2019
123
Next