Multimodal Convolutional Neural Networks for Matching Image and Sentence

23 April 2015

Lifeng Shang

Papers citing "Multimodal Convolutional Neural Networks for Matching Image and Sentence"

28 / 28 papers shown

Title
PREMISE: Matching-based Prediction for Accurate Review Recommendation Wei Han Hui Chen Soujanya Poria 47 0 0 02 May 2025
Target-Augmented Shared Fusion-based Multimodal Sarcasm Explanation Generation Palaash Goel Dushyant Singh Chauhan Md. Shad Akhtar LRM 50 0 0 11 Feb 2025
Large-Scale Traffic Congestion Prediction based on Multimodal Fusion and Representation Mapping Bo Zhou Jiahui Liu Songyi Cui Yaping Zhao 18 5 0 23 Aug 2022
CODER: Coupled Diversity-Sensitive Momentum Contrastive Learning for Image-Text Retrieval Haoran Wang Dongliang He Wenhao Wu Boyang Xia Min Yang Fu Li Yunlong Yu Zhong Ji Errui Ding Jingdong Wang 27 22 0 21 Aug 2022
Two-stream Hierarchical Similarity Reasoning for Image-text Matching Ran Chen Hanli Wang Lei Wang Sam Kwong 13 9 0 10 Mar 2022
Multi-Modal Knowledge Graph Construction and Application: A Survey Xiangru Zhu Zhixu Li Xiaodan Wang Xueyao Jiang Penglei Sun Xuwu Wang Yanghua Xiao N. Yuan 28 154 0 11 Feb 2022
Step-Wise Hierarchical Alignment Network for Image-Text Matching Zhong Ji Kexin Chen Haoran Wang 22 93 0 11 Jun 2021
Graph Structured Network for Image-Text Matching Chunxiao Liu Zhendong Mao Tianzhu Zhang Hongtao Xie Bin Wang Yongdong Zhang 17 232 0 01 Apr 2020
Deep Multimodal Image-Text Embeddings for Automatic Cross-Media Retrieval Hadi Abdi Khojasteh Ebrahim Ansari Parvin Razzaghi Akbar Karimi VLM 11 4 0 23 Feb 2020
Target-Oriented Deformation of Visual-Semantic Embedding Space Takashi Matsubara 18 7 0 15 Oct 2019
Unicoder-VL: A Universal Encoder for Vision and Language by Cross-modal Pre-training Gen Li Nan Duan Yuejian Fang Ming Gong Daxin Jiang Ming Zhou SSL VLM MLLM 57 895 0 16 Aug 2019
Listening while Speaking and Visualizing: Improving ASR through Multimodal Chain Johanes Effendi Andros Tjandra S. Sakti Satoshi Nakamura 19 3 0 03 Jun 2019
Multi-modal gated recurrent units for image description Xuelong Li Aihong Yuan Xiaoqiang Lu GAN 19 26 0 20 Apr 2019
Reconstruction Network for Video Captioning Bairui Wang Lin Ma Wei Zhang Wei Liu 30 316 0 30 Mar 2018
Discriminability objective for training descriptive captions Ruotian Luo Brian L. Price Scott D. Cohen Gregory Shakhnarovich 19 202 0 12 Mar 2018
A Hybrid Method for Traffic Flow Forecasting Using Multimodal Deep Learning Shengdong Du Tianrui Li Xun Gong S. Horng AI4TS 25 150 0 06 Mar 2018
Dual-Path Convolutional Image-Text Embeddings with Instance Loss Zhedong Zheng Liang Zheng Michael Garrett Yi Yang Mingliang Xu Yi-Dong Shen 22 470 0 15 Nov 2017
MUTAN: Multimodal Tucker Fusion for Visual Question Answering H. Ben-younes Rémi Cadène Matthieu Cord Nicolas Thome 44 578 0 18 May 2017
Supervised Learning of Universal Sentence Representations from Natural Language Inference Data Alexis Conneau Douwe Kiela Holger Schwenk Loïc Barrault Antoine Bordes AI4TS SSL 16 2,093 0 05 May 2017
Learning Two-Branch Neural Networks for Image-Text Matching Tasks Liwei Wang Yin Li Jing-ling Huang Svetlana Lazebnik VLM 27 494 0 11 Apr 2017
AMC: Attention guided Multi-modal Correlation Learning for Image Search Kan Chen Trung Bui Chen Fang Zhaowen Wang Ram Nevatia 35 38 0 03 Apr 2017
Comprehension-guided referring expressions Ruotian Luo Gregory Shakhnarovich ObjD 27 171 0 12 Jan 2017
Instance-aware Image and Sentence Matching with Selective Multimodal LSTM Yan Huang Wei Wang Liang Wang 26 222 0 17 Nov 2016
Dual Attention Networks for Multimodal Reasoning and Matching Hyeonseob Nam Jung-Woo Ha Jeonghee Kim 34 664 0 02 Nov 2016
Linking Image and Text with 2-Way Nets Aviv Eisenschtat Lior Wolf 19 176 0 29 Aug 2016
Learning to Answer Questions From Image Using Convolutional Neural Network Lin Ma Zhengdong Lu Hang Li 19 261 0 01 Jun 2015
Learning like a Child: Fast Novel Visual Concept Learning from Sentence Descriptions of Images Junhua Mao Xu Wei Yi Yang Jiang Wang Zhiheng Huang Alan Yuille 25 154 0 25 Apr 2015
Improving neural networks by preventing co-adaptation of feature detectors Geoffrey E. Hinton Nitish Srivastava A. Krizhevsky Ilya Sutskever Ruslan Salakhutdinov VLM 266 7,636 0 03 Jul 2012