ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1805.11730
  4. Cited By
Learn to Combine Modalities in Multimodal Deep Learning

Learn to Combine Modalities in Multimodal Deep Learning

29 May 2018
Kuan Liu
Yanen Li
N. Xu
Premkumar Natarajan
ArXivPDFHTML

Papers citing "Learn to Combine Modalities in Multimodal Deep Learning"

25 / 25 papers shown
Title
TACFN: Transformer-based Adaptive Cross-modal Fusion Network for Multimodal Emotion Recognition
TACFN: Transformer-based Adaptive Cross-modal Fusion Network for Multimodal Emotion Recognition
Feng Liu
Ziwang Fu
Yansen Wang
Qijian Zheng
77
5
0
10 May 2025
Multimodal Deep Learning
Multimodal Deep Learning
Cem Akkus
Jiquan Ngiam
Vladana Djakovic
Steffen Jauch-Walser
A. Khosla
...
Jann Goschenhofer
Honglak Lee
A. Ng
Daniel Schalk
Matthias Aßenmacher
110
3,169
0
12 Jan 2023
EmotiCon: Context-Aware Multimodal Emotion Recognition using Frege's
  Principle
EmotiCon: Context-Aware Multimodal Emotion Recognition using Frege's Principle
Trisha Mittal
P. Guhan
Uttaran Bhattacharya
Rohan Chandra
Aniket Bera
Tianyi Zhou
102
133
0
14 Mar 2020
Multimodal Named Entity Recognition for Short Social Media Posts
Multimodal Named Entity Recognition for Short Social Media Posts
Seungwhan Moon
Leonardo Neves
Vitor R. Carvalho
59
154
0
22 Feb 2018
A Batch Learning Framework for Scalable Personalized Ranking
A Batch Learning Framework for Scalable Personalized Ranking
Kuan Liu
Premkumar Natarajan
FedML
15
2
0
10 Nov 2017
The Reversible Residual Network: Backpropagation Without Storing
  Activations
The Reversible Residual Network: Backpropagation Without Storing Activations
Aidan Gomez
Mengye Ren
R. Urtasun
Roger C. Grosse
74
548
0
14 Jul 2017
Multimedia Semantic Integrity Assessment Using Joint Embedding Of Images
  And Text
Multimedia Semantic Integrity Assessment Using Joint Embedding Of Images And Text
Ayush Jaiswal
Ekraam Sabir
Wael Abd-Almageed
Premkumar Natarajan
38
45
0
06 Jul 2017
Collaborative Layer-wise Discriminative Learning in Deep Neural Networks
Collaborative Layer-wise Discriminative Learning in Deep Neural Networks
Xiaojie Jin
Yunpeng Chen
Jian Dong
Jiashi Feng
Shuicheng Yan
57
21
0
19 Jul 2016
Deep Residual Learning for Image Recognition
Deep Residual Learning for Image Recognition
Kaiming He
Xinming Zhang
Shaoqing Ren
Jian Sun
MedIm
2.2K
193,814
0
10 Dec 2015
Ask, Attend and Answer: Exploring Question-Guided Spatial Attention for
  Visual Question Answering
Ask, Attend and Answer: Exploring Question-Guided Spatial Attention for Visual Question Answering
Huijuan Xu
Kate Saenko
68
763
0
17 Nov 2015
Character-level Convolutional Networks for Text Classification
Character-level Convolutional Networks for Text Classification
Xiang Zhang
Jiaqi Zhao
Yann LeCun
262
6,107
0
04 Sep 2015
Listen, Attend and Spell
Listen, Attend and Spell
William Chan
Navdeep Jaitly
Quoc V. Le
Oriol Vinyals
RALM
149
2,266
0
05 Aug 2015
Are You Talking to a Machine? Dataset and Methods for Multilingual Image
  Question Answering
Are You Talking to a Machine? Dataset and Methods for Multilingual Image Question Answering
Haoyuan Gao
Junhua Mao
Jie Zhou
Zhiheng Huang
Lei Wang
Wenyuan Xu
78
500
0
21 May 2015
Multi-scale recognition with DAG-CNNs
Multi-scale recognition with DAG-CNNs
Songfan Yang
Deva Ramanan
BDL
61
205
0
20 May 2015
Ask Your Neurons: A Neural-based Approach to Answering Questions about
  Images
Ask Your Neurons: A Neural-based Approach to Answering Questions about Images
Mateusz Malinowski
Marcus Rohrbach
Mario Fritz
106
600
0
05 May 2015
EmoNets: Multimodal deep learning approaches for emotion recognition in
  video
EmoNets: Multimodal deep learning approaches for emotion recognition in video
Samira Ebrahimi Kahou
Xavier Bouthillier
Pascal Lamblin
Çağlar Gülçehre
Vincent Michalski
...
Aaron Courville
Pascal Vincent
Roland Memisevic
C. Pal
Yoshua Bengio
178
401
0
05 Mar 2015
ModDrop: adaptive multi-modal gesture recognition
ModDrop: adaptive multi-modal gesture recognition
Natalia Neverova
Christian Wolf
Graham W. Taylor
Florian Nebout
87
295
0
31 Dec 2014
Adam: A Method for Stochastic Optimization
Adam: A Method for Stochastic Optimization
Diederik P. Kingma
Jimmy Ba
ODL
1.7K
150,006
0
22 Dec 2014
Hypercolumns for Object Segmentation and Fine-grained Localization
Hypercolumns for Object Segmentation and Fine-grained Localization
Bharath Hariharan
Pablo Arbeláez
Ross B. Girshick
Jitendra Malik
SSeg
128
1,595
0
21 Nov 2014
Show and Tell: A Neural Image Caption Generator
Show and Tell: A Neural Image Caption Generator
Oriol Vinyals
Alexander Toshev
Samy Bengio
D. Erhan
3DV
235
6,026
0
17 Nov 2014
Convolutional Neural Networks for Sentence Classification
Convolutional Neural Networks for Sentence Classification
Yoon Kim
AILaw
VLM
615
13,420
0
25 Aug 2014
Recurrent Models of Visual Attention
Recurrent Models of Visual Attention
Volodymyr Mnih
N. Heess
Alex Graves
Koray Kavukcuoglu
VLM
152
3,656
0
24 Jun 2014
Majority Vote of Diverse Classifiers for Late Fusion
Majority Vote of Diverse Classifiers for Late Fusion
Emilie Morvant
Amaury Habrard
Stéphane Ayache
46
103
0
30 Apr 2014
Network In Network
Network In Network
Min Lin
Qiang Chen
Shuicheng Yan
291
6,274
0
16 Dec 2013
Speech Recognition with Deep Recurrent Neural Networks
Speech Recognition with Deep Recurrent Neural Networks
Alex Graves
Abdel-rahman Mohamed
Geoffrey E. Hinton
220
8,513
0
22 Mar 2013
1