ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1911.03977
  4. Cited By
Multimodal Intelligence: Representation Learning, Information Fusion,
  and Applications

Multimodal Intelligence: Representation Learning, Information Fusion, and Applications

10 November 2019
Chao Zhang
Zichao Yang
Xiaodong He
Li Deng
    HAI
    AI4TS
ArXivPDFHTML

Papers citing "Multimodal Intelligence: Representation Learning, Information Fusion, and Applications"

34 / 34 papers shown
Title
MM-GTUNets: Unified Multi-Modal Graph Deep Learning for Brain Disorders Prediction
MM-GTUNets: Unified Multi-Modal Graph Deep Learning for Brain Disorders Prediction
Luhui Cai
Weiming Zeng
Hongyu Chen
Hua Zhang
Yueyang Li
Hongjie Yan
Lingbin Bian
Lingbin Bian
Wai Ting Siok
Nizhuan Wang
MedIm
55
3
0
20 Jun 2024
Solving the Inverse Problem of Electrocardiography for Cardiac Digital
  Twins: A Survey
Solving the Inverse Problem of Electrocardiography for Cardiac Digital Twins: A Survey
Lei Li
J. Camps
Blanca Rodriguez
Vicente Grau
SyDa
38
2
0
17 Jun 2024
Automatic Fused Multimodal Deep Learning for Plant Identification
Automatic Fused Multimodal Deep Learning for Plant Identification
Alfreds Lapkovskis
Natalia Nefedova
Ali Beikmohammadi
52
0
0
03 Jun 2024
MiKASA: Multi-Key-Anchor & Scene-Aware Transformer for 3D Visual
  Grounding
MiKASA: Multi-Key-Anchor & Scene-Aware Transformer for 3D Visual Grounding
Chun-Peng Chang
Shaoxiang Wang
A. Pagani
Didier Stricker
43
7
0
05 Mar 2024
SSTFormer: Bridging Spiking Neural Network and Memory Support Transformer for Frame-Event based Recognition
SSTFormer: Bridging Spiking Neural Network and Memory Support Transformer for Frame-Event based Recognition
Tianlin Li
Zong-Yao Wu
Yao Rong
Lin Zhu
Bowei Jiang
Jin Tang
Yonghong Tian
ViT
74
17
0
08 Aug 2023
Multi-Modal Deep Learning for Credit Rating Prediction Using Text and
  Numerical Data Streams
Multi-Modal Deep Learning for Credit Rating Prediction Using Text and Numerical Data Streams
M. Tavakoli
Rohitash Chandra
Fengrui Tian
Cristián Bravo
26
8
0
21 Apr 2023
The Contribution of Knowledge in Visiolinguistic Learning: A Survey on
  Tasks and Challenges
The Contribution of Knowledge in Visiolinguistic Learning: A Survey on Tasks and Challenges
Maria Lymperaiou
Giorgos Stamou
VLM
32
4
0
04 Mar 2023
Cross-modal Audio-visual Co-learning for Text-independent Speaker
  Verification
Cross-modal Audio-visual Co-learning for Text-independent Speaker Verification
Meng Liu
Kong Aik Lee
Longbiao Wang
Hanyi Zhang
Chang Zeng
J. Dang
23
10
0
22 Feb 2023
Oriented Object Detection in Optical Remote Sensing Images using Deep Learning: A Survey
Oriented Object Detection in Optical Remote Sensing Images using Deep Learning: A Survey
Kunlin Wang
Zi Wang
Zhang Li
Ang Su
Xichao Teng
Minhao Liu
Qifeng Yu
Qifeng Yu
ObjD
89
9
0
21 Feb 2023
A Self-Adjusting Fusion Representation Learning Model for Unaligned
  Text-Audio Sequences
A Self-Adjusting Fusion Representation Learning Model for Unaligned Text-Audio Sequences
Kaicheng Yang
Ruxuan Zhang
Hua Xu
Kai Gao
23
3
0
12 Nov 2022
Distribution-based Emotion Recognition in Conversation
Distribution-based Emotion Recognition in Conversation
Wen Wu
C. Zhang
P. Woodland
24
4
0
09 Nov 2022
Multimodal learning with graphs
Multimodal learning with graphs
Yasha Ektefaie
George Dasoulas
Ayush Noori
Maha Farhat
Marinka Zitnik
51
82
0
07 Sep 2022
Multimodal Learning with Transformers: A Survey
Multimodal Learning with Transformers: A Survey
P. Xu
Xiatian Zhu
David A. Clifton
ViT
66
527
0
13 Jun 2022
COMPASS: Contrastive Multimodal Pretraining for Autonomous Systems
COMPASS: Contrastive Multimodal Pretraining for Autonomous Systems
Shuang Ma
Sai H. Vemprala
Wenshan Wang
Jayesh K. Gupta
Yale Song
Daniel J. McDuff
Ashish Kapoor
SSL
37
9
0
20 Feb 2022
Improving the fusion of acoustic and text representations in RNN-T
Improving the fusion of acoustic and text representations in RNN-T
Chao Zhang
Bo-wen Li
Zhiyun Lu
Tara N. Sainath
Shuo-yiin Chang
AI4CE
43
12
0
25 Jan 2022
How to find a good image-text embedding for remote sensing visual
  question answering?
How to find a good image-text embedding for remote sensing visual question answering?
Christel Chappuis
Sylvain Lobry
B. Kellenberger
Bertrand Le Saux
D. Tuia
40
20
0
24 Sep 2021
N24News: A New Dataset for Multimodal News Classification
N24News: A New Dataset for Multimodal News Classification
Zhen Wang
Xu Shan
Xiangxie Zhang
Jie Yang
VLM
20
33
0
30 Aug 2021
Imbalanced Big Data Oversampling: Taxonomy, Algorithms, Software,
  Guidelines and Future Directions
Imbalanced Big Data Oversampling: Taxonomy, Algorithms, Software, Guidelines and Future Directions
W. Sleeman
R. Kapoor
AI4TS
17
71
0
24 Jul 2021
A Review on Explainability in Multimodal Deep Neural Nets
A Review on Explainability in Multimodal Deep Neural Nets
Gargi Joshi
Rahee Walambe
K. Kotecha
29
139
0
17 May 2021
Literature review on vulnerability detection using NLP technology
Literature review on vulnerability detection using NLP technology
Jiajie Wu
39
14
0
23 Apr 2021
What is Multimodality?
What is Multimodality?
Letitia Parcalabescu
Nils Trost
Anette Frank
21
0
0
10 Mar 2021
A Boundary Regression Model for Nested Named Entity Recognition
A Boundary Regression Model for Nested Named Entity Recognition
Yanping Chen
Lefei Wu
Q. Zheng
Ruizhang Huang
Xiaozhong Liu
Liyuan Deng
Junhui Yu
Yongbin Qing
B. Dong
Ping Chen
13
16
0
29 Nov 2020
Combination of Deep Speaker Embeddings for Diarisation
Combination of Deep Speaker Embeddings for Diarisation
Guangzhi Sun
Chao Zhang
P. Woodland
17
20
0
22 Oct 2020
Relating by Contrasting: A Data-efficient Framework for Multimodal
  Generative Models
Relating by Contrasting: A Data-efficient Framework for Multimodal Generative Models
Yuge Shi
Brooks Paige
Philip Torr
N. Siddharth
VLM
23
36
0
02 Jul 2020
VoxCeleb2: Deep Speaker Recognition
VoxCeleb2: Deep Speaker Recognition
Joon Son Chung
Arsha Nagrani
Andrew Zisserman
245
2,233
0
14 Jun 2018
Transfer Learning from Speaker Verification to Multispeaker
  Text-To-Speech Synthesis
Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis
Ye Jia
Yu Zhang
Ron J. Weiss
Quan Wang
Jonathan Shen
...
Z. Chen
Patrick Nguyen
Ruoming Pang
Ignacio López Moreno
Yonghui Wu
207
820
0
12 Jun 2018
Dialog-based Interactive Image Retrieval
Dialog-based Interactive Image Retrieval
Xiaoxiao Guo
Hui Wu
Yu Cheng
Steven J. Rennie
Gerald Tesauro
Rogerio Feris
52
204
0
01 May 2018
Image Generation from Scene Graphs
Image Generation from Scene Graphs
Justin Johnson
Agrim Gupta
Li Fei-Fei
GNN
223
815
0
04 Apr 2018
Neural Architecture Search with Reinforcement Learning
Neural Architecture Search with Reinforcement Learning
Barret Zoph
Quoc V. Le
271
5,329
0
05 Nov 2016
Conditional Image Synthesis With Auxiliary Classifier GANs
Conditional Image Synthesis With Auxiliary Classifier GANs
Augustus Odena
C. Olah
Jonathon Shlens
GAN
250
3,190
0
30 Oct 2016
Google's Neural Machine Translation System: Bridging the Gap between
  Human and Machine Translation
Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation
Yonghui Wu
M. Schuster
Z. Chen
Quoc V. Le
Mohammad Norouzi
...
Alex Rudnick
Oriol Vinyals
G. Corrado
Macduff Hughes
J. Dean
AIMat
716
6,746
0
26 Sep 2016
Multimodal Compact Bilinear Pooling for Visual Question Answering and
  Visual Grounding
Multimodal Compact Bilinear Pooling for Visual Question Answering and Visual Grounding
Akira Fukui
Dong Huk Park
Daylen Yang
Anna Rohrbach
Trevor Darrell
Marcus Rohrbach
158
1,464
0
06 Jun 2016
Learning Deep Representations of Fine-grained Visual Descriptions
Learning Deep Representations of Fine-grained Visual Descriptions
Scott E. Reed
Zeynep Akata
Bernt Schiele
Honglak Lee
OCL
VLM
170
840
0
17 May 2016
Effective Approaches to Attention-based Neural Machine Translation
Effective Approaches to Attention-based Neural Machine Translation
Thang Luong
Hieu H. Pham
Christopher D. Manning
218
7,926
0
17 Aug 2015
1