ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2011.10916
  4. Cited By

Hierachical Delta-Attention Method for Multimodal Fusion

22 November 2020
Kunjal Panchal
ArXivPDFHTML

Papers citing "Hierachical Delta-Attention Method for Multimodal Fusion"

19 / 19 papers shown
Title
Audio-Visual Event Localization via Recursive Fusion by Joint
  Co-Attention
Audio-Visual Event Localization via Recursive Fusion by Joint Co-Attention
Bin Duan
Hao Tang
Wei Wang
Ziliang Zong
Guowei Yang
Yan Yan
55
59
0
14 Aug 2020
Low Rank Fusion based Transformers for Multimodal Sequences
Low Rank Fusion based Transformers for Multimodal Sequences
Saurav Sahay
Eda Okur
Shachi H. Kumar
L. Nachman
ViT
36
64
0
04 Jul 2020
A Transformer-based joint-encoding for Emotion Recognition and Sentiment
  Analysis
A Transformer-based joint-encoding for Emotion Recognition and Sentiment Analysis
Jean-Benoit Delbrouck
Noé Tits
Mathilde Brousmiche
Stéphane Dupont
32
112
0
29 Jun 2020
Hierarchical Transformer Network for Utterance-level Emotion Recognition
Hierarchical Transformer Network for Utterance-level Emotion Recognition
Qingbiao Li
Chunhua Wu
K. Zheng
Zhe Wang
26
23
0
18 Feb 2020
Bimodal Speech Emotion Recognition Using Pre-Trained Language Models
Bimodal Speech Emotion Recognition Using Pre-Trained Language Models
Verena Heusser
Niklas Freymuth
Stefan Constantin
A. Waibel
47
26
0
29 Nov 2019
Watch, Listen and Tell: Multi-modal Weakly Supervised Dense Event
  Captioning
Watch, Listen and Tell: Multi-modal Weakly Supervised Dense Event Captioning
Tanzila Rahman
Bicheng Xu
Leonid Sigal
47
79
0
22 Sep 2019
Learning Alignment for Multimodal Emotion Recognition from Speech
Learning Alignment for Multimodal Emotion Recognition from Speech
Haiyang Xu
Hui Zhang
Kun Han
Yun Wang
Yiping Peng
Xiangang Li
21
121
0
06 Sep 2019
EPIC-Fusion: Audio-Visual Temporal Binding for Egocentric Action
  Recognition
EPIC-Fusion: Audio-Visual Temporal Binding for Egocentric Action Recognition
Evangelos Kazakos
Arsha Nagrani
Andrew Zisserman
Dima Damen
EgoV
43
332
0
22 Aug 2019
Multimodal Emotion Recognition Using Deep Canonical Correlation Analysis
Multimodal Emotion Recognition Using Deep Canonical Correlation Analysis
Wei Liu
Jielin Qiu
Wei-Long Zheng
Bao-Liang Lu
23
67
0
13 Aug 2019
Attending to Emotional Narratives
Attending to Emotional Narratives
Zhengxuan Wu
Xiyu Zhang
Zhi-Xuan Tan
Jamil Zaki
Desmond C. Ong
AI4TS
38
18
0
08 Jul 2019
Multimodal and Multi-view Models for Emotion Recognition
Multimodal and Multi-view Models for Emotion Recognition
Gustavo Aguilar
Viktor Rozgic
Weiran Wang
Chao Wang
28
29
0
24 Jun 2019
Multimodal Transformer for Unaligned Multimodal Language Sequences
Multimodal Transformer for Unaligned Multimodal Language Sequences
Yao-Hung Hubert Tsai
Shaojie Bai
Paul Pu Liang
J. Zico Kolter
Louis-Philippe Morency
Ruslan Salakhutdinov
57
1,280
0
01 Jun 2019
Found in Translation: Learning Robust Joint Representations by Cyclic
  Translations Between Modalities
Found in Translation: Learning Robust Joint Representations by Cyclic Translations Between Modalities
Hai Pham
Paul Pu Liang
Thomas Manzini
Louis-Philippe Morency
Barnabás Póczós
49
407
0
19 Dec 2018
Words Can Shift: Dynamically Adjusting Word Representations Using
  Nonverbal Behaviors
Words Can Shift: Dynamically Adjusting Word Representations Using Nonverbal Behaviors
Yansen Wang
Ying Shen
Zhun Liu
Paul Pu Liang
Amir Zadeh
Louis-Philippe Morency
50
398
0
23 Nov 2018
BERT: Pre-training of Deep Bidirectional Transformers for Language
  Understanding
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
VLM
SSL
SSeg
882
93,936
0
11 Oct 2018
Memory Fusion Network for Multi-view Sequential Learning
Memory Fusion Network for Multi-view Sequential Learning
Amir Zadeh
Paul Pu Liang
Navonil Mazumder
Soujanya Poria
Min Zhang
Louis-Philippe Morency
61
699
0
03 Feb 2018
Multi-attention Recurrent Network for Human Communication Comprehension
Multi-attention Recurrent Network for Human Communication Comprehension
Amir Zadeh
Paul Pu Liang
Soujanya Poria
Prateek Vij
Min Zhang
Louis-Philippe Morency
56
481
0
03 Feb 2018
Attention Is All You Need
Attention Is All You Need
Ashish Vaswani
Noam M. Shazeer
Niki Parmar
Jakob Uszkoreit
Llion Jones
Aidan Gomez
Lukasz Kaiser
Illia Polosukhin
3DV
430
129,831
0
12 Jun 2017
Multimodal Machine Learning: A Survey and Taxonomy
Multimodal Machine Learning: A Survey and Taxonomy
T. Baltrušaitis
Chaitanya Ahuja
Louis-Philippe Morency
64
2,890
0
26 May 2017
1