ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1910.00058
  4. Cited By
Multi-Head Attention with Diversity for Learning Grounded Multilingual
  Multimodal Representations

Multi-Head Attention with Diversity for Learning Grounded Multilingual Multimodal Representations

30 September 2019
Po-Yao (Bernie) Huang
Xiaojun Chang
Alexander G. Hauptmann
ArXiv (abs)PDFHTML

Papers citing "Multi-Head Attention with Diversity for Learning Grounded Multilingual Multimodal Representations"

11 / 11 papers shown
Title
LookHere: Vision Transformers with Directed Attention Generalize and
  Extrapolate
LookHere: Vision Transformers with Directed Attention Generalize and Extrapolate
A. Fuller
Daniel G. Kyrollos
Yousef Yassin
James R. Green
112
3
0
22 May 2024
Scene Graph as Pivoting: Inference-time Image-free Unsupervised
  Multimodal Machine Translation with Visual Scene Hallucination
Scene Graph as Pivoting: Inference-time Image-free Unsupervised Multimodal Machine Translation with Visual Scene Hallucination
Hao Fei
Qianfeng Liu
Meishan Zhang
Hao Fei
Tat-Seng Chua
LRM
142
49
0
20 May 2023
Manifestations of Xenophobia in AI Systems
Manifestations of Xenophobia in AI Systems
Nenad Tomašev
J. L. Maynard
Iason Gabriel
102
9
0
15 Dec 2022
Multilingual Multimodality: A Taxonomical Survey of Datasets,
  Techniques, Challenges and Opportunities
Multilingual Multimodality: A Taxonomical Survey of Datasets, Techniques, Challenges and Opportunities
Khyathi Chandu
A. Geramifard
76
3
0
30 Oct 2022
IGLUE: A Benchmark for Transfer Learning across Modalities, Tasks, and
  Languages
IGLUE: A Benchmark for Transfer Learning across Modalities, Tasks, and Languages
Emanuele Bugliarello
Fangyu Liu
Jonas Pfeiffer
Siva Reddy
Desmond Elliott
Edoardo Ponti
Ivan Vulić
MLLMVLMELM
121
64
0
27 Jan 2022
Towards Zero-shot Cross-lingual Image Retrieval and Tagging
Towards Zero-shot Cross-lingual Image Retrieval and Tagging
Pranav Aggarwal
Ritiz Tambi
Ajinkya Kale
VLM
89
6
0
15 Sep 2021
Multilingual Multimodal Pre-training for Zero-Shot Cross-Lingual
  Transfer of Vision-Language Models
Multilingual Multimodal Pre-training for Zero-Shot Cross-Lingual Transfer of Vision-Language Models
Po-Yao (Bernie) Huang
Mandela Patrick
Junjie Hu
Graham Neubig
Florian Metze
Alexander G. Hauptmann
MLLMVLM
111
57
0
16 Mar 2021
Towards Zero-shot Cross-lingual Image Retrieval
Towards Zero-shot Cross-lingual Image Retrieval
Pranav Aggarwal
Ajinkya Kale
VLM
93
25
0
24 Nov 2020
Support-set bottlenecks for video-text representation learning
Support-set bottlenecks for video-text representation learning
Mandela Patrick
Po-Yao (Bernie) Huang
Yuki M. Asano
Florian Metze
Alexander G. Hauptmann
João Henriques
Andrea Vedaldi
112
249
0
06 Oct 2020
DSC IIT-ISM at SemEval-2020 Task 8: Bi-Fusion Techniques for Deep Meme
  Emotion Analysis
DSC IIT-ISM at SemEval-2020 Task 8: Bi-Fusion Techniques for Deep Meme Emotion Analysis
Pradyumna Gupta
Himanshu Gupta
Aman Sinha
62
8
0
28 Jul 2020
Unsupervised Multimodal Neural Machine Translation with Pseudo Visual
  Pivoting
Unsupervised Multimodal Neural Machine Translation with Pseudo Visual Pivoting
Po-Yao (Bernie) Huang
Junjie Hu
Xiaojun Chang
Alexander G. Hauptmann
103
52
0
06 May 2020
1