Multi-Head Attention with Diversity for Learning Grounded Multilingual
Multimodal Representations

Multi-Head Attention with Diversity for Learning Grounded Multilingual Multimodal Representations

30 September 2019

Po-Yao (Bernie) Huang

Xiaojun Chang

Alexander G. Hauptmann

ArXiv (abs)PDF HTML

Papers citing "Multi-Head Attention with Diversity for Learning Grounded Multilingual Multimodal Representations"

11 / 11 papers shown

Title
LookHere: Vision Transformers with Directed Attention Generalize and Extrapolate A. Fuller Daniel G. Kyrollos Yousef Yassin James R. Green 112 3 0 22 May 2024
Scene Graph as Pivoting: Inference-time Image-free Unsupervised Multimodal Machine Translation with Visual Scene Hallucination Hao Fei Qianfeng Liu Meishan Zhang Hao Fei Tat-Seng Chua LRM 142 49 0 20 May 2023
Manifestations of Xenophobia in AI Systems Nenad Tomašev J. L. Maynard Iason Gabriel 102 9 0 15 Dec 2022
Multilingual Multimodality: A Taxonomical Survey of Datasets, Techniques, Challenges and Opportunities Khyathi Chandu A. Geramifard 76 3 0 30 Oct 2022
IGLUE: A Benchmark for Transfer Learning across Modalities, Tasks, and Languages Emanuele Bugliarello Fangyu Liu Jonas Pfeiffer Siva Reddy Desmond Elliott Edoardo Ponti Ivan Vulić MLLM VLM ELM 121 64 0 27 Jan 2022
Towards Zero-shot Cross-lingual Image Retrieval and Tagging Pranav Aggarwal Ritiz Tambi Ajinkya Kale VLM 89 6 0 15 Sep 2021
Multilingual Multimodal Pre-training for Zero-Shot Cross-Lingual Transfer of Vision-Language Models Po-Yao (Bernie) Huang Mandela Patrick Junjie Hu Graham Neubig Florian Metze Alexander G. Hauptmann MLLM VLM 111 57 0 16 Mar 2021
Towards Zero-shot Cross-lingual Image Retrieval Pranav Aggarwal Ajinkya Kale VLM 93 25 0 24 Nov 2020
Support-set bottlenecks for video-text representation learning Mandela Patrick Po-Yao (Bernie) Huang Yuki M. Asano Florian Metze Alexander G. Hauptmann João Henriques Andrea Vedaldi 112 249 0 06 Oct 2020
DSC IIT-ISM at SemEval-2020 Task 8: Bi-Fusion Techniques for Deep Meme Emotion Analysis Pradyumna Gupta Himanshu Gupta Aman Sinha 62 8 0 28 Jul 2020
Unsupervised Multimodal Neural Machine Translation with Pseudo Visual Pivoting Po-Yao (Bernie) Huang Junjie Hu Xiaojun Chang Alexander G. Hauptmann 103 52 0 06 May 2020