ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1908.05787
  4. Cited By
Integrating Multimodal Information in Large Pretrained Transformers

Integrating Multimodal Information in Large Pretrained Transformers

15 August 2019
Wasifur Rahman
M. Hasan
Sangwu Lee
Amir Zadeh
Chengfeng Mao
Louis-Philippe Morency
Ehsan Hoque
ArXivPDFHTML

Papers citing "Integrating Multimodal Information in Large Pretrained Transformers"

7 / 7 papers shown
Title
M6-UFC: Unifying Multi-Modal Controls for Conditional Image Synthesis
  via Non-Autoregressive Generative Transformers
M6-UFC: Unifying Multi-Modal Controls for Conditional Image Synthesis via Non-Autoregressive Generative Transformers
Zhu Zhang
Jianxin Ma
Chang Zhou
Rui Men
Zhikang Li
Ming Ding
Jie Tang
Jingren Zhou
Hongxia Yang
25
46
0
29 May 2021
Complaint Identification in Social Media with Transformer Networks
Complaint Identification in Social Media with Transformer Networks
Mali Jin
Nikolaos Aletras
12
16
0
21 Oct 2020
X-LXMERT: Paint, Caption and Answer Questions with Multi-Modal
  Transformers
X-LXMERT: Paint, Caption and Answer Questions with Multi-Modal Transformers
Jaemin Cho
Jiasen Lu
Dustin Schwenk
Hannaneh Hajishirzi
Aniruddha Kembhavi
VLM
MLLM
30
102
0
23 Sep 2020
SemEval-2020 Task 8: Memotion Analysis -- The Visuo-Lingual Metaphor!
SemEval-2020 Task 8: Memotion Analysis -- The Visuo-Lingual Metaphor!
Chhavi Sharma
Deepesh Bhageria
W. Scott
Srinivas Pykl
A. Das
Tanmoy Chakraborty
Viswanath Pulabaigari
Björn Gambäck
20
166
0
09 Aug 2020
Cross-media Structured Common Space for Multimedia Event Extraction
Cross-media Structured Common Space for Multimedia Event Extraction
Manling Li
Alireza Zareian
Qi Zeng
Spencer Whitehead
Di Lu
Heng Ji
Shih-Fu Chang
10
103
0
05 May 2020
Span-based Localizing Network for Natural Language Video Localization
Span-based Localizing Network for Natural Language Video Localization
Hao Zhang
Aixin Sun
Wei Jing
Qiufeng Wang
32
311
0
29 Apr 2020
Modality-based Factorization for Multimodal Fusion
Modality-based Factorization for Multimodal Fusion
Elham J. Barezi
Peyman Momeni
Pascale Fung
43
36
0
30 Nov 2018
1