ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2312.03025
  4. Cited By
Training on Synthetic Data Beats Real Data in Multimodal Relation
  Extraction

Training on Synthetic Data Beats Real Data in Multimodal Relation Extraction

5 December 2023
Zilin Du
Haoxin Li
Xu Guo
Boyang Li
ArXiv (abs)PDFHTML

Papers citing "Training on Synthetic Data Beats Real Data in Multimodal Relation Extraction"

50 / 73 papers shown
Title
Prompt Me Up: Unleashing the Power of Alignments for Multimodal Entity
  and Relation Extraction
Prompt Me Up: Unleashing the Power of Alignments for Multimodal Entity and Relation Extraction
Xuming Hu
Junzhe Chen
Aiwei Liu
Shiao Meng
Lijie Wen
Philip S. Yu
79
17
0
25 Oct 2023
I2SRM: Intra- and Inter-Sample Relationship Modeling for Multimodal
  Information Extraction
I2SRM: Intra- and Inter-Sample Relationship Modeling for Multimodal Information Extraction
Yusheng Huang
Zhouhan Lin
57
5
0
10 Oct 2023
Improved Baselines with Visual Instruction Tuning
Improved Baselines with Visual Instruction Tuning
Haotian Liu
Chunyuan Li
Yuheng Li
Yong Jae Lee
VLMMLLM
165
2,825
0
05 Oct 2023
Emu: Enhancing Image Generation Models Using Photogenic Needles in a
  Haystack
Emu: Enhancing Image Generation Models Using Photogenic Needles in a Haystack
Xiaoliang Dai
Ji Hou
Chih-Yao Ma
Sam S. Tsai
Jialiang Wang
...
Roshan Sumbaly
Vignesh Ramanathan
Zijian He
Peter Vajda
Devi Parikh
VLM
87
214
0
27 Sep 2023
SYNAuG: Exploiting Synthetic Data for Data Imbalance Problems
SYNAuG: Exploiting Synthetic Data for Data Imbalance Problems
Moon Ye-Bin
Nam Hyeon-Woo
Wonseok Choi
Nayeong Kim
Suha Kwak
Tae-Hyun Oh
DiffM
42
6
0
02 Aug 2023
Dual-Gated Fusion with Prefix-Tuning for Multi-Modal Relation Extraction
Dual-Gated Fusion with Prefix-Tuning for Multi-Modal Relation Extraction
Qian Li
Shu Guo
Cheng Ji
Xutan Peng
Shiyao Cui
Jianxin Li
85
13
0
19 Jun 2023
Training Multimedia Event Extraction With Generated Images and Captions
Training Multimedia Event Extraction With Generated Images and Captions
Zilin Du
Yunxin Li
Xu Guo
Yidan Sun
Boyang Albert Li
DiffM
73
8
0
15 Jun 2023
A Comprehensive Survey on Relation Extraction: Recent Advances and New
  Frontiers
A Comprehensive Survey on Relation Extraction: Recent Advances and New Frontiers
Xiaoyan Zhao
Yang Deng
Min Yang
Lingzhi Wang
Rui Zhang
Hong Cheng
W. Lam
Ying Shen
Ruifeng Xu
KELM
62
30
0
03 Jun 2023
Learning to Imagine: Visually-Augmented Natural Language Generation
Learning to Imagine: Visually-Augmented Natural Language Generation
Tianyi Tang
Yushuo Chen
Yifan Du
Junyi Li
Wayne Xin Zhao
Ji-Rong Wen
DiffM
64
9
0
26 May 2023
Multimodal Relation Extraction with Cross-Modal Retrieval and Synthesis
Multimodal Relation Extraction with Cross-Modal Retrieval and Synthesis
Xuming Hu
Zhijiang Guo
Zhiyang Teng
I. King
Philip S. Yu
81
18
0
25 May 2023
Information Screening whilst Exploiting! Multimodal Relation Extraction
  with Feature Denoising and Multimodal Topic Modeling
Information Screening whilst Exploiting! Multimodal Relation Extraction with Feature Denoising and Multimodal Topic Modeling
Shengqiong Wu
Hao Fei
Yixin Cao
Lidong Bing
Tat-Seng Chua
80
34
0
19 May 2023
Visual Chain of Thought: Bridging Logical Gaps with Multimodal
  Infillings
Visual Chain of Thought: Bridging Logical Gaps with Multimodal Infillings
Daniel Philip Rose
Vaishnavi Himakunthala
Andy Ouyang
Ryan He
Alex Mei
Yujie Lu
Michael Stephen Saxon
Chinmay Sonar
Diba Mirza
William Yang Wang
LRM
118
47
0
03 May 2023
Multimodal Procedural Planning via Dual Text-Image Prompting
Multimodal Procedural Planning via Dual Text-Image Prompting
Yujie Lu
Pan Lu
Zhiyu Zoey Chen
Wanrong Zhu
Xinze Wang
William Yang Wang
LM&Ro
107
45
0
02 May 2023
Synthetic Data from Diffusion Models Improves ImageNet Classification
Synthetic Data from Diffusion Models Improves ImageNet Classification
Shekoofeh Azizi
Simon Kornblith
Chitwan Saharia
Mohammad Norouzi
David J. Fleet
VLMDiffM
103
315
0
17 Apr 2023
Enhancing Multimodal Entity and Relation Extraction with Variational
  Information Bottleneck
Enhancing Multimodal Entity and Relation Extraction with Variational Information Bottleneck
Shiyao Cui
Jiangxia Cao
Xin Cong
Shuaiyi Nie
Quangang Li
Tingwen Liu
Jinqiao Shi
62
25
0
05 Apr 2023
Adding Conditional Control to Text-to-Image Diffusion Models
Adding Conditional Control to Text-to-Image Diffusion Models
Lvmin Zhang
Anyi Rao
Maneesh Agrawala
AI4CE
182
4,175
1
10 Feb 2023
BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image
  Encoders and Large Language Models
BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image Encoders and Large Language Models
Junnan Li
Dongxu Li
Silvio Savarese
Steven C. H. Hoi
VLMMLLM
429
4,656
0
30 Jan 2023
Self-Instruct: Aligning Language Models with Self-Generated Instructions
Self-Instruct: Aligning Language Models with Self-Generated Instructions
Yizhong Wang
Yeganeh Kordi
Swaroop Mishra
Alisa Liu
Noah A. Smith
Daniel Khashabi
Hannaneh Hajishirzi
ALMSyDaLRM
151
2,253
0
20 Dec 2022
Unnatural Instructions: Tuning Language Models with (Almost) No Human
  Labor
Unnatural Instructions: Tuning Language Models with (Almost) No Human Labor
Or Honovich
Thomas Scialom
Omer Levy
Timo Schick
ALM
126
375
0
19 Dec 2022
Multi-Concept Customization of Text-to-Image Diffusion
Multi-Concept Customization of Text-to-Image Diffusion
Nupur Kumari
Bin Zhang
Richard Y. Zhang
Eli Shechtman
Jun-Yan Zhu
165
875
0
08 Dec 2022
Named Entity and Relation Extraction with Multi-Modal Retrieval
Named Entity and Relation Extraction with Multi-Modal Retrieval
Xinyu Wang
Jiong Cai
Yong Jiang
Pengjun Xie
Kewei Tu
Wei Lu
77
52
0
03 Dec 2022
Joint Multimodal Entity-Relation Extraction Based on Edge-enhanced Graph
  Alignment Network and Word-pair Relation Tagging
Joint Multimodal Entity-Relation Extraction Based on Edge-enhanced Graph Alignment Network and Word-pair Relation Tagging
Li Yuan
Yi Cai
Jiangming Wang
Qing Li
55
53
0
28 Nov 2022
On Analyzing the Role of Image for Visual-enhanced Relation Extraction
On Analyzing the Role of Image for Visual-enhanced Relation Extraction
Lei Li
Xiang Chen
Shuofei Qiao
Feiyu Xiong
Huajun Chen
Ningyu Zhang
ViT
34
14
0
14 Nov 2022
Tuning Language Models as Training Data Generators for
  Augmentation-Enhanced Few-Shot Learning
Tuning Language Models as Training Data Generators for Augmentation-Enhanced Few-Shot Learning
Yu Meng
Martin Michalski
Jiaxin Huang
Yu Zhang
Tarek Abdelzaher
Jiawei Han
VLM
118
49
0
06 Nov 2022
eDiff-I: Text-to-Image Diffusion Models with an Ensemble of Expert
  Denoisers
eDiff-I: Text-to-Image Diffusion Models with an Ensemble of Expert Denoisers
Yogesh Balaji
Seungjun Nah
Xun Huang
Arash Vahdat
Jiaming Song
...
Timo Aila
S. Laine
Bryan Catanzaro
Tero Karras
Xuan Li
VLMMoE
177
828
0
02 Nov 2022
Z-LaVI: Zero-Shot Language Solver Fueled by Visual Imagination
Z-LaVI: Zero-Shot Language Solver Fueled by Visual Imagination
Yue Yang
Wenlin Yao
Hongming Zhang
Xiaoyang Wang
Dong Yu
Jianshu Chen
VLM
67
22
0
21 Oct 2022
Visualize Before You Write: Imagination-Guided Open-Ended Text
  Generation
Visualize Before You Write: Imagination-Guided Open-Ended Text Generation
Wanrong Zhu
An Yan
Yujie Lu
Wenda Xu
Xinze Wang
Miguel P. Eckstein
William Yang Wang
114
36
0
07 Oct 2022
Self-Guided Noise-Free Data Generation for Efficient Zero-Shot Learning
Self-Guided Noise-Free Data Generation for Efficient Zero-Shot Learning
Jiahui Gao
Renjie Pi
Yong Lin
Hang Xu
Jiacheng Ye
Zhiyong Wu
Weizhong Zhang
Xiaodan Liang
Zhenguo Li
Lingpeng Kong
SyDaVLM
144
49
0
25 May 2022
Photorealistic Text-to-Image Diffusion Models with Deep Language
  Understanding
Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding
Chitwan Saharia
William Chan
Saurabh Saxena
Lala Li
Jay Whang
...
Raphael Gontijo-Lopes
Tim Salimans
Jonathan Ho
David J Fleet
Mohammad Norouzi
VLM
466
6,077
0
23 May 2022
Flamingo: a Visual Language Model for Few-Shot Learning
Flamingo: a Visual Language Model for Few-Shot Learning
Jean-Baptiste Alayrac
Jeff Donahue
Pauline Luc
Antoine Miech
Iain Barr
...
Mikolaj Binkowski
Ricardo Barreira
Oriol Vinyals
Andrew Zisserman
Karen Simonyan
MLLMVLM
418
3,607
0
29 Apr 2022
Imagination-Augmented Natural Language Understanding
Imagination-Augmented Natural Language Understanding
Yujie Lu
Wanrong Zhu
Xinze Wang
Miguel P. Eckstein
William Yang Wang
48
24
0
18 Apr 2022
Hierarchical Text-Conditional Image Generation with CLIP Latents
Hierarchical Text-Conditional Image Generation with CLIP Latents
Aditya A. Ramesh
Prafulla Dhariwal
Alex Nichol
Casey Chu
Mark Chen
VLMDiffM
413
6,916
0
13 Apr 2022
ZeroGen: Efficient Zero-shot Learning via Dataset Generation
ZeroGen: Efficient Zero-shot Learning via Dataset Generation
Jiacheng Ye
Jiahui Gao
Qintong Li
Hang Xu
Jiangtao Feng
Zhiyong Wu
Tao Yu
Lingpeng Kong
SyDa
105
220
0
16 Feb 2022
Generating Training Data with Language Models: Towards Zero-Shot
  Language Understanding
Generating Training Data with Language Models: Towards Zero-Shot Language Understanding
Yu Meng
Jiaxin Huang
Yu Zhang
Jiawei Han
SyDa
75
235
0
09 Feb 2022
OFA: Unifying Architectures, Tasks, and Modalities Through a Simple
  Sequence-to-Sequence Learning Framework
OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework
Peng Wang
An Yang
Rui Men
Junyang Lin
Shuai Bai
Zhikang Li
Jianxin Ma
Chang Zhou
Jingren Zhou
Hongxia Yang
MLLMObjD
157
880
0
07 Feb 2022
BLIP: Bootstrapping Language-Image Pre-training for Unified
  Vision-Language Understanding and Generation
BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation
Junnan Li
Dongxu Li
Caiming Xiong
Guosheng Lin
MLLMBDLVLMCLIP
555
4,413
0
28 Jan 2022
High-Resolution Image Synthesis with Latent Diffusion Models
High-Resolution Image Synthesis with Latent Diffusion Models
Robin Rombach
A. Blattmann
Dominik Lorenz
Patrick Esser
Bjorn Ommer
3DV
496
15,768
0
20 Dec 2021
FLAVA: A Foundational Language And Vision Alignment Model
FLAVA: A Foundational Language And Vision Alignment Model
Amanpreet Singh
Ronghang Hu
Vedanuj Goswami
Guillaume Couairon
Wojciech Galuba
Marcus Rohrbach
Douwe Kiela
CLIPVLM
106
719
0
08 Dec 2021
On the Frequency Bias of Generative Models
On the Frequency Bias of Generative Models
Katja Schwarz
Yiyi Liao
Andreas Geiger
83
78
0
03 Nov 2021
Symbolic Knowledge Distillation: from General Language Models to
  Commonsense Models
Symbolic Knowledge Distillation: from General Language Models to Commonsense Models
Peter West
Chandrasekhar Bhagavatula
Jack Hessel
Jena D. Hwang
Liwei Jiang
Ronan Le Bras
Ximing Lu
Sean Welleck
Yejin Choi
SyDa
109
333
0
14 Oct 2021
SimVLM: Simple Visual Language Model Pretraining with Weak Supervision
SimVLM: Simple Visual Language Model Pretraining with Weak Supervision
Zirui Wang
Jiahui Yu
Adams Wei Yu
Zihang Dai
Yulia Tsvetkov
Yuan Cao
VLMMLLM
136
799
0
24 Aug 2021
Neural Network Classifier as Mutual Information Evaluator
Neural Network Classifier as Mutual Information Evaluator
Zhenyue Qin
Dongwoo Kim
Tom Gedeon
32
2
0
19 Jun 2021
Learning Transferable Visual Models From Natural Language Supervision
Learning Transferable Visual Models From Natural Language Supervision
Alec Radford
Jong Wook Kim
Chris Hallacy
Aditya A. Ramesh
Gabriel Goh
...
Amanda Askell
Pamela Mishkin
Jack Clark
Gretchen Krueger
Ilya Sutskever
CLIPVLM
978
29,871
0
26 Feb 2021
Vokenization: Improving Language Understanding with Contextualized,
  Visual-Grounded Supervision
Vokenization: Improving Language Understanding with Contextualized, Visual-Grounded Supervision
Hao Tan
Joey Tianyi Zhou
CLIP
66
121
0
14 Oct 2020
Generative Imagination Elevates Machine Translation
Generative Imagination Elevates Machine Translation
Quanyu Long
Mingxuan Wang
Lei Li
60
37
0
21 Sep 2020
Denoising Diffusion Probabilistic Models
Denoising Diffusion Probabilistic Models
Jonathan Ho
Ajay Jain
Pieter Abbeel
DiffM
724
18,364
0
19 Jun 2020
Oscar: Object-Semantics Aligned Pre-training for Vision-Language Tasks
Oscar: Object-Semantics Aligned Pre-training for Vision-Language Tasks
Xiujun Li
Xi Yin
Chunyuan Li
Pengchuan Zhang
Xiaowei Hu
...
Houdong Hu
Li Dong
Furu Wei
Yejin Choi
Jianfeng Gao
VLM
140
1,947
0
13 Apr 2020
A unifying mutual information view of metric learning: cross-entropy vs.
  pairwise losses
A unifying mutual information view of metric learning: cross-entropy vs. pairwise losses
Malik Boudiaf
Jérôme Rony
Imtiaz Masud Ziko
Eric Granger
M. Pedersoli
Pablo Piantanida
Ismail Ben Ayed
SSL
89
160
0
19 Mar 2020
DivideMix: Learning with Noisy Labels as Semi-supervised Learning
DivideMix: Learning with Noisy Labels as Semi-supervised Learning
Junnan Li
R. Socher
Guosheng Lin
NoLa
107
1,034
0
18 Feb 2020
CopyMTL: Copy Mechanism for Joint Extraction of Entities and Relations
  with Multi-Task Learning
CopyMTL: Copy Mechanism for Joint Extraction of Entities and Relations with Multi-Task Learning
Daojian Zeng
Haoran Zhang
Qianying Liu
56
190
0
24 Nov 2019
12
Next