ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2301.11362
  4. Cited By
Improving Cross-modal Alignment for Text-Guided Image Inpainting

Improving Cross-modal Alignment for Text-Guided Image Inpainting

26 January 2023
Yucheng Zhou
Guodong Long
ArXivPDFHTML

Papers citing "Improving Cross-modal Alignment for Text-Guided Image Inpainting"

18 / 18 papers shown
Title
High-Fidelity Pseudo-label Generation by Large Language Models for Training Robust Radiology Report Classifiers
High-Fidelity Pseudo-label Generation by Large Language Models for Training Robust Radiology Report Classifiers
Brian Wong
Kaito Tanaka
37
0
0
03 May 2025
Elevating Visual Question Answering through Implicitly Learned Reasoning Pathways in LVLMs
Elevating Visual Question Answering through Implicitly Learned Reasoning Pathways in LVLMs
Liu Jing
Amirul Rahman
ReLM
LRM
71
0
0
18 Mar 2025
A Generative Framework for Bidirectional Image-Report Understanding in Chest Radiography
A Generative Framework for Bidirectional Image-Report Understanding in Chest Radiography
Nicholas Evans
Stephen Baker
Miles Reed
LM&MA
MedIm
67
0
0
09 Feb 2025
Cross-Domain Semantic Segmentation with Large Language Model-Assisted Descriptor Generation
Cross-Domain Semantic Segmentation with Large Language Model-Assisted Descriptor Generation
Philip Hughes
Larry Burns
Luke Adams
VLM
39
0
0
27 Jan 2025
Vision-Driven Prompt Optimization for Large Language Models in Multimodal Generative Tasks
Vision-Driven Prompt Optimization for Large Language Models in Multimodal Generative Tasks
Leo Franklin
Apiradee Boonmee
Kritsada Wongsuwan
MLLM
VLM
43
0
0
05 Jan 2025
Hierarchical Vision-Language Alignment for Text-to-Image Generation via Diffusion Models
Emily Johnson
Noah Wilson
VLM
62
0
0
03 Jan 2025
Bridging Vision and Language: Modeling Causality and Temporality in
  Video Narratives
Bridging Vision and Language: Modeling Causality and Temporality in Video Narratives
Ji-jun Park
Soo-joon Choi
VGen
99
0
0
14 Dec 2024
Exploring Large Vision-Language Models for Robust and Efficient
  Industrial Anomaly Detection
Exploring Large Vision-Language Models for Robust and Efficient Industrial Anomaly Detection
Kun Qian
Tianyu Sun
Wenhong Wang
71
0
0
01 Dec 2024
Enhancing AI-Driven Psychological Consultation: Layered Prompts with
  Large Language Models
Enhancing AI-Driven Psychological Consultation: Layered Prompts with Large Language Models
Rafael Souza
Jia-Hao Lim
Alexander Davis
LM&MA
AI4MH
33
0
0
29 Aug 2024
DAFT-GAN: Dual Affine Transformation Generative Adversarial Network for
  Text-Guided Image Inpainting
DAFT-GAN: Dual Affine Transformation Generative Adversarial Network for Text-Guided Image Inpainting
Jihoon Lee
Yunhong Min
Hwidong Kim
Sangtae Ahn
32
0
0
09 Aug 2024
LLMs for Enhanced Agricultural Meteorological Recommendations
LLMs for Enhanced Agricultural Meteorological Recommendations
Ji-jun Park
Soo-joon Choi
34
1
0
30 Jul 2024
Enhancing Agricultural Machinery Management through Advanced LLM
  Integration
Enhancing Agricultural Machinery Management through Advanced LLM Integration
Emily Johnson
Noah Wilson
46
0
0
30 Jul 2024
Educational Personalized Learning Path Planning with Large Language
  Models
Educational Personalized Learning Path Planning with Large Language Models
Chee Ng
Yuen Fung
AI4Ed
34
3
0
16 Jul 2024
Exploiting Diffusion Prior for Out-of-Distribution Detection
Exploiting Diffusion Prior for Out-of-Distribution Detection
Armando Zhu
Jiabei Liu
Keqin Li
Shuying Dai
Bo Hong
Peng Zhao
Changsong Wei
46
7
0
16 Jun 2024
Dog Heart Rate and Blood Oxygen Metaverse Interaction System
Dog Heart Rate and Blood Oxygen Metaverse Interaction System
Yanhui Jiang
Jin Cao
Chang Yu
48
2
0
06 Jun 2024
Bridging the Gap between Synthetic and Authentic Images for Multimodal
  Machine Translation
Bridging the Gap between Synthetic and Authentic Images for Multimodal Machine Translation
Wenyu Guo
Qingkai Fang
Dong Yu
Yang Feng
22
6
0
20 Oct 2023
EventBERT: A Pre-Trained Model for Event Correlation Reasoning
EventBERT: A Pre-Trained Model for Event Correlation Reasoning
Yucheng Zhou
Xiubo Geng
Tao Shen
Guodong Long
Daxin Jiang
42
46
0
13 Oct 2021
Zero-Shot Text-to-Image Generation
Zero-Shot Text-to-Image Generation
Aditya A. Ramesh
Mikhail Pavlov
Gabriel Goh
Scott Gray
Chelsea Voss
Alec Radford
Mark Chen
Ilya Sutskever
VLM
255
4,781
0
24 Feb 2021
1