ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2009.09566
  4. Cited By
SSCR: Iterative Language-Based Image Editing via Self-Supervised Counterfactual Reasoning

SSCR: Iterative Language-Based Image Editing via Self-Supervised Counterfactual Reasoning

21 September 2020
Tsu-jui Fu
Qing Guo
Scott T. Grafton
M. Eckstein
William Yang Wang
ArXivPDFHTML

Papers citing "SSCR: Iterative Language-Based Image Editing via Self-Supervised Counterfactual Reasoning"

30 / 30 papers shown
Title
InstructRL4Pix: Training Diffusion for Image Editing by Reinforcement
  Learning
InstructRL4Pix: Training Diffusion for Image Editing by Reinforcement Learning
Tiancheng Li
Jinxiu Liu
Huajun Chen
Qi Liu
EGVM
37
0
0
14 Jun 2024
TheaterGen: Character Management with LLM for Consistent Multi-turn
  Image Generation
TheaterGen: Character Management with LLM for Consistent Multi-turn Image Generation
Junhao Cheng
Baiqiao Yin
Kaixin Cai
Minbin Huang
Hanhui Li
...
Yue Li
Yifei Li
Yuhao Cheng
Yiqiang Yan
Xiaodan Liang
DiffM
MLLM
32
12
0
29 Apr 2024
Identity-aware Dual-constraint Network for Cloth-Changing Person
  Re-identification
Identity-aware Dual-constraint Network for Cloth-Changing Person Re-identification
Peini Guo
Mengyuan Liu
Hong Liu
Ruijia Fan
Guoquan Wang
Bin He
21
0
0
13 Mar 2024
DivCon: Divide and Conquer for Progressive Text-to-Image Generation
DivCon: Divide and Conquer for Progressive Text-to-Image Generation
Yuhao Jia
Wenhan Tan
DiffM
57
1
0
11 Mar 2024
Repositioning the Subject within Image
Repositioning the Subject within Image
Yikai Wang
Chenjie Cao
Ke Fan
Qiaole Dong
Yifan Li
Xiangyang Xue
Yanwei Fu
DiffM
42
1
0
30 Jan 2024
Learning to Follow Object-Centric Image Editing Instructions Faithfully
Learning to Follow Object-Centric Image Editing Instructions Faithfully
Tuhin Chakrabarty
Kanishk Singh
Arkadiy Saakyan
Smaranda Muresan
DiffM
25
6
0
29 Oct 2023
Guiding Instruction-based Image Editing via Multimodal Large Language
  Models
Guiding Instruction-based Image Editing via Multimodal Large Language Models
Johannes Frey
Wenze Hu
Xianzhi Du
William Yang Wang
Yinfei Yang
Zhe Gan
40
88
0
29 Sep 2023
Iterative Multi-granular Image Editing using Diffusion Models
Iterative Multi-granular Image Editing using Diffusion Models
K. J. Joseph
Prateksha Udhayanan
Tripti Shukla
Aishwarya Agarwal
Srikrishna Karanam
Koustava Goswami
Balaji Vasan Srinivasan
DiffM
30
16
0
01 Sep 2023
Diffusion idea exploration for art generation
Diffusion idea exploration for art generation
N. Verma
DiffM
30
1
0
11 Jul 2023
MagicBrush: A Manually Annotated Dataset for Instruction-Guided Image
  Editing
MagicBrush: A Manually Annotated Dataset for Instruction-Guided Image Editing
Kai Zhang
Lingbo Mo
Wenhu Chen
Huan Sun
Yu-Chuan Su
EGVM
111
237
0
16 Jun 2023
Unveiling Cross Modality Bias in Visual Question Answering: A Causal
  View with Possible Worlds VQA
Unveiling Cross Modality Bias in Visual Question Answering: A Causal View with Possible Worlds VQA
A. Vosoughi
Shijian Deng
Songyang Zhang
Yapeng Tian
Chenliang Xu
Jiebo Luo
CML
53
3
0
31 May 2023
LayoutGPT: Compositional Visual Planning and Generation with Large
  Language Models
LayoutGPT: Compositional Visual Planning and Generation with Large Language Models
Weixi Feng
Wanrong Zhu
Tsu-jui Fu
Varun Jampani
Arjun Reddy Akula
Xuehai He
Sugato Basu
Qing Guo
William Yang Wang
MLLM
27
162
0
24 May 2023
Text-guided 3D Human Generation from 2D Collections
Text-guided 3D Human Generation from 2D Collections
Tsu-jui Fu
Wenhan Xiong
Yixin Nie
Jingyu Liu
Barlas Ouguz
William Yang Wang
39
1
0
23 May 2023
Video Generation Beyond a Single Clip
Video Generation Beyond a Single Clip
Hsin-Ping Huang
Yu-Chuan Su
Ming Yang
VLM
DiffM
VGen
22
3
0
15 Apr 2023
A Diffusion-based Method for Multi-turn Compositional Image Generation
A Diffusion-based Method for Multi-turn Compositional Image Generation
Chao Wang
DiffM
33
3
0
05 Apr 2023
Instruction Clarification Requests in Multimodal Collaborative Dialogue
  Games: Tasks, and an Analysis of the CoDraw Dataset
Instruction Clarification Requests in Multimodal Collaborative Dialogue Games: Tasks, and an Analysis of the CoDraw Dataset
Brielen Madureira
David Schlangen
46
10
0
28 Feb 2023
Entity-Level Text-Guided Image Manipulation
Entity-Level Text-Guided Image Manipulation
Yikai Wang
Jianan Wang
Guansong Lu
Hang Xu
Zhenguo Li
Wei Zhang
Yanwei Fu
VGen
31
3
0
22 Feb 2023
Training-Free Structured Diffusion Guidance for Compositional
  Text-to-Image Synthesis
Training-Free Structured Diffusion Guidance for Compositional Text-to-Image Synthesis
Weixi Feng
Xuehai He
Tsu-jui Fu
Varun Jampani
Arjun Reddy Akula
P. Narayana
Sugato Basu
Qing Guo
William Yang Wang
CoGe
30
299
0
09 Dec 2022
Learning Action-Effect Dynamics for Hypothetical Vision-Language
  Reasoning Task
Learning Action-Effect Dynamics for Hypothetical Vision-Language Reasoning Task
Shailaja Keyur Sampat
Pratyay Banerjee
Yezhou Yang
Chitta Baral
21
2
0
07 Dec 2022
Tell Me What Happened: Unifying Text-guided Video Completion via
  Multimodal Masked Video Generation
Tell Me What Happened: Unifying Text-guided Video Completion via Multimodal Masked Video Generation
Tsu-jui Fu
Licheng Yu
Ning Zhang
Cheng-Yang Fu
Jong-Chyi Su
William Yang Wang
Sean Bell
VGen
56
37
0
23 Nov 2022
ManiTrans: Entity-Level Text-Guided Image Manipulation via Token-wise
  Semantic Alignment and Generation
ManiTrans: Entity-Level Text-Guided Image Manipulation via Token-wise Semantic Alignment and Generation
Jianan Wang
Guansong Lu
Hang Xu
Zhenguo Li
Chunjing Xu
Yanwei Fu
25
17
0
09 Apr 2022
CAISE: Conversational Agent for Image Search and Editing
CAISE: Conversational Agent for Image Search and Editing
Hyounghun Kim
Doo Soon Kim
Seunghyun Yoon
Franck Dernoncourt
Trung Bui
Joey Tianyi Zhou
19
6
0
24 Feb 2022
LEMON: Language-Based Environment Manipulation via Execution-Guided
  Pre-training
LEMON: Language-Based Environment Manipulation via Execution-Guided Pre-training
Qi Shi
Qian Liu
Bei Chen
Yu Zhang
Ting Liu
Jian-Guang Lou
32
9
0
20 Jan 2022
LatteGAN: Visually Guided Language Attention for Multi-Turn
  Text-Conditioned Image Manipulation
LatteGAN: Visually Guided Language Attention for Multi-Turn Text-Conditioned Image Manipulation
Shoya Matsumori
Yukikoko Abe
Kosuke Shingyouchi
K. Sugiura
M. Imai
34
9
0
28 Dec 2021
Talk-to-Edit: Fine-Grained Facial Editing via Dialog
Talk-to-Edit: Fine-Grained Facial Editing via Dialog
Yuming Jiang
Ziqi Huang
Xingang Pan
Chen Change Loy
Ziwei Liu
DiffM
107
126
0
09 Sep 2021
Unified Questioner Transformer for Descriptive Question Generation in
  Goal-Oriented Visual Dialogue
Unified Questioner Transformer for Descriptive Question Generation in Goal-Oriented Visual Dialogue
Shoya Matsumori
Kosuke Shingyouchi
Yukikoko Abe
Yosuke Fukuchi
K. Sugiura
M. Imai
36
16
0
29 Jun 2021
Element Intervention for Open Relation Extraction
Element Intervention for Open Relation Extraction
Liu Fangchao
Lingyong Yan
Hongyu Lin
Xianpei Han
Le Sun
LRM
19
20
0
17 Jun 2021
Language-Driven Image Style Transfer
Language-Driven Image Style Transfer
Tsu-jui Fu
Qing Guo
William Yang Wang
CLIP
VLM
21
46
0
01 Jun 2021
M3L: Language-based Video Editing via Multi-Modal Multi-Level
  Transformers
M3L: Language-based Video Editing via Multi-Modal Multi-Level Transformers
Tsu-jui Fu
Qing Guo
Scott T. Grafton
M. Eckstein
Luu Anh Tuan
24
9
0
02 Apr 2021
Counterfactual VQA: A Cause-Effect Look at Language Bias
Counterfactual VQA: A Cause-Effect Look at Language Bias
Yulei Niu
Kaihua Tang
Hanwang Zhang
Zhiwu Lu
Xiansheng Hua
Ji-Rong Wen
CML
36
394
0
08 Jun 2020
1