ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1811.09845
  4. Cited By
Tell, Draw, and Repeat: Generating and Modifying Images Based on
  Continual Linguistic Instruction

Tell, Draw, and Repeat: Generating and Modifying Images Based on Continual Linguistic Instruction

24 November 2018
Alaaeldin El-Nouby
Shikhar Sharma
Hannes Schulz
Devon Hjelm
Layla El Asri
Samira Ebrahimi Kahou
Yoshua Bengio
Graham W.Taylor
    VLM
ArXivPDFHTML

Papers citing "Tell, Draw, and Repeat: Generating and Modifying Images Based on Continual Linguistic Instruction"

34 / 34 papers shown
Title
Are We Done with Object-Centric Learning?
Are We Done with Object-Centric Learning?
Alexander Rubinstein
Ameya Prabhu
Matthias Bethge
Seong Joon Oh
OCL
716
0
0
09 Apr 2025
InstructRL4Pix: Training Diffusion for Image Editing by Reinforcement
  Learning
InstructRL4Pix: Training Diffusion for Image Editing by Reinforcement Learning
Tiancheng Li
Yu Lei
Huajun Chen
Nan Zhuang
EGVM
40
0
0
14 Jun 2024
TheaterGen: Character Management with LLM for Consistent Multi-turn
  Image Generation
TheaterGen: Character Management with LLM for Consistent Multi-turn Image Generation
Junhao Cheng
Baiqiao Yin
Kaixin Cai
Minbin Huang
Hanhui Li
...
Yue Li
Yifei Li
Yuhao Cheng
Yiqiang Yan
Xiaodan Liang
DiffM
MLLM
45
12
0
29 Apr 2024
Paint by Inpaint: Learning to Add Image Objects by Removing Them First
Paint by Inpaint: Learning to Add Image Objects by Removing Them First
Navve Wasserman
Noam Rotstein
Roy Ganz
Ron Kimmel
DiffM
39
15
0
28 Apr 2024
DialogGen: Multi-modal Interactive Dialogue System for Multi-turn Text-to-Image Generation
DialogGen: Multi-modal Interactive Dialogue System for Multi-turn Text-to-Image Generation
Minbin Huang
Yanxin Long
Xinchi Deng
Ruihang Chu
Jiangfeng Xiong
Xiaodan Liang
Hong Cheng
Qinglin Lu
Wei Liu
MLLM
EGVM
65
8
0
13 Mar 2024
ZONE: Zero-Shot Instruction-Guided Local Editing
ZONE: Zero-Shot Instruction-Guided Local Editing
Shanglin Li
Bo-Wen Zeng
Yutang Feng
Sicheng Gao
Xuhui Liu
...
Li Lin
Xu Tang
Yao Hu
Jianzhuang Liu
Baochang Zhang
DiffM
36
30
0
28 Dec 2023
A Diffusion-based Method for Multi-turn Compositional Image Generation
A Diffusion-based Method for Multi-turn Compositional Image Generation
Chao Wang
DiffM
38
3
0
05 Apr 2023
CHATEDIT: Towards Multi-turn Interactive Facial Image Editing via
  Dialogue
CHATEDIT: Towards Multi-turn Interactive Facial Image Editing via Dialogue
Xing Cui
Zekun Li
Peipei Li
Yibo Hu
Hailin Shi
Zhaofeng He
36
7
0
20 Mar 2023
Knowledge-Based Counterfactual Queries for Visual Question Answering
Knowledge-Based Counterfactual Queries for Visual Question Answering
Theodoti Stoikou
Maria Lymperaiou
Giorgos Stamou
AAML
34
1
0
05 Mar 2023
The Contribution of Knowledge in Visiolinguistic Learning: A Survey on
  Tasks and Challenges
The Contribution of Knowledge in Visiolinguistic Learning: A Survey on Tasks and Challenges
Maria Lymperaiou
Giorgos Stamou
VLM
32
4
0
04 Mar 2023
Instruction Clarification Requests in Multimodal Collaborative Dialogue
  Games: Tasks, and an Analysis of the CoDraw Dataset
Instruction Clarification Requests in Multimodal Collaborative Dialogue Games: Tasks, and an Analysis of the CoDraw Dataset
Brielen Madureira
David Schlangen
51
10
0
28 Feb 2023
Fine-grained Cross-modal Fusion based Refinement for Text-to-Image
  Synthesis
Fine-grained Cross-modal Fusion based Refinement for Text-to-Image Synthesis
Haoran Sun
Yang Wang
Haipeng Liu
Biao Qian
26
8
0
17 Feb 2023
Training-Free Structured Diffusion Guidance for Compositional
  Text-to-Image Synthesis
Training-Free Structured Diffusion Guidance for Compositional Text-to-Image Synthesis
Weixi Feng
Xuehai He
Tsu-Jui Fu
Varun Jampani
Arjun Reddy Akula
P. Narayana
Sugato Basu
Xinze Wang
William Yang Wang
CoGe
51
300
0
09 Dec 2022
Target-Free Text-guided Image Manipulation
Target-Free Text-guided Image Manipulation
Wanshu Fan
Cheng Yang
Chiao-An Yang
Yu-Chiang Frank Wang
DiffM
31
2
0
26 Nov 2022
Tell Me What Happened: Unifying Text-guided Video Completion via
  Multimodal Masked Video Generation
Tell Me What Happened: Unifying Text-guided Video Completion via Multimodal Masked Video Generation
Tsu-Jui Fu
Licheng Yu
Ning Zhang
Cheng-Yang Fu
Jong-Chyi Su
William Yang Wang
Sean Bell
VGen
61
37
0
23 Nov 2022
DrawMon: A Distributed System for Detection of Atypical Sketch Content
  in Concurrent Pictionary Games
DrawMon: A Distributed System for Detection of Atypical Sketch Content in Concurrent Pictionary Games
Nikhil Bansal
Kartiki Gupta
Kiruthika Kannan
Sivani Pentapati
Ravi Kiran Sarvadevabhatla
38
0
0
10 Nov 2022
Robust Sound-Guided Image Manipulation
Robust Sound-Guided Image Manipulation
Seung Hyun Lee
Gyeongrok Oh
Wonmin Byeon
Sang Ho Yoon
Jinkyu Kim
Sangpil Kim
DiffM
26
7
0
30 Aug 2022
Scaling Autoregressive Models for Content-Rich Text-to-Image Generation
Scaling Autoregressive Models for Content-Rich Text-to-Image Generation
Jiahui Yu
Yuanzhong Xu
Jing Yu Koh
Thang Luong
Gunjan Baid
...
Zarana Parekh
Xin Li
Han Zhang
Jason Baldridge
Yonghui Wu
EGVM
134
1,072
0
22 Jun 2022
Improved Vector Quantized Diffusion Models
Improved Vector Quantized Diffusion Models
Zhicong Tang
Shuyang Gu
Jianmin Bao
Dong Chen
Fang Wen
DiffM
187
63
0
31 May 2022
EnvEdit: Environment Editing for Vision-and-Language Navigation
EnvEdit: Environment Editing for Vision-and-Language Navigation
Jialu Li
Hao Tan
Joey Tianyi Zhou
36
80
0
29 Mar 2022
CAISE: Conversational Agent for Image Search and Editing
CAISE: Conversational Agent for Image Search and Editing
Hyounghun Kim
Doo Soon Kim
Seunghyun Yoon
Franck Dernoncourt
Trung Bui
Joey Tianyi Zhou
27
6
0
24 Feb 2022
LatteGAN: Visually Guided Language Attention for Multi-Turn
  Text-Conditioned Image Manipulation
LatteGAN: Visually Guided Language Attention for Multi-Turn Text-Conditioned Image Manipulation
Shoya Matsumori
Yukikoko Abe
Kosuke Shingyouchi
K. Sugiura
M. Imai
34
9
0
28 Dec 2021
Sound-Guided Semantic Image Manipulation
Sound-Guided Semantic Image Manipulation
Seung Hyun Lee
Wonseok Roh
Wonmin Byeon
Sang Ho Yoon
Chanyoung Kim
Jinkyu Kim
Sangpil Kim
DiffM
35
43
0
30 Nov 2021
Vector Quantized Diffusion Model for Text-to-Image Synthesis
Vector Quantized Diffusion Model for Text-to-Image Synthesis
Shuyang Gu
Dong Chen
Jianmin Bao
Fang Wen
Bo Zhang
Dongdong Chen
Lu Yuan
B. Guo
DiffM
74
765
0
29 Nov 2021
Improving Generation and Evaluation of Visual Stories via Semantic
  Consistency
Improving Generation and Evaluation of Visual Stories via Semantic Consistency
A. Maharana
Darryl Hannan
Joey Tianyi Zhou
EGVM
24
61
0
20 May 2021
Adversarial Text-to-Image Synthesis: A Review
Adversarial Text-to-Image Synthesis: A Review
Stanislav Frolov
Tobias Hinz
Federico Raue
Jörn Hees
Andreas Dengel
EGVM
27
175
0
25 Jan 2021
Text-to-Image Generation Grounded by Fine-Grained User Attention
Text-to-Image Generation Grounded by Fine-Grained User Attention
Jing Yu Koh
Jason Baldridge
Honglak Lee
Yinfei Yang
DiffM
27
59
0
07 Nov 2020
SSCR: Iterative Language-Based Image Editing via Self-Supervised Counterfactual Reasoning
SSCR: Iterative Language-Based Image Editing via Self-Supervised Counterfactual Reasoning
Tsu-Jui Fu
Xinze Wang
Scott T. Grafton
Miguel P. Eckstein
William Yang Wang
36
40
0
21 Sep 2020
Describe What to Change: A Text-guided Unsupervised Image-to-Image
  Translation Approach
Describe What to Change: A Text-guided Unsupervised Image-to-Image Translation Approach
Yahui Liu
Marco De Nadai
Deng Cai
Huayang Li
Xavier Alameda-Pineda
N. Sebe
Bruno Lepri
38
59
0
10 Aug 2020
History for Visual Dialog: Do we really need it?
History for Visual Dialog: Do we really need it?
Shubham Agarwal
Trung Bui
Joon-Young Lee
Ioannis Konstas
Verena Rieser
VLM
19
69
0
08 May 2020
A Review on Generative Adversarial Networks: Algorithms, Theory, and
  Applications
A Review on Generative Adversarial Networks: Algorithms, Theory, and Applications
Jie Gui
Zhenan Sun
Yonggang Wen
Dacheng Tao
Jieping Ye
EGVM
33
821
0
20 Jan 2020
ManiGAN: Text-Guided Image Manipulation
ManiGAN: Text-Guided Image Manipulation
Bowen Li
Xiaojuan Qi
Thomas Lukasiewicz
Philip Torr
EGVM
61
284
0
12 Dec 2019
Semantic Object Accuracy for Generative Text-to-Image Synthesis
Semantic Object Accuracy for Generative Text-to-Image Synthesis
Tobias Hinz
Stefan Heinrich
S. Wermter
EGVM
29
158
0
29 Oct 2019
Conditional Image Synthesis With Auxiliary Classifier GANs
Conditional Image Synthesis With Auxiliary Classifier GANs
Augustus Odena
C. Olah
Jonathon Shlens
GAN
250
3,192
0
30 Oct 2016
1