ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2210.05815
  4. Cited By
Underspecification in Scene Description-to-Depiction Tasks

Underspecification in Scene Description-to-Depiction Tasks

11 October 2022
Ben Hutchinson
Jason Baldridge
Vinodkumar Prabhakaran
    DiffM
ArXivPDFHTML

Papers citing "Underspecification in Scene Description-to-Depiction Tasks"

29 / 29 papers shown
Title
Att-Adapter: A Robust and Precise Domain-Specific Multi-Attributes T2I Diffusion Adapter via Conditional Variational Autoencoder
Att-Adapter: A Robust and Precise Domain-Specific Multi-Attributes T2I Diffusion Adapter via Conditional Variational Autoencoder
Wonwoong Cho
Yan-Ying Chen
M. Klenk
David I. Inouye
Yanxia Zhang
DiffM
168
0
0
15 Mar 2025
GRADE: Quantifying Sample Diversity in Text-to-Image Models
GRADE: Quantifying Sample Diversity in Text-to-Image Models
Royi Rassin
Aviv Slobodkin
Shauli Ravfogel
Yanai Elazar
Yoav Goldberg
91
1
0
29 Oct 2024
Beyond Aesthetics: Cultural Competence in Text-to-Image Models
Beyond Aesthetics: Cultural Competence in Text-to-Image Models
Nithish Kannen
Arif Ahmad
Marco Andreetto
Vinodkumar Prabhakaran
Utsav Prabhu
Adji Bousso Dieng
Pushpak Bhattacharyya
Shachi Dave
56
16
0
09 Jul 2024
DOCCI: Descriptions of Connected and Contrasting Images
DOCCI: Descriptions of Connected and Contrasting Images
Yasumasa Onoe
Sunayana Rane
Zachary Berger
Yonatan Bitton
Jaemin Cho
...
Zarana Parekh
Jordi Pont-Tuset
Garrett Tanzer
Su Wang
Jason Baldridge
41
48
0
30 Apr 2024
Modeling the Sacred: Considerations when Using Religious Texts in
  Natural Language Processing
Modeling the Sacred: Considerations when Using Religious Texts in Natural Language Processing
Ben Hutchinson
91
0
0
23 Apr 2024
An image speaks a thousand words, but can everyone listen? On image
  transcreation for cultural relevance
An image speaks a thousand words, but can everyone listen? On image transcreation for cultural relevance
Simran Khanuja
Sathyanarayanan Ramamoorthy
Yueqi Song
Graham Neubig
DiffM
22
11
0
01 Apr 2024
Do Pre-Trained Language Models Detect and Understand Semantic
  Underspecification? Ask the DUST!
Do Pre-Trained Language Models Detect and Understand Semantic Underspecification? Ask the DUST!
Frank Wildenburg
Michael Hanna
Sandro Pezzelle
31
3
0
19 Feb 2024
Examining Gender and Racial Bias in Large Vision-Language Models Using a
  Novel Dataset of Parallel Images
Examining Gender and Racial Bias in Large Vision-Language Models Using a Novel Dataset of Parallel Images
Kathleen C. Fraser
S. Kiritchenko
46
33
0
08 Feb 2024
Prompt Expansion for Adaptive Text-to-Image Generation
Prompt Expansion for Adaptive Text-to-Image Generation
Siddhartha Datta
Alexander Ku
Deepak Ramachandran
Peter Anderson
DiffM
39
9
0
27 Dec 2023
Semantic and Expressive Variation in Image Captions Across Languages
Semantic and Expressive Variation in Image Captions Across Languages
Andre Ye
Sebastin Santy
Jena D. Hwang
Amy X. Zhang
Ranjay Krishna
VLM
58
3
0
22 Oct 2023
Compositional Abilities Emerge Multiplicatively: Exploring Diffusion
  Models on a Synthetic Task
Compositional Abilities Emerge Multiplicatively: Exploring Diffusion Models on a Synthetic Task
Maya Okawa
Ekdeep Singh Lubana
Robert P. Dick
Hidenori Tanaka
CoGe
DiffM
37
44
0
13 Oct 2023
ITI-GEN: Inclusive Text-to-Image Generation
ITI-GEN: Inclusive Text-to-Image Generation
Cheng Zhang
Xuanbai Chen
Siqi Chai
Chen Henry Wu
Dmitry Lagun
Thabo Beeler
Fernando de la Torre
VLM
32
52
0
11 Sep 2023
Manipulating Embeddings of Stable Diffusion Prompts
Manipulating Embeddings of Stable Diffusion Prompts
Niklas Deckers
Julia Peters
Martin Potthast
DiffM
40
9
0
23 Aug 2023
The Bias Amplification Paradox in Text-to-Image Generation
The Bias Amplification Paradox in Text-to-Image Generation
P. Seshadri
Sameer Singh
Yanai Elazar
DiffM
24
39
0
01 Aug 2023
Dealing with Semantic Underspecification in Multimodal NLP
Dealing with Semantic Underspecification in Multimodal NLP
Sandro Pezzelle
19
9
0
08 Jun 2023
Multilingual Conceptual Coverage in Text-to-Image Models
Multilingual Conceptual Coverage in Text-to-Image Models
Michael Stephen Saxon
William Yang Wang
EGVM
24
8
0
02 Jun 2023
Generative AI for Product Design: Getting the Right Design and the
  Design Right
Generative AI for Product Design: Getting the Right Design and the Design Right
Matthew K. Hong
Shabnam Hakimi
Yan-Ying Chen
Heishiro Toyoda
Charlene C. Wu
M. Klenk
AI4CE
19
16
0
02 Jun 2023
PaLI-X: On Scaling up a Multilingual Vision and Language Model
PaLI-X: On Scaling up a Multilingual Vision and Language Model
Xi Chen
Josip Djolonga
Piotr Padlewski
Basil Mustafa
Soravit Changpinyo
...
Mojtaba Seyedhosseini
A. Angelova
Xiaohua Zhai
N. Houlsby
Radu Soricut
VLM
62
187
0
29 May 2023
I Spy a Metaphor: Large Language Models and Diffusion Models Co-Create
  Visual Metaphors
I Spy a Metaphor: Large Language Models and Diffusion Models Co-Create Visual Metaphors
Tuhin Chakrabarty
Arkadiy Saakyan
Olivia Winn
Artemis Panagopoulou
Yue Yang
Marianna Apidianaki
Smaranda Muresan
DiffM
33
41
0
24 May 2023
AI's Regimes of Representation: A Community-centered Study of
  Text-to-Image Models in South Asia
AI's Regimes of Representation: A Community-centered Study of Text-to-Image Models in South Asia
Rida Qadri
Renee Shelby
Cynthia L. Bennett
Emily Denton
26
67
0
19 May 2023
Inspecting the Geographical Representativeness of Images from
  Text-to-Image Models
Inspecting the Geographical Representativeness of Images from Text-to-Image Models
Aparna Basu
R. Venkatesh Babu
Danish Pruthi
DiffM
31
39
0
18 May 2023
CoBIT: A Contrastive Bi-directional Image-Text Generation Model
CoBIT: A Contrastive Bi-directional Image-Text Generation Model
Haoxuan You
Mandy Guo
Zhecan Wang
Kai-Wei Chang
Jason Baldridge
Jiahui Yu
DiffM
49
12
0
23 Mar 2023
A Friendly Face: Do Text-to-Image Systems Rely on Stereotypes when the
  Input is Under-Specified?
A Friendly Face: Do Text-to-Image Systems Rely on Stereotypes when the Input is Under-Specified?
Kathleen C. Fraser
S. Kiritchenko
I. Nejadgholi
DiffM
35
36
0
14 Feb 2023
DALLE-2 is Seeing Double: Flaws in Word-to-Concept Mapping in Text2Image
  Models
DALLE-2 is Seeing Double: Flaws in Word-to-Concept Mapping in Text2Image Models
Royi Rassin
Shauli Ravfogel
Yoav Goldberg
21
60
0
19 Oct 2022
Scaling Autoregressive Models for Content-Rich Text-to-Image Generation
Scaling Autoregressive Models for Content-Rich Text-to-Image Generation
Jiahui Yu
Yuanzhong Xu
Jing Yu Koh
Thang Luong
Gunjan Baid
...
Zarana Parekh
Xin Li
Han Zhang
Jason Baldridge
Yonghui Wu
EGVM
107
1,062
0
22 Jun 2022
WIT: Wikipedia-based Image Text Dataset for Multimodal Multilingual
  Machine Learning
WIT: Wikipedia-based Image Text Dataset for Multimodal Multilingual Machine Learning
Krishna Srinivasan
K. Raman
Jiecao Chen
Michael Bendersky
Marc Najork
VLM
208
310
0
02 Mar 2021
Zero-Shot Text-to-Image Generation
Zero-Shot Text-to-Image Generation
Aditya A. Ramesh
Mikhail Pavlov
Gabriel Goh
Scott Gray
Chelsea Voss
Alec Radford
Mark Chen
Ilya Sutskever
VLM
255
4,781
0
24 Feb 2021
Scaling Up Visual and Vision-Language Representation Learning With Noisy
  Text Supervision
Scaling Up Visual and Vision-Language Representation Learning With Noisy Text Supervision
Chao Jia
Yinfei Yang
Ye Xia
Yi-Ting Chen
Zarana Parekh
Hieu H. Pham
Quoc V. Le
Yun-hsuan Sung
Zhen Li
Tom Duerig
VLM
CLIP
313
3,708
0
11 Feb 2021
Diversity and Inclusion Metrics in Subset Selection
Diversity and Inclusion Metrics in Subset Selection
Margaret Mitchell
Dylan K. Baker
Nyalleng Moorosi
Emily L. Denton
Ben Hutchinson
A. Hanna
Timnit Gebru
Jamie Morgenstern
FaML
150
85
0
09 Feb 2020
1