Underspecification in Scene Description-to-Depiction Tasks

11 October 2022

Ben Hutchinson

Jason Baldridge

Vinodkumar Prabhakaran

DiffM

ArXiv PDF HTML

Papers citing "Underspecification in Scene Description-to-Depiction Tasks"

29 / 29 papers shown

Title
Att-Adapter: A Robust and Precise Domain-Specific Multi-Attributes T2I Diffusion Adapter via Conditional Variational Autoencoder Wonwoong Cho Yan-Ying Chen M. Klenk David I. Inouye Yanxia Zhang DiffM 168 0 0 15 Mar 2025
GRADE: Quantifying Sample Diversity in Text-to-Image Models Royi Rassin Aviv Slobodkin Shauli Ravfogel Yanai Elazar Yoav Goldberg 91 1 0 29 Oct 2024
Beyond Aesthetics: Cultural Competence in Text-to-Image Models Nithish Kannen Arif Ahmad Marco Andreetto Vinodkumar Prabhakaran Utsav Prabhu Adji Bousso Dieng Pushpak Bhattacharyya Shachi Dave 56 16 0 09 Jul 2024
DOCCI: Descriptions of Connected and Contrasting Images Yasumasa Onoe Sunayana Rane Zachary Berger Yonatan Bitton Jaemin Cho ... Zarana Parekh Jordi Pont-Tuset Garrett Tanzer Su Wang Jason Baldridge 41 48 0 30 Apr 2024
Modeling the Sacred: Considerations when Using Religious Texts in Natural Language Processing Ben Hutchinson 91 0 0 23 Apr 2024
An image speaks a thousand words, but can everyone listen? On image transcreation for cultural relevance Simran Khanuja Sathyanarayanan Ramamoorthy Yueqi Song Graham Neubig DiffM 22 11 0 01 Apr 2024
Do Pre-Trained Language Models Detect and Understand Semantic Underspecification? Ask the DUST! Frank Wildenburg Michael Hanna Sandro Pezzelle 31 3 0 19 Feb 2024
Examining Gender and Racial Bias in Large Vision-Language Models Using a Novel Dataset of Parallel Images Kathleen C. Fraser S. Kiritchenko 46 33 0 08 Feb 2024
Prompt Expansion for Adaptive Text-to-Image Generation Siddhartha Datta Alexander Ku Deepak Ramachandran Peter Anderson DiffM 39 9 0 27 Dec 2023
Semantic and Expressive Variation in Image Captions Across Languages Andre Ye Sebastin Santy Jena D. Hwang Amy X. Zhang Ranjay Krishna VLM 58 3 0 22 Oct 2023
Compositional Abilities Emerge Multiplicatively: Exploring Diffusion Models on a Synthetic Task Maya Okawa Ekdeep Singh Lubana Robert P. Dick Hidenori Tanaka CoGe DiffM 37 44 0 13 Oct 2023
ITI-GEN: Inclusive Text-to-Image Generation Cheng Zhang Xuanbai Chen Siqi Chai Chen Henry Wu Dmitry Lagun Thabo Beeler Fernando de la Torre VLM 32 52 0 11 Sep 2023
Manipulating Embeddings of Stable Diffusion Prompts Niklas Deckers Julia Peters Martin Potthast DiffM 40 9 0 23 Aug 2023
The Bias Amplification Paradox in Text-to-Image Generation P. Seshadri Sameer Singh Yanai Elazar DiffM 24 39 0 01 Aug 2023
Dealing with Semantic Underspecification in Multimodal NLP Sandro Pezzelle 19 9 0 08 Jun 2023
Multilingual Conceptual Coverage in Text-to-Image Models Michael Stephen Saxon William Yang Wang EGVM 24 8 0 02 Jun 2023
Generative AI for Product Design: Getting the Right Design and the Design Right Matthew K. Hong Shabnam Hakimi Yan-Ying Chen Heishiro Toyoda Charlene C. Wu M. Klenk AI4CE 19 16 0 02 Jun 2023
PaLI-X: On Scaling up a Multilingual Vision and Language Model Xi Chen Josip Djolonga Piotr Padlewski Basil Mustafa Soravit Changpinyo ... Mojtaba Seyedhosseini A. Angelova Xiaohua Zhai N. Houlsby Radu Soricut VLM 62 187 0 29 May 2023
I Spy a Metaphor: Large Language Models and Diffusion Models Co-Create Visual Metaphors Tuhin Chakrabarty Arkadiy Saakyan Olivia Winn Artemis Panagopoulou Yue Yang Marianna Apidianaki Smaranda Muresan DiffM 33 41 0 24 May 2023
AI's Regimes of Representation: A Community-centered Study of Text-to-Image Models in South Asia Rida Qadri Renee Shelby Cynthia L. Bennett Emily Denton 26 67 0 19 May 2023
Inspecting the Geographical Representativeness of Images from Text-to-Image Models Aparna Basu R. Venkatesh Babu Danish Pruthi DiffM 31 39 0 18 May 2023
CoBIT: A Contrastive Bi-directional Image-Text Generation Model Haoxuan You Mandy Guo Zhecan Wang Kai-Wei Chang Jason Baldridge Jiahui Yu DiffM 49 12 0 23 Mar 2023
A Friendly Face: Do Text-to-Image Systems Rely on Stereotypes when the Input is Under-Specified? Kathleen C. Fraser S. Kiritchenko I. Nejadgholi DiffM 35 36 0 14 Feb 2023
DALLE-2 is Seeing Double: Flaws in Word-to-Concept Mapping in Text2Image Models Royi Rassin Shauli Ravfogel Yoav Goldberg 21 60 0 19 Oct 2022
Scaling Autoregressive Models for Content-Rich Text-to-Image Generation Jiahui Yu Yuanzhong Xu Jing Yu Koh Thang Luong Gunjan Baid ... Zarana Parekh Xin Li Han Zhang Jason Baldridge Yonghui Wu EGVM 107 1,062 0 22 Jun 2022
WIT: Wikipedia-based Image Text Dataset for Multimodal Multilingual Machine Learning Krishna Srinivasan K. Raman Jiecao Chen Michael Bendersky Marc Najork VLM 208 310 0 02 Mar 2021
Zero-Shot Text-to-Image Generation Aditya A. Ramesh Mikhail Pavlov Gabriel Goh Scott Gray Chelsea Voss Alec Radford Mark Chen Ilya Sutskever VLM 255 4,781 0 24 Feb 2021
Scaling Up Visual and Vision-Language Representation Learning With Noisy Text Supervision Chao Jia Yinfei Yang Ye Xia Yi-Ting Chen Zarana Parekh Hieu H. Pham Quoc V. Le Yun-hsuan Sung Zhen Li Tom Duerig VLM CLIP 313 3,708 0 11 Feb 2021
Diversity and Inclusion Metrics in Subset Selection Margaret Mitchell Dylan K. Baker Nyalleng Moorosi Emily L. Denton Ben Hutchinson A. Hanna Timnit Gebru Jamie Morgenstern FaML 150 85 0 09 Feb 2020