ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2312.02974
  4. Cited By
Describing Differences in Image Sets with Natural Language
v1v2 (latest)

Describing Differences in Image Sets with Natural Language

Computer Vision and Pattern Recognition (CVPR), 2023
5 December 2023
Lisa Dunlap
Yuhui Zhang
Xiaohan Wang
Ruiqi Zhong
Trevor Darrell
Jacob Steinhardt
Joseph E. Gonzalez
Serena Yeung-Levy
    CoGeVLM
ArXiv (abs)PDFHTMLHuggingFace (16 upvotes)

Papers citing "Describing Differences in Image Sets with Natural Language"

27 / 27 papers shown
Title
ConceptScope: Characterizing Dataset Bias via Disentangled Visual Concepts
ConceptScope: Characterizing Dataset Bias via Disentangled Visual Concepts
Jinho Choi
Hyesu Lim
Steffen Schneider
Jaegul Choo
52
0
0
30 Oct 2025
LoRAverse: A Submodular Framework to Retrieve Diverse Adapters for Diffusion Models
LoRAverse: A Submodular Framework to Retrieve Diverse Adapters for Diffusion Models
Mert Sonmezer
Matthew Zheng
Pinar Yanardag
DiffMMoMe
165
0
0
16 Oct 2025
From Perception Logs to Failure Modes: Language-Driven Semantic Clustering of Failures for Robot Safety
From Perception Logs to Failure Modes: Language-Driven Semantic Clustering of Failures for Robot Safety
Aryaman Gupta
Yusuf Umut Ciftci
Somil Bansal
100
1
0
06 Jun 2025
EvdCLIP: Improving Vision-Language Retrieval with Entity Visual Descriptions from Large Language Models
EvdCLIP: Improving Vision-Language Retrieval with Entity Visual Descriptions from Large Language ModelsAAAI Conference on Artificial Intelligence (AAAI), 2025
G. MEng
Sunan He
Jinpeng Wang
Tao Dai
Letian Zhang
Jieming Zhu
Qing Li
Gang Wang
Rui Zhang
Yong Jiang
VLM
393
3
0
24 May 2025
HRScene: How Far Are VLMs from Effective High-Resolution Image Understanding?
HRScene: How Far Are VLMs from Effective High-Resolution Image Understanding?
Yusen Zhang
Wenliang Zheng
Aashrith Madasu
Peng Shi
Ryo Kamoi
...
Ranran Haoran Zhang
Avitej Iyer
Renze Lou
Wenpeng Yin
Rui Zhang
478
0
0
25 Apr 2025
Visual Chronicles: Using Multimodal LLMs to Analyze Massive Collections of Images
Visual Chronicles: Using Multimodal LLMs to Analyze Massive Collections of Images
Boyang Deng
Songyou Peng
Kyle Genova
Gordon Wetzstein
Noah Snavely
Leonidas Guibas
Thomas Funkhouser
HAI
907
2
0
11 Apr 2025
On Large Multimodal Models as Open-World Image Classifiers
On Large Multimodal Models as Open-World Image Classifiers
Alessandro Conti
Goran Frehse
Enrico Fini
Yiming Wang
Paolo Rota
Elisa Ricci
VLM
394
2
0
27 Mar 2025
ImageSet2Text: Describing Sets of Images through Text
ImageSet2Text: Describing Sets of Images through Text
Piera Riccio
F. Galati
Kajetan Schweighofer
Noa Garcia
Nuria Oliver
VLMCoGe
375
1
0
25 Mar 2025
Video Action DifferencingInternational Conference on Learning Representations (ICLR), 2025
James Burgess
Xiaohan Wang
Yuhui Zhang
Anita Rau
Alejandro Lozano
Lisa Dunlap
Trevor Darrell
Serena Yeung-Levy
VGen
254
6
0
10 Mar 2025
Dataset Featurization: Uncovering Natural Language Features through Unsupervised Data Reconstruction
Dataset Featurization: Uncovering Natural Language Features through Unsupervised Data Reconstruction
Michal Bravansky
Vaclav Kubon
Suhas Hariharan
Robert Kirk
287
2
0
24 Feb 2025
LaVCa: LLM-assisted Visual Cortex Captioning
LaVCa: LLM-assisted Visual Cortex Captioning
Takuya Matsuyama
Shinji Nishimoto
Yu Takagi
248
3
0
20 Feb 2025
Idiosyncrasies in Large Language Models
Idiosyncrasies in Large Language Models
Mingjie Sun
Yida Yin
Zhiqiu Xu
J. Zico Kolter
Zhuang Liu
291
17
0
17 Feb 2025
Progress-Aware Video Frame Captioning
Progress-Aware Video Frame CaptioningComputer Vision and Pattern Recognition (CVPR), 2024
Zihui Xue
Joungbin An
Xitong Yang
Kristen Grauman
512
5
0
03 Dec 2024
Interpretable Next-token Prediction via the Generalized Induction Head
Interpretable Next-token Prediction via the Generalized Induction Head
Eunji Kim
Sriya Mantena
Weiwei Yang
Chandan Singh
Sungroh Yoon
Jianfeng Gao
281
1
0
31 Oct 2024
Bayesian Concept Bottleneck Models with LLM Priors
Bayesian Concept Bottleneck Models with LLM Priors
Jean Feng
Avni Kothari
Luke Zier
Chandan Singh
Yan Shuo Tan
282
7
0
21 Oct 2024
Preference Optimization with Multi-Sample Comparisons
Preference Optimization with Multi-Sample Comparisons
Chaoqi Wang
Zhuokai Zhao
Chen Zhu
Karthik Abinav Sankararaman
Michal Valko
...
Zhaorun Chen
Madian Khabsa
Yuxin Chen
Hao Ma
Sinong Wang
265
13
0
16 Oct 2024
Organizing Unstructured Image Collections using Natural Language
Organizing Unstructured Image Collections using Natural Language
Mingxuan Liu
Zhun Zhong
Jun Li
Gianni Franchi
Subhankar Roy
Elisa Ricci
VLM
635
10
0
07 Oct 2024
CAST: Cross-modal Alignment Similarity Test for Vision Language Models
CAST: Cross-modal Alignment Similarity Test for Vision Language ModelsInternational Conference on Computational Linguistics (COLING), 2024
Gautier Dagan
Olga Loginova
Anil Batra
CoGe
209
1
0
17 Sep 2024
Explaining Datasets in Words: Statistical Models with Natural Language Parameters
Explaining Datasets in Words: Statistical Models with Natural Language ParametersNeural Information Processing Systems (NeurIPS), 2024
Ruiqi Zhong
Heng Wang
Dan Klein
Jacob Steinhardt
187
10
0
13 Sep 2024
No Detail Left Behind: Revisiting Self-Retrieval for Fine-Grained Image Captioning
No Detail Left Behind: Revisiting Self-Retrieval for Fine-Grained Image Captioning
Manu Gaur
Darshan Singh
Makarand Tapaswi
795
2
0
04 Sep 2024
LLM-assisted Concept Discovery: Automatically Identifying and Explaining
  Neuron Functions
LLM-assisted Concept Discovery: Automatically Identifying and Explaining Neuron Functions
N. Hoang-Xuan
Minh Nhat Vu
My T. Thai
158
5
0
12 Jun 2024
Inverse Constitutional AI: Compressing Preferences into Principles
Inverse Constitutional AI: Compressing Preferences into Principles
Arduin Findeis
Timo Kaufmann
Eyke Hüllermeier
Samuel Albanie
Robert Mullins
SyDa
216
21
0
02 Jun 2024
Policy Learning with a Language Bottleneck
Policy Learning with a Language Bottleneck
Megha Srivastava
Cédric Colas
Dorsa Sadigh
Jacob Andreas
248
3
0
07 May 2024
Discover and Mitigate Multiple Biased Subgroups in Image Classifiers
Discover and Mitigate Multiple Biased Subgroups in Image Classifiers
Zeliang Zhang
Mingqian Feng
Zhiheng Li
Chenliang Xu
245
12
0
19 Mar 2024
Predicting Text Preference Via Structured Comparative Reasoning
Predicting Text Preference Via Structured Comparative ReasoningAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Jing Nathan Yan
Tianqi Liu
Celine Lee
Jiaming Shen
Zhen Qin
...
Charumathi Lakshmanan
Y. Kurzion
Alexander M. Rush
Jialu Liu
Michael Bendersky
LRM
156
9
0
14 Nov 2023
A Unified Approach to Interpreting Model Predictions
A Unified Approach to Interpreting Model Predictions
Scott M. Lundberg
Su-In Lee
FAtt
2.6K
27,989
0
22 May 2017
"Why Should I Trust You?": Explaining the Predictions of Any Classifier
"Why Should I Trust You?": Explaining the Predictions of Any Classifier
Marco Tulio Ribeiro
Sameer Singh
Carlos Guestrin
FAttFaML
1.8K
19,133
0
16 Feb 2016
1