ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2209.06293
  4. Cited By
Do Androids Laugh at Electric Sheep? Humor "Understanding" Benchmarks
  from The New Yorker Caption Contest

Do Androids Laugh at Electric Sheep? Humor "Understanding" Benchmarks from The New Yorker Caption Contest

13 September 2022
Jack Hessel
Ana Marasović
Jena D. Hwang
Lillian Lee
Jeff Da
Rowan Zellers
Robert Mankoff
Yejin Choi
    VLM
ArXivPDFHTML

Papers citing "Do Androids Laugh at Electric Sheep? Humor "Understanding" Benchmarks from The New Yorker Caption Contest"

50 / 64 papers shown
Title
Probing and Inducing Combinational Creativity in Vision-Language Models
Probing and Inducing Combinational Creativity in Vision-Language Models
Yongqian Peng
Yuxi Ma
Mengmeng Wang
Yuxuan Wang
Yizhou Wang
C. Zhang
Yixin Zhu
Zilong Zheng
MLLM
CoGe
87
0
0
17 Apr 2025
Why We Feel: Breaking Boundaries in Emotional Reasoning with Multimodal Large Language Models
Why We Feel: Breaking Boundaries in Emotional Reasoning with Multimodal Large Language Models
Yuxiang Lin
Jingdong Sun
Zhi-Qi Cheng
Jue Wang
Haomin Liang
Zebang Cheng
Yifei Dong
Jun-Yan He
Xiaojiang Peng
Xian-Sheng Hua
47
0
0
10 Apr 2025
Bypassing Safety Guardrails in LLMs Using Humor
Bypassing Safety Guardrails in LLMs Using Humor
Pedro Cisneros-Velarde
29
0
0
09 Apr 2025
Hummus: A Dataset of Humorous Multimodal Metaphor Use
Hummus: A Dataset of Humorous Multimodal Metaphor Use
Xiaoyu Tong
Zhi Zhang
Martha Lewis
Ekaterina Shutova
29
0
0
03 Apr 2025
When 'YES' Meets 'BUT': Can Large Models Comprehend Contradictory Humor Through Comparative Reasoning?
When 'YES' Meets 'BUT': Can Large Models Comprehend Contradictory Humor Through Comparative Reasoning?
Tuo Liang
Zhe Hu
Jing Li
Hao Zhang
Yiren Lu
...
Yiran Qiao
Disheng Liu
Jeirui Peng
Jing Ma
Yu Yin
47
0
0
29 Mar 2025
How Well Can Vison-Language Models Understand Humans' Intention? An Open-ended Theory of Mind Question Evaluation Benchmark
How Well Can Vison-Language Models Understand Humans' Intention? An Open-ended Theory of Mind Question Evaluation Benchmark
Ximing Wen
Mallika Mainali
Anik Sen
37
0
0
28 Mar 2025
Gemma 3 Technical Report
Gemma 3 Technical Report
Gemma Team
Aishwarya B Kamath
Johan Ferret
Shreya Pathak
Nino Vieillard
...
Harshal Tushar Lehri
Hussein Hazimeh
Ian Ballantyne
Idan Szpektor
Ivan Nardini
VLM
85
30
0
25 Mar 2025
DoraCycle: Domain-Oriented Adaptation of Unified Generative Model in Multimodal Cycles
Rui Zhao
Weijia Mao
Mike Zheng Shou
66
0
0
05 Mar 2025
Bridging the Creativity Understanding Gap: Small-Scale Human Alignment Enables Expert-Level Humor Ranking in LLMs
Bridging the Creativity Understanding Gap: Small-Scale Human Alignment Enables Expert-Level Humor Ranking in LLMs
Kuan Lok Zhou
Jiayi Chen
Siddharth Suresh
Reuben Narad
Timothy Rogers
Lalit K Jain
R. Nowak
Bob Mankoff
Jifan Zhang
52
0
0
27 Feb 2025
BIG-Bench Extra Hard
BIG-Bench Extra Hard
Mehran Kazemi
Bahare Fatemi
Hritik Bansal
John Palowitch
Chrysovalantis Anastasiou
...
Kate Olszewska
Yi Tay
Vinh Q. Tran
Quoc V. Le
Orhan Firat
ELM
LRM
117
5
0
26 Feb 2025
BottleHumor: Self-Informed Humor Explanation using the Information Bottleneck Principle
BottleHumor: Self-Informed Humor Explanation using the Information Bottleneck Principle
EunJeong Hwang
Peter West
Vered Shwartz
39
1
0
22 Feb 2025
The Stochastic Parrot on LLM's Shoulder: A Summative Assessment of Physical Concept Understanding
The Stochastic Parrot on LLM's Shoulder: A Summative Assessment of Physical Concept Understanding
Mo Yu
Lemao Liu
J. Wu
Tsz Ting Chung
Shunchi Zhang
JiangNan Li
Dit-Yan Yeung
Jie Zhou
85
1
0
13 Feb 2025
Neuro-Symbolic AI in 2024: A Systematic Review
Neuro-Symbolic AI in 2024: A Systematic Review
Brandon C. Colelough
William Regli
NAI
65
9
0
09 Jan 2025
Chumor 2.0: Towards Benchmarking Chinese Humor Understanding
Chumor 2.0: Towards Benchmarking Chinese Humor Understanding
Ruiqi He
Yushu He
Longju Bai
Jiarui Liu
Zhenjie Sun
Zenghao Tang
He Wang
Hanchen Xia
Rada Mihalcea
Naihao Deng
78
1
0
23 Dec 2024
PunchBench: Benchmarking MLLMs in Multimodal Punchline Comprehension
PunchBench: Benchmarking MLLMs in Multimodal Punchline Comprehension
Kun Ouyang
Yuanxin Liu
Shicheng Li
Yi Liu
Hao Zhou
Fandong Meng
Jie Zhou
Xu Sun
104
1
0
16 Dec 2024
Can Generic LLMs Help Analyze Child-adult Interactions Involving Children with Autism in Clinical Observation?
Tiantian Feng
Anfeng Xu
Rimita Lahiri
Helen Tager-Flusberg
So Hyun Kim
Somer Bishop
C. Lord
Shrikanth Narayanan
LM&MA
36
1
0
16 Nov 2024
EMOCPD: Efficient Attention-based Models for Computational Protein
  Design Using Amino Acid Microenvironment
EMOCPD: Efficient Attention-based Models for Computational Protein Design Using Amino Acid Microenvironment
Xiaoqi Ling
Cheng Cai
Demin Kong
Zhisheng Wei
Jing Wu
Lei Wang
Zhaohong Deng
3DV
30
0
0
28 Oct 2024
Can MLLMs Understand the Deep Implication Behind Chinese Images?
Can MLLMs Understand the Deep Implication Behind Chinese Images?
Chenhao Zhang
Xi Feng
Yuelin Bai
Xinrun Du
Jinchang Hou
...
Min Yang
Wenhao Huang
Chenghua Lin
Ge Zhang
Shiwen Ni
ELM
VLM
38
3
0
17 Oct 2024
Self-Comparison for Dataset-Level Membership Inference in Large
  (Vision-)Language Models
Self-Comparison for Dataset-Level Membership Inference in Large (Vision-)Language Models
J. Ren
Kangrui Chen
Chen Chen
Vikash Sehwag
Yue Xing
Jiliang Tang
Lingjuan Lyu
24
1
0
16 Oct 2024
Can We Predict Performance of Large Models across Vision-Language Tasks?
Can We Predict Performance of Large Models across Vision-Language Tasks?
Qinyu Zhao
Ming Xu
Kartik Gupta
Akshay Asthana
Liang Zheng
Stephen Gould
39
0
0
14 Oct 2024
Enhancing Advanced Visual Reasoning Ability of Large Language Models
Enhancing Advanced Visual Reasoning Ability of Large Language Models
Zhiyuan Li
Dongnan Liu
Chaoyi Zhang
Heng Wang
Tengfei Xue
Weidong Cai
VLM
LRM
43
6
0
21 Sep 2024
YesBut: A High-Quality Annotated Multimodal Dataset for evaluating
  Satire Comprehension capability of Vision-Language Models
YesBut: A High-Quality Annotated Multimodal Dataset for evaluating Satire Comprehension capability of Vision-Language Models
Abhilash Nandy
Yash Agarwal
Ashish Patwa
Millon Madhur Das
Aman Bansal
Ankit Raj
Pawan Goyal
Niloy Ganguly
30
0
0
20 Sep 2024
NYK-MS: A Well-annotated Multi-modal Metaphor and Sarcasm Understanding
  Benchmark on Cartoon-Caption Dataset
NYK-MS: A Well-annotated Multi-modal Metaphor and Sarcasm Understanding Benchmark on Cartoon-Caption Dataset
Ke Chang
Hao Li
Junzhao Zhang
Yunfang Wu
23
0
0
02 Sep 2024
Visual Riddles: a Commonsense and World Knowledge Challenge for Large
  Vision and Language Models
Visual Riddles: a Commonsense and World Knowledge Challenge for Large Vision and Language Models
Nitzan Bitton-Guetta
Aviv Slobodkin
Aviya Maimon
Eliya Habba
Royi Rassin
Yonatan Bitton
Idan Szpektor
Amir Globerson
Yuval Elovici
ReLM
VLM
LRM
34
5
0
28 Jul 2024
Predicting Winning Captions for Weekly New Yorker Comics
Predicting Winning Captions for Weekly New Yorker Comics
Stanley Cao
Sonny Young
ViT
VLM
37
1
0
12 Jul 2024
HEMM: Holistic Evaluation of Multimodal Foundation Models
HEMM: Holistic Evaluation of Multimodal Foundation Models
Paul Pu Liang
Akshay Goindani
Talha Chafekar
Leena Mathur
Haofei Yu
Ruslan Salakhutdinov
Louis-Philippe Morency
36
10
0
03 Jul 2024
VIVA: A Benchmark for Vision-Grounded Decision-Making with Human Values
VIVA: A Benchmark for Vision-Grounded Decision-Making with Human Values
Zhe Hu
Yixiao Ren
Jing Li
Yu Yin
VLM
31
4
0
03 Jul 2024
We-Math: Does Your Large Multimodal Model Achieve Human-like
  Mathematical Reasoning?
We-Math: Does Your Large Multimodal Model Achieve Human-like Mathematical Reasoning?
Runqi Qiao
Qiuna Tan
Guanting Dong
Minhui Wu
Chong Sun
...
Yida Xu
Muxi Diao
Zhimin Bao
Chen Li
Honggang Zhang
VLM
LRM
39
31
0
01 Jul 2024
Is AI fun? HumorDB: a curated dataset and benchmark to investigate
  graphical humor
Is AI fun? HumorDB: a curated dataset and benchmark to investigate graphical humor
Veedant Jain
Felipe dos Santos Alves Feitosa
Gabriel Kreiman
VLM
42
2
0
19 Jun 2024
Humor in AI: Massive Scale Crowd-Sourced Preferences and Benchmarks for
  Cartoon Captioning
Humor in AI: Massive Scale Crowd-Sourced Preferences and Benchmarks for Cartoon Captioning
Jifan Zhang
Lalit P. Jain
Yang Guo
Jiayi Chen
Kuan Lok Zhou
...
Scott Sievert
Timothy Rogers
Kevin Jamieson
Robert Mankoff
Robert Nowak
31
5
0
15 Jun 2024
Creating a Lens of Chinese Culture: A Multimodal Dataset for Chinese Pun
  Rebus Art Understanding
Creating a Lens of Chinese Culture: A Multimodal Dataset for Chinese Pun Rebus Art Understanding
Tuo Zhang
Tiantian Feng
Yibin Ni
Mengqin Cao
Ruying Liu
Katharine Butler
Yanjun Weng
Mi Zhang
Shrikanth S. Narayanan
Salman Avestimehr
49
1
0
14 Jun 2024
DiffuSyn Bench: Evaluating Vision-Language Models on Real-World
  Complexities with Diffusion-Generated Synthetic Benchmarks
DiffuSyn Bench: Evaluating Vision-Language Models on Real-World Complexities with Diffusion-Generated Synthetic Benchmarks
Haokun Zhou
Yipeng Hong
VLM
EGVM
26
1
0
06 Jun 2024
Culturally Aware and Adapted NLP: A Taxonomy and a Survey of the State of the Art
Culturally Aware and Adapted NLP: A Taxonomy and a Survey of the State of the Art
Chen Cecilia Liu
Iryna Gurevych
Anna Korhonen
33
5
0
06 Jun 2024
Cracking the Code of Juxtaposition: Can AI Models Understand the
  Humorous Contradictions
Cracking the Code of Juxtaposition: Can AI Models Understand the Humorous Contradictions
Zhe Hu
Tuo Liang
Jing Li
Yiren Lu
Yunlai Zhou
Yiran Qiao
Jing Ma
Yu Yin
44
4
0
29 May 2024
DuanzAI: Slang-Enhanced LLM with Prompt for Humor Understanding
DuanzAI: Slang-Enhanced LLM with Prompt for Humor Understanding
Yesian Rohn
16
0
0
23 May 2024
Humor Mechanics: Advancing Humor Generation with Multistep Reasoning
Humor Mechanics: Advancing Humor Generation with Multistep Reasoning
Alexey Tikhonov
Pavel Shtykovskiy
LRM
ReLM
26
1
0
12 May 2024
Do Large Language Models Understand Conversational Implicature -- A case
  study with a chinese sitcom
Do Large Language Models Understand Conversational Implicature -- A case study with a chinese sitcom
Shisen Yue
Siyuan Song
Xinyuan Cheng
Hai Hu
50
2
0
30 Apr 2024
"A good pun is its own reword": Can Large Language Models Understand
  Puns?
"A good pun is its own reword": Can Large Language Models Understand Puns?
Zhijun Xu
Siyu Yuan
Lingjie Chen
Deqing Yang
LRM
40
8
0
21 Apr 2024
Language Models Still Struggle to Zero-shot Reason about Time Series
Language Models Still Struggle to Zero-shot Reason about Time Series
Mike A. Merrill
Mingtian Tan
Vinayak Gupta
Tom Hartvigsen
Tim Althoff
AI4TS
LRM
40
27
0
17 Apr 2024
What Are We Measuring When We Evaluate Large Vision-Language Models? An
  Analysis of Latent Factors and Biases
What Are We Measuring When We Evaluate Large Vision-Language Models? An Analysis of Latent Factors and Biases
A. M. H. Tiong
Junqi Zhao
Boyang Albert Li
Junnan Li
S. Hoi
Caiming Xiong
40
8
0
03 Apr 2024
Getting Serious about Humor: Crafting Humor Datasets with Unfunny Large
  Language Models
Getting Serious about Humor: Crafting Humor Datasets with Unfunny Large Language Models
Zachary Horvitz
Jingru Chen
Rahul Aditya
Harshvardhan Srivastava
Robert West
Zhou Yu
Kathleen McKeown
22
1
0
23 Feb 2024
Shallow Synthesis of Knowledge in GPT-Generated Texts: A Case Study in
  Automatic Related Work Composition
Shallow Synthesis of Knowledge in GPT-Generated Texts: A Case Study in Automatic Related Work Composition
Anna Martin-Boyle
Aahan Tyagi
Marti A. Hearst
Dongyeop Kang
26
8
0
19 Feb 2024
Can Large Multimodal Models Uncover Deep Semantics Behind Images?
Can Large Multimodal Models Uncover Deep Semantics Behind Images?
Yixin Yang
Zheng Li
Qingxiu Dong
Heming Xia
Zhifang Sui
VLM
27
9
0
17 Feb 2024
When LLMs Meet Cunning Texts: A Fallacy Understanding Benchmark for
  Large Language Models
When LLMs Meet Cunning Texts: A Fallacy Understanding Benchmark for Large Language Models
Yinghui Li
Qingyu Zhou
Yuanzhen Luo
Shirong Ma
Yangning Li
Hai-Tao Zheng
Xuming Hu
Philip S. Yu
LRM
39
13
0
16 Feb 2024
SMILE: Multimodal Dataset for Understanding Laughter in Video with
  Language Models
SMILE: Multimodal Dataset for Understanding Laughter in Video with Language Models
Lee Hyun
Kim Sung-Bin
Seungju Han
Youngjae Yu
Tae-Hyun Oh
25
13
0
15 Dec 2023
Let's Think Outside the Box: Exploring Leap-of-Thought in Large Language
  Models with Creative Humor Generation
Let's Think Outside the Box: Exploring Leap-of-Thought in Large Language Models with Creative Humor Generation
Shan Zhong
Zhongzhan Huang
Shanghua Gao
Wushao Wen
Liang Lin
Marinka Zitnik
Pan Zhou
LLMAG
LRM
19
35
0
05 Dec 2023
MacGyver: Are Large Language Models Creative Problem Solvers?
MacGyver: Are Large Language Models Creative Problem Solvers?
Yufei Tian
Abhilasha Ravichander
Lianhui Qin
Ronan Le Bras
Raja Marjieh
Nanyun Peng
Yejin Choi
Thomas L. Griffiths
Faeze Brahman
AI4CE
LLMAG
15
11
0
16 Nov 2023
Social Meme-ing: Measuring Linguistic Variation in Memes
Social Meme-ing: Measuring Linguistic Variation in Memes
Naitian Zhou
David Jurgens
David Bamman
18
1
0
15 Nov 2023
Are You Sure? Challenging LLMs Leads to Performance Drops in The
  FlipFlop Experiment
Are You Sure? Challenging LLMs Leads to Performance Drops in The FlipFlop Experiment
Philippe Laban
Lidiya Murakhovs'ka
Caiming Xiong
Chien-Sheng Wu
LRM
26
19
0
14 Nov 2023
Chain of Images for Intuitively Reasoning
Chain of Images for Intuitively Reasoning
Fanxu Meng
Haotong Yang
Yiding Wang
Muhan Zhang
LRM
28
6
0
09 Nov 2023
12
Next