ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2407.07840
  4. Cited By
Decompose and Compare Consistency: Measuring VLMs' Answer Reliability
  via Task-Decomposition Consistency Comparison

Decompose and Compare Consistency: Measuring VLMs' Answer Reliability via Task-Decomposition Consistency Comparison

10 July 2024
Qian Yang
Weixiang Yan
Aishwarya Agrawal
    CoGe
ArXivPDFHTML

Papers citing "Decompose and Compare Consistency: Measuring VLMs' Answer Reliability via Task-Decomposition Consistency Comparison"

5 / 5 papers shown
Title
Lost in OCR Translation? Vision-Based Approaches to Robust Document Retrieval
Lost in OCR Translation? Vision-Based Approaches to Robust Document Retrieval
Alexander Buschmann Most
Joseph Winjum
Ayan Biswas
Shawn Jones
Nishath Rajiv Ranasinghe
Dan O’Malley
Manish Bhattarai
26
0
0
08 May 2025
Few-Shot Recalibration of Language Models
Few-Shot Recalibration of Language Models
Xiang Lisa Li
Urvashi Khandelwal
Kelvin Guu
44
5
0
27 Mar 2024
Adaptive Chameleon or Stubborn Sloth: Revealing the Behavior of Large
  Language Models in Knowledge Conflicts
Adaptive Chameleon or Stubborn Sloth: Revealing the Behavior of Large Language Models in Knowledge Conflicts
Jian Xie
Kai Zhang
Jiangjie Chen
Renze Lou
Yu-Chuan Su
RALM
211
155
0
22 May 2023
Self-Consistency Improves Chain of Thought Reasoning in Language Models
Self-Consistency Improves Chain of Thought Reasoning in Language Models
Xuezhi Wang
Jason W. Wei
Dale Schuurmans
Quoc Le
Ed H. Chi
Sharan Narang
Aakanksha Chowdhery
Denny Zhou
ReLM
BDL
LRM
AI4CE
314
3,248
0
21 Mar 2022
Reducing conversational agents' overconfidence through linguistic
  calibration
Reducing conversational agents' overconfidence through linguistic calibration
Sabrina J. Mielke
Arthur Szlam
Emily Dinan
Y-Lan Boureau
209
154
0
30 Dec 2020
1