ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2404.10198
  4. Cited By
ClashEval: Quantifying the tug-of-war between an LLM's internal prior and external evidence

ClashEval: Quantifying the tug-of-war between an LLM's internal prior and external evidence

16 April 2024
Kevin Wu
Eric Wu
James Zou
    AAML
ArXivPDFHTML

Papers citing "ClashEval: Quantifying the tug-of-war between an LLM's internal prior and external evidence"

32 / 32 papers shown
Title
Do Large Language Models Know Conflict? Investigating Parametric vs. Non-Parametric Knowledge of LLMs for Conflict Forecasting
Do Large Language Models Know Conflict? Investigating Parametric vs. Non-Parametric Knowledge of LLMs for Conflict Forecasting
Apollinaire Poli Nemkova
Sarath Chandra Lingareddy
Sagnik Ray Choudhury
Mark V. Albert
21
0
0
14 May 2025
ConSens: Assessing context grounding in open-book question answering
ConSens: Assessing context grounding in open-book question answering
Ivan Vankov
Matyo Ivanov
Adriana Correia
Victor Botev
ELM
67
0
0
30 Apr 2025
RAG LLMs are Not Safer: A Safety Analysis of Retrieval-Augmented Generation for Large Language Models
RAG LLMs are Not Safer: A Safety Analysis of Retrieval-Augmented Generation for Large Language Models
Bang An
Shiyue Zhang
Mark Dredze
61
0
0
25 Apr 2025
Transparentize the Internal and External Knowledge Utilization in LLMs with Trustworthy Citation
Transparentize the Internal and External Knowledge Utilization in LLMs with Trustworthy Citation
Jiajun Shen
Tong Zhou
Yubo Chen
Delai Qiu
Shengping Liu
Kang-Jun Liu
Jun Zhao
HILM
RALM
86
0
0
21 Apr 2025
Medical large language models are easily distracted
Medical large language models are easily distracted
Krithik Vishwanath
Anton Alyakin
Daniel Alber
Jin Vivian Lee
Douglas Kondziolka
E. Oermann
31
0
0
01 Apr 2025
Don't lie to your friends: Learning what you know from collaborative self-play
Don't lie to your friends: Learning what you know from collaborative self-play
Jacob Eisenstein
Reza Aghajani
Adam Fisch
Dheeru Dua
Fantine Huot
Mirella Lapata
Vicky Zayats
Jonathan Berant
72
0
0
18 Mar 2025
Quantifying the Robustness of Retrieval-Augmented Language Models Against Spurious Features in Grounding Data
Shiping Yang
Jie Wu
Wenbiao Ding
Ning Wu
Shining Liang
Ming Gong
Hengyuan Zhang
Dongmei Zhang
AAML
66
1
0
07 Mar 2025
Words or Vision: Do Vision-Language Models Have Blind Faith in Text?
Ailin Deng
Tri Cao
Zhirui Chen
Bryan Hooi
VLM
98
2
0
04 Mar 2025
MEBench: Benchmarking Large Language Models for Cross-Document Multi-Entity Question Answering
MEBench: Benchmarking Large Language Models for Cross-Document Multi-Entity Question Answering
Teng Lin
RALM
68
2
0
26 Feb 2025
Enhancing Retrieval-Augmented Generation: A Study of Best Practices
Enhancing Retrieval-Augmented Generation: A Study of Best Practices
Siran Li
Linus Stenzel
Carsten Eickhoff
Seyed Ali Bahrainian
RALM
3DV
64
4
0
13 Jan 2025
Decoding Knowledge in Large Language Models: A Framework for Categorization and Comprehension
Yanbo Fang
Ruixiang Tang
ELM
38
0
0
03 Jan 2025
Know Your RAG: Dataset Taxonomy and Generation Strategies for Evaluating
  RAG Systems
Know Your RAG: Dataset Taxonomy and Generation Strategies for Evaluating RAG Systems
Rafael Teixeira de Lima
Shubham Gupta
Cesar Berrospi
Lokesh Mishra
Michele Dolfi
Peter W. J. Staar
Panagiotis Vagenas
82
1
0
29 Nov 2024
Exploring Knowledge Boundaries in Large Language Models for Retrieval
  Judgment
Exploring Knowledge Boundaries in Large Language Models for Retrieval Judgment
Zhen Zhang
Xinyu Wang
Yong-feng Jiang
Zhuo Chen
Feiteng Mu
Mengting Hu
Pengjun Xie
Fei Huang
KELM
59
2
0
09 Nov 2024
TeleOracle: Fine-Tuned Retrieval-Augmented Generation with Long-Context
  Support for Network
TeleOracle: Fine-Tuned Retrieval-Augmented Generation with Long-Context Support for Network
Nouf Alabbasi
Omar Erak
Omar Alhussein
Ismail Lotfi
Sami Muhaidat
Merouane Debbah
RALM
154
0
0
04 Nov 2024
Rationale-Guided Retrieval Augmented Generation for Medical Question
  Answering
Rationale-Guided Retrieval Augmented Generation for Medical Question Answering
Jiwoong Sohn
Yein Park
Chanwoong Yoon
Sihyeon Park
Hyeon Hwang
Mujeen Sung
Hyunjae Kim
Jaewoo Kang
RALM
67
6
0
01 Nov 2024
Beyond Text: Optimizing RAG with Multimodal Inputs for Industrial
  Applications
Beyond Text: Optimizing RAG with Multimodal Inputs for Industrial Applications
Monica Riedler
Stefan Langer
VLM
41
12
0
29 Oct 2024
Teaching Models to Balance Resisting and Accepting Persuasion
Teaching Models to Balance Resisting and Accepting Persuasion
Elias Stengel-Eskin
Peter Hase
Joey Tianyi Zhou
MU
31
4
0
18 Oct 2024
RAG-DDR: Optimizing Retrieval-Augmented Generation Using Differentiable Data Rewards
RAG-DDR: Optimizing Retrieval-Augmented Generation Using Differentiable Data Rewards
Xinze Li
Sen Mei
Zhenghao Liu
Yukun Yan
Shuo Wang
...
H. Chen
Ge Yu
Zhiyuan Liu
Maosong Sun
Chenyan Xiong
50
7
0
17 Oct 2024
Parenting: Optimizing Knowledge Selection of Retrieval-Augmented
  Language Models with Parameter Decoupling and Tailored Tuning
Parenting: Optimizing Knowledge Selection of Retrieval-Augmented Language Models with Parameter Decoupling and Tailored Tuning
Yongxin Xu
Ruizhe Zhang
Xinke Jiang
Yujie Feng
Yuzhen Xiao
Xinyu Ma
Runchuan Zhu
Xu Chu
Junfeng Zhao
Yasha Wang
KELM
22
4
0
14 Oct 2024
Deciphering the Interplay of Parametric and Non-parametric Memory in
  Retrieval-augmented Language Models
Deciphering the Interplay of Parametric and Non-parametric Memory in Retrieval-augmented Language Models
M. Farahani
Richard Johansson
RALM
33
2
0
07 Oct 2024
FaithEval: Can Your Language Model Stay Faithful to Context, Even If "The Moon is Made of Marshmallows"
FaithEval: Can Your Language Model Stay Faithful to Context, Even If "The Moon is Made of Marshmallows"
Yifei Ming
Senthil Purushwalkam
Shrey Pandit
Zixuan Ke
Xuan-Phi Nguyen
Caiming Xiong
Shafiq R. Joty
HILM
112
16
0
30 Sep 2024
Questioning Internal Knowledge Structure of Large Language Models
  Through the Lens of the Olympic Games
Questioning Internal Knowledge Structure of Large Language Models Through the Lens of the Olympic Games
Juhwan Choi
Youngbin Kim
46
0
0
10 Sep 2024
Hierarchical Retrieval-Augmented Generation Model with Rethink for
  Multi-hop Question Answering
Hierarchical Retrieval-Augmented Generation Model with Rethink for Multi-hop Question Answering
Xiaoming Zhang
Ming Wang
Xiaocui Yang
Daling Wang
Shi Feng
Yifei Zhang
RALM
32
5
0
20 Aug 2024
Value Alignment from Unstructured Text
Value Alignment from Unstructured Text
Inkit Padhi
K. Ramamurthy
P. Sattigeri
Manish Nagireddy
Pierre L. Dognin
Kush R. Varshney
32
0
0
19 Aug 2024
KnowPO: Knowledge-aware Preference Optimization for Controllable
  Knowledge Selection in Retrieval-Augmented Language Models
KnowPO: Knowledge-aware Preference Optimization for Controllable Knowledge Selection in Retrieval-Augmented Language Models
Ruizhe Zhang
Yongxin Xu
Yuzhen Xiao
Runchuan Zhu
Xinke Jiang
Xu Chu
Junfeng Zhao
Yasha Wang
37
2
0
06 Aug 2024
Grounding and Evaluation for Large Language Models: Practical Challenges
  and Lessons Learned (Survey)
Grounding and Evaluation for Large Language Models: Practical Challenges and Lessons Learned (Survey)
K. Kenthapadi
M. Sameki
Ankur Taly
HILM
ELM
AILaw
39
12
0
10 Jul 2024
A Tale of Trust and Accuracy: Base vs. Instruct LLMs in RAG Systems
A Tale of Trust and Accuracy: Base vs. Instruct LLMs in RAG Systems
Florin Cuconasu
Giovanni Trappolini
Nicola Tonellotto
Fabrizio Silvestri
53
2
0
21 Jun 2024
On the Intrinsic Self-Correction Capability of LLMs: Uncertainty and
  Latent Concept
On the Intrinsic Self-Correction Capability of LLMs: Uncertainty and Latent Concept
Guangliang Liu
Haitao Mao
Bochuan Cao
Zhiyu Xue
K. Johnson
Jiliang Tang
Rongrong Wang
LRM
34
9
0
04 Jun 2024
Compressing Long Context for Enhancing RAG with AMR-based Concept
  Distillation
Compressing Long Context for Enhancing RAG with AMR-based Concept Distillation
Kaize Shi
Xueyao Sun
Qing Li
Guandong Xu
48
13
0
06 May 2024
Adaptive Chameleon or Stubborn Sloth: Revealing the Behavior of Large
  Language Models in Knowledge Conflicts
Adaptive Chameleon or Stubborn Sloth: Revealing the Behavior of Large Language Models in Knowledge Conflicts
Jian Xie
Kai Zhang
Jiangjie Chen
Renze Lou
Yu-Chuan Su
RALM
211
155
0
22 May 2023
Evaluation of GPT-3.5 and GPT-4 for supporting real-world information
  needs in healthcare delivery
Evaluation of GPT-3.5 and GPT-4 for supporting real-world information needs in healthcare delivery
Debadutta Dash
Rahul Thapa
Juan M. Banda
Akshay Swaminathan
Morgan Cheatham
...
Garret K. Morris
H. Magon
M. Lungren
Eric Horvitz
N. Shah
ELM
LM&MA
AI4MH
68
51
0
26 Apr 2023
Entity-Based Knowledge Conflicts in Question Answering
Entity-Based Knowledge Conflicts in Question Answering
Shayne Longpre
Kartik Perisetla
Anthony Chen
Nikhil Ramesh
Chris DuBois
Sameer Singh
HILM
245
237
0
10 Sep 2021
1