ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2101.00288
  4. Cited By
Polyjuice: Generating Counterfactuals for Explaining, Evaluating, and
  Improving Models
v1v2 (latest)

Polyjuice: Generating Counterfactuals for Explaining, Evaluating, and Improving Models

1 January 2021
Tongshuang Wu
Marco Tulio Ribeiro
Jeffrey Heer
Daniel S. Weld
ArXiv (abs)PDFHTML

Papers citing "Polyjuice: Generating Counterfactuals for Explaining, Evaluating, and Improving Models"

50 / 182 papers shown
Title
SCISSOR: Mitigating Semantic Bias through Cluster-Aware Siamese Networks for Robust Classification
SCISSOR: Mitigating Semantic Bias through Cluster-Aware Siamese Networks for Robust Classification
Shuo Yang
Bardh Prenkaj
Gjergji Kasneci
40
0
0
17 Jun 2025
UGCE: User-Guided Incremental Counterfactual Exploration
UGCE: User-Guided Incremental Counterfactual Exploration
Christos Fragkathoulas
E. Pitoura
32
0
0
27 May 2025
Through a Compressed Lens: Investigating the Impact of Quantization on LLM Explainability and Interpretability
Through a Compressed Lens: Investigating the Impact of Quantization on LLM Explainability and Interpretability
Qianli Wang
Mingyang Wang
Nils Feldhus
Simon Ostermann
Yuan Cao
Hinrich Schütze
Sebastian Möller
Vera Schmitt
MQ
65
1
0
20 May 2025
Truth or Twist? Optimal Model Selection for Reliable Label Flipping Evaluation in LLM-based Counterfactuals
Truth or Twist? Optimal Model Selection for Reliable Label Flipping Evaluation in LLM-based Counterfactuals
Qianli Wang
Van Bach Nguyen
Nils Feldhus
Luis Felipe Villa-Arenas
Christin Seifert
Sebastian Möller
Vera Schmitt
64
0
0
20 May 2025
Reasoning-Grounded Natural Language Explanations for Language Models
Vojtech Cahlik
Rodrigo Alves
Pavel Kordík
LRM
103
2
0
14 Mar 2025
Biases in Large Language Model-Elicited Text: A Case Study in Natural Language Inference
Grace Proebsting
Adam Poliak
101
0
0
06 Mar 2025
Guiding LLMs to Generate High-Fidelity and High-Quality Counterfactual Explanations for Text Classification
Van Bach Nguyen
C. Seifert
Jorg Schlotterer
BDL
129
0
0
06 Mar 2025
Interactive Debugging and Steering of Multi-Agent AI Systems
Will Epperson
Gagan Bansal
Victor C. Dibia
Adam Fourney
Jack Gerrits
Erkang Zhu
Saleema Amershi
119
7
0
03 Mar 2025
Conceptual Contrastive Edits in Textual and Vision-Language Retrieval
Maria Lymperaiou
Giorgos Stamou
VLM
82
0
0
01 Mar 2025
Is Conversational XAI All You Need? Human-AI Decision Making With a Conversational XAI Assistant
Is Conversational XAI All You Need? Human-AI Decision Making With a Conversational XAI Assistant
Gaole He
Nilay Aishwarya
U. Gadiraju
97
10
0
29 Jan 2025
LLMs as Workers in Human-Computational Algorithms? Replicating Crowdsourcing Pipelines with LLMs
LLMs as Workers in Human-Computational Algorithms? Replicating Crowdsourcing Pipelines with LLMs
Tongshuang Wu
Haiyi Zhu
Maya Albayrak
Alexis Axon
Amanda Bertsch
...
Ying-Jui Tseng
Patricia Vaidos
Zhijin Wu
Wei Wu
Chenyang Yang
182
34
0
10 Jan 2025
FitCF: A Framework for Automatic Feature Importance-guided Counterfactual Example Generation
FitCF: A Framework for Automatic Feature Importance-guided Counterfactual Example Generation
Qianli Wang
Nils Feldhus
Simon Ostermann
Luis Felipe Villa-Arenas
Sebastian Möller
Vera Schmitt
AAML
131
1
0
01 Jan 2025
The Evolution of LLM Adoption in Industry Data Curation Practices
The Evolution of LLM Adoption in Industry Data Curation Practices
Crystal Qian
Michael Xieyang Liu
Emily Reif
Grady Simon
Nada Hussein
Nathan Clement
James Wexler
Carrie J. Cai
Michael Terry
Minsuk Kahng
AILawELM
112
5
0
20 Dec 2024
Interpreting Language Reward Models via Contrastive Explanations
Interpreting Language Reward Models via Contrastive Explanations
Junqi Jiang
Tom Bewley
Saumitra Mishra
Freddy Lecue
Manuela Veloso
175
2
0
25 Nov 2024
Gumbel Counterfactual Generation From Language Models
Gumbel Counterfactual Generation From Language Models
Shauli Ravfogel
Anej Svete
Vésteinn Snæbjarnarson
Ryan Cotterell
LRMCML
105
1
0
11 Nov 2024
A Comparative Analysis of Counterfactual Explanation Methods for Text
  Classifiers
A Comparative Analysis of Counterfactual Explanation Methods for Text Classifiers
Stephen McAleese
Mark Keane
73
0
0
04 Nov 2024
Generating Diverse Negations from Affirmative Sentences
Generating Diverse Negations from Affirmative Sentences
Darian Rodriguez Vasquez
Afroditi Papadaki
89
0
0
30 Oct 2024
PromptExp: Multi-granularity Prompt Explanation of Large Language Models
PromptExp: Multi-granularity Prompt Explanation of Large Language Models
Ximing Dong
Shaowei Wang
Dayi Lin
Gopi Krishnan Rajbahadur
Boquan Zhou
Shichao Liu
Ahmed E. Hassan
AAMLLRM
88
1
0
16 Oct 2024
Reasoning Elicitation in Language Models via Counterfactual Feedback
Reasoning Elicitation in Language Models via Counterfactual Feedback
Alihan Hüyük
Xinnuo Xu
Jacqueline R. M. A. Maasch
Aditya V. Nori
Javier González
ReLMLRM
449
3
0
02 Oct 2024
Exploring Empty Spaces: Human-in-the-Loop Data Augmentation
Exploring Empty Spaces: Human-in-the-Loop Data Augmentation
Catherine Yeh
Donghao Ren
Yannick Assogba
Dominik Moritz
Fred Hohman
105
0
0
01 Oct 2024
Supporting Co-Adaptive Machine Teaching through Human Concept Learning
  and Cognitive Theories
Supporting Co-Adaptive Machine Teaching through Human Concept Learning and Cognitive Theories
Simret Araya Gebreegziabher
Yukun Yang
Elena L. Glassman
Tao Li
88
5
0
25 Sep 2024
CSCE: Boosting LLM Reasoning by Simultaneous Enhancing of Causal Significance and Consistency
CSCE: Boosting LLM Reasoning by Simultaneous Enhancing of Causal Significance and Consistency
Kangsheng Wang
Xiao Zhang
Zizheng Guo
Tianyu Hu
Huimin Ma
LRM
159
7
0
20 Sep 2024
Counterfactuals As a Means for Evaluating Faithfulness of Attribution Methods in Autoregressive Language Models
Counterfactuals As a Means for Evaluating Faithfulness of Attribution Methods in Autoregressive Language Models
Sepehr Kamahi
Yadollah Yaghoobzadeh
166
0
0
21 Aug 2024
Case-based Explainability for Random Forest: Prototypes, Critics,
  Counter-factuals and Semi-factuals
Case-based Explainability for Random Forest: Prototypes, Critics, Counter-factuals and Semi-factuals
Gregory Yampolsky
Dhruv Desai
Mingshu Li
Stefano Pasquali
Dhagash Mehta
64
4
0
13 Aug 2024
SCENE: Evaluating Explainable AI Techniques Using Soft Counterfactuals
SCENE: Evaluating Explainable AI Techniques Using Soft Counterfactuals
Haoran Zheng
Utku Pamuksuz
112
0
0
08 Aug 2024
Leveraging Variation Theory in Counterfactual Data Augmentation for Optimized Active Learning
Leveraging Variation Theory in Counterfactual Data Augmentation for Optimized Active Learning
Simret Araya Gebreegziabher
Kuangshi Ai
Zheng Zhang
Elena L. Glassman
Tao Li
47
4
0
07 Aug 2024
Optimal and efficient text counterfactuals using Graph Neural Networks
Optimal and efficient text counterfactuals using Graph Neural Networks
Dimitris Lymperopoulos
Maria Lymperaiou
Giorgos Filandrianos
Giorgos Stamou
73
1
0
04 Aug 2024
On Behalf of the Stakeholders: Trends in NLP Model Interpretability in the Era of LLMs
On Behalf of the Stakeholders: Trends in NLP Model Interpretability in the Era of LLMs
Nitay Calderon
Roi Reichart
136
16
0
27 Jul 2024
FairFlow: An Automated Approach to Model-based Counterfactual Data
  Augmentation For NLP
FairFlow: An Automated Approach to Model-based Counterfactual Data Augmentation For NLP
E. Tokpo
T. Calders
62
1
0
23 Jul 2024
XAI meets LLMs: A Survey of the Relation between Explainable AI and
  Large Language Models
XAI meets LLMs: A Survey of the Relation between Explainable AI and Large Language Models
Min Zhang
Lorenzo Malandri
Fabio Mercorio
Navid Nobani
Andrea Seveso
121
15
0
21 Jul 2024
A Survey on Natural Language Counterfactual Generation
A Survey on Natural Language Counterfactual Generation
Yongjie Wang
Xiaoqi Qiu
Yu Yue
Xu Guo
Zhiwei Zeng
Yuhong Feng
Zhiqi Shen
88
9
0
04 Jul 2024
Is Your Large Language Model Knowledgeable or a Choices-Only Cheater?
Is Your Large Language Model Knowledgeable or a Choices-Only Cheater?
Nishant Balepur
Rachel Rudinger
94
8
0
02 Jul 2024
The Odyssey of Commonsense Causality: From Foundational Benchmarks to
  Cutting-Edge Reasoning
The Odyssey of Commonsense Causality: From Foundational Benchmarks to Cutting-Edge Reasoning
Shaobo Cui
Zhijing Jin
Bernhard Schölkopf
Boi Faltings
CMLLRM
96
4
0
27 Jun 2024
Automated Adversarial Discovery for Safety Classifiers
Automated Adversarial Discovery for Safety Classifiers
Yash Kumar Lal
Preethi Lahoti
Aradhana Sinha
Yao Qin
Ananth Balashankar
129
0
0
24 Jun 2024
CELL your Model: Contrastive Explanations for Large Language Models
CELL your Model: Contrastive Explanations for Large Language Models
Ronny Luss
Erik Miehling
Amit Dhurandhar
143
0
0
17 Jun 2024
Multi-Aspect Controllable Text Generation with Disentangled
  Counterfactual Augmentation
Multi-Aspect Controllable Text Generation with Disentangled Counterfactual Augmentation
Yi Liu
Xiangyu Liu
Xiangrong Zhu
Wei Hu
75
2
0
30 May 2024
PertEval: Unveiling Real Knowledge Capacity of LLMs with
  Knowledge-Invariant Perturbations
PertEval: Unveiling Real Knowledge Capacity of LLMs with Knowledge-Invariant Perturbations
Jiatong Li
Renjun Hu
Kunzhe Huang
Zhuang Yan
Qi Liu
Mengxiao Zhu
Xing Shi
Wei Lin
KELM
110
8
0
30 May 2024
Beyond Agreement: Diagnosing the Rationale Alignment of Automated Essay
  Scoring Methods based on Linguistically-informed Counterfactuals
Beyond Agreement: Diagnosing the Rationale Alignment of Automated Essay Scoring Methods based on Linguistically-informed Counterfactuals
Yupei Wang
Renfen Hu
Zhe Zhao
91
3
0
29 May 2024
Low-rank finetuning for LLMs: A fairness perspective
Low-rank finetuning for LLMs: A fairness perspective
Saswat Das
Marco Romanelli
Cuong Tran
Zarreen Reza
B. Kailkhura
Ferdinando Fioretto
72
2
0
28 May 2024
Overlap Number of Balls Model-Agnostic CounterFactuals (ONB-MACF): A
  Data-Morphology-based Counterfactual Generation Method for Trustworthy
  Artificial Intelligence
Overlap Number of Balls Model-Agnostic CounterFactuals (ONB-MACF): A Data-Morphology-based Counterfactual Generation Method for Trustworthy Artificial Intelligence
José Daniel Pascual-Triana
Alberto Fernández
Javier Del Ser
Francisco Herrera
82
1
0
20 May 2024
Data Science Principles for Interpretable and Explainable AI
Data Science Principles for Interpretable and Explainable AI
Kris Sankaran
FaML
116
1
0
17 May 2024
Mitigating Text Toxicity with Counterfactual Generation
Mitigating Text Toxicity with Counterfactual Generation
Milan Bhan
Jean-Noel Vittaut
Nina Achache
Victor Legrand
Nicolas Chesneau
A. Blangero
Juliette Murris
Marie-Jeanne Lesot
MedIm
215
0
0
16 May 2024
Challenges and Opportunities in Text Generation Explainability
Challenges and Opportunities in Text Generation Explainability
Kenza Amara
Rita Sevastjanova
Mennatallah El-Assady
SILM
81
3
0
14 May 2024
Zero-shot LLM-guided Counterfactual Generation for Text
Zero-shot LLM-guided Counterfactual Generation for Text
Amrita Bhattacharjee
Raha Moraffah
Joshua Garland
Huan Liu
94
7
0
08 May 2024
CEval: A Benchmark for Evaluating Counterfactual Text Generation
CEval: A Benchmark for Evaluating Counterfactual Text Generation
Van Bach Nguyen
Jorg Schlotterer
Christin Seifert
107
7
0
26 Apr 2024
LLMs for Generating and Evaluating Counterfactuals: A Comprehensive
  Study
LLMs for Generating and Evaluating Counterfactuals: A Comprehensive Study
Van Bach Nguyen
Paul Youssef
Jorg Schlotterer
Christin Seifert
87
18
0
26 Apr 2024
Does It Make Sense to Explain a Black Box With Another Black Box?
Does It Make Sense to Explain a Black Box With Another Black Box?
J. Delaunay
Luis Galárraga
Christine Largouet
AAML
66
1
0
23 Apr 2024
Utilizing Adversarial Examples for Bias Mitigation and Accuracy
  Enhancement
Utilizing Adversarial Examples for Bias Mitigation and Accuracy Enhancement
Pushkar Shukla
Dhruv Srikanth
Lee Cohen
Matthew Turk
AAML
73
0
0
18 Apr 2024
Explainable Generative AI (GenXAI): A Survey, Conceptualization, and
  Research Agenda
Explainable Generative AI (GenXAI): A Survey, Conceptualization, and Research Agenda
Johannes Schneider
142
35
0
15 Apr 2024
A Cause-Effect Look at Alleviating Hallucination of Knowledge-grounded
  Dialogue Generation
A Cause-Effect Look at Alleviating Hallucination of Knowledge-grounded Dialogue Generation
Jifan Yu
Xiaohan Zhang
Yifan Xu
Xuanyu Lei
Zijun Yao
Jing Zhang
Lei Hou
Juanzi Li
HILM
115
2
0
04 Apr 2024
1234
Next