ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2102.02201
  4. Cited By
When Can Models Learn From Explanations? A Formal Framework for
  Understanding the Roles of Explanation Data

When Can Models Learn From Explanations? A Formal Framework for Understanding the Roles of Explanation Data

3 February 2021
Peter Hase
Joey Tianyi Zhou
    XAI
ArXivPDFHTML

Papers citing "When Can Models Learn From Explanations? A Formal Framework for Understanding the Roles of Explanation Data"

50 / 69 papers shown
Title
AI-Slop to AI-Polish? Aligning Language Models through Edit-Based Writing Rewards and Test-time Computation
AI-Slop to AI-Polish? Aligning Language Models through Edit-Based Writing Rewards and Test-time Computation
Tuhin Chakrabarty
Philippe Laban
C. Wu
32
1
0
10 Apr 2025
Policy-to-Language: Train LLMs to Explain Decisions with Flow-Matching Generated Rewards
Policy-to-Language: Train LLMs to Explain Decisions with Flow-Matching Generated Rewards
Xinyi Yang
Liang Zeng
Heng Dong
Chao Yu
X. Wu
H. Yang
Yu Wang
Milind Tambe
Tonghan Wang
76
2
0
18 Feb 2025
Who Taught You That? Tracing Teachers in Model Distillation
Who Taught You That? Tracing Teachers in Model Distillation
Somin Wadhwa
Chantal Shaib
Silvio Amir
Byron C. Wallace
72
1
0
10 Feb 2025
Chain-of-Translation Prompting (CoTR): A Novel Prompting Technique for Low Resource Languages
Chain-of-Translation Prompting (CoTR): A Novel Prompting Technique for Low Resource Languages
Tejas Deshpande
Nidhi Kowtal
Raviraj Joshi
LRM
55
1
0
31 Dec 2024
Enhancing SLM via ChatGPT and Dataset Augmentation
Enhancing SLM via ChatGPT and Dataset Augmentation
Tom Pieper
Mohamad Ballout
U. Krumnack
Gunther Heidemann
Kai-Uwe Kühnberger
33
0
0
19 Sep 2024
Efficient Knowledge Distillation: Empowering Small Language Models with
  Teacher Model Insights
Efficient Knowledge Distillation: Empowering Small Language Models with Teacher Model Insights
Mohamad Ballout
U. Krumnack
Gunther Heidemann
Kai-Uwe Kühnberger
35
2
0
19 Sep 2024
Explanation Regularisation through the Lens of Attributions
Explanation Regularisation through the Lens of Attributions
Pedro Ferreira
Wilker Aziz
Ivan Titov
43
1
0
23 Jul 2024
Data-Centric Human Preference Optimization with Rationales
Data-Centric Human Preference Optimization with Rationales
H. Just
Ming Jin
Anit Kumar Sahu
Huy Phan
Ruoxi Jia
49
3
0
19 Jul 2024
Evaluating Human Alignment and Model Faithfulness of LLM Rationale
Evaluating Human Alignment and Model Faithfulness of LLM Rationale
Mohsen Fayyaz
Fan Yin
Jiao Sun
Nanyun Peng
59
3
0
28 Jun 2024
A look under the hood of the Interactive Deep Learning Enterprise
  (No-IDLE)
A look under the hood of the Interactive Deep Learning Enterprise (No-IDLE)
Daniel Sonntag
Michael Barz
Thiago S. Gouvêa
VLM
49
4
0
27 Jun 2024
Investigating Mysteries of CoT-Augmented Distillation
Investigating Mysteries of CoT-Augmented Distillation
Somin Wadhwa
Silvio Amir
Byron C. Wallace
ReLM
LRM
29
8
0
20 Jun 2024
LogicBench: Towards Systematic Evaluation of Logical Reasoning Ability
  of Large Language Models
LogicBench: Towards Systematic Evaluation of Logical Reasoning Ability of Large Language Models
Mihir Parmar
Nisarg Patel
Neeraj Varshney
Mutsumi Nakamura
Man Luo
Santosh Mashetty
Arindam Mitra
Chitta Baral
LRM
ReLM
ELM
38
23
0
23 Apr 2024
A survey on Concept-based Approaches For Model Improvement
A survey on Concept-based Approaches For Model Improvement
Avani Gupta
P. J. Narayanan
LRM
32
5
0
21 Mar 2024
Show Me How It's Done: The Role of Explanations in Fine-Tuning Language
  Models
Show Me How It's Done: The Role of Explanations in Fine-Tuning Language Models
Mohamad Ballout
U. Krumnack
Gunther Heidemann
Kai-Uwe Kuehnberger
LRM
24
3
0
12 Feb 2024
Towards Faithful Explanations for Text Classification with Robustness
  Improvement and Explanation Guided Training
Towards Faithful Explanations for Text Classification with Robustness Improvement and Explanation Guided Training
Dongfang Li
Baotian Hu
Qingcai Chen
Shan He
31
4
0
29 Dec 2023
ALMANACS: A Simulatability Benchmark for Language Model Explainability
ALMANACS: A Simulatability Benchmark for Language Model Explainability
Edmund Mills
Shiye Su
Stuart J. Russell
Scott Emmons
51
7
0
20 Dec 2023
Multi-Defendant Legal Judgment Prediction via Hierarchical Reasoning
Multi-Defendant Legal Judgment Prediction via Hierarchical Reasoning
Yougang Lyu
Jitai Hao
Zihan Wang
Kai Zhao
Shen Gao
Pengjie Ren
Zhumin Chen
Fang Wang
Zhaochun Ren
AILaw
19
9
0
10 Dec 2023
Zero-shot Conversational Summarization Evaluations with small Large
  Language Models
Zero-shot Conversational Summarization Evaluations with small Large Language Models
R. Manuvinakurike
Saurav Sahay
Sangeeta Manepalli
L. Nachman
ELM
LM&MA
22
0
0
29 Nov 2023
Concept Distillation: Leveraging Human-Centered Explanations for Model
  Improvement
Concept Distillation: Leveraging Human-Centered Explanations for Model Improvement
Avani Gupta
Saurabh Saini
P. J. Narayanan
25
6
0
26 Nov 2023
Meta Prompting for AI Systems
Meta Prompting for AI Systems
Yifan Zhang
Yang Yuan
Andrew Chi-Chih Yao
LLMAG
LRM
21
5
0
20 Nov 2023
Explain-then-Translate: An Analysis on Improving Program Translation
  with Self-generated Explanations
Explain-then-Translate: An Analysis on Improving Program Translation with Self-generated Explanations
Zilu Tang
Mayank Agarwal
Alex Shypula
Bailin Wang
Derry Wijaya
Jie Chen
Yoon Kim
LRM
37
15
0
13 Nov 2023
MCC-KD: Multi-CoT Consistent Knowledge Distillation
MCC-KD: Multi-CoT Consistent Knowledge Distillation
Hongzhan Chen
Siyue Wu
Xiaojun Quan
Rui Wang
Ming Yan
Ji Zhang
LRM
19
17
0
23 Oct 2023
REFER: An End-to-end Rationale Extraction Framework for Explanation
  Regularization
REFER: An End-to-end Rationale Extraction Framework for Explanation Regularization
Mohammad Reza Ghasemi Madani
Pasquale Minervini
32
4
0
22 Oct 2023
XAL: EXplainable Active Learning Makes Classifiers Better Low-resource
  Learners
XAL: EXplainable Active Learning Makes Classifiers Better Low-resource Learners
Yun Luo
Zhen Yang
Fandong Meng
Yingjie Li
Fang Guo
Qinglin Qi
Jie Zhou
Yue Zhang
24
1
0
09 Oct 2023
Cumulative Reasoning with Large Language Models
Cumulative Reasoning with Large Language Models
Yifan Zhang
Jingqin Yang
Yang Yuan
Andrew Chi-Chih Yao
ReLM
ELM
LRM
AI4CE
36
69
0
08 Aug 2023
Symbolic Chain-of-Thought Distillation: Small Models Can Also "Think"
  Step-by-Step
Symbolic Chain-of-Thought Distillation: Small Models Can Also "Think" Step-by-Step
Liunian Harold Li
Jack Hessel
Youngjae Yu
Xiang Ren
Kai-Wei Chang
Yejin Choi
LRM
AI4CE
ReLM
22
129
0
24 Jun 2023
Passive learning of active causal strategies in agents and language
  models
Passive learning of active causal strategies in agents and language models
Andrew Kyle Lampinen
Stephanie C. Y. Chan
Ishita Dasgupta
A. Nam
Jane X. Wang
29
15
0
25 May 2023
Language Models Implement Simple Word2Vec-style Vector Arithmetic
Language Models Implement Simple Word2Vec-style Vector Arithmetic
Jack Merullo
Carsten Eickhoff
Ellie Pavlick
KELM
31
52
0
25 May 2023
EDM3: Event Detection as Multi-task Text Generation
EDM3: Event Detection as Multi-task Text Generation
Ujjwala Anantheswaran
Himanshu Gupta
Mihir Parmar
Kuntal Kumar Pal
Chitta Baral
27
5
0
25 May 2023
Beyond Labels: Empowering Human Annotators with Natural Language
  Explanations through a Novel Active-Learning Architecture
Beyond Labels: Empowering Human Annotators with Natural Language Explanations through a Novel Active-Learning Architecture
Bingsheng Yao
Ishan Jindal
Lucian Popa
Yannis Katsis
Sayan Ghosh
...
Yuxuan Lu
Shashank Srivastava
Yunyao Li
James A. Hendler
Dakuo Wang
34
10
0
22 May 2023
Are Human Explanations Always Helpful? Towards Objective Evaluation of
  Human Natural Language Explanations
Are Human Explanations Always Helpful? Towards Objective Evaluation of Human Natural Language Explanations
Bingsheng Yao
Prithviraj Sen
Lucian Popa
James A. Hendler
Dakuo Wang
XAI
ELM
FAtt
23
10
0
04 May 2023
Distilling Step-by-Step! Outperforming Larger Language Models with Less
  Training Data and Smaller Model Sizes
Distilling Step-by-Step! Outperforming Larger Language Models with Less Training Data and Smaller Model Sizes
Lokesh Nagalapatti
Chun-Liang Li
Chih-Kuan Yeh
Hootan Nakhost
Yasuhisa Fujii
Alexander Ratner
Ranjay Krishna
Chen-Yu Lee
Tomas Pfister
ALM
220
502
0
03 May 2023
Understanding and Predicting Human Label Variation in Natural Language
  Inference through Explanation
Understanding and Predicting Human Label Variation in Natural Language Inference through Explanation
Nan-Jiang Jiang
Chenhao Tan
M. Marneffe
27
2
0
24 Apr 2023
Training Language Models with Language Feedback at Scale
Training Language Models with Language Feedback at Scale
Jérémy Scheurer
Jon Ander Campos
Tomasz Korbak
Jun Shern Chan
Angelica Chen
Kyunghyun Cho
Ethan Perez
ALM
39
103
0
28 Mar 2023
Improving Code Generation by Training with Natural Language Feedback
Improving Code Generation by Training with Natural Language Feedback
Angelica Chen
Jérémy Scheurer
Tomasz Korbak
Jon Ander Campos
Jun Shern Chan
Samuel R. Bowman
Kyunghyun Cho
Ethan Perez
SyDa
ALM
AI4CE
31
76
0
28 Mar 2023
InstructABSA: Instruction Learning for Aspect Based Sentiment Analysis
InstructABSA: Instruction Learning for Aspect Based Sentiment Analysis
Kevin Scaria
Himanshu Gupta
Siddharth Goyal
Saurabh Arjun Sawant
Swaroop Mishra
Chitta Baral
26
25
0
16 Feb 2023
Streamlining models with explanations in the learning loop
Streamlining models with explanations in the learning loop
Francesco Lomuscio
P. Bajardi
Alan Perotti
E. Amparore
FAtt
29
0
0
15 Feb 2023
"Why is this misleading?": Detecting News Headline Hallucinations with
  Explanations
"Why is this misleading?": Detecting News Headline Hallucinations with Explanations
Jiaming Shen
Jialu Liu
Daniel Finnie
N. Rahmati
Michael Bendersky
Marc Najork
30
19
0
12 Feb 2023
KNIFE: Distilling Reasoning Knowledge From Free-Text Rationales
KNIFE: Distilling Reasoning Knowledge From Free-Text Rationales
Aaron Chan
Zhiyuan Zeng
Wyatt Lake
Brihi Joshi
Hanjie Chen
Xiang Ren
ReLM
LRM
31
1
0
19 Dec 2022
Calibration Meets Explanation: A Simple and Effective Approach for Model
  Confidence Estimates
Calibration Meets Explanation: A Simple and Effective Approach for Model Confidence Estimates
Dongfang Li
Baotian Hu
Qingcai Chen
13
8
0
06 Nov 2022
Does Self-Rationalization Improve Robustness to Spurious Correlations?
Does Self-Rationalization Improve Robustness to Spurious Correlations?
Alexis Ross
Matthew E. Peters
Ana Marasović
LRM
24
11
0
24 Oct 2022
Challenging BIG-Bench Tasks and Whether Chain-of-Thought Can Solve Them
Challenging BIG-Bench Tasks and Whether Chain-of-Thought Can Solve Them
Mirac Suzgun
Nathan Scales
Nathanael Scharli
Sebastian Gehrmann
Yi Tay
...
Aakanksha Chowdhery
Quoc V. Le
Ed H. Chi
Denny Zhou
Jason W. Wei
ALM
ELM
LRM
ReLM
83
997
0
17 Oct 2022
Learning to Reason With Relational Abstractions
Learning to Reason With Relational Abstractions
A. Nam
Mengye Ren
Chelsea Finn
James L. McClelland
ReLM
LRM
37
4
0
06 Oct 2022
Towards Faithful Model Explanation in NLP: A Survey
Towards Faithful Model Explanation in NLP: A Survey
Qing Lyu
Marianna Apidianaki
Chris Callison-Burch
XAI
109
107
0
22 Sep 2022
Leveraging Explanations in Interactive Machine Learning: An Overview
Leveraging Explanations in Interactive Machine Learning: An Overview
Stefano Teso
Öznur Alkan
Wolfgang Stammer
Elizabeth M. Daly
XAI
FAtt
LRM
26
62
0
29 Jul 2022
Mediators: Conversational Agents Explaining NLP Model Behavior
Mediators: Conversational Agents Explaining NLP Model Behavior
Nils Feldhus
A. Ravichandran
Sebastian Möller
32
16
0
13 Jun 2022
Investigating the Benefits of Free-Form Rationales
Investigating the Benefits of Free-Form Rationales
Jiao Sun
Swabha Swayamdipta
Jonathan May
Xuezhe Ma
21
14
0
25 May 2022
ER-Test: Evaluating Explanation Regularization Methods for Language
  Models
ER-Test: Evaluating Explanation Regularization Methods for Language Models
Brihi Joshi
Aaron Chan
Ziyi Liu
Shaoliang Nie
Maziar Sanjabi
Hamed Firooz
Xiang Ren
AAML
38
6
0
25 May 2022
Learning to Ignore Adversarial Attacks
Learning to Ignore Adversarial Attacks
Yiming Zhang
Yan Zhou
Samuel Carton
Chenhao Tan
48
2
0
23 May 2022
Training Language Models with Language Feedback
Training Language Models with Language Feedback
Jérémy Scheurer
Jon Ander Campos
Jun Shern Chan
Angelica Chen
Kyunghyun Cho
Ethan Perez
ALM
38
47
0
29 Apr 2022
12
Next