ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2311.01007
  4. Cited By
Effective Human-AI Teams via Learned Natural Language Rules and
  Onboarding
v1v2 (latest)

Effective Human-AI Teams via Learned Natural Language Rules and Onboarding

2 November 2023
Hussein Mozannar
Jimin J Lee
Dennis L. Wei
P. Sattigeri
Subhro Das
David Sontag
ArXiv (abs)PDFHTML

Papers citing "Effective Human-AI Teams via Learned Natural Language Rules and Onboarding"

35 / 35 papers shown
Title
The Value of Information in Human-AI Decision-making
The Value of Information in Human-AI Decision-making
Ziyang Guo
Yifan Wu
Jason D. Hartline
Jessica Hullman
FAtt
217
0
0
10 Feb 2025
Contrastive Explanations That Anticipate Human Misconceptions Can Improve Human Decision-Making Skills
Contrastive Explanations That Anticipate Human Misconceptions Can Improve Human Decision-Making Skills
Zana Buçinca
S. Swaroop
Amanda E. Paluch
Finale Doshi-Velez
Krzysztof Z. Gajos
102
3
0
05 Oct 2024
Are Machine Rationales (Not) Useful to Humans? Measuring and Improving
  Human Utility of Free-Text Rationales
Are Machine Rationales (Not) Useful to Humans? Measuring and Improving Human Utility of Free-Text Rationales
Brihi Joshi
Ziyi Liu
Sahana Ramnath
Aaron Chan
Zhewei Tong
Shaoliang Nie
Qifan Wang
Yejin Choi
Xiang Ren
HAILRM
75
35
0
11 May 2023
Learning Personalized Decision Support Policies
Learning Personalized Decision Support Policies
Umang Bhatt
Valerie Chen
Katherine M. Collins
Parameswaran Kamalaruban
Emma Kallina
Adrian Weller
Ameet Talwalkar
OffRL
213
11
0
13 Apr 2023
Explanations Can Reduce Overreliance on AI Systems During
  Decision-Making
Explanations Can Reduce Overreliance on AI Systems During Decision-Making
Helena Vasconcelos
Matthew Jörke
Madeleine Grunde-McLaughlin
Tobias Gerstenberg
Michael S. Bernstein
Ranjay Krishna
72
176
0
13 Dec 2022
Adaptive Testing of Computer Vision Models
Adaptive Testing of Computer Vision Models
Irena Gao
Gabriel Ilharco
Scott M. Lundberg
Marco Tulio Ribeiro
VLM
73
43
0
06 Dec 2022
Measuring Progress on Scalable Oversight for Large Language Models
Measuring Progress on Scalable Oversight for Large Language Models
Sam Bowman
Jeeyoon Hyun
Ethan Perez
Edwin Chen
Craig Pettit
...
Tristan Hume
Yuntao Bai
Zac Hatfield-Dodds
Benjamin Mann
Jared Kaplan
ALMELM
81
132
0
04 Nov 2022
Scaling Instruction-Finetuned Language Models
Scaling Instruction-Finetuned Language Models
Hyung Won Chung
Le Hou
Shayne Longpre
Barret Zoph
Yi Tay
...
Jacob Devlin
Adam Roberts
Denny Zhou
Quoc V. Le
Jason W. Wei
ReLMLRM
238
3,165
0
20 Oct 2022
SEAL : Interactive Tool for Systematic Error Analysis and Labeling
SEAL : Interactive Tool for Systematic Error Analysis and Labeling
Nazneen Rajani
Weixin Liang
Lingjiao Chen
Margaret Mitchell
James Zou
96
16
0
11 Oct 2022
Sample Efficient Learning of Predictors that Complement Humans
Sample Efficient Learning of Predictors that Complement Humans
Mohammad-Amin Charusaie
Hussein Mozannar
David Sontag
Samira Samadi
71
34
0
19 Jul 2022
GitHub Copilot AI pair programmer: Asset or Liability?
GitHub Copilot AI pair programmer: Asset or Liability?
Arghavan Moradi Dakhel
Vahid Majdinasab
Amin Nikanjam
Foutse Khomh
Michel C. Desmarais
Zhen Ming
Z. Jiang
95
358
0
30 Jun 2022
Distilling Model Failures as Directions in Latent Space
Distilling Model Failures as Directions in Latent Space
Saachi Jain
Hannah Lawrence
Ankur Moitra
Aleksander Madry
97
90
0
29 Jun 2022
Human-AI Collaboration via Conditional Delegation: A Case Study of
  Content Moderation
Human-AI Collaboration via Conditional Delegation: A Case Study of Content Moderation
Vivian Lai
Samuel Carton
Rajat Bhatnagar
Vera Liao
Yunfeng Zhang
Chenhao Tan
85
134
0
25 Apr 2022
Domino: Discovering Systematic Errors with Cross-Modal Embeddings
Domino: Discovering Systematic Errors with Cross-Modal Embeddings
Sabri Eyuboglu
M. Varma
Khaled Kamal Saab
Jean-Benoit Delbrouck
Christopher Lee-Messer
Jared A. Dunnmon
James Zou
Christopher Ré
100
148
0
24 Mar 2022
Describing Differences between Text Distributions with Natural Language
Describing Differences between Text Distributions with Natural Language
Ruiqi Zhong
Charles Burton Snell
Dan Klein
Jacob Steinhardt
VLM
189
44
0
28 Jan 2022
Towards a Science of Human-AI Decision Making: A Survey of Empirical
  Studies
Towards a Science of Human-AI Decision Making: A Survey of Empirical Studies
Vivian Lai
Chacha Chen
Q. V. Liao
Alison Smith-Renner
Chenhao Tan
111
189
0
21 Dec 2021
Teaching Humans When To Defer to a Classifier via Exemplars
Teaching Humans When To Defer to a Classifier via Exemplars
Hussein Mozannar
Arvindmani Satyanarayan
David Sontag
82
43
0
22 Nov 2021
Auditing AI models for Verified Deployment under Semantic Specifications
Auditing AI models for Verified Deployment under Semantic Specifications
Homanga Bharadhwaj
De-An Huang
Chaowei Xiao
Anima Anandkumar
Animesh Garg
MLAU
86
6
0
25 Sep 2021
Wordcraft: a Human-AI Collaborative Editor for Story Writing
Wordcraft: a Human-AI Collaborative Editor for Story Writing
Andy Coenen
Luke Davis
Daphne Ippolito
Emily Reif
Ann Yuan
LLMAG
110
73
0
15 Jul 2021
The Spotlight: A General Method for Discovering Systematic Errors in
  Deep Learning Models
The Spotlight: A General Method for Discovering Systematic Errors in Deep Learning Models
G. dÉon
Jason dÉon
J. R. Wright
Kevin Leyton-Brown
79
75
0
01 Jul 2021
Intuitively Assessing ML Model Reliability through Example-Based
  Explanations and Editing Model Inputs
Intuitively Assessing ML Model Reliability through Example-Based Explanations and Editing Model Inputs
Harini Suresh
Kathleen M. Lewis
John Guttag
Arvind Satyanarayan
FAtt
88
26
0
17 Feb 2021
Human Evaluation of Spoken vs. Visual Explanations for Open-Domain QA
Human Evaluation of Spoken vs. Visual Explanations for Open-Domain QA
Ana Valeria González
Gagan Bansal
Angela Fan
Robin Jia
Yashar Mehdad
Srini Iyer
AAML
85
24
0
30 Dec 2020
TweetEval: Unified Benchmark and Comparative Evaluation for Tweet
  Classification
TweetEval: Unified Benchmark and Comparative Evaluation for Tweet Classification
Francesco Barbieri
Jose Camacho-Collados
Leonardo Neves
Luis Espinosa-Anke
VLM
93
728
0
23 Oct 2020
Does the Whole Exceed its Parts? The Effect of AI Explanations on
  Complementary Team Performance
Does the Whole Exceed its Parts? The Effect of AI Explanations on Complementary Team Performance
Gagan Bansal
Tongshuang Wu
Joyce Zhou
Raymond Fok
Besmira Nushi
Ece Kamar
Marco Tulio Ribeiro
Daniel S. Weld
104
602
0
26 Jun 2020
Consistent Estimators for Learning to Defer to an Expert
Consistent Estimators for Learning to Defer to an Expert
Hussein Mozannar
David Sontag
71
205
0
02 Jun 2020
Misplaced Trust: Measuring the Interference of Machine Learning in Human
  Decision-Making
Misplaced Trust: Measuring the Interference of Machine Learning in Human Decision-Making
Harini Suresh
Natalie Lao
Ilaria Liccardi
36
49
0
22 May 2020
Evaluating Explainable AI: Which Algorithmic Explanations Help Users
  Predict Model Behavior?
Evaluating Explainable AI: Which Algorithmic Explanations Help Users Predict Model Behavior?
Peter Hase
Joey Tianyi Zhou
FAtt
79
304
0
04 May 2020
"Why is 'Chicago' deceptive?" Towards Building Model-Driven Tutorials
  for Humans
"Why is 'Chicago' deceptive?" Towards Building Model-Driven Tutorials for Humans
Vivian Lai
Han Liu
Chenhao Tan
84
143
0
14 Jan 2020
Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks
Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks
Nils Reimers
Iryna Gurevych
1.3K
12,332
0
27 Aug 2019
The Algorithmic Automation Problem: Prediction, Triage, and Human Effort
The Algorithmic Automation Problem: Prediction, Triage, and Human Effort
M. Raghu
Katy Blumer
G. Corrado
Jon M. Kleinberg
Ziad Obermeyer
S. Mullainathan
174
140
0
28 Mar 2019
On Human Predictions with Explanations and Predictions of Machine
  Learning Models: A Case Study on Deception Detection
On Human Predictions with Explanations and Predictions of Machine Learning Models: A Case Study on Deception Detection
Vivian Lai
Chenhao Tan
78
380
0
19 Nov 2018
BDD100K: A Diverse Driving Dataset for Heterogeneous Multitask Learning
BDD100K: A Diverse Driving Dataset for Heterogeneous Multitask Learning
Feng Yu
Haofeng Chen
Xin Wang
Wenqi Xian
Yingying Chen
Fangchen Liu
Vashisht Madhavan
Trevor Darrell
VLM
348
2,164
0
12 May 2018
Predict Responsibly: Improving Fairness and Accuracy by Learning to
  Defer
Predict Responsibly: Improving Fairness and Accuracy by Learning to Defer
David Madras
T. Pitassi
R. Zemel
FaML
180
221
0
17 Nov 2017
End-to-end Learning of Driving Models from Large-scale Video Datasets
End-to-end Learning of Driving Models from Large-scale Video Datasets
Huazhe Xu
Yang Gao
Feng Yu
Trevor Darrell
163
827
0
04 Dec 2016
"Why Should I Trust You?": Explaining the Predictions of Any Classifier
"Why Should I Trust You?": Explaining the Predictions of Any Classifier
Marco Tulio Ribeiro
Sameer Singh
Carlos Guestrin
FAttFaML
1.2K
17,092
0
16 Feb 2016
1