ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2305.07984
  4. Cited By
SCENE: Self-Labeled Counterfactuals for Extrapolating to Negative
  Examples

SCENE: Self-Labeled Counterfactuals for Extrapolating to Negative Examples

13 May 2023
Deqing Fu
Ameya Godbole
Robin Jia
ArXivPDFHTML

Papers citing "SCENE: Self-Labeled Counterfactuals for Extrapolating to Negative Examples"

14 / 14 papers shown
Title
Cancer-Myth: Evaluating AI Chatbot on Patient Questions with False Presuppositions
Cancer-Myth: Evaluating AI Chatbot on Patient Questions with False Presuppositions
Wang Zhu
Tianqi Chen
Ching Ying Lin
Jade Law
Mazen Jizzini
Jorge J. Nieva
Ruishan Liu
Robin Jia
34
0
0
15 Apr 2025
FactGuard: Leveraging Multi-Agent Systems to Generate Answerable and Unanswerable Questions for Enhanced Long-Context LLM Extraction
FactGuard: Leveraging Multi-Agent Systems to Generate Answerable and Unanswerable Questions for Enhanced Long-Context LLM Extraction
Qian Zhang
Fang Li
Jie Wang
Lingfeng Qiao
Yifei Yu
Di Yin
X. Sun
RALM
65
0
0
08 Apr 2025
TLDR: Token-Level Detective Reward Model for Large Vision Language Models
TLDR: Token-Level Detective Reward Model for Large Vision Language Models
Deqing Fu
Tong Xiao
Rui Wang
Wang Zhu
Pengchuan Zhang
Guan Pang
Robin Jia
Lawrence Chen
60
5
0
07 Oct 2024
Towards Robust Extractive Question Answering Models: Rethinking the
  Training Methodology
Towards Robust Extractive Question Answering Models: Rethinking the Training Methodology
Son Quoc Tran
Matt Kretchmar
OOD
19
0
0
29 Sep 2024
A Survey on Natural Language Counterfactual Generation
A Survey on Natural Language Counterfactual Generation
Yongjie Wang
Xiaoqi Qiu
Yu Yue
Xu Guo
Zhiwei Zeng
Yuhong Feng
Zhiqi Shen
36
5
0
04 Jul 2024
ACCORD: Closing the Commonsense Measurability Gap
ACCORD: Closing the Commonsense Measurability Gap
François Roewer-Després
Jinyue Feng
Zining Zhu
Frank Rudzicz
LRM
48
0
0
04 Jun 2024
DreamSync: Aligning Text-to-Image Generation with Image Understanding
  Feedback
DreamSync: Aligning Text-to-Image Generation with Image Understanding Feedback
Jiao Sun
Deqing Fu
Yushi Hu
Su Wang
Royi Rassin
...
Dana Alon
Charles Herrmann
Sjoerd van Steenkiste
Ranjay Krishna
Cyrus Rashtchian
EGVM
30
39
0
29 Nov 2023
NeuroCounterfactuals: Beyond Minimal-Edit Counterfactuals for Richer
  Data Augmentation
NeuroCounterfactuals: Beyond Minimal-Edit Counterfactuals for Richer Data Augmentation
Phillip Howard
Gadi Singer
Vasudev Lal
Yejin Choi
Swabha Swayamdipta
CML
55
25
0
22 Oct 2022
Training language models to follow instructions with human feedback
Training language models to follow instructions with human feedback
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
...
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
OSLM
ALM
313
11,915
0
04 Mar 2022
Tailor: Generating and Perturbing Text with Semantic Controls
Tailor: Generating and Perturbing Text with Semantic Controls
Alexis Ross
Tongshuang Wu
Hao Peng
Matthew E. Peters
Matt Gardner
136
77
0
15 Jul 2021
Competency Problems: On Finding and Removing Artifacts in Language Data
Competency Problems: On Finding and Removing Artifacts in Language Data
Matt Gardner
William Merrill
Jesse Dodge
Matthew E. Peters
Alexis Ross
Sameer Singh
Noah A. Smith
166
107
0
17 Apr 2021
Which Linguist Invented the Lightbulb? Presupposition Verification for
  Question-Answering
Which Linguist Invented the Lightbulb? Presupposition Verification for Question-Answering
Najoung Kim
Ellie Pavlick
Burcu Karagol Ayan
Deepak Ramachandran
70
43
0
02 Jan 2021
Linguistically-Informed Transformations (LIT): A Method for
  Automatically Generating Contrast Sets
Linguistically-Informed Transformations (LIT): A Method for Automatically Generating Contrast Sets
Chuanrong Li
Lin Shengshuo
Leo Z. Liu
Xinyi Wu
Xuhui Zhou
Shane Steinert-Threlkeld
VLM
130
38
0
16 Oct 2020
GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language
  Understanding
GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language Understanding
Alex Jinpeng Wang
Amanpreet Singh
Julian Michael
Felix Hill
Omer Levy
Samuel R. Bowman
ELM
297
6,956
0
20 Apr 2018
1