Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2310.12418
Cited By
The Shifted and The Overlooked: A Task-oriented Investigation of User-GPT Interactions
19 October 2023
Siru Ouyang
Shuohang Wang
Yang Liu
Ming Zhong
Yizhu Jiao
Dan Iter
Reid Pryzant
Chenguang Zhu
Heng Ji
Jiawei Han
Re-assign community
ArXiv
PDF
HTML
Papers citing
"The Shifted and The Overlooked: A Task-oriented Investigation of User-GPT Interactions"
25 / 25 papers shown
Title
Benchmarking Retrieval-Augmented Generation for Chemistry
Xianrui Zhong
Bowen Jin
Siru Ouyang
Yanzhen Shen
Qiao Jin
Yin Fang
Zhiyong Lu
Jiawei Han
3DV
31
0
0
12 May 2025
Out of Style: RAG's Fragility to Linguistic Variation
Tianyu Cao
Neel Bhandari
Akhila Yerukola
Akari Asai
Maarten Sap
27
0
0
11 Apr 2025
EvalTree: Profiling Language Model Weaknesses via Hierarchical Capability Trees
Zhiyuan Zeng
Yizhong Wang
Hannaneh Hajishirzi
Pang Wei Koh
ELM
55
5
0
11 Mar 2025
Better Aligned with Survey Respondents or Training Data? Unveiling Political Leanings of LLMs on U.S. Supreme Court Cases
Shanshan Xu
T. Y. S. S. Santosh
Yanai Elazar
Quirin Vogel
Barbara Plank
Matthias Grabmair
AILaw
83
0
0
25 Feb 2025
Presumed Cultural Identity: How Names Shape LLM Responses
Siddhesh Pawar
Arnav Arora
Lucie-Aimée Kaffee
Isabelle Augenstein
58
0
0
17 Feb 2025
Fostering Appropriate Reliance on Large Language Models: The Role of Explanations, Sources, and Inconsistencies
Sunnie S. Y. Kim
J. Vaughan
Q. V. Liao
Tania Lombrozo
Olga Russakovsky
105
5
0
12 Feb 2025
Clio: Privacy-Preserving Insights into Real-World AI Use
Alex Tamkin
Miles McCain
Kunal Handa
Esin Durmus
Liane Lovitt
...
Wes Mitchell
Shan Carter
Jack Clark
Jared Kaplan
Deep Ganguli
85
14
0
18 Dec 2024
CiMaTe: Citation Count Prediction Effectively Leveraging the Main Text
Jun Hirako
Ryohei Sasano
Koichi Takeda
32
2
0
06 Oct 2024
RepoGraph: Enhancing AI Software Engineering with Repository-level Code Graph
Siru Ouyang
W. Yu
Kaixin Ma
Zilin Xiao
Z. Zhang
Mengzhao Jia
J. Han
H. Zhang
Dong Yu
57
12
0
03 Oct 2024
Gender, Race, and Intersectional Bias in Resume Screening via Language Model Retrieval
Kyra Wilson
Aylin Caliskan
41
17
0
29 Jul 2024
WildHallucinations: Evaluating Long-form Factuality in LLMs with Real-World Entity Queries
Wenting Zhao
Tanya Goyal
Yu Ying Chiu
Liwei Jiang
Benjamin Newman
...
Khyathi Raghavi Chandu
Ronan Le Bras
Claire Cardie
Yuntian Deng
Yejin Choi
HILM
38
7
0
24 Jul 2024
Exploring Human-LLM Conversations: Mental Models and the Originator of Toxicity
Johannes Schneider
Arianna Casanova Flores
Anne-Catherine Kranz
50
2
0
08 Jul 2024
WildBench: Benchmarking LLMs with Challenging Tasks from Real Users in the Wild
Bill Yuchen Lin
Yuntian Deng
Khyathi Raghavi Chandu
Faeze Brahman
Abhilasha Ravichander
Valentina Pyatkin
Nouha Dziri
Ronan Le Bras
Yejin Choi
42
69
0
07 Jun 2024
Quriosity: Analyzing Human Questioning Behavior and Causal Inquiry through Curiosity-Driven Queries
Roberto Ceraolo
Dmitrii Kharlapenko
Amélie Reymond
Rada Mihalcea
Mrinmaya Sachan
Bernhard Schölkopf
Zhijing Jin
Zhijing Jin
CML
37
2
0
30 May 2024
Facilitating Opinion Diversity through Hybrid NLP Approaches
Michiel van der Meer
47
0
0
15 May 2024
DOLOMITES: Domain-Specific Long-Form Methodical Tasks
Chaitanya Malaviya
Priyanka Agrawal
Kuzman Ganchev
Pranesh Srinivasan
Fantine Huot
Jonathan Berant
Mark Yatskar
Dipanjan Das
Mirella Lapata
Chris Alberti
37
6
0
09 May 2024
SafetyPrompts: a Systematic Review of Open Datasets for Evaluating and Improving Large Language Model Safety
Paul Röttger
Fabio Pernisi
Bertie Vidgen
Dirk Hovy
ELM
KELM
58
31
0
08 Apr 2024
Conifer: Improving Complex Constrained Instruction-Following Ability of Large Language Models
Haoran Sun
Lixin Liu
Junjie Li
Fengyu Wang
Baohua Dong
Ran Lin
Ruohui Huang
27
14
0
03 Apr 2024
KIWI: A Dataset of Knowledge-Intensive Writing Instructions for Answering Research Questions
Fangyuan Xu
Kyle Lo
Luca Soldaini
Bailey Kuehl
Eunsol Choi
David Wadden
37
6
0
06 Mar 2024
Evaluating the Elementary Multilingual Capabilities of Large Language Models with MultiQ
Carolin Holtermann
Paul Röttger
Timm Dill
Anne Lauscher
ELM
LRM
37
22
0
06 Mar 2024
Political Compass or Spinning Arrow? Towards More Meaningful Evaluations for Values and Opinions in Large Language Models
Paul Röttger
Valentin Hofmann
Valentina Pyatkin
Musashi Hinck
Hannah Rose Kirk
Hinrich Schütze
Dirk Hovy
ELM
23
53
0
26 Feb 2024
NLP for Maternal Healthcare: Perspectives and Guiding Principles in the Age of LLMs
Maria Antoniak
Aakanksha Naik
Carla S. Alvarado
Lucy Lu Wang
Irene Y. Chen
AILaw
29
15
0
19 Dec 2023
Ontology Enrichment for Effective Fine-grained Entity Typing
Si-yuan Ouyang
Jiaxin Huang
Pranav Pillai
Yunyi Zhang
Yu Zhang
Jiawei Han
107
4
0
11 Oct 2023
Measuring Coding Challenge Competence With APPS
Dan Hendrycks
Steven Basart
Saurav Kadavath
Mantas Mazeika
Akul Arora
...
Collin Burns
Samir Puranik
Horace He
D. Song
Jacob Steinhardt
ELM
AIMat
ALM
208
624
0
20 May 2021
Efficient Estimation of Word Representations in Vector Space
Tomáš Mikolov
Kai Chen
G. Corrado
J. Dean
3DV
245
31,267
0
16 Jan 2013
1