ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2004.10645
  4. Cited By
AmbigQA: Answering Ambiguous Open-domain Questions

AmbigQA: Answering Ambiguous Open-domain Questions

22 April 2020
Sewon Min
Julian Michael
Hannaneh Hajishirzi
Luke Zettlemoyer
ArXivPDFHTML

Papers citing "AmbigQA: Answering Ambiguous Open-domain Questions"

50 / 214 papers shown
Title
The Art of Saying No: Contextual Noncompliance in Language Models
The Art of Saying No: Contextual Noncompliance in Language Models
Faeze Brahman
Sachin Kumar
Vidhisha Balachandran
Pradeep Dasigi
Valentina Pyatkin
...
Jack Hessel
Yulia Tsvetkov
Noah A. Smith
Yejin Choi
Hannaneh Hajishirzi
75
21
0
02 Jul 2024
AMBROSIA: A Benchmark for Parsing Ambiguous Questions into Database
  Queries
AMBROSIA: A Benchmark for Parsing Ambiguous Questions into Database Queries
Irina Saparina
Mirella Lapata
57
11
0
27 Jun 2024
Assessing "Implicit" Retrieval Robustness of Large Language Models
Assessing "Implicit" Retrieval Robustness of Large Language Models
Xiaoyu Shen
Rexhina Blloshmi
Dawei Zhu
Jiahuan Pei
Wei Zhang
RALM
KELM
53
0
0
26 Jun 2024
DEXTER: A Benchmark for open-domain Complex Question Answering using
  LLMs
DEXTER: A Benchmark for open-domain Complex Question Answering using LLMs
Venktesh V. Deepali Prabhu
Avishek Anand
RALM
CoGe
39
3
0
24 Jun 2024
Large Language Models Are Cross-Lingual Knowledge-Free Reasoners
Large Language Models Are Cross-Lingual Knowledge-Free Reasoners
Peng Hu
Sizhe Liu
Changjiang Gao
Xin Huang
Xue Han
Junlan Feng
Chao Deng
Shujian Huang
LRM
46
1
0
24 Jun 2024
ALiiCE: Evaluating Positional Fine-grained Citation Generation
ALiiCE: Evaluating Positional Fine-grained Citation Generation
Yilong Xu
Jinhua Gao
Xiaoming Yu
Baolong Bi
Huawei Shen
Xueqi Cheng
HILM
31
5
0
19 Jun 2024
Towards Robust Evaluation: A Comprehensive Taxonomy of Datasets and
  Metrics for Open Domain Question Answering in the Era of Large Language
  Models
Towards Robust Evaluation: A Comprehensive Taxonomy of Datasets and Metrics for Open Domain Question Answering in the Era of Large Language Models
Akchay Srivastava
Atif Memon
ELM
48
1
0
19 Jun 2024
Learning to Generate Answers with Citations via Factual Consistency
  Models
Learning to Generate Answers with Citations via Factual Consistency Models
Rami Aly
Zhiqiang Tang
Samson Tan
George Karypis
HILM
42
5
0
19 Jun 2024
Can Tool-augmented Large Language Models be Aware of Incomplete Conditions?
Can Tool-augmented Large Language Models be Aware of Incomplete Conditions?
Seungbin Yang
chaeHun Park
Taehee Kim
Jaegul Choo
46
2
0
18 Jun 2024
MDCR: A Dataset for Multi-Document Conditional Reasoning
MDCR: A Dataset for Multi-Document Conditional Reasoning
Peter Baile Chen
Yi Zhang
Chunwei Liu
Sejal Gupta
Yoon Kim
Michael Cafarella
47
2
0
17 Jun 2024
The BiGGen Bench: A Principled Benchmark for Fine-grained Evaluation of Language Models with Language Models
The BiGGen Bench: A Principled Benchmark for Fine-grained Evaluation of Language Models with Language Models
Seungone Kim
Juyoung Suk
Ji Yong Cho
Shayne Longpre
Chaeeun Kim
...
Sean Welleck
Graham Neubig
Moontae Lee
Kyungjae Lee
Minjoon Seo
ELM
ALM
LM&MA
105
31
0
09 Jun 2024
CaLM: Contrasting Large and Small Language Models to Verify Grounded
  Generation
CaLM: Contrasting Large and Small Language Models to Verify Grounded Generation
I-Hung Hsu
Zifeng Wang
Long T. Le
Lesly Miculicich
Nanyun Peng
Chen-Yu Lee
Tomas Pfister
HILM
37
4
0
08 Jun 2024
To Believe or Not to Believe Your LLM
To Believe or Not to Believe Your LLM
Yasin Abbasi-Yadkori
Ilja Kuzborskij
András György
Csaba Szepesvári
UQCV
59
41
0
04 Jun 2024
Can Large Language Models Faithfully Express Their Intrinsic Uncertainty
  in Words?
Can Large Language Models Faithfully Express Their Intrinsic Uncertainty in Words?
G. Yona
Roee Aharoni
Mor Geva
HILM
49
17
0
27 May 2024
FlashRAG: A Modular Toolkit for Efficient Retrieval-Augmented Generation Research
FlashRAG: A Modular Toolkit for Efficient Retrieval-Augmented Generation Research
Jiajie Jin
Yutao Zhu
Xinyu Yang
Chenghao Zhang
Zhicheng Dou
Chenghao Zhang
Tong Zhao
Zhao Yang
Zhicheng Dou
Ji-Rong Wen
VLM
85
49
0
22 May 2024
CLAMBER: A Benchmark of Identifying and Clarifying Ambiguous Information
  Needs in Large Language Models
CLAMBER: A Benchmark of Identifying and Clarifying Ambiguous Information Needs in Large Language Models
Tong Zhang
Peixin Qin
Yang Deng
Chen Huang
Wenqiang Lei
Junhong Liu
Dingnan Jin
Hongru Liang
Tat-Seng Chua
35
8
0
20 May 2024
A Survey on RAG Meeting LLMs: Towards Retrieval-Augmented Large Language
  Models
A Survey on RAG Meeting LLMs: Towards Retrieval-Augmented Large Language Models
Wenqi Fan
Yujuan Ding
Liang-bo Ning
Shijie Wang
Hengyun Li
Dawei Yin
Tat-Seng Chua
Qing Li
RALM
3DV
40
191
0
10 May 2024
ERAGent: Enhancing Retrieval-Augmented Language Models with Improved
  Accuracy, Efficiency, and Personalization
ERAGent: Enhancing Retrieval-Augmented Language Models with Improved Accuracy, Efficiency, and Personalization
Yunxiao Shi
Xing Zi
Zijing Shi
Haimin Zhang
Qiang Wu
Min Xu
RALM
38
16
0
06 May 2024
Beyond Relevance: Evaluate and Improve Retrievers on Perspective
  Awareness
Beyond Relevance: Evaluate and Improve Retrievers on Perspective Awareness
Xinran Zhao
Tong Chen
Sihao Chen
Hongming Zhang
Tongshuang Wu
34
7
0
04 May 2024
AmbigDocs: Reasoning across Documents on Different Entities under the
  Same Name
AmbigDocs: Reasoning across Documents on Different Entities under the Same Name
Yoonsang Lee
Xi Ye
Eunsol Choi
46
5
0
18 Apr 2024
Aligning Language Models to Explicitly Handle Ambiguity
Aligning Language Models to Explicitly Handle Ambiguity
Hyuhng Joon Kim
Youna Kim
Cheonbok Park
Junyeob Kim
Choonghyun Park
Kang Min Yoo
Sang-goo Lee
Taeuk Kim
36
14
0
18 Apr 2024
Confidence Calibration and Rationalization for LLMs via Multi-Agent
  Deliberation
Confidence Calibration and Rationalization for LLMs via Multi-Agent Deliberation
Ruixin Yang
Dheeraj Rajagopal
S. Hayati
Bin Hu
Dongyeop Kang
LLMAG
43
5
0
14 Apr 2024
Learning to Plan and Generate Text with Citations
Learning to Plan and Generate Text with Citations
Constanza Fierro
Reinald Kim Amplayo
Fantine Huot
Nicola De Cao
Joshua Maynez
Shashi Narayan
Mirella Lapata
32
18
0
04 Apr 2024
CLAPNQ: Cohesive Long-form Answers from Passages in Natural Questions
  for RAG systems
CLAPNQ: Cohesive Long-form Answers from Passages in Natural Questions for RAG systems
Sara Rosenthal
Avirup Sil
Radu Florian
Salim Roukos
56
11
0
02 Apr 2024
Context Quality Matters in Training Fusion-in-Decoder for Extractive
  Open-Domain Question Answering
Context Quality Matters in Training Fusion-in-Decoder for Extractive Open-Domain Question Answering
Kosuke Akimoto
Kunihiro Takeoka
Masafumi Oyamada
38
1
0
21 Mar 2024
Dynamic Contexts for Generating Suggestion Questions in RAG Based
  Conversational Systems
Dynamic Contexts for Generating Suggestion Questions in RAG Based Conversational Systems
Anuja Tayal
Aman Tyagi
69
7
0
18 Mar 2024
Evaluating Biases in Context-Dependent Health Questions
Evaluating Biases in Context-Dependent Health Questions
Sharon Levy
T. Karver
William D. Adler
Michelle R. Kaufman
Mark Dredze
33
3
0
07 Mar 2024
AmbigNLG: Addressing Task Ambiguity in Instruction for NLG
AmbigNLG: Addressing Task Ambiguity in Instruction for NLG
Ayana Niwa
Hayate Iso
36
4
0
27 Feb 2024
PAQA: Toward ProActive Open-Retrieval Question Answering
PAQA: Toward ProActive Open-Retrieval Question Answering
Pierre Erbacher
Jian-Yun Nie
P. Preux
Laure Soulier
RALM
21
2
0
26 Feb 2024
Interpreting Predictive Probabilities: Model Confidence or Human Label
  Variation?
Interpreting Predictive Probabilities: Model Confidence or Human Label Variation?
Joris Baan
Raquel Fernández
Barbara Plank
Wilker Aziz
51
10
0
25 Feb 2024
CriticBench: Benchmarking LLMs for Critique-Correct Reasoning
CriticBench: Benchmarking LLMs for Critique-Correct Reasoning
Zicheng Lin
Zhibin Gou
Tian Liang
Ruilin Luo
Haowei Liu
Yujiu Yang
LRM
42
43
0
22 Feb 2024
Does the Generator Mind its Contexts? An Analysis of Generative Model
  Faithfulness under Context Transfer
Does the Generator Mind its Contexts? An Analysis of Generative Model Faithfulness under Context Transfer
Xinshuo Hu
Baotian Hu
Dongfang Li
Xiaoguang Li
Lifeng Shang
HILM
27
1
0
22 Feb 2024
What Evidence Do Language Models Find Convincing?
What Evidence Do Language Models Find Convincing?
Alexander Wan
Eric Wallace
Dan Klein
210
28
0
19 Feb 2024
Imagining a Future of Designing with AI: Dynamic Grounding, Constructive
  Negotiation, and Sustainable Motivation
Imagining a Future of Designing with AI: Dynamic Grounding, Constructive Negotiation, and Sustainable Motivation
Priyan Vaithilingam
Ian Arawjo
Elena L. Glassman
35
21
0
12 Feb 2024
Merging Facts, Crafting Fallacies: Evaluating the Contradictory Nature
  of Aggregated Factual Claims in Long-Form Generations
Merging Facts, Crafting Fallacies: Evaluating the Contradictory Nature of Aggregated Factual Claims in Long-Form Generations
Cheng-Han Chiang
Hung-yi Lee
HILM
75
8
0
08 Feb 2024
A Roadmap to Pluralistic Alignment
A Roadmap to Pluralistic Alignment
Taylor Sorensen
Jared Moore
Jillian R. Fisher
Mitchell L. Gordon
Niloofar Mireshghallah
...
Liwei Jiang
Ximing Lu
Nouha Dziri
Tim Althoff
Yejin Choi
65
80
0
07 Feb 2024
Different Tastes of Entities: Investigating Human Label Variation in
  Named Entity Annotations
Different Tastes of Entities: Investigating Human Label Variation in Named Entity Annotations
Siyao Peng
Zihang Sun
Sebastian Loftus
Barbara Plank
30
3
0
02 Feb 2024
Don't Hallucinate, Abstain: Identifying LLM Knowledge Gaps via Multi-LLM
  Collaboration
Don't Hallucinate, Abstain: Identifying LLM Knowledge Gaps via Multi-LLM Collaboration
Shangbin Feng
Weijia Shi
Yike Wang
Wenxuan Ding
Vidhisha Balachandran
Yulia Tsvetkov
29
78
0
01 Feb 2024
The Power of Noise: Redefining Retrieval for RAG Systems
The Power of Noise: Redefining Retrieval for RAG Systems
Florin Cuconasu
Giovanni Trappolini
F. Siciliano
Simone Filice
Cesare Campagnano
Y. Maarek
Nicola Tonellotto
Fabrizio Silvestri
RALM
50
145
0
26 Jan 2024
Genie: Achieving Human Parity in Content-Grounded Datasets Generation
Genie: Achieving Human Parity in Content-Grounded Datasets Generation
Asaf Yehudai
Boaz Carmeli
Y. Mass
Ofir Arviv
Nathaniel Mills
Assaf Toledo
Eyal Shnarch
Leshem Choshen
45
22
0
25 Jan 2024
How the Advent of Ubiquitous Large Language Models both Stymie and
  Turbocharge Dynamic Adversarial Question Generation
How the Advent of Ubiquitous Large Language Models both Stymie and Turbocharge Dynamic Adversarial Question Generation
Yoo Yeon Sung
Ishani Mondal
Jordan L. Boyd-Graber
30
0
0
20 Jan 2024
SAPT: A Shared Attention Framework for Parameter-Efficient Continual
  Learning of Large Language Models
SAPT: A Shared Attention Framework for Parameter-Efficient Continual Learning of Large Language Models
Weixiang Zhao
Shilong Wang
Yulin Hu
Yanyan Zhao
Bing Qin
Xuanyu Zhang
Qing Yang
Dongliang Xu
Wanxiang Che
KELM
CLL
34
11
0
16 Jan 2024
Narrowing the Knowledge Evaluation Gap: Open-Domain Question Answering
  with Multi-Granularity Answers
Narrowing the Knowledge Evaluation Gap: Open-Domain Question Answering with Multi-Granularity Answers
G. Yona
Roee Aharoni
Mor Geva
ELM
44
11
0
09 Jan 2024
From Text to Multimodal: A Comprehensive Survey of Adversarial Example
  Generation in Question Answering Systems
From Text to Multimodal: A Comprehensive Survey of Adversarial Example Generation in Question Answering Systems
Gulsum Yigit
M. Amasyalı
AAML
25
0
0
26 Dec 2023
LLM-SQL-Solver: Can LLMs Determine SQL Equivalence?
LLM-SQL-Solver: Can LLMs Determine SQL Equivalence?
Fuheng Zhao
Lawrence Lim
Ishtiyaque Ahmad
D. Agrawal
A. El Abbadi
Amr El Abbadi
65
9
0
16 Dec 2023
Alignment for Honesty
Alignment for Honesty
Yuqing Yang
Ethan Chern
Xipeng Qiu
Graham Neubig
Pengfei Liu
44
30
0
12 Dec 2023
Dense X Retrieval: What Retrieval Granularity Should We Use?
Dense X Retrieval: What Retrieval Granularity Should We Use?
Tong Chen
Hongwei Wang
Sihao Chen
Wenhao Yu
Kaixin Ma
Xinran Zhao
Hongming Zhang
Dong Yu
27
30
0
11 Dec 2023
Towards leveraging LLMs for Conditional QA
Towards leveraging LLMs for Conditional QA
Syed-Amad Hussain
Parag Dakle
SaiKrishna Rallabandi
Preethi Raghavan
ELM
21
2
0
02 Dec 2023
Boot and Switch: Alternating Distillation for Zero-Shot Dense Retrieval
Boot and Switch: Alternating Distillation for Zero-Shot Dense Retrieval
Fan Jiang
Qiongkai Xu
Tom Drummond
Trevor Cohn
21
2
0
27 Nov 2023
Clarify When Necessary: Resolving Ambiguity Through Interaction with LMs
Clarify When Necessary: Resolving Ambiguity Through Interaction with LMs
Michael J.Q. Zhang
Eunsol Choi
34
30
0
16 Nov 2023
Previous
12345
Next