Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1806.03822
Cited By
Know What You Don't Know: Unanswerable Questions for SQuAD
11 June 2018
Pranav Rajpurkar
Robin Jia
Percy Liang
RALM
ELM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Know What You Don't Know: Unanswerable Questions for SQuAD"
50 / 586 papers shown
Title
TLDR: Token-Level Detective Reward Model for Large Vision Language Models
Deqing Fu
Tong Xiao
Rui Wang
Wang Zhu
Pengchuan Zhang
Guan Pang
Robin Jia
Lawrence Chen
68
5
0
07 Oct 2024
Integrative Decoding: Improve Factuality via Implicit Self-consistency
Yi Cheng
Xiao Liang
Yeyun Gong
Wen Xiao
Song Wang
...
Wenjie Li
Jian Jiao
Qi Chen
Peng Cheng
Wayne Xiong
HILM
59
1
0
02 Oct 2024
T3: A Novel Zero-shot Transfer Learning Framework Iteratively Training on an Assistant Task for a Target Task
Xindi Tong
Yujin Zhu
Shijian Fan
Liang Xu
64
1
0
26 Sep 2024
Contextual Breach: Assessing the Robustness of Transformer-based QA Models
Asir Saadat
Nahian Ibn Asad
Md Farhan Ishmam
AAML
46
0
0
17 Sep 2024
KodeXv0.1: A Family of State-of-the-Art Financial Large Language Models
Neel Rajani
Lilli Kiessling
Aleksandr Ogaltsov
Claus Lang
ALM
33
0
0
13 Sep 2024
SYNTHEVAL: Hybrid Behavioral Testing of NLP Models with Synthetic CheckLists
Raoyuan Zhao
Abdullatif Köksal
Yihong Liu
Leonie Weissweiler
Anna Korhonen
Hinrich Schütze
SyDa
41
1
0
30 Aug 2024
Concise Thoughts: Impact of Output Length on LLM Reasoning and Cost
Sania Nayab
Giulio Rossolini
Giorgio Buttazzo
Nicolamaria Manes
F. Giacomelli
Nicolamaria Manes
Fabrizio Giacomelli
LRM
62
25
0
29 Jul 2024
NV-Retriever: Improving text embedding models with effective hard-negative mining
Gabriel de Souza P. Moreira
Radek Osmulski
Mengyao Xu
Ronay Ak
Benedikt Schifferer
Even Oldridge
RALM
49
31
0
22 Jul 2024
INDIC QA BENCHMARK: A Multilingual Benchmark to Evaluate Question Answering capability of LLMs for Indic Languages
A. Singh
Rudra Murthy
Vishwajeet Kumar
Jaydeep Sen
Ashish Mittal
Ganesh Ramakrishnan
45
6
0
18 Jul 2024
BiGym: A Demo-Driven Mobile Bi-Manual Manipulation Benchmark
Nikita Chernyadev
Nicholas Backshall
Xiao Ma
Yunfan Lu
Younggyo Seo
Stephen James
22
11
0
10 Jul 2024
Prompting Techniques for Secure Code Generation: A Systematic Investigation
Catherine Tony
Nicolás E. Díaz Ferreyra
Markus Mutas
Salem Dhiff
Riccardo Scandariato
SILM
79
9
0
09 Jul 2024
An Empirical Comparison of Vocabulary Expansion and Initialization Approaches for Language Models
Nandini Mundra
Aditya Nanda Kishore
Raj Dabre
Ratish Puduppully
Anoop Kunchukuttan
Mitesh Khapra
30
3
0
08 Jul 2024
A Systematic Survey and Critical Review on Evaluating Large Language Models: Challenges, Limitations, and Recommendations
Md Tahmid Rahman Laskar
Sawsan Alqahtani
M Saiful Bari
Mizanur Rahman
Mohammad Abdullah Matin Khan
...
Chee Wei Tan
Md. Rizwan Parvez
Enamul Hoque
Chenyu You
Jimmy Huang
ELM
ALM
31
28
0
04 Jul 2024
LLM Internal States Reveal Hallucination Risk Faced With a Query
Ziwei Ji
Delong Chen
Etsuko Ishii
Samuel Cahyawijaya
Yejin Bang
Bryan Wilie
Pascale Fung
HILM
LRM
39
21
0
03 Jul 2024
Preserving Multilingual Quality While Tuning Query Encoder on English Only
Oleg V. Vasilyev
Randy Sawaya
John Bohannon
35
1
0
01 Jul 2024
Paraphrase and Aggregate with Large Language Models for Minimizing Intent Classification Errors
Vikas Yadav
Zheng Tang
Vijay Srinivasan
34
8
0
24 Jun 2024
Reducing Fine-Tuning Memory Overhead by Approximate and Memory-Sharing Backpropagation
Yuchen Yang
Yingdong Shi
Cheems Wang
Xiantong Zhen
Yuxuan Shi
Jun Xu
40
1
0
24 Jun 2024
Semantic Entropy Probes: Robust and Cheap Hallucination Detection in LLMs
Jannik Kossen
Jiatong Han
Muhammed Razzak
Lisa Schut
Shreshth A. Malik
Yarin Gal
HILM
60
35
0
22 Jun 2024
DeciMamba: Exploring the Length Extrapolation Potential of Mamba
Assaf Ben-Kish
Itamar Zimerman
Shady Abu Hussein
Nadav Cohen
Amir Globerson
Lior Wolf
Raja Giryes
Mamba
77
13
0
20 Jun 2024
Datasets for Multilingual Answer Sentence Selection
Matteo Gabburo
S. Campese
Federico Agostini
Alessandro Moschitti
46
0
0
14 Jun 2024
An Empirical Study of Mamba-based Language Models
R. Waleffe
Wonmin Byeon
Duncan Riach
Brandon Norick
V. Korthikanti
...
Vartika Singh
Jared Casper
Jan Kautz
M. Shoeybi
Bryan Catanzaro
63
65
0
12 Jun 2024
Next-Generation Database Interfaces: A Survey of LLM-based Text-to-SQL
Zijin Hong
Zheng Yuan
Qinggang Zhang
Hao Chen
Junnan Dong
Feiran Huang
Xiao Huang
77
51
0
12 Jun 2024
Paraphrasing in Affirmative Terms Improves Negation Understanding
MohammadHossein Rezaei
Eduardo Blanco
44
1
0
11 Jun 2024
Symmetric Dot-Product Attention for Efficient Training of BERT Language Models
Martin Courtois
Malte Ostendorff
Leonhard Hennig
Georg Rehm
39
2
0
10 Jun 2024
Improved Out-of-Scope Intent Classification with Dual Encoding and Threshold-based Re-Classification
Hossam Zawbaa
Wael Rashwan
Sourav Dutta
H. Assem
OODD
46
0
0
30 May 2024
Evaluating the External and Parametric Knowledge Fusion of Large Language Models
Hao Zhang
Yuyang Zhang
Xiaoguang Li
Wenxuan Shi
Haonan Xu
...
Yasheng Wang
Lifeng Shang
Qun Liu
Yong-jin Liu
Ruiming Tang
KELM
45
4
0
29 May 2024
T-curator: a trust based curation tool for LOD logs
Dihia Lanasri
23
0
0
11 May 2024
Hire Me or Not? Examining Language Model's Behavior with Occupation Attributes
Damin Zhang
Yi Zhang
Geetanjali Bihani
Julia Taylor Rayz
53
2
0
06 May 2024
Explainability for Transparent Conversational Information-Seeking
Weronika Lajewska
Damiano Spina
Johanne Trippas
K. Balog
42
7
0
06 May 2024
Lifelong Knowledge Editing for LLMs with Retrieval-Augmented Continuous Prompt Learning
Qizhou Chen
Taolin Zhang
Xiaofeng He
Dongyang Li
Chengyu Wang
Longtao Huang
Hui Xue
CLL
KELM
51
10
0
06 May 2024
Towards Unbiased Evaluation of Detecting Unanswerable Questions in EHRSQL
Yongjin Yang
Sihyeon Kim
Sangmook Kim
Gyubok Lee
Se-Young Yun
Edward Choi
41
2
0
29 Apr 2024
From Matching to Generation: A Survey on Generative Information Retrieval
Xiaoxi Li
Jiajie Jin
Yujia Zhou
Yuyao Zhang
Peitian Zhang
Yutao Zhu
Zhicheng Dou
3DV
84
46
0
23 Apr 2024
MergeNet: Knowledge Migration across Heterogeneous Models, Tasks, and Modalities
Kunxi Li
Tianyu Zhan
Kairui Fu
Shengyu Zhang
Kun Kuang
Jiwei Li
Zhou Zhao
Fei Wu
MoMe
24
0
0
20 Apr 2024
LoRA Dropout as a Sparsity Regularizer for Overfitting Control
Yang Lin
Xinyu Ma
Xu Chu
Yujie Jin
Zhibang Yang
Yasha Wang
Hong-yan Mei
52
19
0
15 Apr 2024
Unveiling LLM Evaluation Focused on Metrics: Challenges and Solutions
Taojun Hu
Xiao-Hua Zhou
ELM
41
12
0
14 Apr 2024
Your Finetuned Large Language Model is Already a Powerful Out-of-distribution Detector
Andi Zhang
Tim Z. Xiao
Weiyang Liu
Robert Bamler
Damon J. Wischik
OODD
51
4
0
07 Apr 2024
Evaluating Large Language Models Using Contrast Sets: An Experimental Approach
Manish Sanwal
30
5
0
02 Apr 2024
CMAT: A Multi-Agent Collaboration Tuning Framework for Enhancing Small Language Models
Xuechen Liang
Meiling Tao
Yinghui Xia
Yiting Xie
Jun Wang
JingSong Yang
LLMAG
33
12
0
02 Apr 2024
ChroniclingAmericaQA: A Large-scale Question Answering Dataset based on Historical American Newspaper Pages
Bhawna Piryani
Jamshid Mozafari
Adam Jatowt
RALM
48
8
0
26 Mar 2024
Universal Model in Online Customer Service
S. Pi
Cheng-Ping Hsieh
Qun Liu
Yuying Zhu
27
4
0
24 Feb 2024
ConceptMath: A Bilingual Concept-wise Benchmark for Measuring Mathematical Reasoning of Large Language Models
Yanan Wu
Jie Liu
Xingyuan Bu
Jiaheng Liu
Zhanhui Zhou
...
Haibin Chen
Tiezheng Ge
Wanli Ouyang
Wenbo Su
Bo Zheng
LRM
29
6
0
22 Feb 2024
Novi jezički modeli za srpski jezik
Mihailo vSkorić
23
0
0
22 Feb 2024
Qsnail: A Questionnaire Dataset for Sequential Question Generation
Yan Lei
Liang Pang
Yuanzhuo Wang
Huawei Shen
Xueqi Cheng
35
0
0
22 Feb 2024
S
e
2
Se^2
S
e
2
: Sequential Example Selection for In-Context Learning
Haoyu Liu
Jianfeng Liu
Shaohan Huang
Yuefeng Zhan
Hao Sun
Weiwei Deng
Furu Wei
Qi Zhang
33
3
0
21 Feb 2024
Contrastive Instruction Tuning
Tianyi Yan
Fei Wang
James Y. Huang
Wenxuan Zhou
Fan Yin
Aram Galstyan
Wenpeng Yin
Muhao Chen
ALM
27
5
0
17 Feb 2024
A Dataset of Open-Domain Question Answering with Multiple-Span Answers
Zhiyi Luo
Yingying Zhang
Shuyun Luo
Ying Zhao
Wentao Lyu
RALM
21
0
0
15 Feb 2024
Desiderata for the Context Use of Question Answering Systems
Sagi Shaier
Lawrence E Hunter
K. Wense
28
4
0
31 Jan 2024
Contextual Feature Extraction Hierarchies Converge in Large Language Models and the Brain
Gavin Mischler
Yinghao Aaron Li
Stephan Bickel
A. Mehta
N. Mesgarani
30
23
0
31 Jan 2024
How the Advent of Ubiquitous Large Language Models both Stymie and Turbocharge Dynamic Adversarial Question Generation
Yoo Yeon Sung
Ishani Mondal
Jordan L. Boyd-Graber
30
0
0
20 Jan 2024
Power in Numbers: Robust reading comprehension by finetuning with four adversarial sentences per example
Ariel Marcus
AAML
27
0
0
18 Jan 2024
Previous
1
2
3
4
5
...
10
11
12
Next