Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1811.00937
Cited By
CommonsenseQA: A Question Answering Challenge Targeting Commonsense Knowledge
2 November 2018
Alon Talmor
Jonathan Herzig
Nicholas Lourie
Jonathan Berant
RALM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"CommonsenseQA: A Question Answering Challenge Targeting Commonsense Knowledge"
50 / 401 papers shown
Title
HyperCLOVA X Technical Report
Kang Min Yoo
Jaegeun Han
Sookyo In
Heewon Jeon
Jisu Jeong
...
Hyunkyung Noh
Se-Eun Choi
Sang-Woo Lee
Jung Hwa Lim
Nako Sung
VLM
42
8
0
02 Apr 2024
Meta-Cognitive Analysis: Evaluating Declarative and Procedural Knowledge in Datasets and Large Language Models
Zhuoqun Li
Hongyu Lin
Yaojie Lu
Hao Xiang
Xianpei Han
Le Sun
41
1
0
14 Mar 2024
Yi: Open Foundation Models by 01.AI
01. AI
Alex Young
01.AI Alex Young
Bei Chen
Chao Li
...
Yue Wang
Yuxuan Cai
Zhenyu Gu
Zhiyuan Liu
Zonghong Dai
OSLM
LRM
150
511
0
07 Mar 2024
Learning to Maximize Mutual Information for Chain-of-Thought Distillation
Xin Chen
Hanxian Huang
Yanjun Gao
Yi Wang
Jishen Zhao
Ke Ding
45
12
0
05 Mar 2024
Right for Right Reasons: Large Language Models for Verifiable Commonsense Knowledge Graph Question Answering
Armin Toroghi
Willis Guo
Mohammad Mahdi Torabi pour
Scott Sanner
LRM
31
8
0
03 Mar 2024
Prediction-Powered Ranking of Large Language Models
Ivi Chatzi
Eleni Straitouri
Suhas Thejaswi
Manuel Gomez Rodriguez
ALM
29
5
0
27 Feb 2024
Automating Dataset Updates Towards Reliable and Timely Evaluation of Large Language Models
Jiahao Ying
Yixin Cao
Yushi Bai
Qianru Sun
Bo Wang
Wei Tang
Zhaojun Ding
Yizhe Yang
Xuanjing Huang
Shuicheng Yan
KELM
29
7
0
19 Feb 2024
Uncertainty quantification in fine-tuned LLMs using LoRA ensembles
Oleksandr Balabanov
Hampus Linander
UQCV
41
15
0
19 Feb 2024
LEIA: Facilitating Cross-lingual Knowledge Transfer in Language Models with Entity-based Data Augmentation
Ikuya Yamada
Ryokan Ri
KELM
25
0
0
18 Feb 2024
Puzzle Solving using Reasoning of Large Language Models: A Survey
Panagiotis Giadikiaroglou
Maria Lymperaiou
Giorgos Filandrianos
Giorgos Stamou
ELM
ReLM
LRM
26
27
0
17 Feb 2024
Navigating the Dual Facets: A Comprehensive Evaluation of Sequential Memory Editing in Large Language Models
Zihao Lin
Mohammad Beigi
Hongxuan Li
Yufan Zhou
Yuxiang Zhang
Qifan Wang
Wenpeng Yin
Lifu Huang
KELM
29
8
0
16 Feb 2024
Higher Layers Need More LoRA Experts
Chongyang Gao
Kezhen Chen
Jinmeng Rao
Baochen Sun
Ruibo Liu
Daiyi Peng
Yawen Zhang
Xiaoyuan Guo
Jie Yang
V. Subrahmanian
MoE
26
37
0
13 Feb 2024
EntGPT: Entity Linking with Generative Large Language Models
Yifan Ding
Amrit Poudel
Qingkai Zeng
Tim Weninger
Balaji Veeramani
Sanmitra Bhattacharya
ReLM
KELM
LRM
46
4
0
09 Feb 2024
Deductive Beam Search: Decoding Deducible Rationale for Chain-of-Thought Reasoning
Tinghui Zhu
Kai Zhang
Jian Xie
Yu-Chuan Su
LRM
28
15
0
31 Jan 2024
Navigating the OverKill in Large Language Models
Chenyu Shi
Xiao Wang
Qiming Ge
Songyang Gao
Xianjun Yang
Tao Gui
Qi Zhang
Xuanjing Huang
Xun Zhao
Dahua Lin
27
11
0
31 Jan 2024
Large Malaysian Language Model Based on Mistral for Enhanced Local Language Understanding
Husein Zolkepli
Aisyah Razak
Kamarul Adha
Ariff Nazhan
MoE
29
3
0
24 Jan 2024
Temporal Blind Spots in Large Language Models
Jonas Wallat
Adam Jatowt
Avishek Anand
41
3
0
22 Jan 2024
In-context Learning with Retrieved Demonstrations for Language Models: A Survey
an Luo
Xin Xu
Yue Liu
Panupong Pasupat
Mehran Kazemi
RALM
34
55
0
21 Jan 2024
SAPT: A Shared Attention Framework for Parameter-Efficient Continual Learning of Large Language Models
Weixiang Zhao
Shilong Wang
Yulin Hu
Yanyan Zhao
Bing Qin
Xuanyu Zhang
Qing Yang
Dongliang Xu
Wanxiang Che
KELM
CLL
39
12
0
16 Jan 2024
Beyond Chinchilla-Optimal: Accounting for Inference in Language Model Scaling Laws
Nikhil Sardana
Jacob P. Portes
Sasha Doubov
Jonathan Frankle
LRM
251
73
0
31 Dec 2023
How to Prune Your Language Model: Recovering Accuracy on the "Sparsity May Cry'' Benchmark
Eldar Kurtic
Torsten Hoefler
Dan Alistarh
42
3
0
21 Dec 2023
Zebra: Extending Context Window with Layerwise Grouped Local-Global Attention
Kaiqiang Song
Xiaoyang Wang
Sangwoo Cho
Xiaoman Pan
Dong Yu
36
7
0
14 Dec 2023
NovaCOMET: Open Commonsense Foundation Models with Symbolic Knowledge Distillation
Peter West
Ronan Le Bras
Taylor Sorensen
Bill Yuchen Lin
Liwei Jiang
...
Khyathi Raghavi Chandu
Jack Hessel
Ashutosh Baheti
Chandra Bhagavatula
Yejin Choi
VLM
31
10
0
10 Dec 2023
Multi-Defendant Legal Judgment Prediction via Hierarchical Reasoning
Yougang Lyu
Jitai Hao
Zihan Wang
Kai Zhao
Shen Gao
Pengjie Ren
Zhumin Chen
Fang Wang
Zhaochun Ren
AILaw
27
9
0
10 Dec 2023
AlignedCoT: Prompting Large Language Models via Native-Speaking Demonstrations
Zhicheng YANG
Yinya Huang
Jing Xiong
Liang Feng
Xiaodan Liang
Yiwei Wang
Jing Tang
LRM
33
1
0
22 Nov 2023
Do Smaller Language Models Answer Contextualised Questions Through Memorisation Or Generalisation?
Tim Hartill
Joshua Bensemann
Michael Witbrock
Patricia Riddle
KELM
30
0
0
21 Nov 2023
Igniting Language Intelligence: The Hitchhiker's Guide From Chain-of-Thought Reasoning to Language Agents
ZhuoSheng Zhang
Yao Yao
Aston Zhang
Xiangru Tang
Xinbei Ma
...
Yiming Wang
Mark B. Gerstein
Rui Wang
Gongshen Liu
Hai Zhao
LLMAG
LM&Ro
LRM
42
53
0
20 Nov 2023
More Samples or More Prompts? Exploring Effective In-Context Sampling for LLM Few-Shot Prompt Engineering
Bingsheng Yao
Guiming Hardy Chen
Ruishi Zou
Yuxuan Lu
Jiachen Li
Shao Zhang
Yisi Sang
Sijia Liu
James A. Hendler
Dakuo Wang
45
13
0
16 Nov 2023
Self-Contradictory Reasoning Evaluation and Detection
Ziyi Liu
Isabelle Lee
Yongkang Du
Soumya Sanyal
Jieyu Zhao
LRM
32
2
0
16 Nov 2023
ToolTalk: Evaluating Tool-Usage in a Conversational Setting
Nicholas Farn
Richard Shin
LLMAG
ELM
40
14
0
15 Nov 2023
Towards A Unified View of Answer Calibration for Multi-Step Reasoning
Shumin Deng
Ningyu Zhang
Nay Oo
Bryan Hooi
LRM
50
2
0
15 Nov 2023
Llamas Know What GPTs Don't Show: Surrogate Models for Confidence Estimation
Vaishnavi Shrivastava
Percy Liang
Ananya Kumar
28
28
0
15 Nov 2023
Knowing What LLMs DO NOT Know: A Simple Yet Effective Self-Detection Method
Yukun Zhao
Lingyong Yan
Weiwei Sun
Guoliang Xing
Chong Meng
Shuaiqiang Wang
Zhicong Cheng
Zhaochun Ren
Dawei Yin
33
37
0
27 Oct 2023
Emulating the Human Mind: A Neural-symbolic Link Prediction Model with Fast and Slow Reasoning and Filtered Rules
Mohammad Hossein Khojasteh
Najmeh Torabian
Ali Farjami
Saeid Hosseini
B. Minaei-Bidgoli
LRM
30
0
0
21 Oct 2023
KGQuiz: Evaluating the Generalization of Encoded Knowledge in Large Language Models
Yuyang Bai
Shangbin Feng
Vidhisha Balachandran
Zhaoxuan Tan
Shiqi Lou
Tianxing He
Yulia Tsvetkov
ELM
42
2
0
15 Oct 2023
Instruction Tuning with Human Curriculum
Bruce W. Lee
Hyunsoo Cho
Kang Min Yoo
47
3
0
14 Oct 2023
Concise and Organized Perception Facilitates Reasoning in Large Language Models
Junjie Liu
Shaotian Yan
Chen Shen
Zhengdong Xiao
Wenxiao Wang
Jieping Ye
Jieping Ye
LRM
26
1
0
05 Oct 2023
Think before you speak: Training Language Models With Pause Tokens
Sachin Goyal
Ziwei Ji
A. S. Rawat
A. Menon
Sanjiv Kumar
Vaishnavh Nagarajan
LRM
26
97
0
03 Oct 2023
Large Language Models Cannot Self-Correct Reasoning Yet
Jie Huang
Xinyun Chen
Swaroop Mishra
Huaixiu Steven Zheng
Adams Wei Yu
Xinying Song
Denny Zhou
ReLM
LRM
38
424
0
03 Oct 2023
Corex: Pushing the Boundaries of Complex Reasoning through Multi-Model Collaboration
Qiushi Sun
Zhangyue Yin
Xiang Li
Zhiyong Wu
Xipeng Qiu
Lingpeng Kong
LRM
LLMAG
28
44
0
30 Sep 2023
Don't throw away your value model! Generating more preferable text with Value-Guided Monte-Carlo Tree Search decoding
Jiacheng Liu
Andrew Cohen
Ramakanth Pasunuru
Yejin Choi
Hannaneh Hajishirzi
Asli Celikyilmaz
24
24
0
26 Sep 2023
Knowledgeable In-Context Tuning: Exploring and Exploiting Factual Knowledge for In-Context Learning
Rongxiang Weng
Chengyu Wang
Chuanqi Tan
Jun Huang
Ming Gao
KELM
34
4
0
26 Sep 2023
Experience and Prediction: A Metric of Hardness for a Novel Litmus Test
Nicos Isaak
Loizos Michael
41
3
0
05 Sep 2023
Large Language Models Sensitivity to The Order of Options in Multiple-Choice Questions
Pouya Pezeshkpour
Estevam R. Hruschka
LRM
22
131
0
22 Aug 2023
Teaching Smaller Language Models To Generalise To Unseen Compositional Questions
Tim Hartill
N. Tan
Michael Witbrock
Patricia J. Riddle
ReLM
KELM
LRM
34
2
0
02 Aug 2023
A Real-World WebAgent with Planning, Long Context Understanding, and Program Synthesis
Izzeddin Gur
Hiroki Furuta
Austin Huang
Mustafa Safdari
Yutaka Matsuo
Douglas Eck
Aleksandra Faust
LM&Ro
LLMAG
39
201
0
24 Jul 2023
Thrust: Adaptively Propels Large Language Models with External Knowledge
Xinran Zhao
Hongming Zhang
Xiaoman Pan
Wenlin Yao
Dong Yu
Jianshu Chen
KELM
60
5
0
19 Jul 2023
Llama 2: Open Foundation and Fine-Tuned Chat Models
Hugo Touvron
Louis Martin
Kevin R. Stone
Peter Albert
Amjad Almahairi
...
Sharan Narang
Aurelien Rodriguez
Robert Stojnic
Sergey Edunov
Thomas Scialom
AI4MH
ALM
132
11,144
0
18 Jul 2023
Controllable Data Augmentation for Few-Shot Text Mining with Chain-of-Thought Attribute Manipulation
Letian Peng
Yuwei Zhang
Jingbo Shang
LRM
29
7
0
14 Jul 2023
Chain of Thought Prompting Elicits Knowledge Augmentation
Di Wu
Jing Zhang
Xinmei Huang
LRM
30
31
0
04 Jul 2023
Previous
1
2
3
4
5
6
7
8
9
Next