Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2212.08410
Cited By
Teaching Small Language Models to Reason
16 December 2022
Lucie Charlotte Magister
Jonathan Mallinson
Jakub Adamek
Eric Malmi
Aliaksei Severyn
LRM
AI4CE
ReLM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Teaching Small Language Models to Reason"
50 / 191 papers shown
Title
Path-Consistency: Prefix Enhancement for Efficient Inference in LLM
Jiace Zhu
Yingtao Shen
Jie Zhao
An Zou
LLMAG
LRM
27
4
0
25 Aug 2024
Fine-tuning Smaller Language Models for Question Answering over Financial Documents
Karmvir Singh Phogat
Sai Akhil Puranam
Sridhar Dasaratha
Chetan Harsha
Shashishekar Ramakrishna
LRM
31
2
0
22 Aug 2024
Using Advanced LLMs to Enhance Smaller LLMs: An Interpretable Knowledge Distillation Approach
Tong Wang
K. Sudhir
Dat Hong
36
1
0
13 Aug 2024
Inference Optimizations for Large Language Models: Effects, Challenges, and Practical Considerations
Leo Donisch
Sigurd Schacht
Carsten Lanquillon
30
2
0
06 Aug 2024
Self-Training with Direct Preference Optimization Improves Chain-of-Thought Reasoning
Tianduo Wang
Shichen Li
Wei Lu
LRM
AI4CE
45
14
1
25 Jul 2024
Data-Centric Human Preference Optimization with Rationales
H. Just
Ming Jin
Anit Kumar Sahu
Huy Phan
Ruoxi Jia
52
3
0
19 Jul 2024
LAPIS: Language Model-Augmented Police Investigation System
Heedou Kim
Dain Kim
Jiwoo Lee
Chanwoong Yoon
Donghee Choi
Mogan Gim
Jaewoo Kang
RALM
28
1
0
19 Jul 2024
Reflective Instruction Tuning: Mitigating Hallucinations in Large Vision-Language Models
Jinrui Zhang
Teng Wang
Haigang Zhang
Ping Lu
Feng Zheng
MLLM
LRM
VLM
34
3
0
16 Jul 2024
Key-Point-Driven Mathematical Reasoning Distillation of Large Language Model
Xunyu Zhu
Jian Li
Can Ma
Weiping Wang
LRM
41
0
0
14 Jul 2024
Retrieved In-Context Principles from Previous Mistakes
Hao Sun
Yong-jia Jiang
Bo Wang
Yingyan Hou
Yan Zhang
Pengjun Xie
Fei Huang
60
1
0
08 Jul 2024
RVISA: Reasoning and Verification for Implicit Sentiment Analysis
Wenna Lai
H. Xie
Guandong Xu
Qing Li
LRM
39
1
0
02 Jul 2024
Survey on Knowledge Distillation for Large Language Models: Methods, Evaluation, and Application
Chuanpeng Yang
Wang Lu
Yao Zhu
Yidong Wang
Qian Chen
Chenlong Gao
Bingjie Yan
Yiqiang Chen
ALM
KELM
44
22
0
02 Jul 2024
Abstraction-of-Thought Makes Language Models Better Reasoners
Ruixin Hong
Hongming Zhang
Xiaoman Pan
Dong Yu
Changshui Zhang
LRM
53
4
0
18 Jun 2024
Through the Thicket: A Study of Number-Oriented LLMs derived from Random Forest Models
M. Romaszewski
Przemysław Sekuła
P. Głomb
M. Cholewa
Katarzyna Kołodziej
37
0
0
07 Jun 2024
mCoT: Multilingual Instruction Tuning for Reasoning Consistency in Language Models
Huiyuan Lai
Malvina Nissim
LRM
41
14
0
04 Jun 2024
SUBLLM: A Novel Efficient Architecture with Token Sequence Subsampling for LLM
Quandong Wang
Yuxuan Yuan
Xiaoyu Yang
Ruike Zhang
Kang Zhao
Wei Liu
Jian Luan
Daniel Povey
Bin Wang
49
0
0
03 Jun 2024
Improve Student's Reasoning Generalizability through Cascading Decomposed CoTs Distillation
Chengwei Dai
Kun Li
Wei Zhou
Song Hu
LRM
49
3
0
30 May 2024
Beyond Imitation: Learning Key Reasoning Steps from Dual Chain-of-Thoughts in Reasoning Distillation
Chengwei Dai
Kun Li
Wei Zhou
Song Hu
LRM
43
5
0
30 May 2024
MathChat: Benchmarking Mathematical Reasoning and Instruction Following in Multi-Turn Interactions
Zhenwen Liang
Dian Yu
Wenhao Yu
Wenlin Yao
Zhihan Zhang
Xiangliang Zhang
Dong Yu
LRM
45
9
0
29 May 2024
Keypoint-based Progressive Chain-of-Thought Distillation for LLMs
Kaituo Feng
Changsheng Li
Xiaolu Zhang
Jun Zhou
Ye Yuan
Guoren Wang
LRM
47
2
0
25 May 2024
Image-of-Thought Prompting for Visual Reasoning Refinement in Multimodal Large Language Models
Qiji Zhou
Ruochen Zhou
Zike Hu
Panzhong Lu
Siyang Gao
Yue Zhang
LRM
46
13
0
22 May 2024
Enhancing Semantics in Multimodal Chain of Thought via Soft Negative Sampling
Guangmin Zheng
Jin Wang
Xiaobing Zhou
Xuejie Zhang
LRM
38
2
0
16 May 2024
QCRD: Quality-guided Contrastive Rationale Distillation for Large Language Models
Wei Wang
Zhaowei Li
Qi Xu
Yiqing Cai
Hang Song
Qi Qi
Ran Zhou
Zhida Huang
Tao Wang
Li Xiao
ALM
35
1
0
14 May 2024
MuMath-Code: Combining Tool-Use Large Language Models with Multi-perspective Data Augmentation for Mathematical Reasoning
Shuo Yin
Weihao You
Zhilong Ji
Guoqiang Zhong
Jinfeng Bai
LRM
SyDa
37
9
0
13 May 2024
Self-Refine Instruction-Tuning for Aligning Reasoning in Language Models
Leonardo Ranaldi
André Freitas
LRM
ReLM
32
8
0
01 May 2024
From Complex to Simple: Enhancing Multi-Constraint Complex Instruction Following Ability of Large Language Models
Qi He
Jie Zeng
Qianxi He
Jiaqing Liang
Yanghua Xiao
32
10
0
24 Apr 2024
A Survey on Efficient Inference for Large Language Models
Zixuan Zhou
Xuefei Ning
Ke Hong
Tianyu Fu
Jiaming Xu
...
Shengen Yan
Guohao Dai
Xiao-Ping Zhang
Yuhan Dong
Yu-Xiang Wang
46
83
0
22 Apr 2024
PARAMANU-GANITA: Can Small Math Language Models Rival with Large Language Models on Mathematical Reasoning?
Mitodru Niyogi
Arnab Bhattacharya
LRM
ReLM
44
0
0
22 Apr 2024
Relevant or Random: Can LLMs Truly Perform Analogical Reasoning?
Chengwei Qin
Wenhan Xia
Tan Wang
Fangkai Jiao
Yuchen Hu
Bosheng Ding
Ruirui Chen
Shafiq R. Joty
LRM
37
3
0
19 Apr 2024
Distilling Reasoning Ability from Large Language Models with Adaptive Thinking
Xiao Chen
Sihang Zhou
K. Liang
Xinwang Liu
ReLM
LRM
37
2
0
14 Apr 2024
Can only LLMs do Reasoning?: Potential of Small Language Models in Task Planning
Gawon Choi
Hyemin Ahn
LM&Ro
LRM
34
1
0
05 Apr 2024
SAAS: Solving Ability Amplification Strategy for Enhanced Mathematical Reasoning in Large Language Models
Hyeonwoo Kim
Gyoungjin Gim
Yungi Kim
Jihoo Kim
Byungju Kim
Wonseok Lee
Chanjun Park
ReLM
LRM
34
1
0
05 Apr 2024
Emergent Abilities in Reduced-Scale Generative Language Models
Sherin Muckatira
Vijeta Deshpande
Vladislav Lialin
Anna Rumshisky
ReLM
ELM
LRM
38
4
0
02 Apr 2024
Enhancing Reasoning Capacity of SLM using Cognitive Enhancement
Jonathan Pan
Swee Liang Wong
Xin Wei Chia
Yidi Yuan
LRM
37
0
0
01 Apr 2024
Mitigating Misleading Chain-of-Thought Reasoning with Selective Filtering
Yexin Wu
Zhuosheng Zhang
Hai Zhao
LRM
27
3
0
28 Mar 2024
HateCOT: An Explanation-Enhanced Dataset for Generalizable Offensive Speech Detection via Large Language Models
H. Nghiem
Hal Daumé
39
1
0
18 Mar 2024
Self-Consistency Boosts Calibration for Math Reasoning
Ante Wang
Linfeng Song
Ye Tian
Baolin Peng
Lifeng Jin
Haitao Mi
Jinsong Su
Dong Yu
LRM
16
5
0
14 Mar 2024
Can Small Language Models be Good Reasoners for Sequential Recommendation?
Yuling Wang
Changxin Tian
Binbin Hu
Yanhua Yu
Ziqi Liu
Qing Cui
Jun Zhou
Liang Pang
Xiao Wang
LRM
48
24
0
07 Mar 2024
Learning to Maximize Mutual Information for Chain-of-Thought Distillation
Xin Chen
Hanxian Huang
Yanjun Gao
Yi Wang
Jishen Zhao
Ke Ding
35
11
0
05 Mar 2024
Self-Consistent Reasoning-based Aspect-Sentiment Quad Prediction with Extract-Then-Assign Strategy
Jieyong Kim
Ryang Heo
Yongsik Seo
SeongKu Kang
Jinyoung Yeo
Dongha Lee
ReLM
LRM
36
4
0
01 Mar 2024
Think Big, Generate Quick: LLM-to-SLM for Fast Autoregressive Decoding
Benjamin Bergner
Andrii Skliar
Amelie Royer
Tijmen Blankevoort
Yuki Markus Asano
B. Bejnordi
58
5
0
26 Feb 2024
Large Language Models for Data Annotation: A Survey
Zhen Tan
Dawei Li
Song Wang
Alimohammad Beigi
Bohan Jiang
Amrita Bhattacharjee
Mansooreh Karami
Wenlin Yao
Lu Cheng
Huan Liu
SyDa
56
50
0
21 Feb 2024
A Survey on Knowledge Distillation of Large Language Models
Xiaohan Xu
Ming Li
Chongyang Tao
Tao Shen
Reynold Cheng
Jinyang Li
Can Xu
Dacheng Tao
Dinesh Manocha
KELM
VLM
44
101
0
20 Feb 2024
ELAD: Explanation-Guided Large Language Models Active Distillation
Yifei Zhang
Bo Pan
Chen Ling
Yuntong Hu
Liang Zhao
46
5
0
20 Feb 2024
Distilling Large Language Models for Text-Attributed Graph Learning
Bo Pan
Zhengwu Zhang
Yifei Zhang
Yuntong Hu
Liang Zhao
38
13
0
19 Feb 2024
I Learn Better If You Speak My Language: Understanding the Superior Performance of Fine-Tuning Large Language Models with LLM-Generated Responses
Xuan Ren
Biao Wu
Lingqiao Liu
33
5
0
17 Feb 2024
Enhancing Numerical Reasoning with the Guidance of Reliable Reasoning Processes
Dingzirui Wang
Longxu Dou
Xuanliang Zhang
Qingfu Zhu
Wanxiang Che
LRM
39
0
0
16 Feb 2024
Model Compression and Efficient Inference for Large Language Models: A Survey
Wenxiao Wang
Wei Chen
Yicong Luo
Yongliu Long
Zhengkai Lin
Liye Zhang
Binbin Lin
Deng Cai
Xiaofei He
MQ
41
48
0
15 Feb 2024
Show Me How It's Done: The Role of Explanations in Fine-Tuning Language Models
Mohamad Ballout
U. Krumnack
Gunther Heidemann
Kai-Uwe Kuehnberger
LRM
26
3
0
12 Feb 2024
A Survey on Transformer Compression
Yehui Tang
Yunhe Wang
Jianyuan Guo
Zhijun Tu
Kai Han
Hailin Hu
Dacheng Tao
37
28
0
05 Feb 2024
Previous
1
2
3
4
Next