Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2402.11291
Cited By
Puzzle Solving using Reasoning of Large Language Models: A Survey
17 February 2024
Panagiotis Giadikiaroglou
Maria Lymperaiou
Giorgos Filandrianos
Giorgos Stamou
ELM
ReLM
LRM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Puzzle Solving using Reasoning of Large Language Models: A Survey"
28 / 28 papers shown
Title
A Short Survey on Small Reasoning Models: Training, Inference, Applications and Research Directions
Chengyu Wang
Taolin Zhang
Richang Hong
Jun Huang
ReLM
LRM
39
1
0
12 Apr 2025
AILS-NTUA at SemEval-2025 Task 4: Parameter-Efficient Unlearning for Large Language Models using Data Chunking
Iraklis Premptis
Maria Lymperaiou
Giorgos Filandrianos
Orfeas Menis-Mastromichalakis
Athanasios Voulodimos
Giorgos Stamou
MU
48
0
0
04 Mar 2025
FINEREASON: Evaluating and Improving LLMs' Deliberate Reasoning through Reflective Puzzle Solving
Guizhen Chen
Weiwen Xu
Hao Zhang
Hou Pong Chan
Chaoqun Liu
Lidong Bing
Deli Zhao
Anh Tuan Luu
Yu Rong
ReLM
LRM
61
3
0
27 Feb 2025
Beyond In-Distribution Success: Scaling Curves of CoT Granularity for Language Model Generalization
Ru Wang
Wei Huang
Selena Song
Haoyu Zhang
Yusuke Iwasawa
Y. Matsuo
Jiaxian Guo
OODD
LRM
69
2
0
25 Feb 2025
Zero-shot Model-based Reinforcement Learning using Large Language Models
Abdelhakim Benechehab
Youssef Attia El Hili
Ambroise Odonnat
Oussama Zekri
Albert Thomas
Giuseppe Paolo
Maurizio Filippone
I. Redko
Balázs Kégl
OffRL
62
1
0
17 Feb 2025
QUENCH: Measuring the gap between Indic and Non-Indic Contextual General Reasoning in LLMs
Mohammad Aflah Khan
Neemesh Yadav
Sarah Masud
Md. Shad Akhtar
74
0
0
16 Dec 2024
On Memorization of Large Language Models in Logical Reasoning
Chulin Xie
Yangsibo Huang
Chiyuan Zhang
Da Yu
Xinyun Chen
Bill Yuchen Lin
Bo Li
Badih Ghazi
Ravi Kumar
LRM
53
20
0
30 Oct 2024
RISCORE: Enhancing In-Context Riddle Solving in Language Models through Context-Reconstructed Example Augmentation
Ioannis Panagiotopoulos
Giorgos Filandrianos
Maria Lymperaiou
Giorgos Stamou
LRM
ReLM
33
0
0
24 Sep 2024
Causal Language Modeling Can Elicit Search and Reasoning Capabilities on Logic Puzzles
Kulin Shah
Nishanth Dikkala
Xin Wang
Rina Panigrahy
ELM
ReLM
LRM
34
9
0
16 Sep 2024
Enhancing adversarial robustness in Natural Language Inference using explanations
Alexandros Koulakos
Maria Lymperaiou
Giorgos Filandrianos
Giorgos Stamou
SILM
AAML
37
0
0
11 Sep 2024
Harmonic Reasoning in Large Language Models
Anna Kruspe
LRM
24
0
0
09 Sep 2024
Non Verbis, Sed Rebus: Large Language Models are Weak Solvers of Italian Rebuses
Gabriele Sarti
Tommaso Caselli
Malvina Nissim
Arianna Bisazza
ReLM
LRM
31
1
0
01 Aug 2024
Step-by-Step Reasoning to Solve Grid Puzzles: Where do LLMs Falter?
Nemika Tyagi
Mihir Parmar
Mohith Kulkarni
Aswin Rrv
Nisarg Patel
Mutsumi Nakamura
Arindam Mitra
Chitta Baral
LRM
35
6
0
20 Jul 2024
Liar, Liar, Logical Mire: A Benchmark for Suppositional Reasoning in Large Language Models
Philipp Mondorf
Barbara Plank
HILM
LRM
39
4
0
18 Jun 2024
A Peek into Token Bias: Large Language Models Are Not Yet Genuine Reasoners
Bowen Jiang
Yangxinyu Xie
Zhuoqun Hao
Xiaomeng Wang
Tanwi Mallick
Weijie J. Su
Camillo J. Taylor
Dan Roth
LRM
41
40
0
16 Jun 2024
Easy Problems That LLMs Get Wrong
Sean Williams
James Huckle
LRM
24
10
0
30 May 2024
AILS-NTUA at SemEval-2024 Task 9: Cracking Brain Teasers: Transformer Models for Lateral Thinking Puzzles
Ioannis Panagiotopoulos
Giorgos Filandrianos
Maria Lymperaiou
Giorgos Stamou
28
1
0
01 Apr 2024
Reinforcement Learning from LLM Feedback to Counteract Goal Misgeneralization
Houda Nait El Barj
Théophile Sautory
25
2
0
14 Jan 2024
Towards Better Chain-of-Thought Prompting Strategies: A Survey
Zihan Yu
Liang He
Zhen Wu
Xinyu Dai
Jiajun Chen
LRM
126
44
0
08 Oct 2023
Large Language Models (GPT) Struggle to Answer Multiple-Choice Questions about Code
Jaromír Šavelka
Arav Agarwal
Chris Bogart
M. Sakr
ELM
60
50
0
09 Mar 2023
Complexity-Based Prompting for Multi-Step Reasoning
Yao Fu
Hao-Chun Peng
Ashish Sabharwal
Peter Clark
Tushar Khot
ReLM
LRM
162
412
0
03 Oct 2022
CC-Riddle: A Question Answering Dataset of Chinese Character Riddles
Fan Xu
Yunxiang Zhang
Xiao-Yi Wan
28
1
0
28 Jun 2022
Large Language Models are Zero-Shot Reasoners
Takeshi Kojima
S. Gu
Machel Reid
Yutaka Matsuo
Yusuke Iwasawa
ReLM
LRM
310
4,077
0
24 May 2022
Down and Across: Introducing Crossword-Solving as a New NLP Benchmark
Saurabh Kulshreshtha
Olga Kovaleva
Namrata Shivagunde
Anna Rumshisky
ELM
LRM
26
4
0
20 May 2022
Self-Consistency Improves Chain of Thought Reasoning in Language Models
Xuezhi Wang
Jason W. Wei
Dale Schuurmans
Quoc Le
Ed H. Chi
Sharan Narang
Aakanksha Chowdhery
Denny Zhou
ReLM
BDL
LRM
AI4CE
314
3,237
0
21 Mar 2022
BiRdQA: A Bilingual Dataset for Question Answering on Tricky Riddles
Yunxiang Zhang
Xiaojun Wan
34
12
0
23 Sep 2021
Did Aristotle Use a Laptop? A Question Answering Benchmark with Implicit Reasoning Strategies
Mor Geva
Daniel Khashabi
Elad Segal
Tushar Khot
Dan Roth
Jonathan Berant
RALM
250
673
0
06 Jan 2021
RiddleSense: Reasoning about Riddle Questions Featuring Linguistic Creativity and Commonsense Knowledge
Bill Yuchen Lin
Ziyi Wu
Yichi Yang
Dong-Ho Lee
Xiang Ren
ReLM
LRM
236
64
0
02 Jan 2021
1