ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2210.09261
  4. Cited By
Challenging BIG-Bench Tasks and Whether Chain-of-Thought Can Solve Them

Challenging BIG-Bench Tasks and Whether Chain-of-Thought Can Solve Them

17 October 2022
Mirac Suzgun
Nathan Scales
Nathanael Scharli
Sebastian Gehrmann
Yi Tay
Hyung Won Chung
Aakanksha Chowdhery
Quoc V. Le
Ed H. Chi
Denny Zhou
Jason W. Wei
    ALM
    ELM
    LRM
    ReLM
ArXivPDFHTML

Papers citing "Challenging BIG-Bench Tasks and Whether Chain-of-Thought Can Solve Them"

50 / 797 papers shown
Title
LLM-in-the-loop: Leveraging Large Language Model for Thematic Analysis
LLM-in-the-loop: Leveraging Large Language Model for Thematic Analysis
Shih-Chieh Dai
Aiping Xiong
Lun-Wei Ku
27
64
0
23 Oct 2023
Reasoning about Ambiguous Definite Descriptions
Reasoning about Ambiguous Definite Descriptions
Stefan F. Schouten
Peter Bloem
Ilia Markov
Piek Vossen
LRM
UQLM
21
0
0
23 Oct 2023
Language Model Unalignment: Parametric Red-Teaming to Expose Hidden
  Harms and Biases
Language Model Unalignment: Parametric Red-Teaming to Expose Hidden Harms and Biases
Rishabh Bhardwaj
Soujanya Poria
ALM
57
15
0
22 Oct 2023
Evoke: Evoking Critical Thinking Abilities in LLMs via Reviewer-Author
  Prompt Editing
Evoke: Evoking Critical Thinking Abilities in LLMs via Reviewer-Author Prompt Editing
Xinyu Hu
Pengfei Tang
Simiao Zuo
Zihan Wang
Bowen Song
Qiang Lou
Jian Jiao
Denis Xavier Charles
LRM
40
7
0
20 Oct 2023
Teaching Language Models to Self-Improve through Interactive
  Demonstrations
Teaching Language Models to Self-Improve through Interactive Demonstrations
Xiao Yu
Baolin Peng
Michel Galley
Jianfeng Gao
Zhou Yu
LRM
ReLM
35
20
0
20 Oct 2023
Auto-Instruct: Automatic Instruction Generation and Ranking for
  Black-Box Language Models
Auto-Instruct: Automatic Instruction Generation and Ranking for Black-Box Language Models
Zhihan Zhang
Shuohang Wang
W. Yu
Yichong Xu
Dan Iter
Qingkai Zeng
Yang Liu
Chenguang Zhu
Meng Jiang
SyDa
ALM
24
22
0
19 Oct 2023
Multi-stage Large Language Model Correction for Speech Recognition
Multi-stage Large Language Model Correction for Speech Recognition
Jie Pu
Thai-Son Nguyen
Sebastian Stüker
LRM
32
6
0
17 Oct 2023
AdaLomo: Low-memory Optimization with Adaptive Learning Rate
AdaLomo: Low-memory Optimization with Adaptive Learning Rate
Kai Lv
Hang Yan
Qipeng Guo
Haijun Lv
Xipeng Qiu
ODL
27
20
0
16 Oct 2023
GLoRE: Evaluating Logical Reasoning of Large Language Models
GLoRE: Evaluating Logical Reasoning of Large Language Models
Hanmeng Liu
Zhiyang Teng
Ruoxi Ning
Jian Liu
Qiji Zhou
Yuexin Zhang
Yue Zhang
ReLM
ELM
LRM
70
8
0
13 Oct 2023
DistillSpec: Improving Speculative Decoding via Knowledge Distillation
DistillSpec: Improving Speculative Decoding via Knowledge Distillation
Yongchao Zhou
Kaifeng Lyu
A. S. Rawat
A. Menon
Afshin Rostamizadeh
Sanjiv Kumar
Jean-François Kagy
Rishabh Agarwal
47
82
0
12 Oct 2023
Prompting Large Language Models with Chain-of-Thought for Few-Shot
  Knowledge Base Question Generation
Prompting Large Language Models with Chain-of-Thought for Few-Shot Knowledge Base Question Generation
Yuanyuan Liang
Jianing Wang
Hanlun Zhu
Lei Wang
Weining Qian
Yunshi Lan
LRM
ReLM
21
36
0
12 Oct 2023
Found in the Middle: Permutation Self-Consistency Improves Listwise
  Ranking in Large Language Models
Found in the Middle: Permutation Self-Consistency Improves Listwise Ranking in Large Language Models
Raphael Tang
Xinyu Crystina Zhang
Xueguang Ma
Jimmy Lin
Ferhan Ture
LRM
36
15
0
11 Oct 2023
KwaiYiiMath: Technical Report
KwaiYiiMath: Technical Report
Jia-Yi Fu
Lei Lin
Xiaoyang Gao
Pengli Liu
Zhengzong Chen
...
Zijia Lin
Fuzheng Zhang
Zhongyuan Wang
Di Zhang
Kun Gai
LRM
ReLM
RALM
51
2
0
11 Oct 2023
Don't Fine-Tune, Decode: Syntax Error-Free Tool Use via Constrained
  Decoding
Don't Fine-Tune, Decode: Syntax Error-Free Tool Use via Constrained Decoding
Kexun Zhang
Hongqiao Chen
Lei Li
Luu Anh Tuan
50
4
0
10 Oct 2023
Lemur: Harmonizing Natural Language and Code for Language Agents
Lemur: Harmonizing Natural Language and Code for Language Agents
Yiheng Xu
Hongjin Su
Chen Xing
Boyu Mi
Qian Liu
...
Siheng Zhao
Lingpeng Kong
Bailin Wang
Caiming Xiong
Tao Yu
32
68
0
10 Oct 2023
Mistral 7B
Mistral 7B
Albert Q. Jiang
Alexandre Sablayrolles
A. Mensch
Chris Bamford
Devendra Singh Chaplot
...
Teven Le Scao
Thibaut Lavril
Thomas Wang
Timothée Lacroix
William El Sayed
MoE
LRM
23
1,999
0
10 Oct 2023
TRACE: A Comprehensive Benchmark for Continual Learning in Large
  Language Models
TRACE: A Comprehensive Benchmark for Continual Learning in Large Language Models
Xiao Wang
Yuan Zhang
Tianze Chen
Songyang Gao
Senjie Jin
...
Rui Zheng
Yicheng Zou
Tao Gui
Qi Zhang
Xuanjing Huang
ALM
LRM
CLL
60
18
0
10 Oct 2023
Generalizable Chain-of-Thought Prompting in Mixed-task Scenarios with
  Large Language Models
Generalizable Chain-of-Thought Prompting in Mixed-task Scenarios with Large Language Models
Anni Zou
Zhuosheng Zhang
Hai Zhao
Xiangru Tang
LRM
ReLM
42
3
0
10 Oct 2023
SALMON: Self-Alignment with Instructable Reward Models
SALMON: Self-Alignment with Instructable Reward Models
Zhiqing Sun
Songlin Yang
Hongxin Zhang
Qinhong Zhou
Zhenfang Chen
David D. Cox
Yiming Yang
Chuang Gan
ALM
SyDa
38
35
0
09 Oct 2023
LLMLingua: Compressing Prompts for Accelerated Inference of Large
  Language Models
LLMLingua: Compressing Prompts for Accelerated Inference of Large Language Models
Huiqiang Jiang
Qianhui Wu
Chin-Yew Lin
Yuqing Yang
Lili Qiu
34
101
0
09 Oct 2023
Guiding Language Model Math Reasoning with Planning Tokens
Guiding Language Model Math Reasoning with Planning Tokens
Xinyi Wang
Lucas Caccia
O. Ostapenko
Xingdi Yuan
William Yang Wang
Alessandro Sordoni
LRM
33
20
0
09 Oct 2023
Compresso: Structured Pruning with Collaborative Prompting Learns
  Compact Large Language Models
Compresso: Structured Pruning with Collaborative Prompting Learns Compact Large Language Models
Song Guo
Jiahang Xu
Li Zhang
Mao Yang
25
14
0
08 Oct 2023
Self-Knowledge Guided Retrieval Augmentation for Large Language Models
Self-Knowledge Guided Retrieval Augmentation for Large Language Models
Yile Wang
Peng Li
Maosong Sun
Yang Liu
RALM
KELM
29
42
0
08 Oct 2023
Towards Better Chain-of-Thought Prompting Strategies: A Survey
Towards Better Chain-of-Thought Prompting Strategies: A Survey
Zihan Yu
Liang He
Zhen Wu
Xinyu Dai
Jiajun Chen
LRM
129
45
0
08 Oct 2023
Resprompt: Residual Connection Prompting Advances Multi-Step Reasoning
  in Large Language Models
Resprompt: Residual Connection Prompting Advances Multi-Step Reasoning in Large Language Models
Song Jiang
Zahra Shakeri
Aaron Chan
Maziar Sanjabi
Hamed Firooz
...
Bugra Akyildiz
Yizhou Sun
Jinchao Li
Qifan Wang
Asli Celikyilmaz
LRM
ReLM
26
8
0
07 Oct 2023
Adapting Large Language Models for Content Moderation: Pitfalls in Data
  Engineering and Supervised Fine-tuning
Adapting Large Language Models for Content Moderation: Pitfalls in Data Engineering and Supervised Fine-tuning
Huan Ma
Changqing Zhang
Huazhu Fu
Peilin Zhao
Bing Wu
OffRL
AI4MH
29
17
0
05 Oct 2023
JsonTuning: Towards Generalizable, Robust, and Controllable Instruction Tuning
JsonTuning: Towards Generalizable, Robust, and Controllable Instruction Tuning
Chang Gao
Wenxuan Zhang
Guizhen Chen
Wai Lam
55
5
0
04 Oct 2023
Shadow Alignment: The Ease of Subverting Safely-Aligned Language Models
Shadow Alignment: The Ease of Subverting Safely-Aligned Language Models
Xianjun Yang
Xiao Wang
Qi Zhang
Linda R. Petzold
William Y. Wang
Xun Zhao
Dahua Lin
26
163
0
04 Oct 2023
Think before you speak: Training Language Models With Pause Tokens
Think before you speak: Training Language Models With Pause Tokens
Sachin Goyal
Ziwei Ji
A. S. Rawat
A. Menon
Sanjiv Kumar
Vaishnavh Nagarajan
LRM
24
95
0
03 Oct 2023
Instances Need More Care: Rewriting Prompts for Instances with LLMs in
  the Loop Yields Better Zero-Shot Performance
Instances Need More Care: Rewriting Prompts for Instances with LLMs in the Loop Yields Better Zero-Shot Performance
Saurabh Srivastava
Chengyue Huang
Weiguo Fan
Ziyu Yao
LLMAG
28
5
0
03 Oct 2023
Large Language Models as Analogical Reasoners
Large Language Models as Analogical Reasoners
Michihiro Yasunaga
Xinyun Chen
Yujia Li
Panupong Pasupat
J. Leskovec
Percy Liang
Ed H. Chi
Denny Zhou
ReLM
LRM
26
77
0
03 Oct 2023
Probing the Multi-turn Planning Capabilities of LLMs via 20 Question
  Games
Probing the Multi-turn Planning Capabilities of LLMs via 20 Question Games
Yizhe Zhang
Jiarui Lu
Navdeep Jaitly
LRM
ELM
24
9
0
02 Oct 2023
UPAR: A Kantian-Inspired Prompting Framework for Enhancing Large
  Language Model Capabilities
UPAR: A Kantian-Inspired Prompting Framework for Enhancing Large Language Model Capabilities
Hejia Geng
Boxun Xu
Peng Li
ELM
LRM
ReLM
41
1
0
30 Sep 2023
Corex: Pushing the Boundaries of Complex Reasoning through Multi-Model
  Collaboration
Corex: Pushing the Boundaries of Complex Reasoning through Multi-Model Collaboration
Qiushi Sun
Zhangyue Yin
Xiang Li
Zhiyong Wu
Xipeng Qiu
Lingpeng Kong
LRM
LLMAG
28
44
0
30 Sep 2023
"I'd Like to Have an Argument, Please": Argumentative Reasoning in Large
  Language Models
"I'd Like to Have an Argument, Please": Argumentative Reasoning in Large Language Models
Sizhe Wei
Yifan Lu
LRM
48
4
0
29 Sep 2023
Qwen Technical Report
Qwen Technical Report
Jinze Bai
Shuai Bai
Yunfei Chu
Zeyu Cui
Kai Dang
...
Zhenru Zhang
Chang Zhou
Jingren Zhou
Xiaohuan Zhou
Tianhang Zhu
OSLM
29
1,577
0
28 Sep 2023
GPT-Fathom: Benchmarking Large Language Models to Decipher the
  Evolutionary Path towards GPT-4 and Beyond
GPT-Fathom: Benchmarking Large Language Models to Decipher the Evolutionary Path towards GPT-4 and Beyond
Timothée Darcet
Yuyu Zhang
Yijie Zhu
Chenguang Xi
Pengyang Gao
Piotr Bojanowski
Kevin Chen-Chuan Chang
ELM
32
24
0
28 Sep 2023
A Benchmark for Learning to Translate a New Language from One Grammar
  Book
A Benchmark for Learning to Translate a New Language from One Grammar Book
Garrett Tanzer
Mirac Suzgun
Chenguang Xi
Dan Jurafsky
Luke Melas-Kyriazi
24
51
0
28 Sep 2023
Language Models as a Service: Overview of a New Paradigm and its
  Challenges
Language Models as a Service: Overview of a New Paradigm and its Challenges
Emanuele La Malfa
Aleksandar Petrov
Simon Frieder
Christoph Weinhuber
Ryan Burnell
Raza Nazar
Anthony Cohn
Nigel Shadbolt
Michael Wooldridge
ALM
ELM
35
3
0
28 Sep 2023
LawBench: Benchmarking Legal Knowledge of Large Language Models
LawBench: Benchmarking Legal Knowledge of Large Language Models
Zhiwei Fei
Xiaoyu Shen
D. Zhu
Fengzhe Zhou
Zhuo Han
Songyang Zhang
Kai-xiang Chen
Zongwen Shen
Jidong Ge
ELM
AILaw
34
34
0
28 Sep 2023
ModuLoRA: Finetuning 2-Bit LLMs on Consumer GPUs by Integrating with
  Modular Quantizers
ModuLoRA: Finetuning 2-Bit LLMs on Consumer GPUs by Integrating with Modular Quantizers
Junjie Yin
Jiahao Dong
Yingheng Wang
Christopher De Sa
Volodymyr Kuleshov
MQ
28
4
0
28 Sep 2023
NLPBench: Evaluating Large Language Models on Solving NLP Problems
NLPBench: Evaluating Large Language Models on Solving NLP Problems
Linxin Song
Jieyu Zhang
Lechao Cheng
Pengyuan Zhou
Dinesh Manocha
Irene Z Li
ELM
LM&MA
LRM
36
10
0
27 Sep 2023
Navigate through Enigmatic Labyrinth A Survey of Chain of Thought
  Reasoning: Advances, Frontiers and Future
Navigate through Enigmatic Labyrinth A Survey of Chain of Thought Reasoning: Advances, Frontiers and Future
Zheng Chu
Jingchang Chen
Qianglong Chen
Weijiang Yu
Tao He
Haotian Wang
Weihua Peng
Ming-Yu Liu
Bing Qin
Ting Liu
LRM
AI4CE
31
153
0
27 Sep 2023
HANS, are you clever? Clever Hans Effect Analysis of Neural Systems
Leonardo Ranaldi
Fabio Massimo Zanzotto
34
2
0
21 Sep 2023
Natural Language Embedded Programs for Hybrid Language Symbolic
  Reasoning
Natural Language Embedded Programs for Hybrid Language Symbolic Reasoning
Tianhua Zhang
Jiaxin Ge
Hongyin Luo
Yung-Sung Chuang
Mingye Gao
Yuan Gong
Xixin Wu
Yoon Kim
Helen M. Meng
James R. Glass
LRM
ReLM
90
15
0
19 Sep 2023
OpenBA: An Open-sourced 15B Bilingual Asymmetric seq2seq Model
  Pre-trained from Scratch
OpenBA: An Open-sourced 15B Bilingual Asymmetric seq2seq Model Pre-trained from Scratch
Juntao Li
Zecheng Tang
Yuyang Ding
Pinzheng Wang
Pei Guo
...
Wenliang Chen
Guohong Fu
Qiaoming Zhu
Guodong Zhou
Hao Fei
45
5
0
19 Sep 2023
Baichuan 2: Open Large-scale Language Models
Baichuan 2: Open Large-scale Language Models
Ai Ming Yang
Bin Xiao
Bingning Wang
Borong Zhang
Ce Bian
...
Youxin Jiang
Yuchen Gao
Yupeng Zhang
Zenan Zhou
Zhiying Wu
ELM
LRM
66
703
0
19 Sep 2023
Safety-Tuned LLaMAs: Lessons From Improving the Safety of Large Language
  Models that Follow Instructions
Safety-Tuned LLaMAs: Lessons From Improving the Safety of Large Language Models that Follow Instructions
Federico Bianchi
Mirac Suzgun
Giuseppe Attanasio
Paul Röttger
Dan Jurafsky
Tatsunori Hashimoto
James Zou
ALM
LM&MA
LRM
34
178
0
14 Sep 2023
Re-Reading Improves Reasoning in Large Language Models
Re-Reading Improves Reasoning in Large Language Models
Xiaohan Xu
Chongyang Tao
Tao Shen
Can Xu
Hongbo Xu
Guodong Long
Jian-Guang Lou
ReLM
LRM
KELM
34
22
0
12 Sep 2023
MAmmoTH: Building Math Generalist Models through Hybrid Instruction
  Tuning
MAmmoTH: Building Math Generalist Models through Hybrid Instruction Tuning
Xiang Yue
Xingwei Qu
Ge Zhang
Yao Fu
Wenhao Huang
Huan Sun
Yu-Chuan Su
Wenhu Chen
AIMat
LRM
62
361
0
11 Sep 2023
Previous
123...1213141516
Next