ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2210.09261
  4. Cited By
Challenging BIG-Bench Tasks and Whether Chain-of-Thought Can Solve Them

Challenging BIG-Bench Tasks and Whether Chain-of-Thought Can Solve Them

17 October 2022
Mirac Suzgun
Nathan Scales
Nathanael Scharli
Sebastian Gehrmann
Yi Tay
Hyung Won Chung
Aakanksha Chowdhery
Quoc V. Le
Ed H. Chi
Denny Zhou
Jason W. Wei
    ALM
    ELM
    LRM
    ReLM
ArXivPDFHTML

Papers citing "Challenging BIG-Bench Tasks and Whether Chain-of-Thought Can Solve Them"

50 / 797 papers shown
Title
Modeling Complex Mathematical Reasoning via Large Language Model based MathAgent
Haoran Liao
Qinyi Du
Shaohua Hu
Hao He
Yanyan Xu
Jidong Tian
Yaohui Jin
LRM
AI4CE
32
1
0
14 Dec 2023
VILA: On Pre-training for Visual Language Models
VILA: On Pre-training for Visual Language Models
Ji Lin
Hongxu Yin
Ming-Yu Liu
Yao Lu
Pavlo Molchanov
Andrew Tao
Huizi Mao
Jan Kautz
M. Shoeybi
Song Han
MLLM
VLM
35
356
0
12 Dec 2023
Beyond Human Data: Scaling Self-Training for Problem-Solving with
  Language Models
Beyond Human Data: Scaling Self-Training for Problem-Solving with Language Models
Avi Singh
John D. Co-Reyes
Rishabh Agarwal
Ankesh Anand
Piyush Patil
...
Yamini Bansal
Ethan Dyer
Behnam Neyshabur
Jascha Narain Sohl-Dickstein
Noah Fiedel
ALM
LRM
ReLM
SyDa
157
146
0
11 Dec 2023
Large Scale Foundation Models for Intelligent Manufacturing
  Applications: A Survey
Large Scale Foundation Models for Intelligent Manufacturing Applications: A Survey
Haotian Zhang
S. D. Semujju
Zhicheng Wang
Xianwei Lv
Kang Xu
...
Jing Wu
Zhuo Long
Wensheng Liang
Xiaoguang Ma
Ruiyan Zhuang
UQCV
AI4TS
AI4CE
29
4
0
11 Dec 2023
Latent Skill Discovery for Chain-of-Thought Reasoning
Latent Skill Discovery for Chain-of-Thought Reasoning
Zifan Xu
Haozhu Wang
Dmitriy Bespalov
Peter Stone
Yanjun Qi
ReLM
LRM
59
2
0
07 Dec 2023
Chain of Code: Reasoning with a Language Model-Augmented Code Emulator
Chain of Code: Reasoning with a Language Model-Augmented Code Emulator
Chengshu Li
Jacky Liang
Andy Zeng
Xinyun Chen
Karol Hausman
Dorsa Sadigh
Sergey Levine
Fei-Fei Li
Fei Xia
Brian Ichter
LLMAG
LRM
36
72
0
07 Dec 2023
Prompt Optimization via Adversarial In-Context Learning
Prompt Optimization via Adversarial In-Context Learning
Do Xuan Long
Yiran Zhao
Hannah Brown
Yuxi Xie
James Xu Zhao
Nancy F. Chen
Kenji Kawaguchi
Michael Qizhe Xie
Junxian He
72
11
0
05 Dec 2023
MUFFIN: Curating Multi-Faceted Instructions for Improving
  Instruction-Following
MUFFIN: Curating Multi-Faceted Instructions for Improving Instruction-Following
Renze Lou
Kai Zhang
Jian Xie
Yuxuan Sun
Janice Ahn
Hanzi Xu
Yu Su
Wenpeng Yin
37
26
0
05 Dec 2023
Recursive Visual Programming
Recursive Visual Programming
Jiaxin Ge
Sanjay Subramanian
Baifeng Shi
Roei Herzig
Trevor Darrell
37
4
0
04 Dec 2023
Exchange-of-Thought: Enhancing Large Language Model Capabilities through
  Cross-Model Communication
Exchange-of-Thought: Enhancing Large Language Model Capabilities through Cross-Model Communication
Zhangyue Yin
Qiushi Sun
Cheng Chang
Qipeng Guo
Junqi Dai
Xuanjing Huang
Xipeng Qiu
LRM
56
49
0
04 Dec 2023
CLAMP: Contrastive LAnguage Model Prompt-tuning
CLAMP: Contrastive LAnguage Model Prompt-tuning
Piotr Teterwak
Ximeng Sun
Bryan A. Plummer
Kate Saenko
Ser-Nam Lim
MLLM
VLM
40
1
0
04 Dec 2023
Hyperparameter Optimization for Large Language Model Instruction-Tuning
Hyperparameter Optimization for Large Language Model Instruction-Tuning
C. Tribes
Sacha Benarroch-Lelong
Peng Lu
I. Kobyzev
29
12
0
01 Dec 2023
Instruction-tuning Aligns LLMs to the Human Brain
Instruction-tuning Aligns LLMs to the Human Brain
Khai Loong Aw
Syrielle Montariol
Badr AlKhamissi
Martin Schrimpf
Antoine Bosselut
33
18
0
01 Dec 2023
CoLLiE: Collaborative Training of Large Language Models in an Efficient
  Way
CoLLiE: Collaborative Training of Large Language Models in an Efficient Way
Kai Lv
Shuo Zhang
Tianle Gu
Shuhao Xing
Jiawei Hong
...
Tengxiao Liu
Yu Sun
Penousal Machado
Hang Yan
Xipeng Qiu
38
7
0
01 Dec 2023
CLOMO: Counterfactual Logical Modification with Large Language Models
CLOMO: Counterfactual Logical Modification with Large Language Models
Yinya Huang
Ruixin Hong
Hongming Zhang
Wei Shao
Zhicheng YANG
Dong Yu
Changshui Zhang
Xiaodan Liang
Linqi Song
LRM
34
7
0
29 Nov 2023
Training Chain-of-Thought via Latent-Variable Inference
Training Chain-of-Thought via Latent-Variable Inference
Du Phan
Matthew D. Hoffman
David Dohan
Sholto Douglas
Tuan Anh Le
Aaron T Parisi
Pavel Sountsov
Charles Sutton
Sharad Vikram
Rif A. Saurous
BDL
ReLM
LRM
25
23
0
28 Nov 2023
AlignedCoT: Prompting Large Language Models via Native-Speaking
  Demonstrations
AlignedCoT: Prompting Large Language Models via Native-Speaking Demonstrations
Zhicheng YANG
Yinya Huang
Jing Xiong
Liang Feng
Xiaodan Liang
Yiwei Wang
Jing Tang
LRM
31
1
0
22 Nov 2023
ComPEFT: Compression for Communicating Parameter Efficient Updates via
  Sparsification and Quantization
ComPEFT: Compression for Communicating Parameter Efficient Updates via Sparsification and Quantization
Prateek Yadav
Leshem Choshen
Colin Raffel
Mohit Bansal
32
13
0
22 Nov 2023
Conditions for Length Generalization in Learning Reasoning Skills
Conditions for Length Generalization in Learning Reasoning Skills
Changnan Xiao
Bing Liu
LRM
40
7
0
22 Nov 2023
Compositional Capabilities of Autoregressive Transformers: A Study on
  Synthetic, Interpretable Tasks
Compositional Capabilities of Autoregressive Transformers: A Study on Synthetic, Interpretable Tasks
Rahul Ramesh
Ekdeep Singh Lubana
Mikail Khona
Robert P. Dick
Hidenori Tanaka
CoGe
39
7
0
21 Nov 2023
Data Diversity Matters for Robust Instruction Tuning
Data Diversity Matters for Robust Instruction Tuning
Alexander Bukharin
Tuo Zhao
81
36
0
21 Nov 2023
Can We Utilize Pre-trained Language Models within Causal Discovery
  Algorithms?
Can We Utilize Pre-trained Language Models within Causal Discovery Algorithms?
Chanhui Lee
Juhyeon Kim
Yongjun Jeong
Juhyun Lyu
Junghee Kim
...
Hyeokjun Choe
Soyeon Park
Woohyung Lim
Sungbin Lim
Snu Astronomy Research Center
28
0
0
19 Nov 2023
Camels in a Changing Climate: Enhancing LM Adaptation with Tulu 2
Camels in a Changing Climate: Enhancing LM Adaptation with Tulu 2
Hamish Ivison
Yizhong Wang
Valentina Pyatkin
Nathan Lambert
Matthew E. Peters
...
Joel Jang
David Wadden
Noah A. Smith
Iz Beltagy
Hanna Hajishirzi
ALM
ELM
32
180
0
17 Nov 2023
OVM, Outcome-supervised Value Models for Planning in Mathematical
  Reasoning
OVM, Outcome-supervised Value Models for Planning in Mathematical Reasoning
Fei Yu
Anningzhe Gao
Benyou Wang
OffRL
LRM
17
42
0
16 Nov 2023
Whispers of Doubt Amidst Echoes of Triumph in NLP Robustness
Whispers of Doubt Amidst Echoes of Triumph in NLP Robustness
Ashim Gupta
Rishanth Rajendhran
Nathan Stringham
Vivek Srikumar
Ana Marasović
AAML
31
3
0
16 Nov 2023
Automatic Engineering of Long Prompts
Automatic Engineering of Long Prompts
Cho-Jui Hsieh
Si Si
Felix X. Yu
Inderjit S. Dhillon
VLM
19
8
0
16 Nov 2023
Program-Aided Reasoners (better) Know What They Know
Program-Aided Reasoners (better) Know What They Know
Anubha Kabra
Sanketh Rangreji
Yash Mathur
Aman Madaan
Emmy Liu
Graham Neubig
LRM
34
0
0
16 Nov 2023
Symbol-LLM: Towards Foundational Symbol-centric Interface For Large
  Language Models
Symbol-LLM: Towards Foundational Symbol-centric Interface For Large Language Models
Fangzhi Xu
Zhiyong Wu
Qiushi Sun
Siyu Ren
Fei Yuan
Shuai Yuan
Qika Lin
Yu Qiao
Jun Liu
LLMAG
29
33
0
15 Nov 2023
StrategyLLM: Large Language Models as Strategy Generators, Executors,
  Optimizers, and Evaluators for Problem Solving
StrategyLLM: Large Language Models as Strategy Generators, Executors, Optimizers, and Evaluators for Problem Solving
Chang Gao
Haiyun Jiang
Deng Cai
Shuming Shi
Wai Lam
LRM
34
3
0
15 Nov 2023
Towards Reasoning in Large Language Models via Multi-Agent Peer Review
  Collaboration
Towards Reasoning in Large Language Models via Multi-Agent Peer Review Collaboration
Zhenran Xu
Senbao Shi
Baotian Hu
Jindi Yu
Dongfang Li
Min Zhang
Yuxiang Wu
LRM
LLMAG
ALM
66
22
0
14 Nov 2023
On Measuring Faithfulness or Self-consistency of Natural Language
  Explanations
On Measuring Faithfulness or Self-consistency of Natural Language Explanations
Letitia Parcalabescu
Anette Frank
LRM
74
22
0
13 Nov 2023
Do large language models and humans have similar behaviors in causal
  inference with script knowledge?
Do large language models and humans have similar behaviors in causal inference with script knowledge?
Xudong Hong
Margarita Ryzhova
Daniel Adrian Biondi
Ram Sarkar
42
5
0
13 Nov 2023
Explain-then-Translate: An Analysis on Improving Program Translation
  with Self-generated Explanations
Explain-then-Translate: An Analysis on Improving Program Translation with Self-generated Explanations
Zilu Tang
Mayank Agarwal
Alex Shypula
Bailin Wang
Derry Wijaya
Jie Chen
Yoon Kim
LRM
37
15
0
13 Nov 2023
Towards the Law of Capacity Gap in Distilling Language Models
Towards the Law of Capacity Gap in Distilling Language Models
Chen Zhang
Dawei Song
Zheyu Ye
Yan Gao
ELM
38
20
0
13 Nov 2023
BizBench: A Quantitative Reasoning Benchmark for Business and Finance
BizBench: A Quantitative Reasoning Benchmark for Business and Finance
Rik Koncel-Kedziorski
Michael Krumdick
Viet Dac Lai
Varshini Reddy
Charles Lovering
Chris Tanner
AIMat
35
4
0
11 Nov 2023
Prompt Engineering a Prompt Engineer
Prompt Engineering a Prompt Engineer
Qinyuan Ye
Maxamed Axmed
Reid Pryzant
Fereshte Khani
VLM
LLMAG
LRM
27
28
0
09 Nov 2023
Bias Runs Deep: Implicit Reasoning Biases in Persona-Assigned LLMs
Bias Runs Deep: Implicit Reasoning Biases in Persona-Assigned LLMs
Shashank Gupta
Vaishnavi Shrivastava
Ameet Deshpande
Ashwin Kalyan
Peter Clark
Ashish Sabharwal
Tushar Khot
131
101
0
08 Nov 2023
Black-Box Prompt Optimization: Aligning Large Language Models without
  Model Training
Black-Box Prompt Optimization: Aligning Large Language Models without Model Training
Jiale Cheng
Xiao Liu
Kehan Zheng
Pei Ke
Hongning Wang
Yuxiao Dong
Jie Tang
Minlie Huang
29
79
0
07 Nov 2023
Implicit Chain of Thought Reasoning via Knowledge Distillation
Implicit Chain of Thought Reasoning via Knowledge Distillation
Yuntian Deng
Kiran Prasad
Roland Fernandez
P. Smolensky
Vishrav Chaudhary
Stuart M. Shieber
ReLM
LRM
24
43
0
02 Nov 2023
FollowBench: A Multi-level Fine-grained Constraints Following Benchmark
  for Large Language Models
FollowBench: A Multi-level Fine-grained Constraints Following Benchmark for Large Language Models
Yuxin Jiang
Yufei Wang
Xingshan Zeng
Wanjun Zhong
Liangyou Li
Fei Mi
Lifeng Shang
Xin Jiang
Qun Liu
Wei Wang
ALM
15
25
0
31 Oct 2023
M4LE: A Multi-Ability Multi-Range Multi-Task Multi-Domain Long-Context
  Evaluation Benchmark for Large Language Models
M4LE: A Multi-Ability Multi-Range Multi-Task Multi-Domain Long-Context Evaluation Benchmark for Large Language Models
Wai-Chung Kwan
Xingshan Zeng
Yufei Wang
Yusen Sun
Liangyou Li
Lifeng Shang
Qun Liu
Kam-Fai Wong
ELM
97
10
0
30 Oct 2023
TeacherLM: Teaching to Fish Rather Than Giving the Fish, Language
  Modeling Likewise
TeacherLM: Teaching to Fish Rather Than Giving the Fish, Language Modeling Likewise
Nan He
Hanyu Lai
Chenyang Zhao
Zirui Cheng
Junting Pan
...
Zhaohui Hou
Zhiyuan Huang
Shaoqing Lu
Ding Liang
Mingjie Zhan
LRM
29
13
0
29 Oct 2023
DetermLR: Augmenting LLM-based Logical Reasoning from Indeterminacy to
  Determinacy
DetermLR: Augmenting LLM-based Logical Reasoning from Indeterminacy to Determinacy
Hongda Sun
Weikai Xu
Wei Liu
Jian Luan
Bin Wang
Shuo Shang
Ji-Rong Wen
Rui Yan
LRM
47
24
0
28 Oct 2023
TarGEN: Targeted Data Generation with Large Language Models
TarGEN: Targeted Data Generation with Large Language Models
Himanshu Gupta
Kevin Scaria
Ujjwala Anantheswaran
Shreyas Verma
Mihir Parmar
Saurabh Arjun Sawant
Chitta Baral
Swaroop Mishra
SyDa
38
8
0
27 Oct 2023
OccuQuest: Mitigating Occupational Bias for Inclusive Large Language
  Models
OccuQuest: Mitigating Occupational Bias for Inclusive Large Language Models
Mingfeng Xue
Dayiheng Liu
Kexin Yang
Guanting Dong
Wenqiang Lei
Zheng Yuan
Chang Zhou
Jingren Zhou
LLMAG
22
2
0
25 Oct 2023
PromptAgent: Strategic Planning with Language Models Enables
  Expert-level Prompt Optimization
PromptAgent: Strategic Planning with Language Models Enables Expert-level Prompt Optimization
Xinyuan Wang
Chenxi Li
Zhen Wang
Fan Bai
Haotian Luo
Jiayou Zhang
Nebojsa Jojic
Eric P. Xing
Zhiting Hu
31
102
0
25 Oct 2023
Can You Follow Me? Testing Situational Understanding in ChatGPT
Can You Follow Me? Testing Situational Understanding in ChatGPT
Chenghao Yang
Allyson Ettinger
LRM
LLMAG
ELM
112
4
0
24 Oct 2023
Failures Pave the Way: Enhancing Large Language Models through
  Tuning-free Rule Accumulation
Failures Pave the Way: Enhancing Large Language Models through Tuning-free Rule Accumulation
Zeyuan Yang
Peng Li
Yang Liu
LRM
38
21
0
24 Oct 2023
POE: Process of Elimination for Multiple Choice Reasoning
POE: Process of Elimination for Multiple Choice Reasoning
Chenkai Ma
Xinya Du
LRM
25
5
0
24 Oct 2023
S3Eval: A Synthetic, Scalable, Systematic Evaluation Suite for Large
  Language Models
S3Eval: A Synthetic, Scalable, Systematic Evaluation Suite for Large Language Models
Fangyu Lei
Qian Liu
Yiming Huang
Shizhu He
Jun Zhao
Kang Liu
ELM
LRM
25
12
0
23 Oct 2023
Previous
123...111213141516
Next