Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2302.04023
Cited By
A Multitask, Multilingual, Multimodal Evaluation of ChatGPT on Reasoning, Hallucination, and Interactivity
8 February 2023
Yejin Bang
Samuel Cahyawijaya
Nayeon Lee
Wenliang Dai
Dan Su
Bryan Wilie
Holy Lovenia
Ziwei Ji
Tiezheng Yu
Willy Chung
Quyet V. Do
Yan Xu
Pascale Fung
ReLM
LRM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"A Multitask, Multilingual, Multimodal Evaluation of ChatGPT on Reasoning, Hallucination, and Interactivity"
16 / 166 papers shown
Title
Exploring the Limits of ChatGPT for Query or Aspect-based Text Summarization
Xianjun Yang
Yan Li
Xinlu Zhang
Haifeng Chen
Wei Cheng
AI4MH
17
173
0
16 Feb 2023
AI vs. Human -- Differentiation Analysis of Scientific Content Generation
Yongqiang Ma
Jiawei Liu
Fan Yi
Qikai Cheng
Yong Huang
Wei Lu
Xiaozhong Liu
DeLMO
4
56
0
24 Jan 2023
ChatGPT Makes Medicine Easy to Swallow: An Exploratory Case Study on Simplified Radiology Reports
Katharina Jeblick
B. Schachtner
Jakob Dexl
Andreas Mittermeier
Anna Theresa Stüber
...
Tobias Weber
Philipp Wesp
B. Sabel
J. Ricke
Michael Ingrisch
LM&MA
MedIm
111
373
0
30 Dec 2022
Towards Reasoning in Large Language Models: A Survey
Jie Huang
Kevin Chen-Chuan Chang
LM&MA
ELM
LRM
27
582
0
20 Dec 2022
Self-Consistency Improves Chain of Thought Reasoning in Language Models
Xuezhi Wang
Jason W. Wei
Dale Schuurmans
Quoc Le
Ed H. Chi
Sharan Narang
Aakanksha Chowdhery
Denny Zhou
ReLM
BDL
LRM
AI4CE
314
3,237
0
21 Mar 2022
Training language models to follow instructions with human feedback
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
...
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
OSLM
ALM
313
11,915
0
04 Mar 2022
Commonsense Knowledge Reasoning and Generation with Pre-trained Language Models: A Survey
Prajjwal Bhargava
Vincent Ng
ReLM
LRM
38
62
0
28 Jan 2022
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
Jason W. Wei
Xuezhi Wang
Dale Schuurmans
Maarten Bosma
Brian Ichter
F. Xia
Ed H. Chi
Quoc Le
Denny Zhou
LM&Ro
LRM
AI4CE
ReLM
355
8,457
0
28 Jan 2022
Description-Driven Task-Oriented Dialog Modeling
Jeffrey Zhao
Raghav Gupta
Yuan Cao
Dian Yu
Mingqiu Wang
Harrison Lee
Abhinav Rastogi
Izhak Shafran
Yonghui Wu
46
64
0
21 Jan 2022
Modeling Event Plausibility with Consistent Conceptual Abstraction
Ian Porada
Kaheer Suleman
Adam Trischler
Jackie C.K. Cheung
111
19
0
20 Apr 2021
Explaining Answers with Entailment Trees
Bhavana Dalvi
Peter Alexander Jansen
Oyvind Tafjord
Zhengnan Xie
Hannah Smith
Leighanna Pipatanangkura
Peter Clark
ReLM
FAtt
LRM
239
184
0
17 Apr 2021
Zero-Shot Text-to-Image Generation
Aditya A. Ramesh
Mikhail Pavlov
Gabriel Goh
Scott Gray
Chelsea Voss
Alec Radford
Mark Chen
Ilya Sutskever
VLM
255
4,777
0
24 Feb 2021
When Being Unseen from mBERT is just the Beginning: Handling New Languages With Multilingual Language Models
Benjamin Muller
Antonis Anastasopoulos
Benoît Sagot
Djamé Seddah
LRM
126
165
0
24 Oct 2020
Knowledge-Grounded Dialogue Generation with Pre-trained Language Models
Xueliang Zhao
Wei Yu Wu
Can Xu
Chongyang Tao
Dongyan Zhao
Rui Yan
188
192
0
17 Oct 2020
GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language Understanding
Alex Jinpeng Wang
Amanpreet Singh
Julian Michael
Felix Hill
Omer Levy
Samuel R. Bowman
ELM
297
6,956
0
20 Apr 2018
Teaching Machines to Read and Comprehend
Karl Moritz Hermann
Tomás Kociský
Edward Grefenstette
L. Espeholt
W. Kay
Mustafa Suleyman
Phil Blunsom
175
3,509
0
10 Jun 2015
Previous
1
2
3
4