Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2302.13971
Cited By
LLaMA: Open and Efficient Foundation Language Models
27 February 2023
Hugo Touvron
Thibaut Lavril
Gautier Izacard
Xavier Martinet
Marie-Anne Lachaux
Timothée Lacroix
Baptiste Rozière
Naman Goyal
Eric Hambro
Faisal Azhar
Aurelien Rodriguez
Armand Joulin
Edouard Grave
Guillaume Lample
ALM
PILM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"LLaMA: Open and Efficient Foundation Language Models"
50 / 7,181 papers shown
Title
chatClimate: Grounding Conversational AI in Climate Science
S. Vaghefi
Qian Wang
V. Muccione
Jingwei Ni
Mathias Kraus
...
Tobias Schimanski
Chiara Colesanti-Senni
Nicolas Webersinke
Christrian Huggel
Markus Leippold
KELM
AI4MH
HILM
52
68
0
11 Apr 2023
Graph-ToolFormer: To Empower LLMs with Graph Reasoning Ability via Prompt Augmented by ChatGPT
Jiawei Zhang
LRM
52
77
0
10 Apr 2023
OpenAGI: When LLM Meets Domain Experts
Yingqiang Ge
Wenyue Hua
Kai Mei
Jianchao Ji
Juntao Tan
Shuyuan Xu
Zelong Li
Yongfeng Zhang
VLM
LRM
59
216
0
10 Apr 2023
Should ChatGPT be Biased? Challenges and Risks of Bias in Large Language Models
Emilio Ferrara
SILM
61
249
0
07 Apr 2023
On Efficient Training of Large-Scale Deep Learning Models: A Literature Review
Li Shen
Yan Sun
Zhiyuan Yu
Liang Ding
Xinmei Tian
Dacheng Tao
VLM
44
41
0
07 Apr 2023
Geotechnical Parrot Tales (GPT): Harnessing Large Language Models in geotechnical engineering
Krishna Kumar
LLMAG
AI4CE
19
10
0
04 Apr 2023
RPTQ: Reorder-based Post-training Quantization for Large Language Models
Zhihang Yuan
Lin Niu
Jia-Wen Liu
Wenyu Liu
Xinggang Wang
Yuzhang Shang
Guangyu Sun
Qiang Wu
Jiaxiang Wu
Bingzhe Wu
MQ
46
82
0
03 Apr 2023
CAMEL: Communicative Agents for "Mind" Exploration of Large Language Model Society
Ge Li
Hasan Hammoud
Hani Itani
Dmitrii Khizbullin
Guohao Li
SyDa
ALM
59
443
0
31 Mar 2023
HuggingGPT: Solving AI Tasks with ChatGPT and its Friends in Hugging Face
Yongliang Shen
Kaitao Song
Xu Tan
Dongsheng Li
Weiming Lu
Yueting Zhuang
MLLM
50
858
0
30 Mar 2023
BloombergGPT: A Large Language Model for Finance
Shijie Wu
Ozan Irsoy
Steven Lu
Vadim Dabravolski
Mark Dredze
Sebastian Gehrmann
P. Kambadur
David S. Rosenberg
Gideon Mann
AIFin
183
803
0
30 Mar 2023
LLaMA-Adapter: Efficient Fine-tuning of Language Models with Zero-init Attention
Renrui Zhang
Jiaming Han
Chris Liu
Peng Gao
Aojun Zhou
Xiangfei Hu
Shilin Yan
Pan Lu
Hongsheng Li
Yu Qiao
MLLM
93
757
0
28 Mar 2023
Sigmoid Loss for Language Image Pre-Training
Xiaohua Zhai
Basil Mustafa
Alexander Kolesnikov
Lucas Beyer
CLIP
VLM
48
1,010
0
27 Mar 2023
Active Self-Supervised Learning: A Few Low-Cost Relationships Are All You Need
Vivien A. Cabannes
Léon Bottou
Yann LeCun
Randall Balestriero
52
13
0
27 Mar 2023
InterviewBot: Real-Time End-to-End Dialogue System to Interview Students for College Admission
Zihao Wang
Nathan Keyes
Terry Crawford
Jinho Choi
36
0
0
27 Mar 2023
An Evaluation of Memory Optimization Methods for Training Neural Networks
Xiaoxuan Liu
Siddharth Jha
Alvin Cheung
34
0
0
26 Mar 2023
Scaling Expert Language Models with Unsupervised Domain Discovery
Suchin Gururangan
Margaret Li
M. Lewis
Weijia Shi
Tim Althoff
Noah A. Smith
Luke Zettlemoyer
MoE
43
46
0
24 Mar 2023
Error Analysis Prompting Enables Human-Like Translation Evaluation in Large Language Models
Qingyu Lu
Baopu Qiu
Liang Ding
Liping Xie
Tom Kocmi
Dacheng Tao
LRM
ALM
ELM
31
111
0
24 Mar 2023
SGFormer: Semantic Graph Transformer for Point Cloud-based 3D Scene Graph Generation
Changsheng Lv
Mengshi Qi
Xia Li
Zhengyuan Yang
Huadong Ma
3DPC
ViT
39
10
0
20 Mar 2023
Exploring ChatGPT's Ability to Rank Content: A Preliminary Study on Consistency with Human Preferences
Yunjie Ji
Yan Gong
Yiping Peng
Chao Ni
Peiyan Sun
Dongyu Pan
Baochang Ma
Xiangang Li
ELM
ALM
AI4MH
34
37
0
14 Mar 2023
Does Synthetic Data Generation of LLMs Help Clinical Text Mining?
Ruixiang Tang
Xiaotian Han
Xiaoqian Jiang
Xia Hu
LM&MA
AI4MH
SyDa
42
172
0
08 Mar 2023
A Comprehensive Survey of AI-Generated Content (AIGC): A History of Generative AI from GAN to ChatGPT
Yihan Cao
Siyu Li
Yixin Liu
Zhiling Yan
Yutong Dai
Philip S. Yu
Lichao Sun
46
517
0
07 Mar 2023
Prophet: Prompting Large Language Models with Complementary Answer Heuristics for Knowledge-based Visual Question Answering
Zhou Yu
Xuecheng Ouyang
Zhenwei Shao
Mei Wang
Jun Yu
MLLM
94
11
0
03 Mar 2023
Investigating the Effectiveness of Task-Agnostic Prefix Prompt for Instruction Following
Seonghyeon Ye
Hyeonbin Hwang
Sohee Yang
Hyeongu Yun
Yireun Kim
Minjoon Seo
LRM
64
36
0
28 Feb 2023
SAINE: Scientific Annotation and Inference Engine of Scientific Research
Susie Xi Rao
Yi-Lin Tu
P. Egger
32
1
0
28 Feb 2023
Zero-Shot Cross-Lingual Summarization via Large Language Models
Jiaan Wang
Yunlong Liang
Fandong Meng
Beiqi Zou
Zhixu Li
Jianfeng Qu
Jie Zhou
ELM
39
28
0
28 Feb 2023
Testing AI on language comprehension tasks reveals insensitivity to underlying meaning
Vittoria Dentella
Fritz Guenther
Elliot Murphy
G. Marcus
Evelina Leivada
ELM
59
28
0
23 Feb 2023
Active Prompting with Chain-of-Thought for Large Language Models
Shizhe Diao
Pengcheng Wang
Yong Lin
Tong Zhang
ReLM
KELM
LLMAG
LRM
47
122
0
23 Feb 2023
In-context Example Selection with Influences
Nguyen Tai
Eric Wong
44
49
0
21 Feb 2023
A Reparameterized Discrete Diffusion Model for Text Generation
Lin Zheng
Jianbo Yuan
Lei Yu
Lingpeng Kong
DiffM
41
61
0
11 Feb 2023
IC3: Image Captioning by Committee Consensus
David M. Chan
Austin Myers
Sudheendra Vijayanarasimhan
David A. Ross
John F. Canny
34
17
0
02 Feb 2023
A Survey on Efficient Training of Transformers
Bohan Zhuang
Jing Liu
Zizheng Pan
Haoyu He
Yuetian Weng
Chunhua Shen
36
47
0
02 Feb 2023
Adaptive Machine Translation with Large Language Models
Yasmin Moslem
Rejwanul Haque
John D. Kelleher
Andy Way
AI4CE
35
77
0
30 Jan 2023
Emerging Synergies in Causality and Deep Generative Models: A Survey
Guanglin Zhou
Shaoan Xie
Guang-Yuan Hao
Shiming Chen
Erdun Gao
Xiwei Xu
Chen Wang
Liming Zhu
Lina Yao
Kun Zhang
AI4CE
69
11
0
29 Jan 2023
TikTalk: A Video-Based Dialogue Dataset for Multi-Modal Chitchat in Real World
Hongpeng Lin
Ludan Ruan
Wenke Xia
Peiyu Liu
Jing Wen
...
Di Hu
Ruihua Song
Wayne Xin Zhao
Qin Jin
Zhiwu Lu
VGen
44
10
0
14 Jan 2023
Data Distillation: A Survey
Noveen Sachdeva
Julian McAuley
DD
61
74
0
11 Jan 2023
Does compressing activations help model parallel training?
S. Bian
Dacheng Li
Hongyi Wang
Eric P. Xing
Shivaram Venkataraman
40
5
0
06 Jan 2023
Reference Twice: A Simple and Unified Baseline for Few-Shot Instance Segmentation
Yue Han
Jiangning Zhang
Zhucun Xue
Chao Xu
Xintian Shen
Yabiao Wang
Chengjie Wang
Yong Liu
Xiangtai Li
62
17
0
03 Jan 2023
Principled and Efficient Transfer Learning of Deep Models via Neural Collapse
Xiao Li
Sheng Liu
Jin-li Zhou
Xin Lu
C. Fernandez‐Granda
Zhihui Zhu
Q. Qu
AAML
35
19
0
23 Dec 2022
Language Models as Inductive Reasoners
Zonglin Yang
Li Dong
Xinya Du
Hao Cheng
Min Zhang
Xiaodong Liu
Jianfeng Gao
Furu Wei
ReLM
LRM
35
34
0
21 Dec 2022
JASMINE: Arabic GPT Models for Few-Shot Learning
El Moatez Billah Nagoudi
Muhammad Abdul-Mageed
AbdelRahim Elmadany
Alcides Alcoba Inciarte
Md. Tawkat Islam Khondaker
38
8
0
21 Dec 2022
Does CLIP Bind Concepts? Probing Compositionality in Large Image Models
Martha Lewis
Nihal V. Nayak
Peilin Yu
Qinan Yu
Jack Merullo
Stephen H. Bach
Ellie Pavlick
VLM
OCL
CoGe
46
59
0
20 Dec 2022
Is GPT-3 a Good Data Annotator?
Bosheng Ding
Chengwei Qin
Linlin Liu
Yew Ken Chia
Shafiq Joty
Boyang Albert Li
Lidong Bing
42
237
0
20 Dec 2022
Large Language Models Are Reasoning Teachers
Namgyu Ho
Laura Schmid
Se-Young Yun
ReLM
ELM
LRM
47
330
0
20 Dec 2022
When Federated Learning Meets Pre-trained Language Models' Parameter-Efficient Tuning Methods
Zhuo Zhang
Yuanhang Yang
Yong Dai
Zhuang Li
Zenglin Xu
FedML
74
70
0
20 Dec 2022
On Second Thought, Let's Not Think Step by Step! Bias and Toxicity in Zero-Shot Reasoning
Omar Shaikh
Hongxin Zhang
William B. Held
Michael S. Bernstein
Diyi Yang
ReLM
LRM
64
191
0
15 Dec 2022
A Comprehensive Survey on Enterprise Financial Risk Analysis from Big Data Perspective
Yu Zhao
Huaming Du
Qing Li
Fuzhen Zhuang
Ji Liu
Gang Kou
Gang Kou
63
1
0
28 Nov 2022
Self-Destructing Models: Increasing the Costs of Harmful Dual Uses of Foundation Models
Peter Henderson
E. Mitchell
Christopher D. Manning
Dan Jurafsky
Chelsea Finn
44
47
0
27 Nov 2022
SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models
Guangxuan Xiao
Ji Lin
Mickael Seznec
Hao Wu
Julien Demouth
Song Han
MQ
103
775
0
18 Nov 2022
Deep Emotion Recognition in Textual Conversations: A Survey
Patrícia Pereira
Helena Moniz
Joao Paulo Carvalho
52
15
0
16 Nov 2022
GAMMT: Generative Ambiguity Modeling Using Multiple Transformers
Xingcheng Xu
35
0
0
16 Nov 2022
Previous
1
2
3
...
142
143
144
Next