Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2307.09288
Cited By
Llama 2: Open Foundation and Fine-Tuned Chat Models
18 July 2023
Hugo Touvron
Louis Martin
Kevin R. Stone
Peter Albert
Amjad Almahairi
Yasmine Babaei
Nikolay Bashlykov
Soumya Batra
Prajjwal Bhargava
Shruti Bhosale
Daniel M. Bikel
Lukas Blecher
Cristian Canton Ferrer
Moya Chen
Guillem Cucurull
David Esiobu
Jude Fernandes
Jeremy Fu
Wenyin Fu
Brian Fuller
Cynthia Gao
Vedanuj Goswami
Naman Goyal
Anthony Hartshorn
Saghar Hosseini
Rui Hou
Hakan Inan
Marcin Kardas
Viktor Kerkez
Madian Khabsa
Isabel Kloumann
Artem Korenev
Punit Singh Koura
Marie-Anne Lachaux
Thibaut Lavril
Jenya Lee
Diana Liskovich
Yinghai Lu
Yuning Mao
Xavier Martinet
Todor Mihaylov
Pushkar Mishra
Igor Molybog
Yixin Nie
Andrew Poulton
Jeremy Reizenstein
Rashi Rungta
Kalyan Saladi
Alan Schelten
Ruan Silva
Eric Michael Smith
R. Subramanian
Xia Tan
Binh Tang
Ross Taylor
Adina Williams
Jian Xiang Kuan
Puxin Xu
Zhengxu Yan
Iliyan Zarov
Yuchen Zhang
Angela Fan
Melanie Kambadur
Sharan Narang
Aurelien Rodriguez
Robert Stojnic
Sergey Edunov
Thomas Scialom
AI4MH
ALM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Llama 2: Open Foundation and Fine-Tuned Chat Models"
50 / 7,791 papers shown
Title
Safety-Tuned LLaMAs: Lessons From Improving the Safety of Large Language Models that Follow Instructions
Federico Bianchi
Mirac Suzgun
Giuseppe Attanasio
Paul Röttger
Dan Jurafsky
Tatsunori Hashimoto
James Zou
ALM
LM&MA
LRM
34
183
0
14 Sep 2023
ExpertQA: Expert-Curated Questions and Attributed Answers
Chaitanya Malaviya
Subin Lee
Sihao Chen
Elizabeth Sieber
Mark Yatskar
Dan Roth
ELM
HILM
36
52
0
14 Sep 2023
Tree of Uncertain Thoughts Reasoning for Large Language Models
Shentong Mo
Miao Xin
LRM
AI4CE
14
12
0
14 Sep 2023
Your Code Secret Belongs to Me: Neural Code Completion Tools Can Memorize Hard-Coded Credentials
Yizhan Huang
Yichen Li
Weibin Wu
Jianping Zhang
Michael R. Lyu
31
14
0
14 Sep 2023
SwitchGPT: Adapting Large Language Models for Non-Text Outputs
Xinyu Wang
Bohan Zhuang
Qi Wu
MLLM
47
3
0
14 Sep 2023
Zero-shot Audio Topic Reranking using Large Language Models
Mengjie Qian
Rao Ma
Adian Liusie
Erfan Loweimi
Kate Knill
Mark Gales
37
1
0
14 Sep 2023
Are Large Language Model-based Evaluators the Solution to Scaling Up Multilingual Evaluation?
Rishav Hada
Varun Gumma
Adrian de Wynter
Harshita Diddee
Mohamed Ahmed
Monojit Choudhury
Kalika Bali
Sunayana Sitaram
ALM
LM&MA
ELM
35
63
0
14 Sep 2023
Towards Artificial General Intelligence (AGI) in the Internet of Things (IoT): Opportunities and Challenges
Fei Dou
Jin Ye
Geng Yuan
Qin Lu
Wei Niu
...
Hongyue Sun
Yunli Shao
Changying Li
Tianming Liu
Wenzhan Song
AI4CE
37
29
0
14 Sep 2023
Adapted Large Language Models Can Outperform Medical Experts in Clinical Text Summarization
Dave Van Veen
Cara Van Uden
Louis Blankemeier
Jean-Benoit Delbrouck
Asad Aali
...
C. Langlotz
Jason Hom
S. Gatidis
John M. Pauly
Akshay S. Chaudhari
ELM
AI4MH
LM&MA
45
279
0
14 Sep 2023
A Fast Optimization View: Reformulating Single Layer Attention in LLM Based on Tensor and SVM Trick, and Solving It in Matrix Multiplication Time
Yeqi Gao
Zhao Song
Weixin Wang
Junze Yin
26
26
0
14 Sep 2023
PromptASR for contextualized ASR with controllable style
Xiaoyu Yang
Wei Kang
Zengwei Yao
Yifan Yang
Liyong Guo
Fangjun Kuang
Long Lin
Daniel Povey
36
9
0
14 Sep 2023
Pretraining on the Test Set Is All You Need
Rylan Schaeffer
18
28
0
13 Sep 2023
Mitigate Replication and Copying in Diffusion Models with Generalized Caption and Dual Fusion Enhancement
Chenghao Li
Dake Chen
Yuke Zhang
P. Beerel
DiffM
35
7
0
13 Sep 2023
In-Contextual Gender Bias Suppression for Large Language Models
Daisuke Oba
Masahiro Kaneko
Danushka Bollegala
31
8
0
13 Sep 2023
EarthPT: a time series foundation model for Earth Observation
Michael J. Smith
Luke Fleming
James E. Geach
AI4TS
22
7
0
13 Sep 2023
RAIN: Your Language Models Can Align Themselves without Finetuning
Yuhui Li
Fangyun Wei
Jinjing Zhao
Chao Zhang
Hongyang R. Zhang
SILM
44
108
0
13 Sep 2023
Sight Beyond Text: Multi-Modal Training Enhances LLMs in Truthfulness and Ethics
Haoqin Tu
Bingchen Zhao
Chen Wei
Cihang Xie
MLLM
39
14
0
13 Sep 2023
Mitigating Hallucinations and Off-target Machine Translation with Source-Contrastive and Language-Contrastive Decoding
Rico Sennrich
Jannis Vamvas
Alireza Mohammadshahi
HILM
37
39
0
13 Sep 2023
SafetyBench: Evaluating the Safety of Large Language Models
Zhexin Zhang
Leqi Lei
Lindong Wu
Rui Sun
Yongkang Huang
Chong Long
Xiao Liu
Xuanyu Lei
Jie Tang
Minlie Huang
LRM
LM&MA
ELM
45
92
0
13 Sep 2023
Simultaneous Machine Translation with Large Language Models
Minghan Wang
Jinming Zhao
Thuy-Trang Vu
Fatemeh Shiri
Ehsan Shareghi
Gholamreza Haffari
42
2
0
13 Sep 2023
Query-Dependent Prompt Evaluation and Optimization with Offline Inverse RL
Hao Sun
Alihan Huyuk
M. Schaar
OffRL
LRM
23
28
0
13 Sep 2023
Statistical Rejection Sampling Improves Preference Optimization
Tianqi Liu
Yao-Min Zhao
Rishabh Joshi
Misha Khalman
Mohammad Saleh
Peter J. Liu
Jialu Liu
61
215
0
13 Sep 2023
Text Encoders Lack Knowledge: Leveraging Generative LLMs for Domain-Specific Semantic Textual Similarity
Joseph Gatto
Omar Sharif
Parker Seegmiller
Philip Bohlman
S. Preum
22
8
0
12 Sep 2023
Recovering from Privacy-Preserving Masking with Large Language Models
A. Vats
Zhe Liu
Peng Su
Debjyoti Paul
Yingyi Ma
Yutong Pang
Zeeshan Ahmed
Ozlem Kalinli
31
9
0
12 Sep 2023
The first step is the hardest: Pitfalls of Representing and Tokenizing Temporal Data for Large Language Models
Dimitris Spathis
F. Kawsar
AI4TS
36
18
0
12 Sep 2023
AstroLLaMA: Towards Specialized Foundation Models in Astronomy
Tuan Dung Nguyen
Yuan-Sen Ting
I. Ciucă
Charlie OÑeill
Ze-Chang Sun
...
Alberto Accomazzi
J. P. Naiman
Jesse Cranney
Kevin Schawinski
UniverseTBD
16
20
0
12 Sep 2023
The Moral Machine Experiment on Large Language Models
Kazuhiro Takemoto
27
19
0
12 Sep 2023
Comparing Llama-2 and GPT-3 LLMs for HPC kernels generation
Pedro Valero-Lara
Alexis Huante
Mustafa Al Lail
William F. Godoy
K. Teranishi
Prasanna Balaprakash
Jeffrey S. Vetter
ELM
28
19
0
12 Sep 2023
Large Language Models for Compiler Optimization
Chris Cummins
Volker Seeker
Dejan Grubisic
Mostafa Elhoushi
Youwei Liang
...
Jonas Gehring
Fabian Gloeckle
Kim M. Hazelwood
Gabriel Synnaeve
Hugh Leather
26
48
0
11 Sep 2023
MAmmoTH: Building Math Generalist Models through Hybrid Instruction Tuning
Xiang Yue
Xingwei Qu
Ge Zhang
Yao Fu
Wenhao Huang
Huan Sun
Yu-Chuan Su
Wenhu Chen
AIMat
LRM
85
369
0
11 Sep 2023
Privacy Side Channels in Machine Learning Systems
Edoardo Debenedetti
Giorgio Severi
Nicholas Carlini
Christopher A. Choquette-Choo
Matthew Jagielski
Milad Nasr
Eric Wallace
Florian Tramèr
MIALM
51
38
0
11 Sep 2023
An Empirical Study of NetOps Capability of Pre-Trained Large Language Models
Yukai Miao
Yu Bai
Li Chen
Dan Li
Haifeng Sun
...
Dapeng Sun
Xiuting Xu
Qi Zhang
Chao Xiang
Xinchi Li
ELM
19
10
0
11 Sep 2023
Kani: A Lightweight and Highly Hackable Framework for Building Language Model Applications
Andrew Zhu
Liam Dugan
Alyssa Hwang
Chris Callison-Burch
VLM
17
8
0
11 Sep 2023
Optimize Weight Rounding via Signed Gradient Descent for the Quantization of LLMs
Wenhua Cheng
Weiwei Zhang
Haihao Shen
Yiyang Cai
Xin He
Kaokao Lv
Yi. Liu
MQ
36
22
0
11 Sep 2023
Flesch or Fumble? Evaluating Readability Standard Alignment of Instruction-Tuned Language Models
Joseph Marvin Imperial
Harish Tayyar Madabushi
ELM
40
11
0
11 Sep 2023
Evaluating the Deductive Competence of Large Language Models
S. M. Seals
V. Shalin
ELM
ReLM
LRM
24
8
0
11 Sep 2023
DoG-Instruct: Towards Premium Instruction-Tuning Data via Text-Grounded Instruction Wrapping
Yongrui Chen
Haiyun Jiang
Xinting Huang
Shuming Shi
Guilin Qi
SyDa
10
11
0
11 Sep 2023
Understanding the Impact of Post-Training Quantization on Large Language Models
Somnath Roy
MQ
38
3
0
11 Sep 2023
DePT: Decomposed Prompt Tuning for Parameter-Efficient Fine-tuning
Zhengxiang Shi
Aldo Lipani
VLM
31
31
0
11 Sep 2023
Retrieval-Augmented Meta Learning for Low-Resource Text Classification
Rongsheng Li
Yongqian Li
Hai-Tao Zheng
Chaiyut Luoyiching
Hai-Tao Zheng
Nannan Zhou
Hanjing Su
RALM
22
2
0
10 Sep 2023
SeaEval for Multilingual Foundation Models: From Cross-Lingual Alignment to Cultural Reasoning
Bin Wang
Zhengyuan Liu
Xin Huang
Fangkai Jiao
Yang Ding
Ai Ti Aw
Nancy F. Chen
LRM
32
63
0
09 Sep 2023
Analysis of Disinformation and Fake News Detection Using Fine-Tuned Large Language Model
B. Pavlyshenko
29
13
0
09 Sep 2023
From Sparse to Dense: GPT-4 Summarization with Chain of Density Prompting
Griffin Adams
Alexander R. Fabbri
Faisal Ladhak
Eric Lehman
Noémie Elhadad
32
53
0
08 Sep 2023
Evaluation and Enhancement of Semantic Grounding in Large Vision-Language Models
Jiaying Lu
Jinmeng Rao
Kezhen Chen
Xiaoyuan Guo
Yawen Zhang
Baochen Sun
Carl Yang
Jie Yang
29
11
0
07 Sep 2023
Evaluation of large language models for discovery of gene set function
Mengzhou Hu
Sahar Alkhairy
Ingoo Lee
Rudolf T. Pillich
Dylan Fong
Kevin Smith
Robin Bachelder
T. Ideker
Dexter Pratt
LM&MA
23
33
0
07 Sep 2023
Large Language Models Are Not Robust Multiple Choice Selectors
Chujie Zheng
Hao Zhou
Fandong Meng
Jie Zhou
Minlie Huang
25
217
0
07 Sep 2023
OpinionGPT: Modelling Explicit Biases in Instruction-Tuned LLMs
Patrick Haller
Ansar Aynetdinov
Alan Akbik
33
24
0
07 Sep 2023
FLM-101B: An Open LLM and How to Train It with
100
K
B
u
d
g
e
t
100K Budget
100
K
B
u
d
g
e
t
Xiang Li
Yiqun Yao
Xin Jiang
Xuezhi Fang
Xuying Meng
...
Li Du
Bowen Qin
Zheng-Wei Zhang
Aixin Sun
Yequan Wang
60
21
0
07 Sep 2023
Supervised Learning and Large Language Model Benchmarks on Mental Health Datasets: Cognitive Distortions and Suicidal Risks in Chinese Social Media
Hongzhi Qi
Qing Zhao
Jianqiang Li
Changwei Song
Wei-dong Zhai
...
Y. Yu
Fan Wang
Huijing Zou
Bing Xiang Yang
Guanghui Fu
AI4MH
29
12
0
07 Sep 2023
From Base to Conversational: Japanese Instruction Dataset and Tuning Large Language Models
Masahiro Suzuki
Masanori Hirano
Hiroki Sakaji
39
6
0
07 Sep 2023
Previous
1
2
3
...
148
149
150
...
154
155
156
Next