Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2307.09288
Cited By
Llama 2: Open Foundation and Fine-Tuned Chat Models
18 July 2023
Hugo Touvron
Louis Martin
Kevin R. Stone
Peter Albert
Amjad Almahairi
Yasmine Babaei
Nikolay Bashlykov
Soumya Batra
Prajjwal Bhargava
Shruti Bhosale
Daniel M. Bikel
Lukas Blecher
Cristian Canton Ferrer
Moya Chen
Guillem Cucurull
David Esiobu
Jude Fernandes
Jeremy Fu
Wenyin Fu
Brian Fuller
Cynthia Gao
Vedanuj Goswami
Naman Goyal
Anthony Hartshorn
Saghar Hosseini
Rui Hou
Hakan Inan
Marcin Kardas
Viktor Kerkez
Madian Khabsa
Isabel Kloumann
Artem Korenev
Punit Singh Koura
Marie-Anne Lachaux
Thibaut Lavril
Jenya Lee
Diana Liskovich
Yinghai Lu
Yuning Mao
Xavier Martinet
Todor Mihaylov
Pushkar Mishra
Igor Molybog
Yixin Nie
Andrew Poulton
Jeremy Reizenstein
Rashi Rungta
Kalyan Saladi
Alan Schelten
Ruan Silva
Eric Michael Smith
R. Subramanian
Xia Tan
Binh Tang
Ross Taylor
Adina Williams
Jian Xiang Kuan
Puxin Xu
Zhengxu Yan
Iliyan Zarov
Yuchen Zhang
Angela Fan
Melanie Kambadur
Sharan Narang
Aurelien Rodriguez
Robert Stojnic
Sergey Edunov
Thomas Scialom
AI4MH
ALM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Llama 2: Open Foundation and Fine-Tuned Chat Models"
50 / 7,792 papers shown
Title
Few-Shot Panoptic Segmentation With Foundation Models
Markus Kappeler
Kürsat Petek
Niclas Vodisch
Wolfram Burgard
Abhinav Valada
31
17
0
19 Sep 2023
OpenBA: An Open-sourced 15B Bilingual Asymmetric seq2seq Model Pre-trained from Scratch
Juntao Li
Zecheng Tang
Yuyang Ding
Pinzheng Wang
Pei Guo
...
Wenliang Chen
Guohong Fu
Qiaoming Zhu
Guodong Zhou
Hao Fei
45
5
0
19 Sep 2023
MINT: Evaluating LLMs in Multi-turn Interaction with Tools and Language Feedback
Xingyao Wang
Zihan Wang
Jiateng Liu
Yangyi Chen
Lifan Yuan
Hao Peng
Heng Ji
LRM
133
142
0
19 Sep 2023
Estimating Contamination via Perplexity: Quantifying Memorisation in Language Model Evaluation
Yucheng Li
20
29
0
19 Sep 2023
CFGPT: Chinese Financial Assistant with Large Language Model
Jiangtong Li
Hao Wang
Guoxuan Wang
Yang Lei
Dawei Cheng
Zhijun Ding
Changjun Jiang
45
10
0
19 Sep 2023
PoSE: Efficient Context Window Extension of LLMs via Positional Skip-wise Training
Dawei Zhu
Nan Yang
Liang Wang
Yifan Song
Wenhao Wu
Furu Wei
Sujian Li
76
78
0
19 Sep 2023
Investigating the Catastrophic Forgetting in Multimodal Large Language Models
Yuexiang Zhai
Shengbang Tong
Xiao Li
Mu Cai
Qing Qu
Yong Jae Lee
Yi Ma
VLM
MLLM
CLL
77
78
0
19 Sep 2023
GPTFUZZER: Red Teaming Large Language Models with Auto-Generated Jailbreak Prompts
Jiahao Yu
Xingwei Lin
Zheng Yu
Xinyu Xing
SILM
119
307
0
19 Sep 2023
Baichuan 2: Open Large-scale Language Models
Ai Ming Yang
Bin Xiao
Bingning Wang
Borong Zhang
Ce Bian
...
Youxin Jiang
Yuchen Gao
Yupeng Zhang
Zenan Zhou
Zhiying Wu
ELM
LRM
77
712
0
19 Sep 2023
Stabilizing RLHF through Advantage Model and Selective Rehearsal
Baolin Peng
Linfeng Song
Ye Tian
Lifeng Jin
Haitao Mi
Dong Yu
40
17
0
18 Sep 2023
Understanding Catastrophic Forgetting in Language Models via Implicit Inference
Suhas Kotha
Jacob Mitchell Springer
Aditi Raghunathan
CLL
42
61
0
18 Sep 2023
Conformal Temporal Logic Planning using Large Language Models
Jun Wang
J. Tong
Kai Liang Tan
Yevgeniy Vorobeychik
Y. Kantaros
LM&Ro
58
20
0
18 Sep 2023
SMART-LLM: Smart Multi-Agent Robot Task Planning using Large Language Models
S. S. Kannan
Vishnunandan L. N. Venkatesh
Byung-Cheol Min
LLMAG
LM&Ro
40
102
0
18 Sep 2023
MindAgent: Emergent Gaming Interaction
Ran Gong
Qiuyuan Huang
Xiaojian Ma
Hoi Vo
Zane Durante
...
Zilong Zheng
Song-Chun Zhu
Demetri Terzopoulos
Fei-Fei Li
Jianfeng Gao
LM&Ro
107
65
0
18 Sep 2023
Prompt a Robot to Walk with Large Language Models
Yen-Jen Wang
Bike Zhang
Jianyu Chen
Koushil Sreenath
LM&Ro
LLMAG
32
49
0
18 Sep 2023
An Empirical Study of Scaling Instruct-Tuned Large Multimodal Models
Yadong Lu
Chunyuan Li
Haotian Liu
Jianwei Yang
Jianfeng Gao
Yelong Shen
MLLM
105
30
0
18 Sep 2023
Speaker attribution in German parliamentary debates with QLoRA-adapted large language models
Tobias Bornheim
Niklas Grieger
Patrick Gustav Blaneck
Stephan Bialonski
19
2
0
18 Sep 2023
AMuRD: Annotated Arabic-English Receipt Dataset for Key Information Extraction and Classification
Abdelrahman Abdallah
Mahmoud Abdalla
Mohamed Elkasaby
Yasser Elbendary
Adam Jatowt
35
0
0
18 Sep 2023
The ParlaSent Multilingual Training Dataset for Sentiment Identification in Parliamentary Proceedings
Michal Mochtak
Peter Rupnik
Nikola Ljubesic
AILaw
23
4
0
18 Sep 2023
Dynamic-SUPERB: Towards A Dynamic, Collaborative, and Comprehensive Instruction-Tuning Benchmark for Speech
Chien-yu Huang
Ke-Han Lu
Shi Wang
Chi-Yuan Hsiao
Chun-Yi Kuan
...
Roshan S. Sharma
Shinji Watanabe
Bhiksha Ramakrishnan
Shady Shehata
Hung-yi Lee
AuLLM
36
53
0
18 Sep 2023
Pruning Large Language Models via Accuracy Predictor
Yupeng Ji
Yibo Cao
Jiu-si Liu
KELM
36
4
0
18 Sep 2023
LayoutNUWA: Revealing the Hidden Layout Expertise of Large Language Models
Zecheng Tang
Chenfei Wu
Juntao Li
Nan Duan
3DV
28
9
0
18 Sep 2023
Are You Worthy of My Trust?: A Socioethical Perspective on the Impacts of Trustworthy AI Systems on the Environment and Human Society
Jamell Dacon
SILM
26
1
0
18 Sep 2023
Defending Against Alignment-Breaking Attacks via Robustly Aligned LLM
Bochuan Cao
Yu Cao
Lu Lin
Jinghui Chen
AAML
36
135
0
18 Sep 2023
Augmenting text for spoken language understanding with Large Language Models
Roshan Sharma
Suyoun Kim
Daniel Lazar
Trang Le
Akshat Shrivastava
Kwanghoon Ahn
Piyush Kansal
Leda Sari
Ozlem Kalinli
Michael Seltzer
31
2
0
17 Sep 2023
Embrace Divergence for Richer Insights: A Multi-document Summarization Benchmark and a Case Study on Summarizing Diverse Information from News Articles
Kung-Hsiang Huang
Philippe Laban
Alexander R. Fabbri
Prafulla Kumar Choubey
Chenyu You
Caiming Xiong
Chien-Sheng Wu
21
26
0
17 Sep 2023
Language models are susceptible to incorrect patient self-diagnosis in medical applications
Rojin Ziaei
Samuel Schmidgall
ELM
LM&MA
31
8
0
17 Sep 2023
OWL: A Large Language Model for IT Operations
Hongcheng Guo
Jian Yang
Jiaheng Liu
Liqun Yang
Linzheng Chai
...
Tieqiao Zheng
Liangfan Zheng
Bo Zhang
Ke Xu
Zhoujun Li
VLM
66
41
0
17 Sep 2023
Can Large Language Models Understand Real-World Complex Instructions?
Qi He
Jie Zeng
Wenhao Huang
Lina Chen
Jin Xiao
...
Shisong Chen
Yikai Zhang
Zhouhong Gu
Jiaqing Liang
Yanghua Xiao
ALM
LRM
ELM
98
52
0
17 Sep 2023
Sorted LLaMA: Unlocking the Potential of Intermediate Layers of Large Language Models for Dynamic Inference
Parsa Kavehzadeh
Mojtaba Valipour
Marzieh S. Tahaei
Ali Ghodsi
Boxing Chen
Mehdi Rezagholizadeh
35
6
0
16 Sep 2023
ODSum: New Benchmarks for Open Domain Multi-Document Summarization
Yijie Zhou
Kejian Shi
Wencai Zhang
Yixin Liu
Yilun Zhao
Arman Cohan
RALM
37
2
0
16 Sep 2023
Cross-Lingual Knowledge Editing in Large Language Models
Jiaan Wang
Yunlong Liang
Zengkui Sun
Yu Cao
Jiarong Xu
Fandong Meng
KELM
33
11
0
16 Sep 2023
Contextual Label Projection for Cross-Lingual Structured Prediction
Tanmay Parekh
I-Hung Hsu
Kuan-Hao Huang
Kai-Wei Chang
Nanyun Peng
27
4
0
16 Sep 2023
Investigating Subtler Biases in LLMs: Ageism, Beauty, Institutional, and Nationality Bias in Generative Models
M. Kamruzzaman
M. M. I. Shovon
Gene Louis Kim
48
25
0
16 Sep 2023
X-PARADE: Cross-Lingual Textual Entailment and Information Divergence across Paragraphs
Juan Diego Rodriguez
Katrin Erk
Greg Durrett
48
4
0
16 Sep 2023
Rethinking Learning Rate Tuning in the Era of Large Language Models
Hongpeng Jin
Wenqi Wei
Xuyu Wang
Wenbin Zhang
Yanzhao Wu
18
11
0
16 Sep 2023
Mining Patents with Large Language Models Elucidates the Chemical Function Landscape
Clayton W. Kosonocky
Claus O. Wilke
E. Marcotte
A. D. Ellington
41
3
0
15 Sep 2023
Are Multilingual LLMs Culturally-Diverse Reasoners? An Investigation into Multicultural Proverbs and Sayings
Chen Cecilia Liu
Fajri Koto
Timothy Baldwin
Iryna Gurevych
LRM
32
18
0
15 Sep 2023
Chain-of-Thought Reasoning is a Policy Improvement Operator
Hugh Zhang
David C. Parkes
ReLM
LM&Ro
LRM
31
12
0
15 Sep 2023
When do Generative Query and Document Expansions Fail? A Comprehensive Study Across Methods, Retrievers, and Datasets
Orion Weller
Kyle Lo
David Wadden
Dawn J Lawrie
Benjamin Van Durme
Arman Cohan
Luca Soldaini
49
19
0
15 Sep 2023
Scaling Laws for Sparsely-Connected Foundation Models
Elias Frantar
C. Riquelme
N. Houlsby
Dan Alistarh
Utku Evci
35
36
0
15 Sep 2023
Advancing the Evaluation of Traditional Chinese Language Models: Towards a Comprehensive Benchmark Suite
Chan-Jan Hsu
Chang-Le Liu
Feng-Ting Liao
Po-Chun Hsu
Yi-Chang Chen
Da-Shan Shiu
ELM
ALM
22
12
0
15 Sep 2023
Segment Anything Model for Brain Tumor Segmentation
Peng Zhang
Yaping Wang
VLM
19
6
0
15 Sep 2023
Reward Engineering for Generating Semi-structured Explanation
Paul Burgess
Wray Buntine
Ehsan Shareghi
LRM
30
0
0
15 Sep 2023
Data Distribution Bottlenecks in Grounding Language Models to Knowledge Bases
Yiheng Shu
Zhiwei Yu
27
3
0
15 Sep 2023
How to Handle Different Types of Out-of-Distribution Scenarios in Computational Argumentation? A Comprehensive and Fine-Grained Field Study
Andreas Waldis
Yufang Hou
Iryna Gurevych
30
2
0
15 Sep 2023
PromptTTS++: Controlling Speaker Identity in Prompt-Based Text-to-Speech Using Natural Language Descriptions
Reo Shimizu
Ryuichi Yamamoto
Masaya Kawamura
Yuma Shirahata
Hironori Doi
Tatsuya Komatsu
Kentaro Tachibana
DiffM
29
20
0
15 Sep 2023
Foundation Model Assisted Automatic Speech Emotion Recognition: Transcribing, Annotating, and Augmenting
Tiantian Feng
Shrikanth Narayanan
37
16
0
15 Sep 2023
Bias in News Summarization: Measures, Pitfalls and Corpora
Julius Steen
Katja Markert
28
4
0
14 Sep 2023
An Empirical Evaluation of Prompting Strategies for Large Language Models in Zero-Shot Clinical Natural Language Processing
Sonish Sivarajkumar
Mark Kelley
Alyssa Samolyk-Mazzanti
Shyam Visweswaran
Yanshan Wang
LM&MA
46
28
0
14 Sep 2023
Previous
1
2
3
...
147
148
149
...
154
155
156
Next