Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2307.09288
Cited By
Llama 2: Open Foundation and Fine-Tuned Chat Models
18 July 2023
Hugo Touvron
Louis Martin
Kevin R. Stone
Peter Albert
Amjad Almahairi
Yasmine Babaei
Nikolay Bashlykov
Soumya Batra
Prajjwal Bhargava
Shruti Bhosale
Daniel M. Bikel
Lukas Blecher
Cristian Canton Ferrer
Moya Chen
Guillem Cucurull
David Esiobu
Jude Fernandes
Jeremy Fu
Wenyin Fu
Brian Fuller
Cynthia Gao
Vedanuj Goswami
Naman Goyal
Anthony Hartshorn
Saghar Hosseini
Rui Hou
Hakan Inan
Marcin Kardas
Viktor Kerkez
Madian Khabsa
Isabel Kloumann
Artem Korenev
Punit Singh Koura
Marie-Anne Lachaux
Thibaut Lavril
Jenya Lee
Diana Liskovich
Yinghai Lu
Yuning Mao
Xavier Martinet
Todor Mihaylov
Pushkar Mishra
Igor Molybog
Yixin Nie
Andrew Poulton
Jeremy Reizenstein
Rashi Rungta
Kalyan Saladi
Alan Schelten
Ruan Silva
Eric Michael Smith
R. Subramanian
Xia Tan
Binh Tang
Ross Taylor
Adina Williams
Jian Xiang Kuan
Puxin Xu
Zhengxu Yan
Iliyan Zarov
Yuchen Zhang
Angela Fan
Melanie Kambadur
Sharan Narang
Aurelien Rodriguez
Robert Stojnic
Sergey Edunov
Thomas Scialom
AI4MH
ALM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Llama 2: Open Foundation and Fine-Tuned Chat Models"
50 / 7,772 papers shown
Title
OctoPack: Instruction Tuning Code Large Language Models
Niklas Muennighoff
Qian Liu
A. Zebaze
Qinkai Zheng
Binyuan Hui
Terry Yue Zhuo
Swayam Singh
Xiangru Tang
Leandro von Werra
Shayne Longpre
VLM
ALM
71
119
0
14 Aug 2023
Position: Key Claims in LLM Research Have a Long Tail of Footnotes
Anna Rogers
A. Luccioni
53
19
0
14 Aug 2023
#InsTag: Instruction Tagging for Analyzing Supervised Fine-tuning of Large Language Models
Keming Lu
Hongyi Yuan
Zheng Yuan
Runji Lin
Junyang Lin
Chuanqi Tan
Chang Zhou
Jingren Zhou
ALM
LRM
35
65
0
14 Aug 2023
Approximating Human-Like Few-shot Learning with GPT-based Compression
C.-Y. Huang
Yuqing Xie
Zhiying Jiang
Jimmy J. Lin
Ming Li
30
9
0
14 Aug 2023
Building Trust in Conversational AI: A Comprehensive Review and Solution Architecture for Explainable, Privacy-Aware Systems using LLMs and Knowledge Graph
Ahtsham Zafar
V. Parthasarathy
Chan Le Van
Saad Shahid
A. khan
Arsalan Shahid
16
13
0
13 Aug 2023
Token-Scaled Logit Distillation for Ternary Weight Generative Language Models
Minsoo Kim
Sihwa Lee
Jangwhan Lee
S. Hong
Duhyeuk Chang
Wonyong Sung
Jungwook Choi
MQ
24
14
0
13 Aug 2023
MT4CrossOIE: Multi-stage Tuning for Cross-lingual Open Information Extraction
Tongliang Li
Zixiang Wang
Linzheng Chai
Jian Yang
Jiaqi Bai
...
Jiaheng Liu
Hongcheng Guo
Liqun Yang
Hebboul Zine el-abidine
Zhoujun Li
41
3
0
12 Aug 2023
Three Ways of Using Large Language Models to Evaluate Chat
Ondvrej Plátek
Vojtvech Hudevcek
Patrícia Schmidtová
Mateusz Lango
Ondrej Dusek
ALM
19
6
0
12 Aug 2023
GPT-4 Is Too Smart To Be Safe: Stealthy Chat with LLMs via Cipher
Youliang Yuan
Wenxiang Jiao
Wenxuan Wang
Jen-tse Huang
Pinjia He
Shuming Shi
Zhaopeng Tu
SILM
76
234
0
12 Aug 2023
Detecting and Preventing Hallucinations in Large Vision Language Models
Anisha Gunjal
Jihan Yin
Erhan Bas
MLLM
VLM
36
156
0
11 Aug 2023
Self-Alignment with Instruction Backtranslation
Xian Li
Ping Yu
Chunting Zhou
Timo Schick
Omer Levy
Luke Zettlemoyer
Jason Weston
M. Lewis
SyDa
29
124
0
11 Aug 2023
Composable Function-preserving Expansions for Transformer Architectures
Andrea Gesmundo
Kaitlin Maile
AI4CE
40
8
0
11 Aug 2023
BOLAA: Benchmarking and Orchestrating LLM-augmented Autonomous Agents
Zhiwei Liu
Weiran Yao
Jianguo Zhang
Le Xue
Shelby Heinecke
...
Ran Xu
P. Mùi
Haiquan Wang
Caiming Xiong
Silvio Savarese
LLMAG
39
83
0
11 Aug 2023
Metacognitive Prompting Improves Understanding in Large Language Models
Yuqing Wang
Yun Zhao
ReLM
LRM
44
25
0
10 Aug 2023
TBIN: Modeling Long Textual Behavior Data for CTR Prediction
Shuwei Chen
Xiang Li
Jian Dong
Jin Zhang
Yongkang Wang
Xingxing Wang
20
3
0
09 Aug 2023
In-Context Alignment: Chat with Vanilla Language Models Before Fine-Tuning
Xiaochuang Han
25
19
0
08 Aug 2023
Fine-tuning Multimodal LLMs to Follow Zero-shot Demonstrative Instructions
Juncheng Li
Kaihang Pan
Zhiqi Ge
Minghe Gao
Wei Ji
Wenqiao Zhang
Tat-Seng Chua
Siliang Tang
Hanwang Zhang
Yueting Zhuang
MLLM
35
68
0
08 Aug 2023
Simple synthetic data reduces sycophancy in large language models
Jerry W. Wei
Da Huang
Yifeng Lu
Denny Zhou
Quoc V. Le
33
69
0
07 Aug 2023
Training BERT Models to Carry Over a Coding System Developed on One Corpus to Another
Dalma Galambos
Pál Zsámboki
11
0
0
07 Aug 2023
AgentBench: Evaluating LLMs as Agents
Xiao Liu
Hao Yu
Hanchen Zhang
Yifan Xu
Xuanyu Lei
...
Yu-Chuan Su
Huan Sun
Minlie Huang
Yuxiao Dong
Jie Tang
ELM
LLMAG
37
261
0
07 Aug 2023
Emotionally Numb or Empathetic? Evaluating How LLMs Feel Using EmotionBench
Jen-tse Huang
Man Ho Adrian Lam
E. Li
Shujie Ren
Wenxuan Wang
Wenxiang Jiao
Zhaopeng Tu
Michael R. Lyu
51
40
0
07 Aug 2023
SciGraphQA: A Large-Scale Synthetic Multi-Turn Question-Answering Dataset for Scientific Graphs
Sheng Li
Nima Tajbakhsh
MLLM
13
48
0
07 Aug 2023
LoRA-FA: Memory-efficient Low-rank Adaptation for Large Language Models Fine-tuning
Longteng Zhang
Lin Zhang
S. Shi
Xiangxiang Chu
Bo-wen Li
AI4CE
18
92
0
07 Aug 2023
UniversalNER: Targeted Distillation from Large Language Models for Open Named Entity Recognition
Wenxuan Zhou
Sheng Zhang
Yu Gu
Muhao Chen
Hoifung Poon
30
59
0
07 Aug 2023
TARJAMAT: Evaluation of Bard and ChatGPT on Machine Translation of Ten Arabic Varieties
Karima Kadaoui
Samar Magdy
Abdul Waheed
Md. Tawkat Islam Khondaker
Ahmed Oumar El-Shangiti
El Moatez Billah Nagoudi
Muhammad Abdul-Mageed
35
20
0
06 Aug 2023
Pre-Trained Large Language Models for Industrial Control
Lei Song
Chuheng Zhang
Li Zhao
Jiang Bian
LM&Ro
AI4CE
32
12
0
06 Aug 2023
MM-Vet: Evaluating Large Multimodal Models for Integrated Capabilities
Weihao Yu
Zhengyuan Yang
Linjie Li
Jianfeng Wang
Kevin Qinghong Lin
Zicheng Liu
Xinchao Wang
Lijuan Wang
MLLM
60
615
0
04 Aug 2023
A Survey of Spanish Clinical Language Models
Guillem García Subies
Á. Jiménez
Paloma Martínez
LM&MA
ELM
LRM
29
0
0
04 Aug 2023
Scaling Relationship on Learning Mathematical Reasoning with Large Language Models
Zheng Yuan
Hongyi Yuan
Cheng Li
Guanting Dong
Keming Lu
Chuanqi Tan
Chang Zhou
Jingren Zhou
LRM
ALM
33
167
0
03 Aug 2023
Local Large Language Models for Complex Structured Medical Tasks
V. Bumgardner
Aaron D. Mullen
Samuel E. Armstrong
Caylin D. Hickey
Jeffrey A. Talbert
36
5
0
03 Aug 2023
Flows: Building Blocks of Reasoning and Collaborating AI
Martin Josifoski
Lars Klein
Maxime Peyrard
Nicolas Mario Baldwin
Yifei Li
...
Julian Paul Schnitzler
Yuxing Yao
Jiheng Wei
Debjit Paul
Robert West
AI4CE
41
25
0
02 Aug 2023
XSTest: A Test Suite for Identifying Exaggerated Safety Behaviours in Large Language Models
Paul Röttger
Hannah Rose Kirk
Bertie Vidgen
Giuseppe Attanasio
Federico Bianchi
Dirk Hovy
ALM
ELM
AILaw
27
127
0
02 Aug 2023
Do Multilingual Language Models Think Better in English?
Julen Etxaniz
Gorka Azkune
Aitor Soroa Etxabe
Oier López de Lacalle
Mikel Artetxe
LRM
38
58
0
02 Aug 2023
SurveyLM: A platform to explore emerging value perspectives in augmented language models' behaviors
Steve J. Bickley
H. F. Chan
Bang Dao
B. Torgler
Son Tran
19
1
0
01 Aug 2023
Advancing Beyond Identification: Multi-bit Watermark for Large Language Models
Kiyoon Yoo
Wonhyuk Ahn
Nojun Kwak
WaLM
30
17
0
01 Aug 2023
Pretrained deep models outperform GBDTs in Learning-To-Rank under label scarcity
Charlie Hou
K. K. Thekumparampil
Michael Shavlovsky
Giulia Fanti
Yesh Dattatreya
Sujay Sanghavi
LMTD
21
1
0
31 Jul 2023
Reinforcement Learning for Generative AI: State of the Art, Opportunities and Open Research Challenges
Giorgio Franceschelli
Mirco Musolesi
AI4CE
40
20
0
31 Jul 2023
Backdooring Instruction-Tuned Large Language Models with Virtual Prompt Injection
Jun Yan
Vikas Yadav
Shiyang Li
Lichang Chen
Zheng Tang
Hai Wang
Vijay Srinivasan
Xiang Ren
Hongxia Jin
SILM
28
82
0
31 Jul 2023
Evaluating Correctness and Faithfulness of Instruction-Following Models for Question Answering
Vaibhav Adlakha
Parishad BehnamGhader
Xing Han Lù
Nicholas Meade
Siva Reddy
30
120
0
31 Jul 2023
ToolLLM: Facilitating Large Language Models to Master 16000+ Real-world APIs
Yujia Qin
Shi Liang
Yining Ye
Kunlun Zhu
Lan Yan
...
Jie Zhou
Mark B. Gerstein
Dahai Li
Zhiyuan Liu
Maosong Sun
CLL
ALM
LLMAG
ELM
LM&MA
87
628
0
31 Jul 2023
MovieChat: From Dense Token to Sparse Memory for Long Video Understanding
Enxin Song
Wenhao Chai
Guanhong Wang
Yucheng Zhang
Haoyang Zhou
...
Tianbo Ye
Yanting Zhang
Yang Lu
Lei Li
Gaoang Wang
VLM
MLLM
22
264
0
31 Jul 2023
UniAP: Unifying Inter- and Intra-Layer Automatic Parallelism by Mixed Integer Quadratic Programming
Hao Lin
Ke Wu
Jie Li
Jun Yu Li
Wu-Jun Li
39
1
0
31 Jul 2023
LaFiCMIL: Rethinking Large File Classification from the Perspective of Correlated Multiple Instance Learning
Tiezhu Sun
Weiguo Pian
N. Daoudi
Kevin Allix
Tegawende F. Bissyande
Jacques Klein
31
1
0
30 Jul 2023
RoCar: A Relationship Network-based Evaluation Method to Large Language Models
Ming Wang
Wenfang Wu
Chongyun Gao
Daling Wang
Shi Feng
Yifei Zhang
22
0
0
29 Jul 2023
CHATREPORT: Democratizing Sustainability Disclosure Analysis through LLM-based Tools
Jingwei Ni
J. Bingler
Chiara Colesanti-Senni
Mathias Kraus
Glen Gostlow
...
Qian Wang
Nicolas Webersinke
Tobias Wekhof
Ting Yu
Markus Leippold
37
29
0
28 Jul 2023
Uncertainty in Natural Language Generation: From Theory to Applications
Joris Baan
Nico Daheim
Evgenia Ilia
Dennis Ulmer
Haau-Sing Li
Raquel Fernández
Barbara Plank
Rico Sennrich
Chrysoula Zerva
Wilker Aziz
UQLM
34
40
0
28 Jul 2023
Med-HALT: Medical Domain Hallucination Test for Large Language Models
Ankit Pal
Logesh Kumar Umapathi
Malaikannan Sankarasubbu
HILM
LM&MA
VLM
36
128
0
28 Jul 2023
TrafficSafetyGPT: Tuning a Pre-trained Large Language Model to a Domain-Specific Expert in Transportation Safety
Ou Zheng
Mohamed Abdel-Aty
Dongdong Wang
Chenzhu Wang
Shengxuan Ding
21
20
0
28 Jul 2023
Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback
Stephen Casper
Xander Davies
Claudia Shi
T. Gilbert
Jérémy Scheurer
...
Erdem Biyik
Anca Dragan
David M. Krueger
Dorsa Sadigh
Dylan Hadfield-Menell
ALM
OffRL
52
473
0
27 Jul 2023
Universal and Transferable Adversarial Attacks on Aligned Language Models
Andy Zou
Zifan Wang
Nicholas Carlini
Milad Nasr
J. Zico Kolter
Matt Fredrikson
94
1,278
0
27 Jul 2023
Previous
1
2
3
...
151
152
153
154
155
156
Next