Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2307.09288
Cited By
Llama 2: Open Foundation and Fine-Tuned Chat Models
18 July 2023
Hugo Touvron
Louis Martin
Kevin R. Stone
Peter Albert
Amjad Almahairi
Yasmine Babaei
Nikolay Bashlykov
Soumya Batra
Prajjwal Bhargava
Shruti Bhosale
Daniel M. Bikel
Lukas Blecher
Cristian Canton Ferrer
Moya Chen
Guillem Cucurull
David Esiobu
Jude Fernandes
Jeremy Fu
Wenyin Fu
Brian Fuller
Cynthia Gao
Vedanuj Goswami
Naman Goyal
Anthony Hartshorn
Saghar Hosseini
Rui Hou
Hakan Inan
Marcin Kardas
Viktor Kerkez
Madian Khabsa
Isabel Kloumann
Artem Korenev
Punit Singh Koura
Marie-Anne Lachaux
Thibaut Lavril
Jenya Lee
Diana Liskovich
Yinghai Lu
Yuning Mao
Xavier Martinet
Todor Mihaylov
Pushkar Mishra
Igor Molybog
Yixin Nie
Andrew Poulton
Jeremy Reizenstein
Rashi Rungta
Kalyan Saladi
Alan Schelten
Ruan Silva
Eric Michael Smith
R. Subramanian
Xia Tan
Binh Tang
Ross Taylor
Adina Williams
Jian Xiang Kuan
Puxin Xu
Zhengxu Yan
Iliyan Zarov
Yuchen Zhang
Angela Fan
Melanie Kambadur
Sharan Narang
Aurelien Rodriguez
Robert Stojnic
Sergey Edunov
Thomas Scialom
AI4MH
ALM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Llama 2: Open Foundation and Fine-Tuned Chat Models"
50 / 7,751 papers shown
Title
Can AI Master Construction Management (CM)? Benchmarking State-of-the-Art Large Language Models on CM Certification Exams
Ruoxin Xiong
Yanyu Wang
Suat Gunhan
Yimin Zhu
Charles Berryman
ELM
26
0
0
04 Apr 2025
How Social is It? A Benchmark for LLMs' Capabilities in Multi-user Multi-turn Social Agent Tasks
Yusen Wu
Junwu Xiong
Xiaotie Deng
LLMAG
38
0
0
04 Apr 2025
Language Models Are Implicitly Continuous
Samuele Marro
Davide Evangelista
X. A. Huang
Emanuele La Malfa
M. Lombardi
Michael Wooldridge
33
0
0
04 Apr 2025
Beyond Progress Measures: Theoretical Insights into the Mechanism of Grokking
Zihan Gu
Ruoyu Chen
Hua Zhang
Yue Hu
Xiaochun Cao
39
0
0
04 Apr 2025
LLM Library Learning Fails: A LEGO-Prover Case Study
Ian Berlot-Attwell
Frank Rudzicz
Xujie Si
ELM
39
0
0
03 Apr 2025
Sparse Autoencoders Learn Monosemantic Features in Vision-Language Models
Mateusz Pach
Shyamgopal Karthik
Quentin Bouniot
Serge Belongie
Zeynep Akata
VLM
69
0
0
03 Apr 2025
DaKultur: Evaluating the Cultural Awareness of Language Models for Danish with Native Speakers
Max Müller-Eberstein
Mike Zhang
Elisa Bassignana
Peter Brunsgaard Trolle
Rob van der Goot
ELM
39
0
0
03 Apr 2025
Beyond Accuracy: The Role of Calibration in Self-Improving Large Language Models
Liangjie Huang
Dawei Li
Huan Liu
Lu Cheng
LRM
34
0
0
03 Apr 2025
VARGPT-v1.1: Improve Visual Autoregressive Large Unified Model via Iterative Instruction Tuning and Reinforcement Learning
Xianwei Zhuang
Yuxin Xie
Yufan Deng
Dongchao Yang
Liming Liang
Jinghan Ru
Yuguo Yin
Yuexian Zou
71
2
0
03 Apr 2025
Language Models Guidance with Multi-Aspect-Cueing: A Case Study for Competitor Analysis
Amir Hadifar
Christopher Ochs
Arjan Van Ewijk
ELM
50
0
0
03 Apr 2025
How Post-Training Reshapes LLMs: A Mechanistic View on Knowledge, Truthfulness, Refusal, and Confidence
Hongzhe Du
Weikai Li
Min Cai
Karim Saraipour
Zimin Zhang
Himabindu Lakkaraju
Yizhou Sun
Shichang Zhang
KELM
56
0
0
03 Apr 2025
LightDefense: A Lightweight Uncertainty-Driven Defense against Jailbreaks via Shifted Token Distribution
Zhuoran Yang
Jie Peng
Zhen Tan
Tianlong Chen
Yanyong Zhang
AAML
44
0
0
02 Apr 2025
Efficient Federated Learning Tiny Language Models for Mobile Network Feature Prediction
Daniel Becking
Ingo Friese
Karsten Müller
Thomas Buchholz
Mandy Galkow-Schneider
Wojciech Samek
D. Marpe
36
0
0
02 Apr 2025
Leveraging Modality Tags for Enhanced Cross-Modal Video Retrieval
A. Fragomeni
Dima Damen
Michael Wray
33
0
0
02 Apr 2025
Investigating and Scaling up Code-Switching for Multilingual Language Model Pre-Training
Zhijun Wang
Jiahuan Li
Hao Zhou
Rongxiang Weng
J. Wang
Xin Huang
Xue Han
Junlan Feng
Chao Deng
Shujian Huang
LRM
50
1
0
02 Apr 2025
Testing Low-Resource Language Support in LLMs Using Language Proficiency Exams: the Case of Luxembourgish
Cedric Lothritz
Jordi Cabot
33
0
0
02 Apr 2025
Beyond Non-Expert Demonstrations: Outcome-Driven Action Constraint for Offline Reinforcement Learning
Ke Jiang
Wen Jiang
Y. Li
Xiaoyang Tan
OffRL
43
0
0
02 Apr 2025
Rethinking industrial artificial intelligence: a unified foundation framework
Jay Lee
Hanqi Su
AI4CE
41
1
0
02 Apr 2025
Graphically Speaking: Unmasking Abuse in Social Media with Conversation Insights
Célia Nouri
Jean-Philippe Cointet
Chloé Clavel
37
0
0
02 Apr 2025
Generalized Tensor-based Parameter-Efficient Fine-Tuning via Lie Group Transformations
Chongjie Si
Zhiyi Shi
Xuehui Wang
Yichen Xiao
Xiaokang Yang
Wei-Ming Shen
AI4CE
65
0
0
01 Apr 2025
The Illusionist's Prompt: Exposing the Factual Vulnerabilities of Large Language Models with Linguistic Nuances
Yining Wang
Yixuan Wang
Xi Li
Mi Zhang
Geng Hong
Min Yang
AAML
HILM
67
0
0
01 Apr 2025
PRISM-0: A Predicate-Rich Scene Graph Generation Framework for Zero-Shot Open-Vocabulary Tasks
Abdelrahman Elskhawy
Mengze Li
Nassir Navab
Benjamin Busam
VLM
54
0
0
01 Apr 2025
MaLAware: Automating the Comprehension of Malicious Software Behaviours using Large Language Models (LLMs)
Bikash Saha
Nanda Rani
Sandeep K. Shukla
36
0
0
01 Apr 2025
Exposing the Ghost in the Transformer: Abnormal Detection for Large Language Models via Hidden State Forensics
Shide Zhou
Kaidi Wang
Ling Shi
Hairu Wang
47
0
0
01 Apr 2025
Synthesized Annotation Guidelines are Knowledge-Lite Boosters for Clinical Information Extraction
Enshuo Hsu
Martin Ugbala
Krishna Kumar Kookal
Zouaidi Kawtar
Nicholas L. Rider
Muhammad F. Walji
Kirk Roberts
29
0
0
01 Apr 2025
MetaLoRA: Tensor-Enhanced Adaptive Low-Rank Fine-tuning
Maolin Wang
Xiangyu Zhao
AI4CE
45
0
0
01 Apr 2025
Making Large Language Models Better Reasoners with Orchestrated Streaming Experiences
Xiangyang Liu
Junliang He
Xipeng Qiu
ReLM
LRM
65
0
0
01 Apr 2025
Self-Routing RAG: Binding Selective Retrieval with Knowledge Verbalization
Di Wu
Jia-Chen Gu
Kai-Wei Chang
Nanyun Peng
34
0
0
01 Apr 2025
Zero-shot Benchmarking: A Framework for Flexible and Scalable Automatic Evaluation of Language Models
José P. Pombal
Nuno M. Guerreiro
Ricardo Rei
André F. T. Martins
ALM
75
0
0
01 Apr 2025
Fair Dynamic Spectrum Access via Fully Decentralized Multi-Agent Reinforcement Learning
Yubo Zhang
Pedro Botelho
Trevor Gordon
Gil Zussman
I. Kadota
55
0
0
31 Mar 2025
Chapter-Llama: Efficient Chaptering in Hour-Long Videos with LLMs
Lucas Ventura
Antoine Yang
Cordelia Schmid
Gül Varol
39
0
0
31 Mar 2025
WebMap -- Large Language Model-assisted Semantic Link Induction in the Web
Shiraj Pokharel
Georg P. Roßrucker
Mario M. Kubek
KELM
23
0
0
31 Mar 2025
SQuat: Subspace-orthogonal KV Cache Quantization
Hao Wang
Ligong Han
Kai Xu
Akash Srivastava
MQ
51
0
0
31 Mar 2025
STI-Bench: Are MLLMs Ready for Precise Spatial-Temporal World Understanding?
Heng Chang
Yuyao Zhang
Tao Lin
Xiangrui Liu
Wenxiao Cai
Zheng Liu
Bo Zhao
LRM
58
1
0
31 Mar 2025
Agents
Under
Siege
\textit{Agents Under Siege}
Agents Under Siege
: Breaking Pragmatic Multi-Agent LLM Systems with Optimized Prompt Attacks
Rana Muhammad Shahroz Khan
Zhen Tan
Sukwon Yun
Charles Flemming
Tianlong Chen
AAML
LLMAG
Presented at
ResearchTrend Connect | LLMAG
on
23 Apr 2025
96
3
0
31 Mar 2025
Text2Tracks: Prompt-based Music Recommendation via Generative Retrieval
Enrico Palumbo
Gustavo Penha
Andreas Damianou
José Luis Redondo García
Timothy Christopher Heath
Alice Wang
Hugues Bouchard
M. Lalmas
54
0
0
31 Mar 2025
Get the Agents Drunk: Memory Perturbations in Autonomous Agent-based Recommender Systems
Shiyi Yang
Z. Hu
Chen Wang
Tong Yu
Xiwei Xu
Liming Zhu
Lina Yao
AAML
42
0
0
31 Mar 2025
Pay More Attention to the Robustness of Prompt for Instruction Data Mining
Qiang Wang
Dawei Feng
Xu Zhang
Ao Shen
Yang Xu
Bo Ding
H. Wang
AAML
48
0
0
31 Mar 2025
CONGRAD:Conflicting Gradient Filtering for Multilingual Preference Alignment
Jiangnan Li
Thuy-Trang Vu
Christian Herold
Amirhossein Tebbifakhr
Shahram Khadivi
Gholamreza Haffari
33
0
0
31 Mar 2025
LLMs for Explainable AI: A Comprehensive Survey
Ahsan Bilal
David Ebert
Beiyu Lin
72
1
0
31 Mar 2025
A Systematic Evaluation of LLM Strategies for Mental Health Text Analysis: Fine-tuning vs. Prompt Engineering vs. RAG
Arshia Kermani
Veronica Perez-Rosas
Vangelis Metsis
AI4MH
42
0
0
31 Mar 2025
WinoWhat: A Parallel Corpus of Paraphrased WinoGrande Sentences with Common Sense Categorization
I. Gevers
Victor De Marez
Luna De Bruyne
Walter Daelemans
37
0
0
31 Mar 2025
Model Hemorrhage and the Robustness Limits of Large Language Models
Ziyang Ma
Zehan Li
L. Zhang
Gui-Song Xia
Bo Du
Liangpei Zhang
Dacheng Tao
59
0
0
31 Mar 2025
Beyond the Reported Cutoff: Where Large Language Models Fall Short on Financial Knowledge
Agam Shah
Liqin Ye
Sebastian Jaskowski
Wei Xu
S. Chava
AIFin
33
0
0
30 Mar 2025
Order Independence With Finetuning
Katrina Brown
Reid McIlroy
35
0
0
30 Mar 2025
Whisper-LM: Improving ASR Models with Language Models for Low-Resource Languages
Xabier de Zuazo
Eva Navas
Ibon Saratxaga
Inma Hernáez Rioja
42
0
0
30 Mar 2025
LaViC: Adapting Large Vision-Language Models to Visually-Aware Conversational Recommendation
Hyunsik Jeon
Satoshi Koide
Yu Wang
Zhankui He
Julian McAuley
VLM
49
0
0
30 Mar 2025
Cocktail: Chunk-Adaptive Mixed-Precision Quantization for Long-Context LLM Inference
Wei Tao
Bin Zhang
Xiaoyang Qu
Jiguang Wan
Jianzong Wang
45
1
0
30 Mar 2025
Not All LoRA Parameters Are Essential: Insights on Inference Necessity
Guanhua Chen
Yutong Yao
Ci-Jun Gao
Lidia S. Chao
Feng Wan
Derek F. Wong
39
0
0
30 Mar 2025
Mixture of Routers
Jia-Chen Zhang
Yu-Jie Xiong
Xi-He Qiu
Chun-Ming Xia
Fei Dai
MoE
73
0
0
30 Mar 2025
Previous
1
2
3
...
6
7
8
...
154
155
156
Next