Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2302.13971
Cited By
LLaMA: Open and Efficient Foundation Language Models
27 February 2023
Hugo Touvron
Thibaut Lavril
Gautier Izacard
Xavier Martinet
Marie-Anne Lachaux
Timothée Lacroix
Baptiste Rozière
Naman Goyal
Eric Hambro
Faisal Azhar
Aurelien Rodriguez
Armand Joulin
Edouard Grave
Guillaume Lample
ALM
PILM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"LLaMA: Open and Efficient Foundation Language Models"
50 / 2,593 papers shown
Title
LLM-QFL: Distilling Large Language Model for Quantum Federated Learning
Dev Gurung
Shiva Raj Pokhrel
FedML
211
0
0
24 May 2025
RefLoRA: Refactored Low-Rank Adaptation for Efficient Fine-Tuning of Large Models
Yilang Zhang
Bingcong Li
G. Giannakis
248
1
0
24 May 2025
VISTA: Vision-Language Inference for Training-Free Stock Time-Series Analysis
Tina Khezresmaeilzadeh
Parsa Razmara
Seyedarmin Azizi
Mohammad Erfan Sadeghi
Erfan Baghaei Portaghloo
AI4TS
278
0
0
24 May 2025
Multi-Scale Manifold Alignment: A Unified Framework for Enhanced Explainability of Large Language Models
Yukun Zhang
Qi Dong
27
0
0
24 May 2025
ToDRE: Visual Token Pruning via Diversity and Task Awareness for Efficient Large Vision-Language Models
Duo Li
Zuhao Yang
Shijian Lu
VLM
98
0
0
24 May 2025
DialogXpert: Driving Intelligent and Emotion-Aware Conversations through Online Value-Based Reinforcement Learning with LLM Priors
Tazeek Bin Abdur Rakib
Ambuj Mehrish
Lay-Ki Soon
Wern Han Lim
Soujanya Poria
OffRL
60
0
0
23 May 2025
Two-Stage Regularization-Based Structured Pruning for LLMs
Mingkuan Feng
Jinyang Wu
Siyuan Liu
Shuai Zhang
Hongjian Fang
Ruihan Jin
Feihu Che
Pengpeng Shao
Zhengqi Wen
48
0
0
23 May 2025
DetailFusion: A Dual-branch Framework with Detail Enhancement for Composed Image Retrieval
Yuxin Yang
Yinan Zhou
Yuxin Chen
Ziqi Zhang
Zongyang Ma
...
Bing Li
Lin Song
Jun Gao
Peng Li
Weiming Hu
199
0
0
23 May 2025
Fann or Flop: A Multigenre, Multiera Benchmark for Arabic Poetry Understanding in LLMs
Wafa Alghallabi
Ritesh Thawkar
Sara Ghaboura
Ketan More
Omkar Thawakar
Hisham Cholakkal
Salman Khan
Rao Muhammad Anwer
156
0
0
23 May 2025
Understanding Gated Neurons in Transformers from Their Input-Output Functionality
Sebastian Gerstner
Hinrich Schütze
MILM
FAtt
219
0
0
23 May 2025
LLM-BSCVM: An LLM-Based Blockchain Smart Contract Vulnerability Management Framework
Yanli Jin
Chunpei Li
Peng Fan
Peng Liu
Xianxian Li
Chen Liu
Wangjie Qiu
41
0
0
23 May 2025
BLAST: Balanced Sampling Time Series Corpus for Universal Forecasting Models
Zezhi Shao
Yujie Li
Fei Wang
Chengqing Yu
Yisong Fu
Tangwen Qian
Bin Xu
Boyu Diao
Yongjun Xu
Xueqi Cheng
AI4TS
79
0
0
23 May 2025
How Can I Publish My LLM Benchmark Without Giving the True Answers Away?
Takashi Ishida
Thanawat Lodkaew
Ikko Yamane
221
0
0
23 May 2025
CONCORD: Concept-Informed Diffusion for Dataset Distillation
Jianyang Gu
Haonan Wang
Ruoxi Jia
Saeed Vahidian
Vyacheslav Kungurtsev
Wei Jiang
Yiran Chen
DiffM
DD
922
0
0
23 May 2025
Towards Practical Defect-Focused Automated Code Review
Junyi Lu
Lili Jiang
Xiaojia Li
Jianbing Fang
Fengjun Zhang
Li Yang
Chun Zuo
208
0
0
23 May 2025
SLearnLLM: A Self-Learning Framework for Efficient Domain-Specific Adaptation of Large Language Models
Xiang Liu
Zhaoxiang Liu
Peng Wang
Kohou Wang
Huan Hu
Kai Wang
Shiguo Lian
201
0
0
23 May 2025
Rehabilitation Exercise Quality Assessment and Feedback Generation Using Large Language Models with Prompt Engineering
Jessica Tang
Ali Abedi
T. Colella
Shehroz S. Khan
LM&MA
32
0
0
23 May 2025
Simultaneous Modeling of Protein Conformation and Dynamics via Autoregression
Yuning Shen
Lihao Wang
Huizhuo Yuan
Yan Wang
B. Yang
Quanquan Gu
DiffM
AI4CE
173
0
0
23 May 2025
Scaling Recurrent Neural Networks to a Billion Parameters with Zero-Order Optimization
Francois Chaubard
Mykel J. Kochenderfer
MQ
AI4CE
190
0
0
23 May 2025
LLM-Based Emulation of the Radio Resource Control Layer: Towards AI-Native RAN Protocols
Ziming Liu
Bryan Liu
Alvaro Valcarce
Xiaoli Chu
246
1
0
22 May 2025
MARché: Fast Masked Autoregressive Image Generation with Cache-Aware Attention
Chaoyi Jiang
Sungwoo Kim
Lei Gao
Hossein Entezari Zarch
Won Woo Ro
Murali Annavaram
24
0
0
22 May 2025
AdamS: Momentum Itself Can Be A Normalizer for LLM Pretraining and Post-training
Huishuai Zhang
Bohan Wang
Luoxin Chen
ODL
232
0
0
22 May 2025
RBench-V: A Primary Assessment for Visual Reasoning Models with Multi-modal Outputs
Meng-Hao Guo
Xuanyu Chu
Qianrui Yang
Zhe-Han Mo
Yiqing Shen
...
Kiyohiro Nakayama
Zhengyang Geng
Houwen Peng
Han Hu
Shi-Min Hu
LRM
197
0
0
22 May 2025
SC4ANM: Identifying Optimal Section Combinations for Automated Novelty Prediction in Academic Papers
Wenqing Wu
Chengzhi Zhang
Tong Bao
Yi Zhao
219
1
0
22 May 2025
From Surveys to Narratives: Rethinking Cultural Value Adaptation in LLMs
Muhammad Farid Adilazuarda
Chen Cecilia Liu
Iryna Gurevych
Alham Fikri Aji
219
0
0
22 May 2025
Understanding Differential Transformer Unchains Pretrained Self-Attentions
Chaerin Kong
Jiho Jang
Nojun Kwak
88
0
0
22 May 2025
LLaDA-V: Large Language Diffusion Models with Visual Instruction Tuning
Zebin You
Shen Nie
Xiaolu Zhang
Jun Hu
Jun Zhou
Zhiwu Lu
J. Wen
Chongxuan Li
MLLM
VLM
112
2
0
22 May 2025
LLaMAs Have Feelings Too: Unveiling Sentiment and Emotion Representations in LLaMA Models Through Probing
Dario Di Palma
Alessandro De Bellis
Giovanni Servedio
Vito Walter Anelli
Fedelucio Narducci
Tommaso Di Noia
MILM
76
0
0
22 May 2025
Panoptic Captioning: Seeking An Equivalency Bridge for Image and Text
Kun-Yu Lin
Hongjun Wang
Weining Ren
Kai Han
294
0
0
22 May 2025
FoMoH: A clinically meaningful foundation model evaluation for structured electronic health records
Chao Pang
Vincent Jeanselme
Young Sang Choi
Xinzhuo Jiang
Zilin Jing
...
Yuta Kobayashi
Yanwei Li
Florent Pollet
Karthik Natarajan
Shalmali Joshi
233
0
0
22 May 2025
Content Moderation in TV Search: Balancing Policy Compliance, Relevance, and User Experience
Adeep Hande
Kishorekumar Sundararajan
Sardar Hamidian
Ferhan Ture
20
0
0
22 May 2025
Leveraging Online Data to Enhance Medical Knowledge in a Small Persian Language Model
Mehrdad Ghassabi
Pedram Rostami
Hamidreza Baradaran Kashani
Amirhossein Poursina
Zahra Kazemi
Milad Tavakoli
LM&MA
191
0
0
21 May 2025
Large Language models for Time Series Analysis: Techniques, Applications, and Challenges
Feifei Shi
Xueyan Yin
Kang Wang
Wanyu Tu
Qifu Sun
Huansheng Ning
AI4TS
22
0
0
21 May 2025
Diagnosing our datasets: How does my language model learn clinical information?
Furong Jia
David Sontag
Monica Agrawal
LM&MA
212
1
0
21 May 2025
UWSAM: Segment Anything Model Guided Underwater Instance Segmentation and A Large-scale Benchmark Dataset
Hua Li
Shijie Lian
Zhiyuan Li
Runmin Cong
Sam Kwong
VLM
81
0
0
21 May 2025
A Linear Approach to Data Poisoning
Diego Granziol
Donald Flynn
AAML
192
0
0
21 May 2025
Small Language Models in the Real World: Insights from Industrial Text Classification
Lujun Li
Lama Sleem
Niccolo Gentile
Geoffrey Nichil
Radu State
LLMAG
218
0
0
21 May 2025
From Tokens to Thoughts: How LLMs and Humans Trade Compression for Meaning
Chen Shani
Dan Jurafsky
Yann LeCun
Ravid Shwartz-Ziv
219
0
0
21 May 2025
Object-Focus Actor for Data-efficient Robot Generalization Dexterous Manipulation
Yihang Li
Tianle Zhang
Xuelong Wei
Jiayi Li
Lin Zhao
Dongchi Huang
Zhirui Fang
Minhua Zheng
Wenjun Dai
Xiaodong He
71
0
0
21 May 2025
HCRMP: A LLM-Hinted Contextual Reinforcement Learning Framework for Autonomous Driving
Zhiwen Chen
Bo Leng
Zhuoren Li
Hanming Deng
Guizhe Jin
Ran Yu
Huanxi Wen
231
0
0
21 May 2025
DISCO Balances the Scales: Adaptive Domain- and Difficulty-Aware Reinforcement Learning on Imbalanced Data
Yuhang Zhou
Jing Zhu
Shengyi Qian
Zhuokai Zhao
Xiyao Wang
Xiaoyu Liu
Ming Li
Paiheng Xu
Wei Ai
Furong Huang
95
1
0
21 May 2025
Exploring Jailbreak Attacks on LLMs through Intent Concealment and Diversion
Tiehan Cui
Yanxu Mao
Peipei Liu
Congying Liu
Datao You
AAML
61
1
0
20 May 2025
ViC-Bench: Benchmarking Visual-Interleaved Chain-of-Thought Capability in MLLMs with Free-Style Intermediate State Representations
Xuecheng Wu
Jiaxing Liu
Danlei Huang
Xiaoyu Li
Yifan Wang
Chen Chen
Liya Ma
Xuezhi Cao
Junxiao Xue
LRM
110
0
0
20 May 2025
PlanGPT-VL: Enhancing Urban Planning with Domain-Specific Vision-Language Models
He Zhu
Junyou Su
Minxin Chen
Wen Wang
Yijie Deng
Guanhua Chen
Wenjia Zhang
197
0
0
20 May 2025
Fragments to Facts: Partial-Information Fragment Inference from LLMs
Lucas Rosenblatt
Bin Han
Robert Wolfe
Bill Howe
AAML
61
0
0
20 May 2025
Large Language Models Implicitly Learn to See and Hear Just By Reading
Prateek Verma
Mert Pilanci
196
0
0
20 May 2025
Exploring Causes of Representational Similarity in Machine Learning Models
Zeyu Michael Li
Hung Anh Vu
Damilola Awofisayo
Emily Wenger
CML
252
0
0
20 May 2025
Do Language Models Use Their Depth Efficiently?
Róbert Csordás
Christopher D. Manning
Christopher Potts
208
2
0
20 May 2025
Policy Contrastive Decoding for Robotic Foundation Models
Shihan Wu
Ji Zhang
Xu Luo
Junlin Xie
Jingkuan Song
Heng Tao Shen
Lianli Gao
OffRL
268
0
0
19 May 2025
A3 : an Analytical Low-Rank Approximation Framework for Attention
Jeffrey T. H. Wong
Cheng Zhang
Xinye Cao
Pedro Gimenes
George A. Constantinides
Wayne Luk
Yiren Zhao
OffRL
MQ
133
1
0
19 May 2025
Previous
1
2
3
...
6
7
8
...
50
51
52
Next