Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2302.13971
Cited By
LLaMA: Open and Efficient Foundation Language Models
27 February 2023
Hugo Touvron
Thibaut Lavril
Gautier Izacard
Xavier Martinet
Marie-Anne Lachaux
Timothée Lacroix
Baptiste Rozière
Naman Goyal
Eric Hambro
Faisal Azhar
Aurelien Rodriguez
Armand Joulin
Edouard Grave
Guillaume Lample
ALM
PILM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"LLaMA: Open and Efficient Foundation Language Models"
50 / 4,439 papers shown
Title
An Empirical Study of Many-to-Many Summarization with Large Language Models
Jiaan Wang
Fandong Meng
Zengkui Sun
Yunlong Liang
Yuxuan Cao
Jiarong Xu
Haoxiang Shi
Jie Zhou
5
0
0
19 May 2025
CompBench: Benchmarking Complex Instruction-guided Image Editing
Bohan Jia
Wenxuan Huang
Yuntian Tang
Junbo Qiao
Jincheng Liao
...
Lin Chen
Fei Zhao
Zihan Wang
Yuan Xie
Shaohui Lin
CoGe
10
0
0
18 May 2025
The Tower of Babel Revisited: Multilingual Jailbreak Prompts on Closed-Source Large Language Models
Linghan Huang
Haolin Jin
Zhaoge Bi
Pengyue Yang
Peizhou Zhao
Taozhao Chen
Xiongfei Wu
Lei Ma
Huaming Chen
AAML
0
0
0
18 May 2025
Video-SafetyBench: A Benchmark for Safety Evaluation of Video LVLMs
Xuannan Liu
Zekun Li
Zheqi He
Peipei Li
Shuhan Xia
Xing Cui
Huaibo Huang
Xi Yang
Ran He
EGVM
AAML
14
0
0
17 May 2025
Fast RoPE Attention: Combining the Polynomial Method and Fast Fourier Transform
Josh Alman
Zhao-quan Song
0
0
0
17 May 2025
REI-Bench: Can Embodied Agents Understand Vague Human Instructions in Task Planning?
Chenxi Jiang
Chuhao Zhou
Jianfei Yang
9
0
0
16 May 2025
TCC-Bench: Benchmarking the Traditional Chinese Culture Understanding Capabilities of MLLMs
Pengju Xu
Yan Wang
S. Zhang
Xuan Zhou
Xin Li
...
Fengzhao Li
Shuigeng Zhou
X. Wang
Yi Zhang
Haiying Zhao
VLM
12
0
0
16 May 2025
Extracting Explainable Dates From Medical Images By Reverse-Engineering UNIX Timestamps
Lee Harris
James Bentham
Philippe De Wilde
MedIm
14
0
0
16 May 2025
A
L
L
M
4
A
D
D
\mathcal{A}LLM4ADD
A
LL
M
4
A
DD
: Unlocking the Capabilities of Audio Large Language Models for Audio Deepfake Detection
Hao Gu
Jiangyan Yi
Chenglong Wang
Jianhua Tao
Zheng Lian
Jiayi He
Yong Ren
Yujie Chen
Zhengqi Wen
12
0
0
16 May 2025
Context-Aware Probabilistic Modeling with LLM for Multimodal Time Series Forecasting
Yueyang Yao
Jiajun Li
Xingyuan Dai
MengMeng Zhang
Xiaoyan Gong
Fei-Yue Wang
Yisheng Lv
AI4TS
24
0
0
16 May 2025
GuardReasoner-VL: Safeguarding VLMs via Reinforced Reasoning
Yong-Jin Liu
Shengfang Zhai
Mingzhe Du
Yulin Chen
Tri Cao
...
Xuzhao Li
Kun Wang
Junfeng Fang
Jiaheng Zhang
Bryan Hooi
OffRL
LRM
7
0
0
16 May 2025
Sample Efficient Reinforcement Learning via Large Vision Language Model Distillation
Donghoon Lee
Tung M. Luu
Younghwan Lee
Chang D. Yoo
OffRL
VLM
14
0
0
16 May 2025
Real-Time Verification of Embodied Reasoning for Generative Skill Acquisition
Bo Yue
Shuqi Guo
Kaiyu Hu
Chujiao Wang
Benyou Wang
Kui Jia
Guiliang Liu
LRM
21
0
0
16 May 2025
Accurate KV Cache Quantization with Outlier Tokens Tracing
Yi Su
Yuechi Zhou
Quantong Qiu
Juntao Li
Qingrong Xia
Ping Li
Xinyu Duan
Zhefeng Wang
Min Zhang
MQ
22
0
0
16 May 2025
On Next-Token Prediction in LLMs: How End Goals Determine the Consistency of Decoding Algorithms
Jacob Trauger
Ambuj Tewari
12
0
0
16 May 2025
LegoSLM: Connecting LLM with Speech Encoder using CTC Posteriors
Rao Ma
Tongzhou Chen
Kartik Audhkhasi
Bhuvana Ramabhadran
AuLLM
22
0
0
16 May 2025
Modeling cognitive processes of natural reading with transformer-based Language Models
Bruno Bianchi
Fermín Travi
Juan E. Kamienkowski
14
0
0
16 May 2025
Search and Refine During Think: Autonomous Retrieval-Augmented Reasoning of LLMs
Yaorui Shi
Shihan Li
Chang Wu
Zhiyuan Liu
Fan Zhang
Hengxing Cai
An Zhang
Xinbing Wang
ReLM
LRM
31
0
0
16 May 2025
Improving Assembly Code Performance with Large Language Models via Reinforcement Learning
Anjiang Wei
Tarun Suresh
Huanmi Tan
Yinglun Xu
Gagandeep Singh
Ke Wang
Alex Aiken
7
0
0
16 May 2025
Reinforced Interactive Continual Learning via Real-time Noisy Human Feedback
Yutao Yang
Jie Zhou
Junsong Li
Qianjun Pan
Bihao Zhan
Qin Chen
Xipeng Qiu
Liang He
CLL
29
0
0
15 May 2025
Rethinking Prompt Optimizers: From Prompt Merits to Optimization
Zixiao Zhu
Hanzhang Zhou
Zijian Feng
Tianjiao Li
Chua Jia Jim Deryl
Mak Lee Onn
Gee Wah Ng
Kezhi Mao
LRM
31
0
0
15 May 2025
Coherent Language Reconstruction from Brain Recordings with Flexible Multi-Modal Input Stimuli
Chunyu Ye
Shaonan Wang
AI4CE
19
0
0
15 May 2025
Private Transformer Inference in MLaaS: A Survey
Yang Li
Xinyu Zhou
Yishuo Wang
Liangxin Qian
Jun Zhao
21
0
0
15 May 2025
Multi-Token Prediction Needs Registers
Anastasios Gerontopoulos
Spyros Gidaris
N. Komodakis
24
0
0
15 May 2025
Advanced Crash Causation Analysis for Freeway Safety: A Large Language Model Approach to Identifying Key Contributing Factors
Ahmed S. Abdelrahman
Mohamed Abdel-Aty
Samgyu Yang
Abdulrahman Faden
29
0
0
15 May 2025
RouteNator: A Router-Based Multi-Modal Architecture for Generating Synthetic Training Data for Function Calling LLMs
Vibha Belavadi
Tushar Vatsa
Dewang Sultania
Suhas Suresha
Ishita Verma
Cheng Chen
Tracy Holloway King
Michael Friedrich
SyDa
28
0
0
15 May 2025
AI Agents vs. Agentic AI: A Conceptual Taxonomy, Applications and Challenges
Ranjan Sapkota
Konstantinos I Roumeliotis
Manoj Karkee
AI4TS
24
0
0
15 May 2025
Cross-Image Contrastive Decoding: Precise, Lossless Suppression of Language Priors in Large Vision-Language Models
Jianfei Zhao
Feng Zhang
Xingchen Sun
Chong Feng
MLLM
28
0
0
15 May 2025
Task-Core Memory Management and Consolidation for Long-term Continual Learning
Tianyu Huai
Jie Zhou
Yuxuan Cai
Qin Chen
Wen Wu
Xingjiao Wu
Xipeng Qiu
Liang He
CLL
33
0
0
15 May 2025
Llama See, Llama Do: A Mechanistic Perspective on Contextual Entrainment and Distraction in LLMs
Jingcheng Niu
Xingdi Yuan
Tong Wang
Hamidreza Saghir
Amir H. Abdi
27
0
0
14 May 2025
Flash-VL 2B: Optimizing Vision-Language Model Performance for Ultra-Low Latency and High Throughput
Bo Zhang
Shuo Li
Runhe Tian
Yang Yang
Jixin Tang
Jinhao Zhou
Lin Ma
VLM
30
0
0
14 May 2025
WavReward: Spoken Dialogue Models With Generalist Reward Evaluators
Shengpeng Ji
Tianle Liang
Yongqian Li
Jialong Zuo
Minghui Fang
...
Xize Cheng
Siqi Zheng
Jin Xu
Junyang Lin
Zhou Zhao
AuLLM
ALM
33
0
0
14 May 2025
Recent Advances in Medical Imaging Segmentation: A Survey
Fares Bougourzi
Abdenour Hadid
OOD
44
0
0
14 May 2025
How Hungry is AI? Benchmarking Energy, Water, and Carbon Footprint of LLM Inference
Nidhal Jegham
Marwen Abdelatti
Lassad Elmoubarki
Abdeltawab Hendawi
26
0
0
14 May 2025
STORYANCHORS: Generating Consistent Multi-Scene Story Frames for Long-Form Narratives
Bo Wang
Haoyang Huang
Zhiying Lu
Fengyuan Liu
Guoqing Ma
Jianlong Yuan
Y. Zhang
Nan Duan
Daxin Jiang
VGen
34
0
0
13 May 2025
LM-Scout: Analyzing the Security of Language Model Integration in Android Apps
Muhammad Ibrahim
Gűliz Seray Tuncay
Z. Berkay Celik
Aravind Machiry
Antonio Bianchi
31
0
0
13 May 2025
Extending Large Vision-Language Model for Diverse Interactive Tasks in Autonomous Driving
Zongchuang Zhao
Haoyu Fu
Dingkang Liang
Xin Zhou
Dingyuan Zhang
Hongwei Xie
Bing Wang
Xiang Bai
MLLM
VLM
49
0
0
13 May 2025
CAD-Coder:Text-Guided CAD Files Code Generation
Changqi He
Shuhan Zhang
Liguo Zhang
Jiajun Miao
29
0
0
13 May 2025
Direct Density Ratio Optimization: A Statistically Consistent Approach to Aligning Large Language Models
Rei Higuchi
Taiji Suzuki
33
0
0
12 May 2025
Circuit Partitioning Using Large Language Models for Quantum Compilation and Simulations
Pranav Sinha
Sumit Kumar Jha
Sunny Raj
34
0
0
12 May 2025
Tagging fully hadronic exotic decays of the vectorlike
B
\mathbf{B}
B
quark using a graph neural network
Jai Bardhan
Tanumoy Mandal
Subhadip Mitra
Cyrin Neeraj
Mihir Rawat
28
0
0
12 May 2025
MilChat: Introducing Chain of Thought Reasoning and GRPO to a Multimodal Small Language Model for Remote Sensing
Aybora Koksal
Aydin Alatan
LRM
29
0
0
12 May 2025
SpecRouter: Adaptive Routing for Multi-Level Speculative Decoding in Large Language Models
Hang Wu
Jianian Zhu
Yongqian Li
Haojie Wang
Biao Hou
Jidong Zhai
40
0
0
12 May 2025
AttentionInfluence: Adopting Attention Head Influence for Weak-to-Strong Pretraining Data Selection
Kai Hua
Steven Wu
Ge Zhang
Ke Shen
LRM
28
0
0
12 May 2025
Continuous Visual Autoregressive Generation via Score Maximization
Chenze Shao
Fandong Meng
Jie Zhou
DiffM
31
0
0
12 May 2025
Assessing and Mitigating Medical Knowledge Drift and Conflicts in Large Language Models
Weiyi Wu
Xinwen Xu
Chongyang Gao
Xingjian Diao
Siting Li
Lucas A. Salas
Jiang Gui
26
0
0
12 May 2025
Visually Interpretable Subtask Reasoning for Visual Question Answering
Yu Cheng
A. Goel
Hakan Bilen
LRM
31
0
0
12 May 2025
Step1X-3D: Towards High-Fidelity and Controllable Generation of Textured 3D Assets
Weiyu Li
Xuanyang Zhang
Zheng Sun
Di Qi
Hao Li
...
Zeming Li
Gang Yu
Xiangyu Zhang
Daxin Jiang
Ping Tan
43
0
0
12 May 2025
OnPrem.LLM: A Privacy-Conscious Document Intelligence Toolkit
Arun S. Maiya
KELM
29
0
0
12 May 2025
Implementing Long Text Style Transfer with LLMs through Dual-Layered Sentence and Paragraph Structure Extraction and Mapping
Yusen Wu
Xiaotie Deng
19
0
0
11 May 2025
1
2
3
4
...
87
88
89
Next