Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2302.13971
Cited By
LLaMA: Open and Efficient Foundation Language Models
27 February 2023
Hugo Touvron
Thibaut Lavril
Gautier Izacard
Xavier Martinet
Marie-Anne Lachaux
Timothée Lacroix
Baptiste Rozière
Naman Goyal
Eric Hambro
Faisal Azhar
Aurelien Rodriguez
Armand Joulin
Edouard Grave
Guillaume Lample
ALM
PILM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"LLaMA: Open and Efficient Foundation Language Models"
50 / 2,584 papers shown
Title
RATE-Nav: Region-Aware Termination Enhancement for Zero-shot Object Navigation with Vision-Language Models
Junjie Li
Nan Zhang
Xiaoyang Qu
Kai Lu
Guokuan Li
Jiguang Wan
Jianzong Wang
54
0
0
03 Jun 2025
Comparing LLM-generated and human-authored news text using formal syntactic theory
Olga Zamaraeva
Dan Flickinger
Francis Bond
Carlos Gómez-Rodríguez
57
0
0
02 Jun 2025
Growing Through Experience: Scaling Episodic Grounding in Language Models
Chunhui Zhang
Sirui
Wang
Z. Ouyang
Xiangchi Yuan
Soroush Vosoughi
CLL
70
1
0
02 Jun 2025
Natural, Artificial, and Human Intelligences
E. Pothos
Dominic Widdows
21
0
0
02 Jun 2025
GRAM: Generative Recommendation via Semantic-aware Multi-granular Late Fusion
Sunkyung Lee
Minjin Choi
Eunseong Choi
Hye-young Kim
Jongwuk Lee
VLM
65
0
0
02 Jun 2025
ShapeLLM-Omni: A Native Multimodal LLM for 3D Generation and Understanding
Junliang Ye
Zhengyi Wang
Ruowen Zhao
Shenghao Xie
Jun Zhu
56
0
0
02 Jun 2025
Unraveling Spatio-Temporal Foundation Models via the Pipeline Lens: A Comprehensive Review
Yuchen Fang
Hao Miao
Yuxuan Liang
Liwei Deng
Yue Cui
...
Yan Zhao
T. Pedersen
Christian S. Jensen
Xiaofang Zhou
Kai Zheng
AI4TS
AI4CE
70
0
0
02 Jun 2025
FreqPolicy: Frequency Autoregressive Visuomotor Policy with Continuous Tokens
Yiming Zhong
Yumeng Liu
Chuyang Xiao
Zemin Yang
Youzhuo Wang
Yufei Zhu
Ye-ling Shi
Yujing Sun
X. Zhu
Yuexin Ma
57
0
0
02 Jun 2025
Revisiting LRP: Positional Attribution as the Missing Ingredient for Transformer Explainability
Yarden Bakish
Itamar Zimerman
Hila Chefer
Lior Wolf
22
0
0
02 Jun 2025
Multiverse Through Deepfakes: The MultiFakeVerse Dataset of Person-Centric Visual and Conceptual Manipulations
Parul Gupta
Shreya Ghosh
Tom Gedeon
Thanh-Toan Do
Abhinav Dhall
50
0
0
01 Jun 2025
Generic Token Compression in Multimodal Large Language Models from an Explainability Perspective
Lei Lei
Jie Gu
Xiaokang Ma
Chu Tang
Jingmin Chen
Tong Xu
43
1
0
01 Jun 2025
CC-Tuning: A Cross-Lingual Connection Mechanism for Improving Joint Multilingual Supervised Fine-Tuning
Yangfan Ye
Xiaocheng Feng
Zekun Yuan
Xiachong Feng
L. Qin
...
Yunfei Lu
Xiaohui Yan
Duyu Tang
Dandan Tu
Bing Qin
37
0
0
01 Jun 2025
Doubly Robust Alignment for Large Language Models
Erhan Xu
Kai Ye
Hongyi Zhou
Luhan Zhu
Francesco Quinzan
Chengchun Shi
44
0
0
01 Jun 2025
From Plain Text to Poetic Form: Generating Metrically-Constrained Sanskrit Verses
Manoj Balaji Jagadeeshan
S. Bhatia
Pretam Ray
Harshul Raj Surana
A. Prathosh
Priya Mishra
Annarao Kulkarni
Ganesh Ramakrishnan
Prathosh AP
Pawan Goyal
45
0
0
01 Jun 2025
Fighting Fire with Fire (F3): A Training-free and Efficient Visual Adversarial Example Purification Method in LVLMs
Yudong Zhang
Ruobing Xie
Yiqing Huang
Jiansheng Chen
Xingwu Sun
Zhanhui Kang
Di Wang
Yu Wang
AAML
49
0
0
01 Jun 2025
How Programming Concepts and Neurons Are Shared in Code Language Models
Amir Hossein Kargaran
Yihong Liu
François Yvon
Hinrich Schütze
43
0
0
01 Jun 2025
Attention Retrieves, MLP Memorizes: Disentangling Trainable Components in the Transformer
Yihe Dong
Lorenzo Noci
Mikhail Khodak
Mufan Li
62
0
0
01 Jun 2025
Leveraging Large Language Models for Sarcastic Speech Annotation in Sarcasm Detection
Zhu Li
Yuqing Zhang
Xiyuan Gao
Shekhar Nayak
Matt Coler
25
0
0
01 Jun 2025
Dynamic Chunking and Selection for Reading Comprehension of Ultra-Long Context in Large Language Models
Boheng Sheng
Jiacheng Yao
Meicong Zhang
Guoxiu He
RALM
45
0
0
01 Jun 2025
FedRPCA: Enhancing Federated LoRA Aggregation Using Robust PCA
Divyansh Jhunjhunwala
Arian Raje
Madan Ravi Ganesh
Chaithanya Kumar Mummadi
Chaoqun Dong
Jiawei Zhou
Wan-Yi Lin
Gauri Joshi
Zhenzhen Li
48
0
0
01 Jun 2025
DefenderBench: A Toolkit for Evaluating Language Agents in Cybersecurity Environments
Chiyu Zhang
Marc-Alexandre Cote
Michael Albada
Anush Sankaran
Jack W. Stokes
Tong Wang
Amir H. Abdi
William Blum
Muhammad Abdul-Mageed
LLMAG
AAML
ELM
58
0
0
31 May 2025
Massively Multilingual Adaptation of Large Language Models Using Bilingual Translation Data
Shaoxiong Ji
Zihao Li
Jaakko Paavola
Indraneil Paul
Hengyu Luo
Jörg Tiedemann
CLL
52
0
0
31 May 2025
Exploring In-context Example Generation for Machine Translation
Dohyun Lee
Seungil Lee
Chanwoo Yang
Yujin Baek
Jaegul Choo
33
0
0
31 May 2025
A Brain Graph Foundation Model: Pre-Training and Prompt-Tuning for Any Atlas and Disorder
Xinxu Wei
K. Zhao
Yong Jiao
Lifang He
Yu Zhang
AI4CE
22
0
0
31 May 2025
SATA-BENCH: Select All That Apply Benchmark for Multiple Choice Questions
Weijie Xu
Shixian Cui
Xi Fang
Chi Xue
Stephanie Eckman
Chandan K. Reddy
ELM
37
0
0
31 May 2025
PMF-CEC: Phoneme-augmented Multimodal Fusion for Context-aware ASR Error Correction with Error-specific Selective Decoding
Jiajun He
Tomoki Toda
24
1
0
31 May 2025
Video Signature: In-generation Watermarking for Latent Video Diffusion Models
Yu Huang
Junhao Chen
Qi Zheng
Hanqian Li
Shuliang Liu
Xuming Hu
DiffM
WIGM
VGen
51
0
0
31 May 2025
Guiding Generative Storytelling with Knowledge Graphs
Zhijun Pan
Antonios Andronis
Eva Hayek
Oscar AP Wilkinson
Ilya Lasy
Annette Parry
Guy Gadney
Tim J. Smith
Mick Grierson
30
0
0
30 May 2025
CLaSp: In-Context Layer Skip for Self-Speculative Decoding
Longze Chen
Renke Shan
Huiming Wang
Lu Wang
Ziqiang Liu
Run Luo
Jiawei Wang
Hamid Alinejad-Rokny
Min Yang
37
0
0
30 May 2025
Visual Embodied Brain: Let Multimodal Large Language Models See, Think, and Control in Spaces
Gen Luo
Ganlin Yang
Ziyang Gong
Guanzhou Chen
Haonan Duan
...
Wenhai Wang
Jifeng Dai
Yu Qiao
Rongrong Ji
X. Zhu
LM&Ro
35
1
0
30 May 2025
Drop Dropout on Single-Epoch Language Model Pretraining
Houjun Liu
John Bauer
Christopher D. Manning
LRM
36
0
0
30 May 2025
Learn from the Past: Fast Sparse Indexing for Large Language Model Decoding
Feiyu Yao
Qian Wang
12
0
0
30 May 2025
Can LLMs Understand Unvoiced Speech? Exploring EMG-to-Text Conversion with LLMs
Payal Mohapatra
Akash Pandey
Xiaoyuan Zhang
Qi Zhu
AuLLM
25
0
0
30 May 2025
On Fairness of Task Arithmetic: The Role of Task Vectors
Hiroki Naganuma
Kotaro Yoshida
Laura Gomezjurado Gonzalez
Takafumi Horie
Yuji Naraki
Ryotaro Shimizu
MoMe
18
0
0
30 May 2025
BIMA: Bijective Maximum Likelihood Learning Approach to Hallucination Prediction and Mitigation in Large Vision-Language Models
Huu-Thien Tran
Thanh-Dat Truong
Khoa Luu
MLLM
18
0
0
30 May 2025
Benchmarking Foundation Models for Zero-Shot Biometric Tasks
Redwan Sony
Parisa Farmanifard
Hamzeh Alzwairy
Nitish Shukla
Arun Ross
CVBM
VLM
56
0
0
30 May 2025
ReCalKV: Low-Rank KV Cache Compression via Head Reordering and Offline Calibration
Xianglong Yan
Zhiteng Li
Tianao Zhang
Linghe Kong
Yulun Zhang
Xiaokang Yang
57
0
0
30 May 2025
Revisiting Epistemic Markers in Confidence Estimation: Can Markers Accurately Reflect Large Language Models' Uncertainty?
Jiayu Liu
Qing Zong
Weiqi Wang
Yangqiu Song
38
0
0
30 May 2025
MDPO: Multi-Granularity Direct Preference Optimization for Mathematical Reasoning
Yunze Lin
LRM
15
0
0
30 May 2025
LLM-powered Query Expansion for Enhancing Boundary Prediction in Language-driven Action Localization
Zirui Shang
Xinxiao Wu
Shuo Yang
44
0
0
30 May 2025
Multiple LLM Agents Debate for Equitable Cultural Alignment
Dayeon Ki
Rachel Rudinger
Tianyi Zhou
Marine Carpuat
LLMAG
33
0
0
30 May 2025
Decoding Knowledge Attribution in Mixture-of-Experts: A Framework of Basic-Refinement Collaboration and Efficiency Analysis
Junzhuo Li
Bo Wang
Xiuze Zhou
Peijie Jiang
Jia Liu
Xuming Hu
MoE
49
0
0
30 May 2025
LittleBit: Ultra Low-Bit Quantization via Latent Factorization
Banseok Lee
Dongkyu Kim
Youngcheon You
Youngmin Kim
MQ
23
0
0
30 May 2025
Causal-aware Large Language Models: Enhancing Decision-Making Through Learning, Adapting and Acting
Wei Chen
Jiahao Zhang
Haipeng Zhu
Boyan Xu
Zijian Li
Keli Zhang
Junjian Ye
Ruichu Cai
41
1
0
30 May 2025
Overfitting has a limitation: a model-independent generalization error bound based on Rényi entropy
Atsushi Suzuki
32
0
0
30 May 2025
MELT: Towards Automated Multimodal Emotion Data Annotation by Leveraging LLM Embedded Knowledge
Xin Jing
Jiadong Wang
Iosif Tsangko
Andreas Triantafyllopoulos
Björn Schuller
29
0
0
30 May 2025
ComposeRAG: A Modular and Composable RAG for Corpus-Grounded Multi-Hop Question Answering
Ruofan Wu
Youngwon Lee
Fan Shu
Danmei Xu
Seung-won Hwang
Z. Yao
Y. He
Feng Yan
LRM
27
0
0
30 May 2025
GradPower: Powering Gradients for Faster Language Model Pre-Training
Mingze Wang
Jinbo Wang
Jiaqi Zhang
Wei Wang
Peng Pei
Xunliang Cai
Weinan E
Lei Wu
58
0
0
30 May 2025
Bi-Manual Joint Camera Calibration and Scene Representation
Haozhan Tang
Tianyi Zhang
Matthew Johnson-Roberson
Weiming Zhi
40
0
0
30 May 2025
Invariant Link Selector for Spatial-Temporal Out-of-Distribution Problem
Katherine Tieu
Dongqi Fu
Jun Wu
Jingrui He
OOD
OODD
CML
56
3
0
30 May 2025
Previous
1
2
3
4
5
...
50
51
52
Next