Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2302.13971
Cited By
LLaMA: Open and Efficient Foundation Language Models
27 February 2023
Hugo Touvron
Thibaut Lavril
Gautier Izacard
Xavier Martinet
Marie-Anne Lachaux
Timothée Lacroix
Baptiste Rozière
Naman Goyal
Eric Hambro
Faisal Azhar
Aurelien Rodriguez
Armand Joulin
Edouard Grave
Guillaume Lample
ALM
PILM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"LLaMA: Open and Efficient Foundation Language Models"
50 / 7,027 papers shown
Title
FirePlace: Geometric Refinements of LLM Common Sense Reasoning for 3D Object Placement
Ian Huang
Yanan Bao
Karen Truong
Howard Zhou
Cordelia Schmid
Leonidas J. Guibas
Alireza Fathi
3DV
35
2
0
06 Mar 2025
Keeping Yourself is Important in Downstream Tuning Multimodal Large Language Model
Wenke Huang
Jian Liang
Xianda Guo
Yiyang Fang
Guancheng Wan
...
Bin Yang
He Li
Jiawei Shao
Mang Ye
Bo Du
OffRL
LRM
MLLM
KELM
VLM
73
1
0
06 Mar 2025
SurveyForge: On the Outline Heuristics, Memory-Driven Generation, and Multi-dimensional Evaluation for Automated Survey Writing
Xiangchao Yan
Shiyang Feng
Jiakang Yuan
Renqiu Xia
Bin Wang
Bo Zhang
Junlin Wu
73
2
0
06 Mar 2025
AOLO: Analysis and Optimization For Low-Carbon Oriented Wireless Large Language Model Services
Xiaoqi Wang
Hongyang Du
Yuehong Gao
Dong In Kim
71
0
0
06 Mar 2025
Balcony: A Lightweight Approach to Dynamic Inference of Generative Language Models
Benyamin Jamialahmadi
Parsa Kavehzadeh
Mehdi Rezagholizadeh
Parsa Farinneya
Hossein Rajabzadeh
A. Jafari
Boxing Chen
Marzieh S. Tahaei
52
0
0
06 Mar 2025
Benchmarking Large Language Models on Multiple Tasks in Bioinformatics NLP with Prompting
Jiyue Jiang
Pengan Chen
Jinqiao Wang
Dongchen He
Ziqin Wei
...
Yimin Fan
Xiangyu Shi
Jimeng Sun
Chuan Wu
Yuan Li
LM&MA
57
1
0
06 Mar 2025
Addressing Overprescribing Challenges: Fine-Tuning Large Language Models for Medication Recommendation Tasks
Zihao Zhao
Chenxiao Fan
Chongming Gao
Fuli Feng
Xiangnan He
LM&MA
AI4MH
82
0
0
05 Mar 2025
Feature-Level Insights into Artificial Text Detection with Sparse Autoencoders
Kristian Kuznetsov
Laida Kushnareva
Polina Druzhinina
Anton Razzhigaev
Anastasia Voznyuk
Irina Piontkovskaya
Evgeny Burnaev
Serguei Barannikov
44
0
0
05 Mar 2025
Effective LLM Knowledge Learning via Model Generalization
Mingkang Zhu
Xi Chen
Junyao Xing
Bei Yu
Hengshuang Zhao
Jiaya Jia
72
0
0
05 Mar 2025
Developing and Utilizing a Large-Scale Cantonese Dataset for Multi-Tasking in Large Language Models
Jiyue Jiang
Alfred Kar Yin Truong
Yuxiao Chen
Qinghang Bao
Sheng Wang
Pengan Chen
Jinqiao Wang
Lingpeng Kong
Yu Li
Chuan Wu
ALM
61
0
0
05 Mar 2025
BEVDriver: Leveraging BEV Maps in LLMs for Robust Closed-Loop Driving
Katharina Winter
Mark Azer
Fabian B. Flohr
61
0
0
05 Mar 2025
PacketCLIP: Multi-Modal Embedding of Network Traffic and Language for Cybersecurity Reasoning
Ryozo Masukawa
Sanggeon Yun
SungHeon Jeong
Wenjun Huang
Yang Ni
Ian Bryant
Nathaniel D. Bastian
Mohsen Imani
58
0
0
05 Mar 2025
LLM as GNN: Graph Vocabulary Learning for Text-Attributed Graph Foundation Models
Xi Zhu
Haochen Xue
Ziwei Zhao
Wujiang Xu
Jingyuan Huang
Minghao Guo
Qifan Wang
Kaixiong Zhou
Yongfeng Zhang
72
2
0
05 Mar 2025
DSVD: Dynamic Self-Verify Decoding for Faithful Generation in Large Language Models
Y. Guo
Yuchen Yang
Zhe Chen
Pingjie Wang
Yusheng Liao
Yujie Zhang
Yanfeng Wang
Yu Wang
HILM
83
0
0
05 Mar 2025
RiskAgent: Autonomous Medical AI Copilot for Generalist Risk Prediction
Fenglin Liu
Jinge Wu
Hongjian Zhou
Xiao Gu
Soheila Molaei
A. Thakur
Lei A. Clifton
Honghan Wu
David Clifton
LM&MA
51
0
0
05 Mar 2025
HeTGB: A Comprehensive Benchmark for Heterophilic Text-Attributed Graphs
Shujie Li
Yuxia Wu
Chuan Shi
Yuan Fang
49
0
0
05 Mar 2025
Predicting Space Tourism Demand Using Explainable AI
Tan-Hanh Pham
Jingchen Bi
Rodrigo Mesa-Arangom
Kim-Doang Nguyen
62
0
0
05 Mar 2025
DivPrune: Diversity-based Visual Token Pruning for Large Multimodal Models
Saeed Ranjbar Alvar
Gursimran Singh
Mohammad Akbari
Yong Zhang
VLM
82
0
0
04 Mar 2025
Zero-Shot Complex Question-Answering on Long Scientific Documents
Wanting Wang
RALM
71
0
0
04 Mar 2025
ATLaS: Agent Tuning via Learning Critical Steps
Zhixun Chen
Ming Li
Yuanmin Huang
Yali Du
Meng Fang
Dinesh Manocha
90
3
0
04 Mar 2025
MindBridge: Scalable and Cross-Model Knowledge Editing via Memory-Augmented Modality
Shuaike Li
Kai Zhang
Qiang Liu
Enhong Chen
KELM
83
1
0
04 Mar 2025
Rewarding Doubt: A Reinforcement Learning Approach to Confidence Calibration of Large Language Models
Paul Stangel
D. Bani-Harouni
Chantal Pellegrini
Ege Özsoy
Kamilia Zaripova
Matthias Keicher
Nassir Navab
36
1
0
04 Mar 2025
Add-One-In: Incremental Sample Selection for Large Language Models via a Choice-Based Greedy Paradigm
Zehan Li
Yuhao Du
Xiaoqi Jiao
Yiwen Guo
Yuege Feng
Xiang Wan
Anningzhe Gao
Jinpeng Hu
68
0
0
04 Mar 2025
AugFL: Augmenting Federated Learning with Pretrained Models
Sheng Yue
Zerui Qin
Yongheng Deng
Ju Ren
Yaoxue Zhang
Junshan Zhang
FedML
87
0
0
04 Mar 2025
Zero-Shot Multi-Label Classification of Bangla Documents: Large Decoders Vs. Classic Encoders
Souvika Sarkar
M. Hasan
S. Karmaker
44
0
0
04 Mar 2025
An Efficient and Precise Training Data Construction Framework for Process-supervised Reward Model in Mathematical Reasoning
Wei Sun
Qianlong Du
Fuwei Cui
Jiajun Zhang
OffRL
LRM
45
0
0
04 Mar 2025
Beyond Cosine Decay: On the effectiveness of Infinite Learning Rate Schedule for Continual Pre-training
Paul Janson
Vaibhav Singh
Paria Mehrbod
Adam Ibrahim
Irina Rish
Eugene Belilovsky
Benjamin Thérien
CLL
83
0
0
04 Mar 2025
LoRA-Null: Low-Rank Adaptation via Null Space for Large Language Models
Pengwei Tang
Yue Liu
Dongjie Zhang
Xing Wu
Debing Zhang
70
0
0
04 Mar 2025
SemGeoMo: Dynamic Contextual Human Motion Generation with Semantic and Geometric Guidance
Peishan Cong
Ziyi Wang
Y. Ma
Xiangyu Yue
59
1
0
03 Mar 2025
UFO: A Unified Approach to Fine-grained Visual Perception via Open-ended Language Interface
Hao Tang
Chenwei Xie
Haiyang Wang
Xiaoyi Bao
Tingyu Weng
Pandeng Li
Yun Zheng
Liwei Wang
ObjD
VLM
67
0
0
03 Mar 2025
Neural ODE Transformers: Analyzing Internal Dynamics and Adaptive Fine-tuning
Anh Tong
Thanh Nguyen-Tang
Dongeun Lee
Duc Nguyen
Toan M. Tran
David Hall
Cheongwoong Kang
Jaesik Choi
40
0
0
03 Mar 2025
LLMInit: A Free Lunch from Large Language Models for Selective Initialization of Recommendation
Weizhi Zhang
Liangwei Yang
Wooseong Yang
Henry Peng Zou
Yuqing Liu
Ke Xu
Sourav Medya
Philip S. Yu
71
2
0
03 Mar 2025
Scaling Law Phenomena Across Regression Paradigms: Multiple and Kernel Approaches
Yifang Chen
Xuyang Guo
Xiaoyu Li
Yingyu Liang
Zhenmei Shi
Zhao Song
73
3
0
03 Mar 2025
PEO: Improving Bi-Factorial Preference Alignment with Post-Training Policy Extrapolation
Yuxuan Liu
47
0
0
03 Mar 2025
Evaluating LLMs' Assessment of Mixed-Context Hallucination Through the Lens of Summarization
Siya Qi
Rui Cao
Yulan He
Zheng Yuan
HILM
66
0
0
03 Mar 2025
A Zero-Shot Learning Approach for Ephemeral Gully Detection from Remote Sensing using Vision Language Models
Seyed Mohamad Ali Tousi
Ramy M. A. Farag
Jacket Demby's
Gbenga Omotara
John A. Lory
Guilherme N. DeSouza
250
0
0
03 Mar 2025
Rethinking Data: Towards Better Performing Domain-Specific Small Language Models
Boris Nazarov
Darya Frolova
Yackov Lubarsky
Alexei Gaissinski
Pavel Kisilev
ALM
78
1
0
03 Mar 2025
Compositional Reasoning with Transformers, RNNs, and Chain of Thought
Gilad Yehudai
Noah Amsel
Joan Bruna
LRM
70
1
0
03 Mar 2025
Provable Benefits of Task-Specific Prompts for In-context Learning
Xiangyu Chang
Yingcong Li
Muti Kara
Samet Oymak
Amit K. Roy-Chowdhury
65
0
0
03 Mar 2025
SwiLTra-Bench: The Swiss Legal Translation Benchmark
Joel Niklaus
Jakob Merane
Luka Nenadic
Sina Ahmadi
Yingqiang Gao
...
Matthew Guillod
Robin Mamié
Daniel Brunner
Julio Pereyra
Niko Grupen
AILaw
ELM
84
1
0
03 Mar 2025
Llama-3.1-Sherkala-8B-Chat: An Open Large Language Model for Kazakh
Fajri Koto
Rituraj Joshi
Nurdaulet Mukhituly
Yunhong Wang
Zhuohan Xie
...
Avraham Sheinin
Natalia Vassilieva
Neha Sengupta
Larry Murray
Preslav Nakov
ALM
KELM
51
0
0
03 Mar 2025
Alchemist: Towards the Design of Efficient Online Continual Learning System
Yuyang Huang
Yuhan Liu
Haryadi S. Gunawi
Beibin Li
Changho Hwang
CLL
OnRL
112
0
0
03 Mar 2025
EAGLE-3: Scaling up Inference Acceleration of Large Language Models via Training-Time Test
Yuhui Li
Fangyun Wei
Chao Zhang
Hongyang R. Zhang
123
6
0
03 Mar 2025
Liger: Linearizing Large Language Models to Gated Recurrent Structures
Disen Lan
Weigao Sun
Jiaxi Hu
Jusen Du
Yu Cheng
69
0
0
03 Mar 2025
Revisiting Large Language Model Pruning using Neuron Semantic Attribution
Yizhuo Ding
Xinwei Sun
Yanwei Fu
Guosheng Hu
66
0
0
03 Mar 2025
Spark-TTS: An Efficient LLM-Based Text-to-Speech Model with Single-Stream Decoupled Speech Tokens
Xiang Wang
Mingqi Jiang
Zejun Ma
Ziyu Zhang
Shixuan Liu
...
Zhifei Li
Xie Chen
Lei Xie
Yu Guo
Wei Xue
86
13
0
03 Mar 2025
Do GFlowNets Transfer? Case Study on the Game of 24/42
Adesh Gupta
Abhinav Kumar
Mansi Gupta
Paras Chopra
105
0
0
03 Mar 2025
Instruct-of-Reflection: Enhancing Large Language Models Iterative Reflection Capabilities via Dynamic-Meta Instruction
Liping Liu
Chunhong Zhang
Likang Wu
Chuang Zhao
Zheng Hu
Ming He
Jianping Fan
LLMAG
LRM
46
1
0
02 Mar 2025
Predictive Data Selection: The Data That Predicts Is the Data That Teaches
Kashun Shum
Yuanmin Huang
Hongjian Zou
Qi Ding
Yixuan Liao
Xiao Chen
Qian Liu
Junxian He
67
2
0
02 Mar 2025
Patient-Level Anatomy Meets Scanning-Level Physics: Personalized Federated Low-Dose CT Denoising Empowered by Large Language Model
Ziyuan Yang
Yingyu Chen
Junyao Xing
Hongming Shan
Yang Chen
Yi Zhang
52
1
0
02 Mar 2025
Previous
1
2
3
...
18
19
20
...
139
140
141
Next