Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2302.13971
Cited By
LLaMA: Open and Efficient Foundation Language Models
27 February 2023
Hugo Touvron
Thibaut Lavril
Gautier Izacard
Xavier Martinet
Marie-Anne Lachaux
Timothée Lacroix
Baptiste Rozière
Naman Goyal
Eric Hambro
Faisal Azhar
Aurelien Rodriguez
Armand Joulin
Edouard Grave
Guillaume Lample
ALM
PILM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"LLaMA: Open and Efficient Foundation Language Models"
50 / 2,584 papers shown
Title
Convergence of Clipped-SGD for Convex
(
L
0
,
L
1
)
(L_0,L_1)
(
L
0
,
L
1
)
-Smooth Optimization with Heavy-Tailed Noise
S. Chezhegov
Aleksandr Beznosikov
Samuel Horváth
Eduard A. Gorbunov
29
0
0
27 May 2025
Ratas framework: A comprehensive genai-based approach to rubric-based marking of real-world textual exams
Masoud Safilian
Amin Beheshti
Stephen Elbourn
21
0
0
27 May 2025
Leveraging LLM and Self-Supervised Training Models for Speech Recognition in Chinese Dialects: A Comparative Analysis
Tianyi Xu
Hongjie Chen
Wang Qing
Lv Hang
Jian Kang
Li Jie
Zhennan Lin
Yongxiang Li
Xie Lei
19
0
0
27 May 2025
DenseLoRA: Dense Low-Rank Adaptation of Large Language Models
Lin Mu
Xiaoyu Wang
Li Ni
Yang Li
Zhize Wu
Peiquan Jin
Yiwen Zhang
ALM
AI4CE
33
0
0
27 May 2025
Explainability of Large Language Models using SMILE: Statistical Model-agnostic Interpretability with Local Explanations
Zeinab Dehghani
Koorosh Aslansefat
Adil Khan
Mohammed Naveed Akram
MILM
LRM
134
0
0
27 May 2025
What happens when generative AI models train recursively on each others' generated outputs?
Hung Ahn Vu
Galen Reeves
Emily Wenger
51
0
0
27 May 2025
MLE-STAR: Machine Learning Engineering Agent via Search and Targeted Refinement
Jaehyun Nam
Jinsung Yoon
Jiefeng Chen
Jinwoo Shin
Sercan Ö. Arık
Tomas Pfister
LLMAG
13
0
0
27 May 2025
Mitigating Hallucination in Large Vision-Language Models via Adaptive Attention Calibration
Mehrdad Fazli
Bowen Wei
Ziwei Zhu
VLM
194
0
0
27 May 2025
Are Language Models Consequentialist or Deontological Moral Reasoners?
Keenan Samway
Max Kleiman-Weiner
David Guzman Piedrahita
Rada Mihalcea
Bernhard Schölkopf
Zhijing Jin
ELM
LRM
28
0
0
27 May 2025
Right Side Up? Disentangling Orientation Understanding in MLLMs with Fine-grained Multi-axis Perception Tasks
Keanu Nichols
Nazia Tasnim
Yuting Yan
Nicholas Ikechukwu
Elva Zou
Deepti Ghadiyaram
Bryan A. Plummer
78
0
0
27 May 2025
In Search of Adam's Secret Sauce
Antonio Orvieto
Robert Gower
33
1
0
27 May 2025
Hierarchical Tree Search-based User Lifelong Behavior Modeling on Large Language Model
Yu Xia
Rui Zhong
Hao Gu
Wei Yang
Chi Lu
Peng Jiang
Kun Gai
225
0
0
26 May 2025
MSD-LLM: Predicting Ship Detention in Port State Control Inspections with Large Language Model
Jiongchao Jin
Xiuju Fu
Xiaowei Gao
Tao Cheng
Ran Yan
AI4CE
191
0
0
26 May 2025
Correlating instruction-tuning (in multimodal models) with vision-language processing (in the brain)
Subba Reddy Oota
Akshett Rai Jindal
Ishani Mondal
Khushbu Pahwa
Satya Sai Srinath Namburi
Manish Shrivastava
M. Singh
Bapi S. Raju
Manish Gupta
48
1
0
26 May 2025
BizFinBench: A Business-Driven Real-World Financial Benchmark for Evaluating LLMs
Guilong Lu
Xuntao Guo
Rongjunchen Zhang
Wenqiao Zhu
Ji Liu
ELM
57
0
0
26 May 2025
Distilling Closed-Source LLM's Knowledge for Locally Stable and Economic Biomedical Entity Linking
Yihao Ai
Zhiyuan Ning
Weiwei Dai
P. Wang
Yi Du
Wenjuan Cui
Kunpeng Liu
Yuanchun Zhou
46
1
0
26 May 2025
Progressive Scaling Visual Object Tracking
Jack Hong
Shilin Yan
Zehao Xiao
Jiayin Cai
Xiaolong Jiang
Yao Hu
Henghui Ding
81
0
0
26 May 2025
DiffNMR: Advancing Inpainting of Randomly Sampled Nuclear Magnetic Resonance Signals
Sen Yan
Fabrizio Gabellieri
Etienne Goffinet
Filippo Castiglione
Thomas Launey
DiffM
MedIm
42
1
0
26 May 2025
Learning a Pessimistic Reward Model in RLHF
Yinglun Xu
Hangoo Kang
Tarun Suresh
Yuxuan Wan
Gagandeep Singh
OffRL
63
0
0
26 May 2025
CAD-Coder: Text-to-CAD Generation with Chain-of-Thought and Geometric Reward
Yandong Guan
Xilin Wang
Xingxi Ming
Jing Zhang
Dong Xu
Qian Yu
3DV
LRM
34
0
0
26 May 2025
Grounding Language with Vision: A Conditional Mutual Information Calibrated Decoding Strategy for Reducing Hallucinations in LVLMs
Hao Fang
Changle Zhou
Jiawei Kong
Kuofeng Gao
Bin Chen
Tao Liang
Guojun Ma
Shu-Tao Xia
MLLM
115
0
0
26 May 2025
In-Context Brush: Zero-shot Customized Subject Insertion with Context-Aware Latent Space Manipulation
Yu Xu
Fan Tang
You Wu
Lin Gao
Oliver Deussen
Hongbin Yan
Jintao Li
Juan Cao
Tong-Yee Lee
DiffM
49
0
0
26 May 2025
ResSVD: Residual Compensated SVD for Large Language Model Compression
Haolei Bai
Siyong Jian
Tuo Liang
Yu Yin
Huan Wang
46
0
0
26 May 2025
Graceful Forgetting in Generative Language Models
Chunyang Jiang
Chi-Min Chan
Yiyang Cai
Yulong Liu
Wei Xue
Yike Guo
MoMe
CLL
KELM
40
0
0
26 May 2025
Small Language Models: Architectures, Techniques, Evaluation, Problems and Future Adaptation
Tanjil Hasan Sakib
Md. Tanzib Hosain
Md. Kishor Morol
ALM
45
0
0
26 May 2025
FieldWorkArena: Agentic AI Benchmark for Real Field Work Tasks
Atsunori Moteki
S. Masui
Fan Yang
Yueqi Song
Yonatan Bisk
...
Ikuo Kusajima
Yasuto Watanabe
Hiroyuki Ishida
Jun Takahashi
Shan Jiang
29
0
0
26 May 2025
DoctorAgent-RL: A Multi-Agent Collaborative Reinforcement Learning System for Multi-Turn Clinical Dialogue
Yichun Feng
Jiawei Wang
Lu Zhou
Yixue Li
OffRL
LM&MA
217
0
0
26 May 2025
LLM Meets Scene Graph: Can Large Language Models Understand and Generate Scene Graphs? A Benchmark and Empirical Study
Dongil Yang
Minjin Kim
Sunghwan Kim
Beong-woo Kwak
Minjun Park
Jinseok Hong
Woontack Woo
Jinyoung Yeo
60
0
0
26 May 2025
GenKI: Enhancing Open-Domain Question Answering with Knowledge Integration and Controllable Generation in Large Language Models
Tingjia Shen
Hao Wang
Chuan Qin
Ruijun Sun
Yang Song
Defu Lian
Hengshu Zhu
Enhong Chen
55
0
0
26 May 2025
LlamaSeg: Image Segmentation via Autoregressive Mask Generation
Jiru Deng
Tengjin Weng
Tianyu Yang
Wenhan Luo
Zhiheng Li
Wenhao Jiang
VLM
149
0
0
26 May 2025
T^2Agent A Tool-augmented Multimodal Misinformation Detection Agent with Monte Carlo Tree Search
Xing Cui
Yueying Zou
Zekun Li
Peipei Li
Xinyuan Xu
Xuannan Liu
Huaibo Huang
Ran He
233
0
0
26 May 2025
ALAS: Measuring Latent Speech-Text Alignment For Spoken Language Understanding In Multimodal LLMs
Pooneh Mousavi
Yingzhi Wang
Mirco Ravanelli
Cem Subakan
AuLLM
74
0
0
26 May 2025
MLLM-Guided VLM Fine-Tuning with Joint Inference for Zero-Shot Composed Image Retrieval
Rong-Cheng Tu
Zhao Jin
Jingyi Liao
Xiao Luo
Yingjie Wang
Li Shen
Dacheng Tao
115
0
0
26 May 2025
Minimalist Softmax Attention Provably Learns Constrained Boolean Functions
Jerry Yao-Chieh Hu
Xiwen Zhang
Maojiang Su
Zhao Song
Han Liu
MLT
243
1
0
26 May 2025
InFact: Informativeness Alignment for Improved LLM Factuality
Roi Cohen
Russa Biswas
Gerard de Melo
20
0
0
26 May 2025
TailorKV: A Hybrid Framework for Long-Context Inference via Tailored KV Cache Optimization
Dingyu Yao
Bowen Shen
Zheng Lin
Wei Liu
Jian Luan
Bin Wang
Weiping Wang
MQ
47
0
0
26 May 2025
MA-RAG: Multi-Agent Retrieval-Augmented Generation via Collaborative Chain-of-Thought Reasoning
Thang Nguyen
Peter Chin
Yu-Wing Tai
LRM
80
1
0
26 May 2025
CloneShield: A Framework for Universal Perturbation Against Zero-Shot Voice Cloning
Renyuan Li
Zhibo Liang
Haichuan Zhang
Tianyu Shi
Zhiyuan Cheng
Jia Shi
Carl Yang
Mingjie Tang
AAML
172
0
0
25 May 2025
SpokenNativQA: Multilingual Everyday Spoken Queries for LLMs
Firoj Alam
Md. Arid Hasan
Shammur A. Chowdhury
81
0
0
25 May 2025
OpenHOI: Open-World Hand-Object Interaction Synthesis with Multimodal Large Language Model
Zhenhao Zhang
Ye-ling Shi
Lingxiao Yang
Suting Ni
Qi Ye
Jingya Wang
74
0
0
25 May 2025
From Generation to Detection: A Multimodal Multi-Task Dataset for Benchmarking Health Misinformation
Zhihao Zhang
Yiran Zhang
Xiyue Zhou
Liting Huang
Imran Razzak
Preslav Nakov
Usman Naseem
24
0
0
24 May 2025
Reinforcement Fine-Tuning Powers Reasoning Capability of Multimodal Large Language Models
Haoyuan Sun
Jiaqi Wu
Bo Xia
Yifu Luo
Yifei Zhao
Kai Qin
Xufei Lv
Tiantian Zhang
Yongzhe Chang
Xueqian Wang
OffRL
LRM
209
0
0
24 May 2025
KerZOO: Kernel Function Informed Zeroth-Order Optimization for Accurate and Accelerated LLM Fine-Tuning
Zhendong Mi
Qitao Tan
Xiaodong Yu
Zining Zhu
Geng Yuan
Shaoyi Huang
206
0
0
24 May 2025
VISTA: Vision-Language Inference for Training-Free Stock Time-Series Analysis
Tina Khezresmaeilzadeh
Parsa Razmara
Seyedarmin Azizi
Mohammad Erfan Sadeghi
Erfan Baghaei Portaghloo
AI4TS
276
0
0
24 May 2025
BTC-LLM: Efficient Sub-1-Bit LLM Quantization via Learnable Transformation and Binary Codebook
Hao Gu
Lujun Li
Zheyu Wang
B. Liu
Qiyuan Zhu
Sirui Han
Yike Guo
MQ
20
0
0
24 May 2025
GainRAG: Preference Alignment in Retrieval-Augmented Generation through Gain Signal Synthesis
Yi Jiang
Sendong Zhao
Jianbo Li
Haochun Wang
Bing Qin
RALM
186
0
0
24 May 2025
REGen: Multimodal Retrieval-Embedded Generation for Long-to-Short Video Editing
Weihan Xu
Yimeng Ma
Jingyue Huang
Yang Li
Wenye Ma
Taylor Berg-Kirkpatrick
Julian McAuley
Paul Pu Liang
Hao-Wen Dong
DiffM
VGen
182
0
0
24 May 2025
RefLoRA: Refactored Low-Rank Adaptation for Efficient Fine-Tuning of Large Models
Yilang Zhang
Bingcong Li
G. Giannakis
246
1
0
24 May 2025
ToDRE: Visual Token Pruning via Diversity and Task Awareness for Efficient Large Vision-Language Models
Duo Li
Zuhao Yang
Shijian Lu
VLM
96
0
0
24 May 2025
Multi-Scale Manifold Alignment: A Unified Framework for Enhanced Explainability of Large Language Models
Yukun Zhang
Qi Dong
27
0
0
24 May 2025
Previous
1
2
3
...
5
6
7
...
50
51
52
Next