ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2302.13971
  4. Cited By
LLaMA: Open and Efficient Foundation Language Models

LLaMA: Open and Efficient Foundation Language Models

27 February 2023
Hugo Touvron
Thibaut Lavril
Gautier Izacard
Xavier Martinet
Marie-Anne Lachaux
Timothée Lacroix
Baptiste Rozière
Naman Goyal
Eric Hambro
Faisal Azhar
Aurelien Rodriguez
Armand Joulin
Edouard Grave
Guillaume Lample
    ALM
    PILM
ArXivPDFHTML

Papers citing "LLaMA: Open and Efficient Foundation Language Models"

50 / 5,726 papers shown
Title
A Multimodal Recaptioning Framework to Account for Perceptual Diversity in Multilingual Vision-Language Modeling
A Multimodal Recaptioning Framework to Account for Perceptual Diversity in Multilingual Vision-Language Modeling
Kyle Buettner
Jacob Emmerson
Adriana Kovashka
25
0
0
19 Apr 2025
Hypothetical Documents or Knowledge Leakage? Rethinking LLM-based Query Expansion
Hypothetical Documents or Knowledge Leakage? Rethinking LLM-based Query Expansion
Yejun Yoon
Jaeyoon Jung
Seunghyun Yoon
Kunwoo Park
27
0
0
19 Apr 2025
Meta-rater: A Multi-dimensional Data Selection Method for Pre-training Language Models
Meta-rater: A Multi-dimensional Data Selection Method for Pre-training Language Models
Xinlin Zhuang
Jiahui Peng
Ren Ma
Yucheng Wang
Tianyi Bai
Xingjian Wei
Jiantao Qiu
Chi Zhang
Ying Qian
Conghui He
53
0
0
19 Apr 2025
Manipulating Multimodal Agents via Cross-Modal Prompt Injection
Manipulating Multimodal Agents via Cross-Modal Prompt Injection
Le Wang
Zonghao Ying
Tianyuan Zhang
Siyuan Liang
Shengshan Hu
Mingchuan Zhang
A. Liu
Xianglong Liu
AAML
33
1
0
19 Apr 2025
Empirical Evaluation of Knowledge Distillation from Transformers to Subquadratic Language Models
Empirical Evaluation of Knowledge Distillation from Transformers to Subquadratic Language Models
Patrick Haller
Jonas Golde
Alan Akbik
29
0
0
19 Apr 2025
HF4Rec: Human-Like Feedback-Driven Optimization Framework for Explainable Recommendation
HF4Rec: Human-Like Feedback-Driven Optimization Framework for Explainable Recommendation
Jiakai Tang
Jingsen Zhang
Zihang Tian
Xueyang Feng
Lei Wang
Xu Chen
OffRL
162
0
0
19 Apr 2025
InfiGUI-R1: Advancing Multimodal GUI Agents from Reactive Actors to Deliberative Reasoners
InfiGUI-R1: Advancing Multimodal GUI Agents from Reactive Actors to Deliberative Reasoners
Yuhang Liu
Pengxiang Li
C. Xie
Xavier Hu
Xiaotian Han
Shengyu Zhang
Hongxia Yang
Fei Wu
LLMAG
LM&Ro
LRM
AI4CE
72
2
0
19 Apr 2025
Teach Me How to Denoise: A Universal Framework for Denoising Multi-modal Recommender Systems via Guided Calibration
Teach Me How to Denoise: A Universal Framework for Denoising Multi-modal Recommender Systems via Guided Calibration
Yiming Li
Hanwen Du
Yongbin Li
Junchen Fu
Chunxiao Li
Ziyi Zhuang
Jiakang Li
Yongxin Ni
AI4TS
29
0
0
19 Apr 2025
Scaling LLaNA: Advancing NeRF-Language Understanding Through Large-Scale Training
Scaling LLaNA: Advancing NeRF-Language Understanding Through Large-Scale Training
Andrea Amaduzzi
Pierluigi Zama Ramirez
Giuseppe Lisanti
Samuele Salti
Luigi Di Stefano
34
0
0
18 Apr 2025
MIG: Automatic Data Selection for Instruction Tuning by Maximizing Information Gain in Semantic Space
MIG: Automatic Data Selection for Instruction Tuning by Maximizing Information Gain in Semantic Space
Yicheng Chen
Yining Li
Kai Hu
Zerun Ma
Haochen Ye
Kai Chen
34
0
0
18 Apr 2025
Zero-Shot Industrial Anomaly Segmentation with Image-Aware Prompt Generation
Zero-Shot Industrial Anomaly Segmentation with Image-Aware Prompt Generation
SoYoung Park
Hyewon Lee
M. Choi
Seunghoon Han
Jong-Ryul Lee
Sungsu Lim
Tae-Ho Kim
VLM
57
0
0
18 Apr 2025
Analysing the Robustness of Vision-Language-Models to Common Corruptions
Analysing the Robustness of Vision-Language-Models to Common Corruptions
Muhammad Usama
Syeda Aishah Asim
Syed Bilal Ali
Syed Talal Wasim
Umair Bin Mansoor
VLM
36
0
0
18 Apr 2025
DMind Benchmark: Toward a Holistic Assessment of LLM Capabilities across the Web3 Domain
DMind Benchmark: Toward a Holistic Assessment of LLM Capabilities across the Web3 Domain
Miracle Master
Rainy Sun
Anya Reese
Joey Ouyang
Alex Chen
...
James Yi
Garry Zhao
Tony Ling
Hobert Wong
Lowes Yang
ALM
ELM
77
0
0
18 Apr 2025
CacheFormer: High Attention-Based Segment Caching
CacheFormer: High Attention-Based Segment Caching
Sushant Singh
A. Mahmood
41
0
0
18 Apr 2025
It's All Connected: A Journey Through Test-Time Memorization, Attentional Bias, Retention, and Online Optimization
It's All Connected: A Journey Through Test-Time Memorization, Attentional Bias, Retention, and Online Optimization
Ali Behrouz
Meisam Razaviyayn
Peilin Zhong
Vahab Mirrokni
41
0
0
17 Apr 2025
Memorization: A Close Look at Books
Memorization: A Close Look at Books
Iris Ma
Ian Domingo
A. Krone-Martins
Pierre Baldi
Cristina V. Lopes
29
0
0
17 Apr 2025
Are Retrials All You Need? Enhancing Large Language Model Reasoning Without Verbalized Feedback
Are Retrials All You Need? Enhancing Large Language Model Reasoning Without Verbalized Feedback
Nearchos Potamitis
Akhil Arora
LRM
49
0
0
17 Apr 2025
SemCORE: A Semantic-Enhanced Generative Cross-Modal Retrieval Framework with MLLMs
SemCORE: A Semantic-Enhanced Generative Cross-Modal Retrieval Framework with MLLMs
Haoxuan Li
Yi Bin
Yunshan Ma
Guoqing Wang
Yuqing Yang
See-Kiong Ng
Tat-Seng Chua
VLM
34
0
0
17 Apr 2025
GeoSense: Evaluating Identification and Application of Geometric Principles in Multimodal Reasoning
GeoSense: Evaluating Identification and Application of Geometric Principles in Multimodal Reasoning
Liangyu Xu
Yingxiu Zhao
Jiadong Wang
Yingyao Wang
Bu Pi
...
Jihao Gu
Xinfeng Li
Xiaoyong Zhu
Jun Song
Jian Xu
LRM
192
1
0
17 Apr 2025
VistaDPO: Video Hierarchical Spatial-Temporal Direct Preference Optimization for Large Video Models
VistaDPO: Video Hierarchical Spatial-Temporal Direct Preference Optimization for Large Video Models
Haojian Huang
Haodong Chen
Shengqiong Wu
Meng Luo
Jinlan Fu
Xinya Du
Hao Zhang
Hao Fei
AI4TS
175
0
0
17 Apr 2025
EarthGPT-X: Enabling MLLMs to Flexibly and Comprehensively Understand Multi-Source Remote Sensing Imagery
EarthGPT-X: Enabling MLLMs to Flexibly and Comprehensively Understand Multi-Source Remote Sensing Imagery
Wei Zhang
Miaoxin Cai
Yaqian Ning
Tianze Zhang
Yin Zhuang
He Chen
Jun Li
Xuerui Mao
38
0
0
17 Apr 2025
A Survey on Cross-Modal Interaction Between Music and Multimodal Data
A Survey on Cross-Modal Interaction Between Music and Multimodal Data
Sifei Li
Mining Tan
Feier Shen
Minyan Luo
Zijiao Yin
Fan Tang
W. Dong
Changsheng Xu
69
0
0
17 Apr 2025
VLMGuard-R1: Proactive Safety Alignment for VLMs via Reasoning-Driven Prompt Optimization
VLMGuard-R1: Proactive Safety Alignment for VLMs via Reasoning-Driven Prompt Optimization
Menglan Chen
Xianghe Pang
Jingjing Dong
Wenhao Wang
Yaxin Du
Siheng Chen
LRM
39
0
0
17 Apr 2025
Self-Supervised Pre-training with Combined Datasets for 3D Perception in Autonomous Driving
Self-Supervised Pre-training with Combined Datasets for 3D Perception in Autonomous Driving
Shumin Wang
Zhuoran Yang
Liwen Wang
ZhiPeng Tang
Heng Li
Lehan Pan
Sha Zhang
Jie Peng
Jianmin Ji
Y. Zhang
3DPC
46
0
0
17 Apr 2025
Simplifying Graph Transformers
Simplifying Graph Transformers
Liheng Ma
Soumyasundar Pal
Yingxue Zhang
Philip Torr
Mark J. Coates
28
0
0
17 Apr 2025
Can Masked Autoencoders Also Listen to Birds?
Can Masked Autoencoders Also Listen to Birds?
Lukas Rauch
Ilyass Moummad
René Heinrich
Alexis Joly
Bernhard Sick
Christoph Scholz
31
0
0
17 Apr 2025
Code Copycat Conundrum: Demystifying Repetition in LLM-based Code Generation
Code Copycat Conundrum: Demystifying Repetition in LLM-based Code Generation
Mingwei Liu
Juntao Li
Ying Wang
Xueying Du
Zuoyu Ou
...
Xin Su
Zifei Shan
Fangming Zou
Xin Peng
Yiling Lou
38
0
0
17 Apr 2025
Why and How LLMs Hallucinate: Connecting the Dots with Subsequence Associations
Why and How LLMs Hallucinate: Connecting the Dots with Subsequence Associations
Yiyou Sun
Y. Gai
Lijie Chen
Abhilasha Ravichander
Yejin Choi
D. Song
HILM
57
0
0
17 Apr 2025
Low-hallucination Synthetic Captions for Large-Scale Vision-Language Model Pre-training
Low-hallucination Synthetic Captions for Large-Scale Vision-Language Model Pre-training
Xinsong Zhang
Yarong Zeng
Xinting Huang
Hu Hu
Runquan Xie
Han Hu
Zhanhui Kang
MLLM
VLM
55
0
0
17 Apr 2025
MAIN: Mutual Alignment Is Necessary for instruction tuning
MAIN: Mutual Alignment Is Necessary for instruction tuning
Fanyi Yang
Jianfeng Liu
Xinsong Zhang
Haoyu Liu
Xixin Cao
Yuefeng Zhan
H. Sun
Weiwei Deng
Feng Sun
Qi Zhang
ALM
27
0
0
17 Apr 2025
CLIMB: CLustering-based Iterative Data Mixture Bootstrapping for Language Model Pre-training
CLIMB: CLustering-based Iterative Data Mixture Bootstrapping for Language Model Pre-training
Shizhe Diao
Yu Yang
Y. Fu
Xin Dong
Dan Su
...
Hongxu Yin
M. Patwary
Yingyan
Jan Kautz
Pavlo Molchanov
40
0
0
17 Apr 2025
CM3AE: A Unified RGB Frame and Event-Voxel/-Frame Pre-training Framework
CM3AE: A Unified RGB Frame and Event-Voxel/-Frame Pre-training Framework
Wentao Wu
Xinyu Wang
Chenglong Li
Bo Jiang
Jin Tang
Bin Luo
Qi Liu
34
0
0
17 Apr 2025
A Complex-valued SAR Foundation Model Based on Physically Inspired Representation Learning
A Complex-valued SAR Foundation Model Based on Physically Inspired Representation Learning
M. D. Wang
Hanbo Bi
Yingchao Feng
Linlin Xin
Shuo Gong
Tianqi Wang
Zhiyuan Yan
Peijin Wang
Wenhui Diao
Xian Sun
36
0
0
16 Apr 2025
AnomalyR1: A GRPO-based End-to-end MLLM for Industrial Anomaly Detection
AnomalyR1: A GRPO-based End-to-end MLLM for Industrial Anomaly Detection
Yuhao Chao
Jie Liu
J. Tang
Gangshan Wu
35
1
0
16 Apr 2025
Can GPT tell us why these images are synthesized? Empowering Multimodal Large Language Models for Forensics
Can GPT tell us why these images are synthesized? Empowering Multimodal Large Language Models for Forensics
Yiran He
Yun Cao
Bowen Yang
Zeyu Zhang
31
0
0
16 Apr 2025
Position: The Most Expensive Part of an LLM should be its Training Data
Position: The Most Expensive Part of an LLM should be its Training Data
Nikhil Kandpal
Colin Raffel
31
0
0
16 Apr 2025
Evaluating Menu OCR and Translation: A Benchmark for Aligning Human and Automated Evaluations in Large Vision-Language Models
Evaluating Menu OCR and Translation: A Benchmark for Aligning Human and Automated Evaluations in Large Vision-Language Models
Zhanglin Wu
Tengfei Song
Ning Xie
Mengli Zhu
Weidong Zhang
...
Pengfei Li
Chong Li
Junhao Zhu
Hao Yang
Shiliang Sun
46
2
0
16 Apr 2025
Could Thinking Multilingually Empower LLM Reasoning?
Could Thinking Multilingually Empower LLM Reasoning?
Changjiang Gao
Xu Huang
Wenhao Zhu
Shujian Huang
Lei Li
Fei Yuan
LRM
32
0
0
16 Apr 2025
The Devil is in the Prompts: Retrieval-Augmented Prompt Optimization for Text-to-Video Generation
The Devil is in the Prompts: Retrieval-Augmented Prompt Optimization for Text-to-Video Generation
Bingjie Gao
Xinyu Gao
Xiaoxue Wu
Yujie Zhou
Yu Qiao
Li Niu
Xinyuan Chen
Yaohui Wang
76
0
0
16 Apr 2025
Shared Disk KV Cache Management for Efficient Multi-Instance Inference in RAG-Powered LLMs
Shared Disk KV Cache Management for Efficient Multi-Instance Inference in RAG-Powered LLMs
Hyungwoo Lee
Kihyun Kim
Jinwoo Kim
Jungmin So
Myung-Hoon Cha
H. Kim
James J. Kim
Youngjae Kim
37
0
0
16 Apr 2025
Selective Attention Federated Learning: Improving Privacy and Efficiency for Clinical Text Classification
Selective Attention Federated Learning: Improving Privacy and Efficiency for Clinical Text Classification
Yue Li
L. Zhang
34
0
0
16 Apr 2025
Multilingual Contextualization of Large Language Models for Document-Level Machine Translation
Multilingual Contextualization of Large Language Models for Document-Level Machine Translation
Miguel Moura Ramos
Patrick Fernandes
Sweta Agrawal
André F.T. Martins
71
0
0
16 Apr 2025
DeepMLF: Multimodal language model with learnable tokens for deep fusion in sentiment analysis
DeepMLF: Multimodal language model with learnable tokens for deep fusion in sentiment analysis
Efthymios Georgiou
V. Katsouros
Yannis Avrithis
Alexandros Potamianos
24
1
0
15 Apr 2025
Bridging Distribution Gaps in Time Series Foundation Model Pretraining with Prototype-Guided Normalization
Bridging Distribution Gaps in Time Series Foundation Model Pretraining with Prototype-Guided Normalization
Peiliang Gong
Emadeldeen Eldele
Min Wu
Zhenghua Chen
Xiaoli Li
Daoqiang Zhang
AI4TS
AI4CE
44
0
0
15 Apr 2025
LayoutCoT: Unleashing the Deep Reasoning Potential of Large Language Models for Layout Generation
LayoutCoT: Unleashing the Deep Reasoning Potential of Large Language Models for Layout Generation
Hengyu Shi
Junhao Su
Huansheng Ning
Xiaoming Wei
Jialin Gao
3DV
AI4TS
LRM
57
0
0
15 Apr 2025
VEXP: A Low-Cost RISC-V ISA Extension for Accelerated Softmax Computation in Transformers
VEXP: A Low-Cost RISC-V ISA Extension for Accelerated Softmax Computation in Transformers
Run Wang
Gamze Islamoglu
Andrea Belano
Viviane Potocnik
Francesco Conti
Angelo Garofalo
Luca Benini
31
0
0
15 Apr 2025
Making Acoustic Side-Channel Attacks on Noisy Keyboards Viable with LLM-Assisted Spectrograms' "Typo" Correction
Making Acoustic Side-Channel Attacks on Noisy Keyboards Viable with LLM-Assisted Spectrograms' "Typo" Correction
Seyyed Ali Ayati
Jin Hyun Park
Yichen Cai
Marcus Botacin
31
0
0
15 Apr 2025
SimpleAR: Pushing the Frontier of Autoregressive Visual Generation through Pretraining, SFT, and RL
SimpleAR: Pushing the Frontier of Autoregressive Visual Generation through Pretraining, SFT, and RL
Junke Wang
Zhi Tian
Xinyu Wang
Xinyu Zhang
Weilin Huang
Zuxuan Wu
Yu Jiang
VGen
55
6
0
15 Apr 2025
Fine-Tuning Large Language Models on Quantum Optimization Problems for Circuit Generation
Fine-Tuning Large Language Models on Quantum Optimization Problems for Circuit Generation
Linus Jern
Valter Uotila
Cong Yu
Bo Zhao
MQ
LRM
27
0
0
15 Apr 2025
When is Task Vector Provably Effective for Model Editing? A Generalization Analysis of Nonlinear Transformers
When is Task Vector Provably Effective for Model Editing? A Generalization Analysis of Nonlinear Transformers
Hongkang Li
Yihua Zhang
Shuai Zhang
Hao Wu
Sijia Liu
Pin-Yu Chen
MoMe
69
4
0
15 Apr 2025
Previous
123...567...113114115
Next