ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2302.13971
  4. Cited By
LLaMA: Open and Efficient Foundation Language Models

LLaMA: Open and Efficient Foundation Language Models

27 February 2023
Hugo Touvron
Thibaut Lavril
Gautier Izacard
Xavier Martinet
Marie-Anne Lachaux
Timothée Lacroix
Baptiste Rozière
Naman Goyal
Eric Hambro
Faisal Azhar
Aurelien Rodriguez
Armand Joulin
Edouard Grave
Guillaume Lample
    ALMPILM
ArXiv (abs)PDFHTML

Papers citing "LLaMA: Open and Efficient Foundation Language Models"

50 / 2,584 papers shown
Title
MoQAE: Mixed-Precision Quantization for Long-Context LLM Inference via Mixture of Quantization-Aware Experts
MoQAE: Mixed-Precision Quantization for Long-Context LLM Inference via Mixture of Quantization-Aware Experts
Wei Tao
Haocheng Lu
Xiaoyang Qu
Bin Zhang
Kai Lu
Jiguang Wan
Jianzong Wang
MQMoE
18
0
0
09 Jun 2025
Decoupling the Image Perception and Multimodal Reasoning for Reasoning Segmentation with Digital Twin Representations
Decoupling the Image Perception and Multimodal Reasoning for Reasoning Segmentation with Digital Twin Representations
Yizhen Li
Dell Zhang
Xuelong Li
Yiqing Shen
VLM
19
0
0
09 Jun 2025
HAIBU-ReMUD: Reasoning Multimodal Ultrasound Dataset and Model Bridging to General Specific Domains
HAIBU-ReMUD: Reasoning Multimodal Ultrasound Dataset and Model Bridging to General Specific Domains
Shijie Wang
Yilun Zhang
Zeyu Lai
Dexing Kong
24
0
0
09 Jun 2025
LeVo: High-Quality Song Generation with Multi-Preference Alignment
LeVo: High-Quality Song Generation with Multi-Preference Alignment
Shun Lei
Yaoxun Xu
Zhiwei Lin
Huaicheng Zhang
Wei Tan
...
Chenyu Yang
Haina Zhu
Shuai Wang
Zhiyong Wu
Dong Yu
47
0
0
09 Jun 2025
GTR-CoT: Graph Traversal as Visual Chain of Thought for Molecular Structure Recognition
GTR-CoT: Graph Traversal as Visual Chain of Thought for Molecular Structure Recognition
Jingchao Wang
Haote Yang
Jiang Wu
Yifan He
Xingjian Wei
...
Lingli Ge
Lijun Wu
Bin Wang
Dahua Lin
Conghui He
24
0
0
09 Jun 2025
Enhancing Watermarking Quality for LLMs via Contextual Generation States Awareness
Enhancing Watermarking Quality for LLMs via Contextual Generation States Awareness
Peiru Yang
Xintian Li
Wanchun Ni
Jinhua Yin
Huili Wang
Guoshun Nan
Shangguang Wang
Yongfeng Huang
Tao Qi
24
0
0
09 Jun 2025
ReCogDrive: A Reinforced Cognitive Framework for End-to-End Autonomous Driving
Yongkang Li
Kaixin Xiong
Xiangyu Guo
Fang Li
Sixu Yan
...
Bing Wang
Guang Chen
Hangjun Ye
Wenyu Liu
Xinggang Wang
VLM
48
0
0
09 Jun 2025
Reward Model Interpretability via Optimal and Pessimal Tokens
Reward Model Interpretability via Optimal and Pessimal Tokens
Brian Christian
Hannah Rose Kirk
Jessica A.F. Thompson
Christopher Summerfield
Tsvetomira Dumbalska
AAML
17
0
0
08 Jun 2025
Evaluating and Improving Robustness in Large Language Models: A Survey and Future Directions
Evaluating and Improving Robustness in Large Language Models: A Survey and Future Directions
Kun Zhang
Le Wu
Kui Yu
Guangyi Lv
Dacao Zhang
AAMLELM
32
0
0
08 Jun 2025
MAGNET: A Multi-agent Framework for Finding Audio-Visual Needles by Reasoning over Multi-Video Haystacks
MAGNET: A Multi-agent Framework for Finding Audio-Visual Needles by Reasoning over Multi-Video Haystacks
Sanjoy Chowdhury
Mohamed Elmoghany
Yohan Abeysinghe
Junjie Fei
Sayan Nag
Salman Khan
Mohamed Elhoseiny
Dinesh Manocha
33
0
0
08 Jun 2025
Robotic Policy Learning via Human-assisted Action Preference Optimization
Robotic Policy Learning via Human-assisted Action Preference Optimization
Wenke Xia
Yichu Yang
Hongtao Wu
Xiao Ma
Tao Kong
Di Hu
33
0
0
08 Jun 2025
RARL: Improving Medical VLM Reasoning and Generalization with Reinforcement Learning and LoRA under Data and Hardware Constraints
RARL: Improving Medical VLM Reasoning and Generalization with Reinforcement Learning and LoRA under Data and Hardware Constraints
Tan-Hanh Pham
Chris Ngo
OffRLLRM
26
0
0
07 Jun 2025
FREE: Fast and Robust Vision Language Models with Early Exits
FREE: Fast and Robust Vision Language Models with Early Exits
Divya J. Bajpai
M. Hanawal
VLM
17
0
0
07 Jun 2025
MarginSel : Max-Margin Demonstration Selection for LLMs
MarginSel : Max-Margin Demonstration Selection for LLMs
Rajeev Bhatt Ambati
James Lester
Shashank Srivastava
Snigdha Chaturvedi
33
0
0
07 Jun 2025
Can Quantized Audio Language Models Perform Zero-Shot Spoofing Detection?
Can Quantized Audio Language Models Perform Zero-Shot Spoofing Detection?
Bikash Dutta
Rishabh Ranjan
Shyam Sathvik
Mayank Vatsa
Richa Singh
18
0
0
07 Jun 2025
Masked Language Models are Good Heterogeneous Graph Generalizers
Masked Language Models are Good Heterogeneous Graph Generalizers
Jinyu Yang
Cheng Yang
Shanyuan Cui
Zeyuan Guo
Liangwei Yang
Muhan Zhang
Chuan Shi
70
0
0
06 Jun 2025
Optimizing Recall or Relevance? A Multi-Task Multi-Head Approach for Item-to-Item Retrieval in Recommendation
Optimizing Recall or Relevance? A Multi-Task Multi-Head Approach for Item-to-Item Retrieval in Recommendation
Jiang Zhang
Sumit Kumar
Wei Chang
Yubo Wang
Feng Zhang
Weize Mao
Hanchao Yu
Aashu Singh
Min Li
Qifan Wang
53
0
0
06 Jun 2025
(LiFT) Lightweight Fitness Transformer: A language-vision model for Remote Monitoring of Physical Training
(LiFT) Lightweight Fitness Transformer: A language-vision model for Remote Monitoring of Physical Training
A. Postlmayr
P. Cosman
S. Dey
22
0
0
06 Jun 2025
Elementary Math Word Problem Generation using Large Language Models
Elementary Math Word Problem Generation using Large Language Models
Nimesh Ariyarathne
Harshani Bandara
Yasith Heshan
Omega Gamage
Surangika Ranathunga
...
Gayathri Lihinikaduarachchi
Tharoosha Vihidun
Meenambika Chandirakumar
Sanujen Premakumar
Sanjula Gathsara
AI4Ed
70
0
0
06 Jun 2025
Dynamic Mixture of Progressive Parameter-Efficient Expert Library for Lifelong Robot Learning
Dynamic Mixture of Progressive Parameter-Efficient Expert Library for Lifelong Robot Learning
Yuheng Lei
Sitong Mao
Shunbo Zhou
Hongyuan Zhang
Xuelong Li
Ping Luo
CLL
42
0
0
06 Jun 2025
MIRIAD: Augmenting LLMs with millions of medical query-response pairs
MIRIAD: Augmenting LLMs with millions of medical query-response pairs
Qinyue Zheng
Salman Abdullah
Sam Rawal
C. Zakka
Sophie Ostmeier
Maximilian Purk
E. Reis
Eric J. Topol
J. Leskovec
Michael Moor
LM&MAAI4MH
66
1
0
06 Jun 2025
Eigenspectrum Analysis of Neural Networks without Aspect Ratio Bias
Eigenspectrum Analysis of Neural Networks without Aspect Ratio Bias
Yuanzhe Hu
Kinshuk Goel
Vlad Killiakov
Yaoqing Yang
61
2
0
06 Jun 2025
Large Language Models are Demonstration Pre-Selectors for Themselves
Large Language Models are Demonstration Pre-Selectors for Themselves
Jiarui Jin
Yuwei Wu
Haoxuan Li
Xiaoting He
Weinan Zhang
Y. Yang
Yong Yu
Jun Wang
Mengyue Yang
65
0
0
06 Jun 2025
Mitigating Catastrophic Forgetting with Adaptive Transformer Block Expansion in Federated Fine-Tuning
Mitigating Catastrophic Forgetting with Adaptive Transformer Block Expansion in Federated Fine-Tuning
Yujia Huo
Jianchun Liu
Hongli Xu
Zhenguo Ma
Shilong Wang
Liusheng Huang
CLL
45
0
0
06 Jun 2025
RecGPT: A Foundation Model for Sequential Recommendation
RecGPT: A Foundation Model for Sequential Recommendation
Yangqin Jiang
Xubin Ren
Lianghao Xia
Da Luo
Kangyi Lin
Chao Huang
LRM
109
0
0
06 Jun 2025
SparseMM: Head Sparsity Emerges from Visual Concept Responses in MLLMs
SparseMM: Head Sparsity Emerges from Visual Concept Responses in MLLMs
Jiahui Wang
Z. Liu
Yongming Rao
Jiwen Lu
VLMLRM
168
0
0
05 Jun 2025
On the Comprehensibility of Multi-structured Financial Documents using LLMs and Pre-processing Tools
Shivani Upadhyay
Messiah Ataey
Shariyar Murtuza
Yifan Nie
Jimmy J. Lin
94
0
0
05 Jun 2025
Simulating LLM-to-LLM Tutoring for Multilingual Math Feedback
Junior Cedric Tonga
KV Aditya Srivatsa
Kaushal Kumar Maurya
Fajri Koto
Ekaterina Kochmar
LRM
100
0
0
05 Jun 2025
LLM-based phoneme-to-grapheme for phoneme-based speech recognition
Te Ma
Min Bi
Saierdaer Yusuyin
Hao Huang
Zhijian Ou
172
0
0
05 Jun 2025
MANBench: Is Your Multimodal Model Smarter than Human?
MANBench: Is Your Multimodal Model Smarter than Human?
Han Zhou
Qitong Xu
Yiheng Dong
Xin Yang
19
0
0
04 Jun 2025
A Generative Adaptive Replay Continual Learning Model for Temporal Knowledge Graph Reasoning
A Generative Adaptive Replay Continual Learning Model for Temporal Knowledge Graph Reasoning
Zhiyu Zhang
Wei Chen
Youfang Lin
Huaiyu Wan
OffRLCLL
114
0
0
04 Jun 2025
Establishing Trustworthy LLM Evaluation via Shortcut Neuron Analysis
Establishing Trustworthy LLM Evaluation via Shortcut Neuron Analysis
Kejian Zhu
Shangqing Tu
Zhuoran Jin
Lei Hou
Juanzi Li
Jun Zhao
KELM
86
0
0
04 Jun 2025
Accurate Sublayer Pruning for Large Language Models by Exploiting Latency and Tunability Information
Accurate Sublayer Pruning for Large Language Models by Exploiting Latency and Tunability Information
Seungcheol Park
Sojin Lee
Jongjin Kim
Jinsik Lee
Hyunjik Jo
U. Kang
75
2
0
04 Jun 2025
Generating 6DoF Object Manipulation Trajectories from Action Description in Egocentric Vision
Generating 6DoF Object Manipulation Trajectories from Action Description in Egocentric Vision
Tomoya Yoshida
Shuhei Kurita
Taichi Nishimura
Shinsuke Mori
77
0
0
04 Jun 2025
TokAlign: Efficient Vocabulary Adaptation via Token Alignment
TokAlign: Efficient Vocabulary Adaptation via Token Alignment
Chong Li
Jiajun Zhang
Chengqing Zong
VLM
59
0
0
04 Jun 2025
ComRoPE: Scalable and Robust Rotary Position Embedding Parameterized by Trainable Commuting Angle Matrices
ComRoPE: Scalable and Robust Rotary Position Embedding Parameterized by Trainable Commuting Angle Matrices
Hao Yu
Tangyu Jiang
Shuning Jia
Shannan Yan
Shunning Liu
Haolong Qian
Guanghao Li
Shuting Dong
Huaisong Zhang
Chun Yuan
102
0
0
04 Jun 2025
Should LLM Safety Be More Than Refusing Harmful Instructions?
Should LLM Safety Be More Than Refusing Harmful Instructions?
Utsav Maskey
Mark Dras
Usman Naseem
64
0
0
03 Jun 2025
Beyond Text Compression: Evaluating Tokenizers Across Scales
Beyond Text Compression: Evaluating Tokenizers Across Scales
Jonas F. Lotz
António V. Lopes
Stephan Peitz
Hendra Setiawan
Leonardo Emili
57
0
0
03 Jun 2025
METok: Multi-Stage Event-based Token Compression for Efficient Long Video Understanding
METok: Multi-Stage Event-based Token Compression for Efficient Long Video Understanding
Mengyue Wang
Shuo Chen
Kristian Kersting
Volker Tresp
Yunpu Ma
VLM
60
0
0
03 Jun 2025
KVCache Cache in the Wild: Characterizing and Optimizing KVCache Cache at a Large Cloud Provider
KVCache Cache in the Wild: Characterizing and Optimizing KVCache Cache at a Large Cloud Provider
Jiahao Wang
Jinbo Han
Xingda Wei
Sijie Shen
Dingyan Zhang
Chenguang Fang
Rong Chen
Wenyuan Yu
Haibo Chen
69
1
0
03 Jun 2025
Native-Resolution Image Synthesis
Native-Resolution Image Synthesis
Zidong Wang
Lei Bai
Xiangyu Yue
Wanli Ouyang
Yiyuan Zhang
74
0
0
03 Jun 2025
Rethinking the effects of data contamination in Code Intelligence
Rethinking the effects of data contamination in Code Intelligence
Zhen Yang
Hongyi Lin
Yifan He
Jie Xu
Zeyu Sun
Shuo Liu
P. Wang
Zhongxing Yu
Qingyuan Liang
50
0
0
03 Jun 2025
RATE-Nav: Region-Aware Termination Enhancement for Zero-shot Object Navigation with Vision-Language Models
RATE-Nav: Region-Aware Termination Enhancement for Zero-shot Object Navigation with Vision-Language Models
Junjie Li
Nan Zhang
Xiaoyang Qu
Kai Lu
Guokuan Li
Jiguang Wan
Jianzong Wang
54
0
0
03 Jun 2025
Heterogeneous Group-Based Reinforcement Learning for LLM-based Multi-Agent Systems
Heterogeneous Group-Based Reinforcement Learning for LLM-based Multi-Agent Systems
Guanzhong Chen
Shaoxiong Yang
Chao Li
Wei Liu
Jian Luan
Zenglin Xu
76
0
0
03 Jun 2025
Cell-o1: Training LLMs to Solve Single-Cell Reasoning Puzzles with Reinforcement Learning
Cell-o1: Training LLMs to Solve Single-Cell Reasoning Puzzles with Reinforcement Learning
Yin Fang
Qiao Jin
Guangzhi Xiong
Bowen Jin
Xianrui Zhong
Siru Ouyang
Aidong Zhang
Jiawei Han
Zhiyong Lu
ReLMOffRLLRM
50
0
0
03 Jun 2025
ATAG: AI-Agent Application Threat Assessment with Attack Graphs
ATAG: AI-Agent Application Threat Assessment with Attack Graphs
Parth Atulbhai Gandhi
Akansha Shukla
David Tayouri
Beni Ifland
Yuval Elovici
Rami Puzis
A. Shabtai
LLMAG
64
0
0
03 Jun 2025
KARE-RAG: Knowledge-Aware Refinement and Enhancement for RAG
KARE-RAG: Knowledge-Aware Refinement and Enhancement for RAG
Yongjian Li
HaoCheng Chu
Yukun Yan
Zhenghao Liu
S. Yu
Zheni Zeng
Ruobing Wang
Sen Song
Zhiyuan Liu
Maosong Sun
47
0
0
03 Jun 2025
One Missing Piece for Open-Source Reasoning Models: A Dataset to Mitigate Cold-Starting Short CoT LLMs in RL
One Missing Piece for Open-Source Reasoning Models: A Dataset to Mitigate Cold-Starting Short CoT LLMs in RL
Hyungjoo Chae
Dongjin Kang
J. Kim
Beong-woo Kwak
Sunghyun Park
Haeju Park
Jinyoung Yeo
M. Lee
Kyungjae Lee
ReLMLRM
51
0
0
03 Jun 2025
EssayBench: Evaluating Large Language Models in Multi-Genre Chinese Essay Writing
EssayBench: Evaluating Large Language Models in Multi-Genre Chinese Essay Writing
Fan Gao
Dongyuan Li
Ding Xia
Fei Mi
Yasheng Wang
Lifeng Shang
Baojun Wang
ELM
42
0
0
03 Jun 2025
Rethinking Dynamic Networks and Heterogeneous Computing with Automatic Parallelization
Rethinking Dynamic Networks and Heterogeneous Computing with Automatic Parallelization
Ruilong Wu
Xinjiao Li
Yisu Wang
Xinyu Chen
Dirk Kutscher
59
0
0
03 Jun 2025
Previous
123456...505152
Next