ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2302.13971
  4. Cited By
LLaMA: Open and Efficient Foundation Language Models

LLaMA: Open and Efficient Foundation Language Models

27 February 2023
Hugo Touvron
Thibaut Lavril
Gautier Izacard
Xavier Martinet
Marie-Anne Lachaux
Timothée Lacroix
Baptiste Rozière
Naman Goyal
Eric Hambro
Faisal Azhar
Aurelien Rodriguez
Armand Joulin
Edouard Grave
Guillaume Lample
    ALM
    PILM
ArXivPDFHTML

Papers citing "LLaMA: Open and Efficient Foundation Language Models"

50 / 7,023 papers shown
Title
Think Big, Generate Quick: LLM-to-SLM for Fast Autoregressive Decoding
Think Big, Generate Quick: LLM-to-SLM for Fast Autoregressive Decoding
Benjamin Bergner
Andrii Skliar
Amelie Royer
Tijmen Blankevoort
Yuki Markus Asano
B. Bejnordi
58
5
0
26 Feb 2024
PhyGrasp: Generalizing Robotic Grasping with Physics-informed Large
  Multimodal Models
PhyGrasp: Generalizing Robotic Grasping with Physics-informed Large Multimodal Models
Dingkun Guo
Yuqi Xiang
Shuqi Zhao
Xinghao Zhu
Masayoshi Tomizuka
Mingyu Ding
Wei Zhan
42
10
0
26 Feb 2024
Set the Clock: Temporal Alignment of Pretrained Language Models
Set the Clock: Temporal Alignment of Pretrained Language Models
Bowen Zhao
Zander Brumbaugh
Yizhong Wang
Hanna Hajishirzi
Noah A. Smith
CLL
KELM
41
11
0
26 Feb 2024
CARTE: Pretraining and Transfer for Tabular Learning
CARTE: Pretraining and Transfer for Tabular Learning
Myung Jun Kim
Léo Grinsztajn
Gaël Varoquaux
LMTD
67
13
0
26 Feb 2024
A Comprehensive Evaluation of Quantization Strategies for Large Language
  Models
A Comprehensive Evaluation of Quantization Strategies for Large Language Models
Renren Jin
Jiangcun Du
Wuwei Huang
Wei Liu
Jian Luan
Bin Wang
Deyi Xiong
MQ
34
31
0
26 Feb 2024
CodeChameleon: Personalized Encryption Framework for Jailbreaking Large
  Language Models
CodeChameleon: Personalized Encryption Framework for Jailbreaking Large Language Models
Huijie Lv
Xiao Wang
Yuan Zhang
Caishuang Huang
Shihan Dou
Junjie Ye
Tao Gui
Qi Zhang
Xuanjing Huang
AAML
44
29
0
26 Feb 2024
Navigating Complexity: Orchestrated Problem Solving with Multi-Agent
  LLMs
Navigating Complexity: Orchestrated Problem Solving with Multi-Agent LLMs
Sumedh Rasal
E. Hauer
32
0
0
26 Feb 2024
GigaPevt: Multimodal Medical Assistant
GigaPevt: Multimodal Medical Assistant
Pavel Blinov
Konstantin Egorov
Ivan Sviridov
Nikolay Ivanov
S. Botman
Evgeniy Tagin
Stepan Kudin
Galina Zubkova
Andrey Savchenko
32
0
0
26 Feb 2024
Long-Context Language Modeling with Parallel Context Encoding
Long-Context Language Modeling with Parallel Context Encoding
Howard Yen
Tianyu Gao
Danqi Chen
40
43
0
26 Feb 2024
Multi-Bit Distortion-Free Watermarking for Large Language Models
Multi-Bit Distortion-Free Watermarking for Large Language Models
Massieh Kordi Boroujeny
Ya Jiang
Kai Zeng
Brian L. Mark
WaLM
VLM
48
4
0
26 Feb 2024
CLAP: Learning Transferable Binary Code Representations with Natural
  Language Supervision
CLAP: Learning Transferable Binary Code Representations with Natural Language Supervision
Hao Wang
Zeyu Gao
Chao Zhang
Zihan Sha
Mingyang Sun
Yuchen Zhou
Wenyu Zhu
Wenju Sun
Han Qiu
Xiangwei Xiao
40
18
0
26 Feb 2024
LLM-based Privacy Data Augmentation Guided by Knowledge Distillation
  with a Distribution Tutor for Medical Text Classification
LLM-based Privacy Data Augmentation Guided by Knowledge Distillation with a Distribution Tutor for Medical Text Classification
Yiping Song
Juhua Zhang
Zhiliang Tian
Yuxin Yang
Minlie Huang
Dongsheng Li
41
10
0
26 Feb 2024
LLMArena: Assessing Capabilities of Large Language Models in Dynamic
  Multi-Agent Environments
LLMArena: Assessing Capabilities of Large Language Models in Dynamic Multi-Agent Environments
Junzhe Chen
Xuming Hu
Shuodi Liu
Shiyu Huang
Weijuan Tu
Zhaofeng He
Lijie Wen
ELM
LLMAG
50
10
0
26 Feb 2024
mEdIT: Multilingual Text Editing via Instruction Tuning
mEdIT: Multilingual Text Editing via Instruction Tuning
Vipul Raheja
Dimitris Alikaniotis
Vivek Kulkarni
Bashar Alhafni
Dhruv Kumar
VLM
43
6
0
26 Feb 2024
Defending LLMs against Jailbreaking Attacks via Backtranslation
Defending LLMs against Jailbreaking Attacks via Backtranslation
Yihan Wang
Zhouxing Shi
Andrew Bai
Cho-Jui Hsieh
AAML
40
33
0
26 Feb 2024
Language-Specific Neurons: The Key to Multilingual Capabilities in Large
  Language Models
Language-Specific Neurons: The Key to Multilingual Capabilities in Large Language Models
Tianyi Tang
Wenyang Luo
Haoyang Huang
Dongdong Zhang
Xiaolei Wang
Xin Zhao
Furu Wei
Ji-Rong Wen
66
50
0
26 Feb 2024
MoZIP: A Multilingual Benchmark to Evaluate Large Language Models in
  Intellectual Property
MoZIP: A Multilingual Benchmark to Evaluate Large Language Models in Intellectual Property
Shiwen Ni
Minghuan Tan
Yuelin Bai
Fuqiang Niu
Min Yang
...
Xiaojun Chen
Chengming Li
Xiping Hu
Ye Li
Jianping Fan
61
7
0
26 Feb 2024
An Integrated Data Processing Framework for Pretraining Foundation
  Models
An Integrated Data Processing Framework for Pretraining Foundation Models
Yiding Sun
Feng Wang
Yutao Zhu
Wayne Xin Zhao
Jiaxin Mao
81
4
0
26 Feb 2024
CodeS: Towards Building Open-source Language Models for Text-to-SQL
CodeS: Towards Building Open-source Language Models for Text-to-SQL
Haoyang Li
Jing Zhang
Hanbing Liu
Ju Fan
Xiaokang Zhang
Jun Zhu
Renjie Wei
Hongyan Pan
Cuiping Li
Hong Chen
ELM
AI4TS
50
98
0
26 Feb 2024
BLO-SAM: Bi-level Optimization Based Overfitting-Preventing Finetuning
  of SAM
BLO-SAM: Bi-level Optimization Based Overfitting-Preventing Finetuning of SAM
Li Zhang
Youwei Liang
Ruiyi Zhang
Amirhosein Javadi
Pengtao Xie
VLM
29
8
0
26 Feb 2024
Personalized Federated Instruction Tuning via Neural Architecture Search
Personalized Federated Instruction Tuning via Neural Architecture Search
Peng Zhang
Yingbo Zhou
Ming Hu
Junxian Feng
Jiawen Weng
Mingsong Chen
FedML
45
4
0
26 Feb 2024
UniRetriever: Multi-task Candidates Selection for Various
  Context-Adaptive Conversational Retrieval
UniRetriever: Multi-task Candidates Selection for Various Context-Adaptive Conversational Retrieval
Hongru Wang
Boyang Xue
Baohang Zhou
Rui Wang
Fei Mi
Weichao Wang
Yasheng Wang
Kam-Fai Wong
50
3
0
26 Feb 2024
ChatMusician: Understanding and Generating Music Intrinsically with LLM
ChatMusician: Understanding and Generating Music Intrinsically with LLM
Ti-Fen Pan
Hanfeng Lin
Yi Wang
Zeyue Tian
Shangda Wu
...
Gus Xia
Roger Dannenberg
Wei Xue
Shiyin Kang
Yike Guo
101
36
0
25 Feb 2024
PeriodicLoRA: Breaking the Low-Rank Bottleneck in LoRA Optimization
PeriodicLoRA: Breaking the Low-Rank Bottleneck in LoRA Optimization
Xiangdi Meng
Damai Dai
Weiyao Luo
Zhe Yang
Shaoxiang Wu
Xiaochen Wang
Peiyi Wang
Qingxiu Dong
Liang Chen
Zhifang Sui
114
11
0
25 Feb 2024
AVI-Talking: Learning Audio-Visual Instructions for Expressive 3D
  Talking Face Generation
AVI-Talking: Learning Audio-Visual Instructions for Expressive 3D Talking Face Generation
Yasheng Sun
Wenqing Chu
Hang Zhou
Kaisiyuan Wang
Hideki Koike
42
5
0
25 Feb 2024
InstructEdit: Instruction-based Knowledge Editing for Large Language
  Models
InstructEdit: Instruction-based Knowledge Editing for Large Language Models
Ningyu Zhang
Bo Tian
Siyuan Cheng
Xiaozhuan Liang
Yi Hu
Kouying Xue
Yanjie Gou
Xi Chen
Huajun Chen
KELM
62
4
0
25 Feb 2024
RoboCodeX: Multimodal Code Generation for Robotic Behavior Synthesis
RoboCodeX: Multimodal Code Generation for Robotic Behavior Synthesis
Yao Mu
Junting Chen
Qinglong Zhang
Shoufa Chen
Qiaojun Yu
...
Wenhai Wang
Jifeng Dai
Yu Qiao
Mingyu Ding
Ping Luo
51
22
0
25 Feb 2024
StochCA: A Novel Approach for Exploiting Pretrained Models with
  Cross-Attention
StochCA: A Novel Approach for Exploiting Pretrained Models with Cross-Attention
SeungWon Seo
Suho Lee
Sangheum Hwang
43
0
0
25 Feb 2024
UrbanGPT: Spatio-Temporal Large Language Models
UrbanGPT: Spatio-Temporal Large Language Models
Zhonghang Li
Lianghao Xia
Jiabin Tang
Yong-mei Xu
Lei Shi
Long Xia
Dawei Yin
Chao Huang
AI4TS
42
38
0
25 Feb 2024
Evaluating Robustness of Generative Search Engine on Adversarial Factual
  Questions
Evaluating Robustness of Generative Search Engine on Adversarial Factual Questions
Xuming Hu
Xiaochuan Li
Junzhe Chen
Hai-Tao Zheng
Yangning Li
...
Yasheng Wang
Qun Liu
Lijie Wen
Philip S. Yu
Zhijiang Guo
AAML
ELM
37
5
0
25 Feb 2024
GraphWiz: An Instruction-Following Language Model for Graph Problems
GraphWiz: An Instruction-Following Language Model for Graph Problems
Nuo Chen
Yuhan Li
Jianheng Tang
Jia Li
50
28
0
25 Feb 2024
Building Flexible Machine Learning Models for Scientific Computing at
  Scale
Building Flexible Machine Learning Models for Scientific Computing at Scale
Tianyu Chen
Haoyi Zhou
Ying Li
Hao Wang
Chonghan Gao
Shanghang Zhang
Jianxin Li
AI4CE
37
0
0
25 Feb 2024
LoRA Meets Dropout under a Unified Framework
LoRA Meets Dropout under a Unified Framework
Sheng Wang
Liheng Chen
Jiyue Jiang
Boyang Xue
Lingpeng Kong
Chuan Wu
31
14
0
25 Feb 2024
Sustainable Supercomputing for AI: GPU Power Capping at HPC Scale
Sustainable Supercomputing for AI: GPU Power Capping at HPC Scale
Dan Zhao
S. Samsi
Joseph McDonald
Baolin Li
David Bestor
Michael Jones
Devesh Tiwari
V. Gadepally
55
17
0
25 Feb 2024
PRP: Propagating Universal Perturbations to Attack Large Language Model
  Guard-Rails
PRP: Propagating Universal Perturbations to Attack Large Language Model Guard-Rails
Neal Mangaokar
Ashish Hooda
Jihye Choi
Shreyas Chandrashekaran
Kassem Fawaz
Somesh Jha
Atul Prakash
AAML
35
35
0
24 Feb 2024
Prompt Perturbation Consistency Learning for Robust Language Models
Prompt Perturbation Consistency Learning for Robust Language Models
Yao Qiang
Subhrangshu Nandi
Ninareh Mehrabi
Greg Ver Steeg
Anoop Kumar
Anna Rumshisky
Aram Galstyan
48
6
0
24 Feb 2024
OAG-Bench: A Human-Curated Benchmark for Academic Graph Mining
OAG-Bench: A Human-Curated Benchmark for Academic Graph Mining
Fanjin Zhang
Shijie Shi
Yifan Zhu
Bo Chen
Yukuo Cen
...
Huihui Yuan
Jian Song
Xiaoyan Li
Yuxiao Dong
Jie Tang
47
16
0
24 Feb 2024
Look Before You Leap: Problem Elaboration Prompting Improves
  Mathematical Reasoning in Large Language Models
Look Before You Leap: Problem Elaboration Prompting Improves Mathematical Reasoning in Large Language Models
Haoran Liao
Jidong Tian
Shaohua Hu
Hao He
Yaohui Jin
ReLM
LRM
46
1
0
24 Feb 2024
Sparse MeZO: Less Parameters for Better Performance in Zeroth-Order LLM
  Fine-Tuning
Sparse MeZO: Less Parameters for Better Performance in Zeroth-Order LLM Fine-Tuning
Yong Liu
Zirui Zhu
Chaoyu Gong
Minhao Cheng
Cho-Jui Hsieh
Yang You
MoE
50
16
0
24 Feb 2024
Intelligent Director: An Automatic Framework for Dynamic Visual
  Composition using ChatGPT
Intelligent Director: An Automatic Framework for Dynamic Visual Composition using ChatGPT
Sixiao Zheng
Jingyang Huo
Yu Wang
Yanwei Fu
VGen
DiffM
44
1
0
24 Feb 2024
Hal-Eval: A Universal and Fine-grained Hallucination Evaluation
  Framework for Large Vision Language Models
Hal-Eval: A Universal and Fine-grained Hallucination Evaluation Framework for Large Vision Language Models
Chaoya Jiang
Wei Ye
Mengfan Dong
Hongrui Jia
Haiyang Xu
Mingshi Yan
Ji Zhang
Shikun Zhang
VLM
MLLM
48
15
0
24 Feb 2024
Query Augmentation by Decoding Semantics from Brain Signals
Query Augmentation by Decoding Semantics from Brain Signals
Ziyi Ye
Jingtao Zhan
Qingyao Ai
Yiqun Liu
Maarten de Rijke
Christina Lioma
Tuukka Ruotsalo
52
0
0
24 Feb 2024
Leveraging ChatGPT in Pharmacovigilance Event Extraction: An Empirical
  Study
Leveraging ChatGPT in Pharmacovigilance Event Extraction: An Empirical Study
ZHAOYUE SUN
Gabriele Pergola
Byron C. Wallace
Yulan He
LM&MA
42
13
0
24 Feb 2024
MegaScale: Scaling Large Language Model Training to More Than 10,000
  GPUs
MegaScale: Scaling Large Language Model Training to More Than 10,000 GPUs
Ziheng Jiang
Yanghua Peng
Yinmin Zhong
Qi Huang
Yangrui Chen
...
Zhe Li
X. Jia
Jia-jun Ye
Xin Jin
Xin Liu
LRM
46
105
0
23 Feb 2024
How Do Nonlinear Transformers Learn and Generalize in In-Context
  Learning?
How Do Nonlinear Transformers Learn and Generalize in In-Context Learning?
Hongkang Li
Meng Wang
Songtao Lu
Xiaodong Cui
Pin-Yu Chen
MLT
51
16
0
23 Feb 2024
AutoMMLab: Automatically Generating Deployable Models from Language
  Instructions for Computer Vision Tasks
AutoMMLab: Automatically Generating Deployable Models from Language Instructions for Computer Vision Tasks
Zekang Yang
Wang Zeng
Sheng Jin
Chao Qian
Ping Luo
Wentao Liu
MLLM
VLM
66
8
0
23 Feb 2024
GPTVQ: The Blessing of Dimensionality for LLM Quantization
GPTVQ: The Blessing of Dimensionality for LLM Quantization
M. V. Baalen
Andrey Kuzmin
Markus Nagel
Peter Couperus
Cédric Bastoul
E. Mahurin
Tijmen Blankevoort
Paul N. Whatmough
MQ
41
28
0
23 Feb 2024
ChunkAttention: Efficient Self-Attention with Prefix-Aware KV Cache and
  Two-Phase Partition
ChunkAttention: Efficient Self-Attention with Prefix-Aware KV Cache and Two-Phase Partition
Lu Ye
Ze Tao
Yong Huang
Yang Li
34
26
0
23 Feb 2024
Where Visual Speech Meets Language: VSP-LLM Framework for Efficient and
  Context-Aware Visual Speech Processing
Where Visual Speech Meets Language: VSP-LLM Framework for Efficient and Context-Aware Visual Speech Processing
Jeong Hun Yeo
Seunghee Han
Minsu Kim
Y. Ro
61
759
0
23 Feb 2024
Fine-tuning CLIP Text Encoders with Two-step Paraphrasing
Fine-tuning CLIP Text Encoders with Two-step Paraphrasing
Hyunjae Kim
Seunghyun Yoon
Trung Bui
Handong Zhao
Quan Tran
Franck Dernoncourt
Jaewoo Kang
CLIP
27
2
0
23 Feb 2024
Previous
123...9899100...139140141
Next