ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2302.13971
  4. Cited By
LLaMA: Open and Efficient Foundation Language Models

LLaMA: Open and Efficient Foundation Language Models

27 February 2023
Hugo Touvron
Thibaut Lavril
Gautier Izacard
Xavier Martinet
Marie-Anne Lachaux
Timothée Lacroix
Baptiste Rozière
Naman Goyal
Eric Hambro
Faisal Azhar
Aurelien Rodriguez
Armand Joulin
Edouard Grave
Guillaume Lample
    ALM
    PILM
ArXivPDFHTML

Papers citing "LLaMA: Open and Efficient Foundation Language Models"

50 / 5,796 papers shown
Title
MEP: Multiple Kernel Learning Enhancing Relative Positional Encoding
  Length Extrapolation
MEP: Multiple Kernel Learning Enhancing Relative Positional Encoding Length Extrapolation
Weiguo Gao
36
1
0
26 Mar 2024
PlainMamba: Improving Non-Hierarchical Mamba in Visual Recognition
PlainMamba: Improving Non-Hierarchical Mamba in Visual Recognition
Chenhongyi Yang
Zehui Chen
Miguel Espinosa
Linus Ericsson
Zhenyu Wang
Jiaming Liu
Elliot J. Crowley
Mamba
39
89
0
26 Mar 2024
Language Models for Text Classification: Is In-Context Learning Enough?
Language Models for Text Classification: Is In-Context Learning Enough?
A. Edwards
Jose Camacho-Collados
LRM
49
18
0
26 Mar 2024
Towards a Zero-Data, Controllable, Adaptive Dialog System
Towards a Zero-Data, Controllable, Adaptive Dialog System
Dirk Vath
Lindsey Vanderlyn
Ngoc Thang Vu
50
2
0
26 Mar 2024
m3P: Towards Multimodal Multilingual Translation with Multimodal Prompt
m3P: Towards Multimodal Multilingual Translation with Multimodal Prompt
Jian Yang
Hongcheng Guo
Yuwei Yin
Jiaqi Bai
Bing Wang
Jiaheng Liu
Xinnian Liang
Linzheng Cahi
Liqun Yang
Zhoujun Li
40
9
0
26 Mar 2024
Naive Bayes-based Context Extension for Large Language Models
Naive Bayes-based Context Extension for Large Language Models
Jianlin Su
Murtadha Ahmed
Wenbo Luo
Abhishek Rao
Denny Zhou
Hyeontaek Lim
39
5
0
26 Mar 2024
KC-GenRe: A Knowledge-constrained Generative Re-ranking Method Based on
  Large Language Models for Knowledge Graph Completion
KC-GenRe: A Knowledge-constrained Generative Re-ranking Method Based on Large Language Models for Knowledge Graph Completion
Yilin Wang
Minghao Hu
Zhen Huang
Dongsheng Li
Dong Yang
Xicheng Lu
27
2
0
26 Mar 2024
Provably Secure Disambiguating Neural Linguistic Steganography
Provably Secure Disambiguating Neural Linguistic Steganography
Yuang Qi
Kejiang Chen
Kai Zeng
Weiming Zhang
Neng H. Yu
26
2
0
26 Mar 2024
DGoT: Dynamic Graph of Thoughts for Scientific Abstract Generation
DGoT: Dynamic Graph of Thoughts for Scientific Abstract Generation
Xinyu Ning
Yutong Zhao
Yitong Liu
Hongwen Yang
VLM
32
1
0
26 Mar 2024
LM-Combiner: A Contextual Rewriting Model for Chinese Grammatical Error
  Correction
LM-Combiner: A Contextual Rewriting Model for Chinese Grammatical Error Correction
Yixuan Wang
Baoxin Wang
Yijun Liu
Dayong Wu
Wanxiang Che
KELM
49
1
0
26 Mar 2024
Residual-based Language Models are Free Boosters for Biomedical Imaging
Residual-based Language Models are Free Boosters for Biomedical Imaging
Zhixin Lai
Jing Wu
Suiyao Chen
Yucheng Zhou
N. Hovakimyan
MedIm
41
30
0
26 Mar 2024
Sketch2Prototype: Rapid Conceptual Design Exploration and Prototyping
  with Generative AI
Sketch2Prototype: Rapid Conceptual Design Exploration and Prototyping with Generative AI
Kristen M. Edwards
Brandon Man
Faez Ahmed
39
17
0
26 Mar 2024
The Unreasonable Ineffectiveness of the Deeper Layers
The Unreasonable Ineffectiveness of the Deeper Layers
Andrey Gromov
Kushal Tirumala
Hassan Shapourian
Paolo Glorioso
Daniel A. Roberts
52
83
0
26 Mar 2024
SeSaMe: A Framework to Simulate Self-Reported Ground Truth for Mental
  Health Sensing Studies
SeSaMe: A Framework to Simulate Self-Reported Ground Truth for Mental Health Sensing Studies
Akshat Choube
V. D. Swain
Varun Mishra
45
1
0
25 Mar 2024
The Strong Pull of Prior Knowledge in Large Language Models and Its
  Impact on Emotion Recognition
The Strong Pull of Prior Knowledge in Large Language Models and Its Impact on Emotion Recognition
Georgios Chochlakis
Alexandros Potamianos
Kristina Lerman
Shrikanth Narayanan
40
5
0
25 Mar 2024
Semantic Ranking for Automated Adversarial Technique Annotation in
  Security Text
Semantic Ranking for Automated Adversarial Technique Annotation in Security Text
Udesh Kumarasinghe
Ahmed Lekssays
Husrev Taha Sencar
Sabri Boughorbel
Charitha Elvitigala
Preslav Nakov
32
6
0
25 Mar 2024
Visual CoT: Advancing Multi-Modal Language Models with a Comprehensive
  Dataset and Benchmark for Chain-of-Thought Reasoning
Visual CoT: Advancing Multi-Modal Language Models with a Comprehensive Dataset and Benchmark for Chain-of-Thought Reasoning
Hao Shao
Shengju Qian
Han Xiao
Guanglu Song
Zhuofan Zong
Letian Wang
Yu Liu
Hongsheng Li
VGen
LRM
MLLM
66
39
0
25 Mar 2024
Make-It-Vivid: Dressing Your Animatable Biped Cartoon Characters from
  Text
Make-It-Vivid: Dressing Your Animatable Biped Cartoon Characters from Text
Junshu Tang
Yanhong Zeng
Ke Fan
Xuheng Wang
Bo Dai
Kai Chen
Lizhuang Ma
26
7
0
25 Mar 2024
CLHA: A Simple yet Effective Contrastive Learning Framework for Human
  Alignment
CLHA: A Simple yet Effective Contrastive Learning Framework for Human Alignment
Feiteng Fang
Liang Zhu
Min Yang
Xi Feng
Jinchang Hou
Qixuan Zhao
Chengming Li
Xiping Hu
Ruifeng Xu
32
0
0
25 Mar 2024
Conversational Grounding: Annotation and Analysis of Grounding Acts and
  Grounding Units
Conversational Grounding: Annotation and Analysis of Grounding Acts and Grounding Units
Biswesh Mohapatra
Seemab Hassan
Laurent Romary
Justine Cassell
37
5
0
25 Mar 2024
Elysium: Exploring Object-level Perception in Videos via MLLM
Elysium: Exploring Object-level Perception in Videos via MLLM
Hang Wang
Yanjie Wang
Yongjie Ye
Yuxiang Nie
Can Huang
MLLM
42
19
0
25 Mar 2024
KIT-19: A Comprehensive Korean Instruction Toolkit on 19 Tasks for
  Fine-Tuning Korean Large Language Models
KIT-19: A Comprehensive Korean Instruction Toolkit on 19 Tasks for Fine-Tuning Korean Large Language Models
Dongjun Jang
Sungjoo Byun
Hyemi Jo
Hyopil Shin
ALM
23
0
0
25 Mar 2024
Re2LLM: Reflective Reinforcement Large Language Model for Session-based
  Recommendation
Re2LLM: Reflective Reinforcement Large Language Model for Session-based Recommendation
Ziyan Wang
Yingpeng Du
Zhu Sun
Haoyan Chua
Kaidong Feng
Wenya Wang
Jie Zhang
LRM
KELM
36
5
0
25 Mar 2024
An Experiment with the Use of ChatGPT for LCSH Subject Assignment on
  Electronic Theses and Dissertations
An Experiment with the Use of ChatGPT for LCSH Subject Assignment on Electronic Theses and Dissertations
Eric H. C. Chow
TJ Kao
Xiaoli Li
24
3
0
25 Mar 2024
Data Mixing Laws: Optimizing Data Mixtures by Predicting Language Modeling Performance
Data Mixing Laws: Optimizing Data Mixtures by Predicting Language Modeling Performance
Jiasheng Ye
Peiju Liu
Tianxiang Sun
Yunhua Zhou
Jun Zhan
Xipeng Qiu
57
64
0
25 Mar 2024
AIOS: LLM Agent Operating System
AIOS: LLM Agent Operating System
Kai Mei
Zelong Li
Wujiang Xu
Wenyue Hua
Mingyu Jin
Yongfeng Zhang
Shuyuan Xu
Ruosong Ye
Yingqiang Ge
Yongfeng Zhang
LLMAG
30
17
0
25 Mar 2024
Large Language Models Offer an Alternative to the Traditional Approach
  of Topic Modelling
Large Language Models Offer an Alternative to the Traditional Approach of Topic Modelling
Yida Mu
Chun Dong
Kalina Bontcheva
Xingyi Song
31
19
0
24 Mar 2024
A Little Leak Will Sink a Great Ship: Survey of Transparency for Large
  Language Models from Start to Finish
A Little Leak Will Sink a Great Ship: Survey of Transparency for Large Language Models from Start to Finish
Masahiro Kaneko
Timothy Baldwin
PILM
34
3
0
24 Mar 2024
A Codesign of Scheduling and Parallelization for Large Model Training in
  Heterogeneous Clusters
A Codesign of Scheduling and Parallelization for Large Model Training in Heterogeneous Clusters
Chunyu Xue
Weihao Cui
Han Zhao
Quan Chen
Shulai Zhang
Peng Yang
Jing Yang
Shaobo Li
Minyi Guo
56
2
0
24 Mar 2024
Qibo: A Large Language Model for Traditional Chinese Medicine
Qibo: A Large Language Model for Traditional Chinese Medicine
Heyi Zhang
Xin Wang
Zhaopeng Meng
Zhe Chen
Pengwei Zhuang
Yongzhe Jia
Dawei Xu
Wenbin Guo
LM&MA
42
10
0
24 Mar 2024
Monotonic Paraphrasing Improves Generalization of Language Model
  Prompting
Monotonic Paraphrasing Improves Generalization of Language Model Prompting
Qin Liu
Fei Wang
Nan Xu
Tianyi Yan
Tao Meng
Muhao Chen
LRM
43
7
0
24 Mar 2024
CBT-LLM: A Chinese Large Language Model for Cognitive Behavioral
  Therapy-based Mental Health Question Answering
CBT-LLM: A Chinese Large Language Model for Cognitive Behavioral Therapy-based Mental Health Question Answering
Hongbin Na
AI4MH
50
11
0
24 Mar 2024
Understanding Emergent Abilities of Language Models from the Loss Perspective
Understanding Emergent Abilities of Language Models from the Loss Perspective
Zhengxiao Du
Aohan Zeng
Yuxiao Dong
Jie Tang
UQCV
LRM
70
46
0
23 Mar 2024
Cost-Efficient Large Language Model Serving for Multi-turn Conversations
  with CachedAttention
Cost-Efficient Large Language Model Serving for Multi-turn Conversations with CachedAttention
Bin Gao
Zhuomin He
Puru Sharma
Qingxuan Kang
Djordje Jevdjic
Junbo Deng
Xingkun Yang
Zhou Yu
Pengfei Zuo
71
45
0
23 Mar 2024
SensoryT5: Infusing Sensorimotor Norms into T5 for Enhanced Fine-grained
  Emotion Classification
SensoryT5: Infusing Sensorimotor Norms into T5 for Enhanced Fine-grained Emotion Classification
Yuhan Xia
Qingqing Zhao
Yunfei Long
Ge Xu
Jia Wang
20
0
0
22 Mar 2024
LLaVA-PruMerge: Adaptive Token Reduction for Efficient Large Multimodal
  Models
LLaVA-PruMerge: Adaptive Token Reduction for Efficient Large Multimodal Models
Yuzhang Shang
Mu Cai
Bingxin Xu
Yong Jae Lee
Yan Yan
VLM
55
107
0
22 Mar 2024
Neural Plasticity-Inspired Multimodal Foundation Model for Earth
  Observation
Neural Plasticity-Inspired Multimodal Foundation Model for Earth Observation
Zhitong Xiong
Yi Wang
Fahong Zhang
Adam J. Stewart
Joelle Hanna
Damian Borth
Ioannis Papoutsis
B. L. Saux
Gustau Camps-Valls
Xiao Xiang Zhu
AI4CE
81
12
0
22 Mar 2024
Comprehensive Reassessment of Large-Scale Evaluation Outcomes in LLMs: A
  Multifaceted Statistical Approach
Comprehensive Reassessment of Large-Scale Evaluation Outcomes in LLMs: A Multifaceted Statistical Approach
Kun Sun
Rong Wang
Anders Sogaard
37
3
0
22 Mar 2024
FollowIR: Evaluating and Teaching Information Retrieval Models to Follow
  Instructions
FollowIR: Evaluating and Teaching Information Retrieval Models to Follow Instructions
Orion Weller
Benjamin Chang
Sean MacAvaney
Kyle Lo
Arman Cohan
Benjamin Van Durme
Dawn J Lawrie
Luca Soldaini
63
30
0
22 Mar 2024
Not All Attention is Needed: Parameter and Computation Efficient
  Transfer Learning for Multi-modal Large Language Models
Not All Attention is Needed: Parameter and Computation Efficient Transfer Learning for Multi-modal Large Language Models
Qiong Wu
Weihao Ye
Yiyi Zhou
Xiaoshuai Sun
Rongrong Ji
MoE
52
1
0
22 Mar 2024
MSCoTDet: Language-driven Multi-modal Fusion for Improved Multispectral Pedestrian Detection
MSCoTDet: Language-driven Multi-modal Fusion for Improved Multispectral Pedestrian Detection
Taeheon Kim
Sangyun Chung
Damin Yeom
Youngjoon Yu
Hak Gu Kim
Y. Ro
38
2
0
22 Mar 2024
On the Convergence of Adam under Non-uniform Smoothness: Separability
  from SGDM and Beyond
On the Convergence of Adam under Non-uniform Smoothness: Separability from SGDM and Beyond
Bohan Wang
Huishuai Zhang
Qi Meng
Ruoyu Sun
Zhi-Ming Ma
Wei Chen
37
7
0
22 Mar 2024
Construction of a Japanese Financial Benchmark for Large Language Models
Construction of a Japanese Financial Benchmark for Large Language Models
Masanori Hirano
31
10
0
22 Mar 2024
LLM2LLM: Boosting LLMs with Novel Iterative Data Enhancement
LLM2LLM: Boosting LLMs with Novel Iterative Data Enhancement
Nicholas Lee
Thanakul Wattanawong
Sehoon Kim
K. Mangalam
Sheng Shen
Gopala Anumanchipalli
Michael W. Mahoney
Kurt Keutzer
A. Gholami
69
46
0
22 Mar 2024
Generative Active Learning for Image Synthesis Personalization
Generative Active Learning for Image Synthesis Personalization
Xu-Lu Zhang
Wengyu Zhang
Xiao Wei
Jinlin Wu
Zhaoxiang Zhang
Zhen Lei
Qing Li
110
2
0
22 Mar 2024
Comprehensive Evaluation and Insights into the Use of Large Language
  Models in the Automation of Behavior-Driven Development Acceptance Test
  Formulation
Comprehensive Evaluation and Insights into the Use of Large Language Models in the Automation of Behavior-Driven Development Acceptance Test Formulation
Shanthi Karpurapu
Sravanthy Myneni
Unnati Nettur
Likhit Sagar Gajja
Dave Burke
Tom Stiehm
Jeffery Payne
LM&MA
37
5
0
22 Mar 2024
KnowLA: Enhancing Parameter-efficient Finetuning with Knowledgeable
  Adaptation
KnowLA: Enhancing Parameter-efficient Finetuning with Knowledgeable Adaptation
Xindi Luo
Zequn Sun
Jing-xin Zhao
Zhe Zhao
Wei Hu
KELM
24
4
0
22 Mar 2024
Stance Reasoner: Zero-Shot Stance Detection on Social Media with
  Explicit Reasoning
Stance Reasoner: Zero-Shot Stance Detection on Social Media with Explicit Reasoning
Maksym Taranukhin
Vered Shwartz
E. Milios
LRM
44
6
0
22 Mar 2024
Surgical-LVLM: Learning to Adapt Large Vision-Language Model for Grounded Visual Question Answering in Robotic Surgery
Surgical-LVLM: Learning to Adapt Large Vision-Language Model for Grounded Visual Question Answering in Robotic Surgery
Guan-Feng Wang
Long Bai
Wan Jun Nah
Jie Wang
Zhaoxi Zhang
Zhen Chen
Jinlin Wu
Mobarakol Islam
Hongbin Liu
Hongliang Ren
46
14
0
22 Mar 2024
MathVerse: Does Your Multi-modal LLM Truly See the Diagrams in Visual
  Math Problems?
MathVerse: Does Your Multi-modal LLM Truly See the Diagrams in Visual Math Problems?
Renrui Zhang
Dongzhi Jiang
Yichi Zhang
Haokun Lin
Ziyu Guo
...
Aojun Zhou
Pan Lu
Kai-Wei Chang
Peng Gao
Hongsheng Li
34
173
0
21 Mar 2024
Previous
123...899091...114115116
Next