ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2302.13971
  4. Cited By
LLaMA: Open and Efficient Foundation Language Models

LLaMA: Open and Efficient Foundation Language Models

27 February 2023
Hugo Touvron
Thibaut Lavril
Gautier Izacard
Xavier Martinet
Marie-Anne Lachaux
Timothée Lacroix
Baptiste Rozière
Naman Goyal
Eric Hambro
Faisal Azhar
Aurelien Rodriguez
Armand Joulin
Edouard Grave
Guillaume Lample
    ALM
    PILM
ArXivPDFHTML

Papers citing "LLaMA: Open and Efficient Foundation Language Models"

50 / 7,027 papers shown
Title
Mixture-of-Experts Meets Instruction Tuning:A Winning Combination for
  Large Language Models
Mixture-of-Experts Meets Instruction Tuning:A Winning Combination for Large Language Models
Sheng Shen
Le Hou
Yan-Quan Zhou
Nan Du
Shayne Longpre
...
Vincent Zhao
Hongkun Yu
Kurt Keutzer
Trevor Darrell
Denny Zhou
ALM
MoE
45
55
0
24 May 2023
Emergent inabilities? Inverse scaling over the course of pretraining
Emergent inabilities? Inverse scaling over the course of pretraining
J. Michaelov
Benjamin Bergen
LRM
ReLM
22
3
0
24 May 2023
ExpertPrompting: Instructing Large Language Models to be Distinguished Experts
ExpertPrompting: Instructing Large Language Models to be Distinguished Experts
Benfeng Xu
An Yang
Junyang Lin
Quang Wang
Chang Zhou
Yongdong Zhang
Zhendong Mao
ALM
47
133
0
24 May 2023
LLMs as Factual Reasoners: Insights from Existing Benchmarks and Beyond
LLMs as Factual Reasoners: Insights from Existing Benchmarks and Beyond
Philippe Laban
Wojciech Kry'sciñski
Divyansh Agarwal
Alexander R. Fabbri
Caiming Xiong
Chenyu You
Chien-Sheng Wu
ALM
HILM
38
33
0
23 May 2023
Domain-Expanded ASTE: Rethinking Generalization in Aspect Sentiment
  Triplet Extraction
Domain-Expanded ASTE: Rethinking Generalization in Aspect Sentiment Triplet Extraction
Yew Ken Chia
Hui Chen
Wei Han
Guizhen Chen
Sharifah Mahani Aljunied
Soujanya Poria
Lidong Bing
45
3
0
23 May 2023
Sophia: A Scalable Stochastic Second-order Optimizer for Language Model
  Pre-training
Sophia: A Scalable Stochastic Second-order Optimizer for Language Model Pre-training
Hong Liu
Zhiyuan Li
David Leo Wright Hall
Percy Liang
Tengyu Ma
VLM
60
133
0
23 May 2023
RET-LLM: Towards a General Read-Write Memory for Large Language Models
RET-LLM: Towards a General Read-Write Memory for Large Language Models
Ali Modarressi
Ayyoob Imani
Mohsen Fayyaz
Hinrich Schütze
KELM
LLMAG
21
34
0
23 May 2023
LLM-powered Data Augmentation for Enhanced Cross-lingual Performance
LLM-powered Data Augmentation for Enhanced Cross-lingual Performance
Chenxi Whitehouse
Monojit Choudhury
Alham Fikri Aji
SyDa
LRM
37
71
0
23 May 2023
Active Learning Principles for In-Context Learning with Large Language
  Models
Active Learning Principles for In-Context Learning with Large Language Models
Katerina Margatina
Timo Schick
Nikolaos Aletras
Jane Dwivedi-Yu
37
39
0
23 May 2023
FActScore: Fine-grained Atomic Evaluation of Factual Precision in Long
  Form Text Generation
FActScore: Fine-grained Atomic Evaluation of Factual Precision in Long Form Text Generation
Sewon Min
Kalpesh Krishna
Xinxi Lyu
M. Lewis
Wen-tau Yih
Pang Wei Koh
Mohit Iyyer
Luke Zettlemoyer
Hannaneh Hajishirzi
HILM
ALM
91
619
0
23 May 2023
ManiTweet: A New Benchmark for Identifying Manipulation of News on
  Social Media
ManiTweet: A New Benchmark for Identifying Manipulation of News on Social Media
Kung-Hsiang Huang
Hou Pong Chan
Kathleen McKeown
Heng Ji
44
1
0
23 May 2023
Label Words are Anchors: An Information Flow Perspective for
  Understanding In-Context Learning
Label Words are Anchors: An Information Flow Perspective for Understanding In-Context Learning
Lean Wang
Lei Li
Damai Dai
Deli Chen
Hao Zhou
Fandong Meng
Jie Zhou
Xu Sun
42
176
0
23 May 2023
Memory-Efficient Fine-Tuning of Compressed Large Language Models via
  sub-4-bit Integer Quantization
Memory-Efficient Fine-Tuning of Compressed Large Language Models via sub-4-bit Integer Quantization
Jeonghoon Kim
J. H. Lee
Sungdong Kim
Joonsuk Park
Kang Min Yoo
S. Kwon
Dongsoo Lee
MQ
49
100
0
23 May 2023
Out-of-Distribution Generalization in Text Classification: Past,
  Present, and Future
Out-of-Distribution Generalization in Text Classification: Past, Present, and Future
Linyi Yang
Yangqiu Song
Xuan Ren
Chenyang Lyu
Yidong Wang
Lingqiao Liu
Jindong Wang
Jennifer Foster
Yue Zhang
OOD
44
2
0
23 May 2023
DetectLLM: Leveraging Log Rank Information for Zero-Shot Detection of
  Machine-Generated Text
DetectLLM: Leveraging Log Rank Information for Zero-Shot Detection of Machine-Generated Text
Jinyan Su
Terry Yue Zhuo
Di Wang
Preslav Nakov
DeLMO
63
126
0
23 May 2023
OlaGPT: Empowering LLMs With Human-like Problem-Solving Abilities
OlaGPT: Empowering LLMs With Human-like Problem-Solving Abilities
Yuanzhen Xie
Tao Xie
Mingxiong Lin
Wen-Ke Wei
Chenglin Li
Beibei Kong
Lei Chen
Chengxiang Zhuo
Bo Hu
Zang Li
RALM
LLMAG
LRM
38
6
0
23 May 2023
A Trip Towards Fairness: Bias and De-Biasing in Large Language Models
A Trip Towards Fairness: Bias and De-Biasing in Large Language Models
Leonardo Ranaldi
Elena Sofia Ruzzetti
Davide Venditti
Dario Onorati
Fabio Massimo Zanzotto
45
35
0
23 May 2023
Learning from Mistakes via Cooperative Study Assistant for Large
  Language Models
Learning from Mistakes via Cooperative Study Assistant for Large Language Models
Danqing Wang
Lei Li
37
6
0
23 May 2023
CombLM: Adapting Black-Box Language Models through Small Fine-Tuned
  Models
CombLM: Adapting Black-Box Language Models through Small Fine-Tuned Models
Aitor Ormazabal
Mikel Artetxe
Eneko Agirre
50
19
0
23 May 2023
Knowledge of Knowledge: Exploring Known-Unknowns Uncertainty with Large
  Language Models
Knowledge of Knowledge: Exploring Known-Unknowns Uncertainty with Large Language Models
Alfonso Amayuelas
Kyle Wong
Liangming Pan
Wenhu Chen
Wenjie Wang
44
25
0
23 May 2023
MemeCap: A Dataset for Captioning and Interpreting Memes
MemeCap: A Dataset for Captioning and Interpreting Memes
EunJeong Hwang
Vered Shwartz
VLM
27
36
0
23 May 2023
Polyglot or Not? Measuring Multilingual Encyclopedic Knowledge in
  Foundation Models
Polyglot or Not? Measuring Multilingual Encyclopedic Knowledge in Foundation Models
Tim Schott
Daniel Furman
Shreshta Bhat
ELM
40
4
0
23 May 2023
LM vs LM: Detecting Factual Errors via Cross Examination
LM vs LM: Detecting Factual Errors via Cross Examination
Roi Cohen
May Hamri
Mor Geva
Amir Globerson
HILM
46
121
0
22 May 2023
Multi-Task Instruction Tuning of LLaMa for Specific Scenarios: A
  Preliminary Study on Writing Assistance
Multi-Task Instruction Tuning of LLaMa for Specific Scenarios: A Preliminary Study on Writing Assistance
Yue Zhang
Leyang Cui
Deng Cai
Xinting Huang
Tao Fang
Wei Bi
ALM
38
36
0
22 May 2023
SCITAB: A Challenging Benchmark for Compositional Reasoning and Claim
  Verification on Scientific Tables
SCITAB: A Challenging Benchmark for Compositional Reasoning and Claim Verification on Scientific Tables
Xinyuan Lu
Liangming Pan
Qian Liu
Preslav Nakov
Min-Yen Kan
LMTD
49
25
0
22 May 2023
Response Length Perception and Sequence Scheduling: An LLM-Empowered LLM
  Inference Pipeline
Response Length Perception and Sequence Scheduling: An LLM-Empowered LLM Inference Pipeline
Zangwei Zheng
Xiaozhe Ren
Fuzhao Xue
Yang Luo
Xin Jiang
Yang You
49
58
0
22 May 2023
InheritSumm: A General, Versatile and Compact Summarizer by Distilling
  from GPT
InheritSumm: A General, Versatile and Compact Summarizer by Distilling from GPT
Yichong Xu
Ruochen Xu
Dan Iter
Yang Liu
Shuohang Wang
Chenguang Zhu
Michael Zeng
21
10
0
22 May 2023
Making Language Models Better Tool Learners with Execution Feedback
Making Language Models Better Tool Learners with Execution Feedback
Shuofei Qiao
Honghao Gui
Chengfei Lv
Qianghuai Jia
Huajun Chen
Ningyu Zhang
LLMAG
51
46
0
22 May 2023
AudioToken: Adaptation of Text-Conditioned Diffusion Models for
  Audio-to-Image Generation
AudioToken: Adaptation of Text-Conditioned Diffusion Models for Audio-to-Image Generation
Guy Yariv
Itai Gat
Lior Wolf
Yossi Adi
Idan Schwartz
DiffM
44
21
0
22 May 2023
Textually Pretrained Speech Language Models
Textually Pretrained Speech Language Models
Michael Hassid
Tal Remez
Tu Nguyen
Itai Gat
Alexis Conneau
...
Alexandre Défossez
Gabriel Synnaeve
Emmanuel Dupoux
Roy Schwartz
Yossi Adi
VLM
SyDa
51
54
0
22 May 2023
GPT-SW3: An Autoregressive Language Model for the Nordic Languages
GPT-SW3: An Autoregressive Language Model for the Nordic Languages
Ariel Ekgren
Amaru Cuba Gyllensten
Felix Stollenwerk
Joey Öhman
T. Isbister
Evangelia Gogoulou
F. Carlsson
Alice Heiman
Judit Casademont
Magnus Sahlgren
34
13
0
22 May 2023
Lion: Adversarial Distillation of Proprietary Large Language Models
Lion: Adversarial Distillation of Proprietary Large Language Models
Yuxin Jiang
Chunkit Chan
Yin Hua
Wei Wang
ALM
40
23
0
22 May 2023
Can We Edit Factual Knowledge by In-Context Learning?
Can We Edit Factual Knowledge by In-Context Learning?
Ce Zheng
Lei Li
Qingxiu Dong
Yuxuan Fan
Zhiyong Wu
Jingjing Xu
Baobao Chang
KELM
39
187
0
22 May 2023
llm-japanese-dataset v0: Construction of Japanese Chat Dataset for Large
  Language Models and its Methodology
llm-japanese-dataset v0: Construction of Japanese Chat Dataset for Large Language Models and its Methodology
Masanori Hirano
Masahiro Suzuki
Hiroki Sakaji
23
6
0
22 May 2023
TheoremQA: A Theorem-driven Question Answering dataset
TheoremQA: A Theorem-driven Question Answering dataset
Wenhu Chen
Ming Yin
Max W.F. Ku
Pan Lu
Yixin Wan
Xueguang Ma
Jianyu Xu
Xinyi Wang
Tony Xia
AIMat
38
125
0
21 May 2023
DPIC: Decoupling Prompt and Intrinsic Characteristics for LLM Generated
  Text Detection
DPIC: Decoupling Prompt and Intrinsic Characteristics for LLM Generated Text Detection
Xiao Yu
Yuang Qi
Kejiang Chen
Guoqiang Chen
Xi Yang
Pengyuan Zhu
Xiuwei Shang
Weiming Zhang
Neng H. Yu
DeLMO
28
11
0
21 May 2023
Teaching the Pre-trained Model to Generate Simple Texts for Text
  Simplification
Teaching the Pre-trained Model to Generate Simple Texts for Text Simplification
Renliang Sun
Wei Xu
Xiaojun Wan
CLL
29
19
0
21 May 2023
Prompting ChatGPT in MNER: Enhanced Multimodal Named Entity Recognition
  with Auxiliary Refined Knowledge
Prompting ChatGPT in MNER: Enhanced Multimodal Named Entity Recognition with Auxiliary Refined Knowledge
Jinyuan Li
Han Li
Zhufeng Pan
Di Sun
Jiahao Wang
Wenkun Zhang
Gang Pan
52
22
0
20 May 2023
OPT-R: Exploring the Role of Explanations in Finetuning and Prompting
  for Reasoning Skills of Large Language Models
OPT-R: Exploring the Role of Explanations in Finetuning and Prompting for Reasoning Skills of Large Language Models
Badr AlKhamissi
Siddharth Verma
Ping Yu
Zhijing Jin
Asli Celikyilmaz
Mona T. Diab
LRM
ReLM
35
10
0
19 May 2023
Scaling laws for language encoding models in fMRI
Scaling laws for language encoding models in fMRI
Richard Antonello
Aditya R. Vaidya
Alexander G. Huth
MedIm
35
59
0
19 May 2023
Let's Sample Step by Step: Adaptive-Consistency for Efficient Reasoning
  and Coding with LLMs
Let's Sample Step by Step: Adaptive-Consistency for Efficient Reasoning and Coding with LLMs
Pranjal Aggarwal
Aman Madaan
Yiming Yang
Mausam
LRM
41
38
0
19 May 2023
Solving NLP Problems through Human-System Collaboration: A
  Discussion-based Approach
Solving NLP Problems through Human-System Collaboration: A Discussion-based Approach
Masahiro Kaneko
Graham Neubig
Naoaki Okazaki
44
6
0
19 May 2023
Examining Inter-Consistency of Large Language Models Collaboration: An
  In-depth Analysis via Debate
Examining Inter-Consistency of Large Language Models Collaboration: An In-depth Analysis via Debate
Kai Xiong
Xiao Ding
Yixin Cao
Ting Liu
Bing Qin
26
61
0
19 May 2023
ToolkenGPT: Augmenting Frozen Language Models with Massive Tools via
  Tool Embeddings
ToolkenGPT: Augmenting Frozen Language Models with Massive Tools via Tool Embeddings
Shibo Hao
Tianyang Liu
Zhen Wang
Zhiting Hu
RALM
LLMAG
76
174
0
19 May 2023
Empower Large Language Model to Perform Better on Industrial
  Domain-Specific Question Answering
Empower Large Language Model to Perform Better on Industrial Domain-Specific Question Answering
Fangkai Yang
Pu Zhao
Zezhong Wang
Lu Wang
Jue Zhang
Mohit Garg
Qingwei Lin
Saravan Rajmohan
Dongmei Zhang
39
47
0
19 May 2023
Data Redaction from Conditional Generative Models
Data Redaction from Conditional Generative Models
Zhifeng Kong
Kamalika Chaudhuri
KELM
26
7
0
18 May 2023
Writing your own book: A method for going from closed to open book QA to
  improve robustness and performance of smaller LLMs
Writing your own book: A method for going from closed to open book QA to improve robustness and performance of smaller LLMs
Giorgi Kokaia
Pratyush Sinha
Yutong Jiang
N. Boujemaa
25
0
0
18 May 2023
Instruct2Act: Mapping Multi-modality Instructions to Robotic Actions
  with Large Language Model
Instruct2Act: Mapping Multi-modality Instructions to Robotic Actions with Large Language Model
Siyuan Huang
Zhengkai Jiang
Hao Dong
Yu Qiao
Peng Gao
Hongsheng Li
LM&Ro
37
93
0
18 May 2023
SpeechGPT: Empowering Large Language Models with Intrinsic Cross-Modal
  Conversational Abilities
SpeechGPT: Empowering Large Language Models with Intrinsic Cross-Modal Conversational Abilities
Dong Zhang
Shimin Li
Xin Zhang
Jun Zhan
Pengyu Wang
Yaqian Zhou
Xipeng Qiu
AuLLM
MLLM
62
304
0
18 May 2023
Are Large Language Models Fit For Guided Reading?
Are Large Language Models Fit For Guided Reading?
Peter Ochieng
LM&MA
ELM
AI4Ed
32
2
0
18 May 2023
Previous
123...137138139140141
Next