Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2302.13971
Cited By
LLaMA: Open and Efficient Foundation Language Models
27 February 2023
Hugo Touvron
Thibaut Lavril
Gautier Izacard
Xavier Martinet
Marie-Anne Lachaux
Timothée Lacroix
Baptiste Rozière
Naman Goyal
Eric Hambro
Faisal Azhar
Aurelien Rodriguez
Armand Joulin
Edouard Grave
Guillaume Lample
ALM
PILM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"LLaMA: Open and Efficient Foundation Language Models"
50 / 7,181 papers shown
Title
When MOE Meets LLMs: Parameter Efficient Fine-tuning for Multi-task Medical Applications
Qidong Liu
Xian Wu
Xiangyu Zhao
Yuanshao Zhu
Derong Xu
Feng Tian
Yefeng Zheng
MoE
56
66
0
21 Oct 2023
Transductive Learning for Textual Few-Shot Classification in API-based Embedding Models
Pierre Colombo
Victor Pellegrain
Malik Boudiaf
Victor Storchan
Myriam Tami
Ismail Ben Ayed
C´eline Hudelot
Pablo Piantanida
43
8
0
21 Oct 2023
On Bilingual Lexicon Induction with Large Language Models
Yaoyiran Li
Anna Korhonen
Ivan Vulić
53
3
0
21 Oct 2023
Ensemble-Instruct: Generating Instruction-Tuning Data with a Heterogeneous Mixture of LMs
Young-Suk Lee
Md Arafat Sultan
Yousef El-Kurdi
Tahira Naseem Asim Munawar
Radu Florian
Salim Roukos
Ramón Fernández Astudillo
SyDa
32
6
0
21 Oct 2023
Evaluation Metrics in the Era of GPT-4: Reliably Evaluating Large Language Models on Sequence to Sequence Tasks
Andrea Sottana
Bin Liang
Kai Zou
Zheng Yuan
ALM
ELM
LM&MA
51
55
0
20 Oct 2023
Three Questions Concerning the Use of Large Language Models to Facilitate Mathematics Learning
An-Zi Yen
Wei-Ling Hsu
LRM
AI4Ed
38
10
0
20 Oct 2023
Make Your Decision Convincing! A Unified Two-Stage Framework: Self-Attribution and Decision-Making
Yanrui Du
Sendong Zhao
Hao Wang
Yuhan Chen
Rui Bai
Zewen Qiang
Muzhen Cai
Bing Qin
28
0
0
20 Oct 2023
MarineGPT: Unlocking Secrets of Ocean to the Public
Ziqiang Zheng
Jipeng Zhang
Tuan-Anh Vu
Shizhe Diao
Yue Him Wong Tim
Sai-Kit Yeung
55
13
0
20 Oct 2023
ReLM: Leveraging Language Models for Enhanced Chemical Reaction Prediction
Yaorui Shi
An Zhang
Enzhi Zhang
Zhiyuan Liu
Xiang Wang
AI4CE
37
24
0
20 Oct 2023
She had Cobalt Blue Eyes: Prompt Testing to Create Aligned and Sustainable Language Models
Veronica Chatrath
Oluwanifemi Bamgbose
Shaina Raza
ALM
ELM
36
1
0
20 Oct 2023
Controlled Randomness Improves the Performance of Transformer Models
Tobias Deuβer
Cong Zhao
Wolfgang Krämer
David Leonhard
Christian Bauckhage
R. Sifa
34
1
0
20 Oct 2023
Mind the instructions: a holistic evaluation of consistency and interactions in prompt-based learning
Lucas Weber
Elia Bruni
Dieuwke Hupkes
48
26
0
20 Oct 2023
Ask Language Model to Clean Your Noisy Translation Data
Quinten Bolding
Baohao Liao
Brandon James Denis
Jun Luo
Christof Monz
45
6
0
20 Oct 2023
Tuna: Instruction Tuning using Feedback from Large Language Models
Haoran Li
Yiran Liu
Xingxing Zhang
Wei Lu
Furu Wei
ALM
46
3
0
20 Oct 2023
Democratizing Reasoning Ability: Tailored Learning from Large Language Model
Zhaoyang Wang
Shaohan Huang
Yuxuan Liu
Jiahai Wang
Minghui Song
...
Haizhen Huang
Furu Wei
Weiwei Deng
Feng Sun
Qi Zhang
LRM
70
11
0
20 Oct 2023
Zero-Shot Sharpness-Aware Quantization for Pre-trained Language Models
Miaoxi Zhu
Qihuang Zhong
Li Shen
Liang Ding
Juhua Liu
Bo Du
Dacheng Tao
MQ
VLM
34
1
0
20 Oct 2023
Test-Time Self-Adaptive Small Language Models for Question Answering
Soyeong Jeong
Jinheon Baek
Sukmin Cho
Sung Ju Hwang
Jong C. Park
39
2
0
20 Oct 2023
Interpreting Indirect Answers to Yes-No Questions in Multiple Languages
Zijie Wang
Md Mosharaf Hossain
Shivam Mathur
Terry Cruz Melo
Kadir Bulut Ozler
...
Jacob Quintero
MohammadHossein Rezaei
Shreya Nupur Shakya
Md Nayem Uddin
Eduardo Blanco
40
1
0
20 Oct 2023
SALMONN: Towards Generic Hearing Abilities for Large Language Models
Changli Tang
Wenyi Yu
Guangzhi Sun
Xianzhao Chen
Tian Tan
Wei Li
Lu Lu
Zejun Ma
Chao Zhang
LM&MA
AuLLM
47
223
0
20 Oct 2023
Steve-Eye: Equipping LLM-based Embodied Agents with Visual Perception in Open Worlds
Sipeng Zheng
Jiazheng Liu
Yicheng Feng
Zongqing Lu
47
29
0
20 Oct 2023
Open-source Large Language Models are Strong Zero-shot Query Likelihood Models for Document Ranking
Shengyao Zhuang
Bing Liu
Bevan Koopman
Guido Zuccon
RALM
37
47
0
20 Oct 2023
The Less the Merrier? Investigating Language Representation in Multilingual Models
H. Nigatu
A. Tonja
Jugal Kalita
49
0
0
20 Oct 2023
Enhancing Zero-Shot Crypto Sentiment with Fine-tuned Language Model and Prompt Engineering
Rahman S. M. Wahidur
Ishmam Tashdeed
Manjit Kaur
Heung-No Lee
ALM
51
17
0
20 Oct 2023
Exploring the Impact of Corpus Diversity on Financial Pretrained Language Models
Jaeyoung Choe
Keonwoong Noh
Nayeon Kim
Seyun Ahn
Woohwan Jung
57
4
0
20 Oct 2023
NameGuess: Column Name Expansion for Tabular Data
Jiani Zhang
Zhengyuan Shen
Balasubramaniam Srinivasan
Shen Wang
Huzefa Rangwala
George Karypis
27
4
0
19 Oct 2023
Better to Ask in English: Cross-Lingual Evaluation of Large Language Models for Healthcare Queries
Yiqiao Jin
Mohit Chandra
Gaurav Verma
Yibo Hu
Munmun De Choudhury
Srijan Kumar
LM&MA
ELM
100
70
0
19 Oct 2023
Frozen Transformers in Language Models Are Effective Visual Encoder Layers
Ziqi Pang
Ziyang Xie
Yunze Man
Yu-Xiong Wang
83
26
0
19 Oct 2023
AutoMix: Automatically Mixing Language Models
Pranjal Aggarwal
Aman Madaan
Ankit Anand
Srividya Pranavi Potharaju
Swaroop Mishra
...
Karthik Kappaganthu
Yiming Yang
Shyam Upadhyay
Manaal Faruqui
Mausam
42
20
0
19 Oct 2023
SEGO: Sequential Subgoal Optimization for Mathematical Problem-Solving
Xueliang Zhao
Xinting Huang
Wei Bi
Lingpeng Kong
LRM
53
0
0
19 Oct 2023
StoryAnalogy: Deriving Story-level Analogies from Large Language Models to Unlock Analogical Understanding
Cheng Jiayang
Lin Qiu
Tszho Chan
Tianqing Fang
Weiqi Wang
...
Qipeng Guo
Hongming Zhang
Yangqiu Song
Yue Zhang
Zheng Zhang
59
30
0
19 Oct 2023
The Locality and Symmetry of Positional Encodings
Lihu Chen
Gaël Varoquaux
Fabian M. Suchanek
51
0
0
19 Oct 2023
Knowledge-Augmented Language Model Verification
Jinheon Baek
Soyeong Jeong
Minki Kang
Jong C. Park
Sung Ju Hwang
RALM
47
13
0
19 Oct 2023
AgentTuning: Enabling Generalized Agent Abilities for LLMs
Aohan Zeng
Mingdao Liu
Rui Lu
Bowen Wang
Xiao Liu
Yuxiao Dong
Jie Tang
LM&MA
ALM
LLMAG
32
166
0
19 Oct 2023
Boosting Inference Efficiency: Unleashing the Power of Parameter-Shared Pre-trained Language Models
Weize Chen
Xiaoyue Xu
Xu Han
Yankai Lin
Ruobing Xie
Zhiyuan Liu
Maosong Sun
Jie Zhou
37
0
0
19 Oct 2023
MolCA: Molecular Graph-Language Modeling with Cross-Modal Projector and Uni-Modal Adapter
Zhiyuan Liu
Changhao Nai
Yancheng Luo
Hao Fei
Yixin Cao
Kenji Kawaguchi
Xiang Wang
Tat-Seng Chua
43
86
0
19 Oct 2023
Are Structural Concepts Universal in Transformer Language Models? Towards Interpretable Cross-Lingual Generalization
Ningyu Xu
Qi Zhang
Jingting Ye
Menghan Zhang
Xuanjing Huang
41
4
0
19 Oct 2023
Safe RLHF: Safe Reinforcement Learning from Human Feedback
Josef Dai
Xuehai Pan
Ruiyang Sun
Jiaming Ji
Xinbo Xu
Mickel Liu
Yizhou Wang
Yaodong Yang
64
317
0
19 Oct 2023
On the Optimization and Generalization of Multi-head Attention
Puneesh Deora
Rouzbeh Ghaderi
Hossein Taheri
Christos Thrampoulidis
MLT
57
34
0
19 Oct 2023
Is ChatGPT a Financial Expert? Evaluating Language Models on Financial Natural Language Processing
Yue Guo
Zian Xu
Yi Yang
ELM
34
9
0
19 Oct 2023
Reliable Academic Conference Question Answering: A Study Based on Large Language Model
Zhiwei Huang
Long Jin
Junjie Wang
Mingchen Tu
Yin Hua
Zhiqiang Liu
Jiawei Meng
Hua-zeng Chen
Wen Zhang
47
0
0
19 Oct 2023
GraphGPT: Graph Instruction Tuning for Large Language Models
Jiabin Tang
Yuhao Yang
Wei Wei
Lei Shi
Lixin Su
Suqi Cheng
Dawei Yin
Chao Huang
47
129
0
19 Oct 2023
Contrastive Learning for Inference in Dialogue
Etsuko Ishii
Yan Xu
Bryan Wilie
Ziwei Ji
Holy Lovenia
Willy Chung
Pascale Fung
40
0
0
19 Oct 2023
Know Where to Go: Make LLM a Relevant, Responsible, and Trustworthy Searcher
Xiang Shi
Jiawei Liu
Yinpeng Liu
Qikai Cheng
Wei Lu
RALM
HILM
KELM
29
6
0
19 Oct 2023
PoisonPrompt: Backdoor Attack on Prompt-based Large Language Models
Hongwei Yao
Jian Lou
Zhan Qin
SILM
AAML
71
33
0
19 Oct 2023
MAF: Multi-Aspect Feedback for Improving Reasoning in Large Language Models
Deepak Nathani
David Wang
Liangming Pan
Wenjie Wang
KELM
LRM
ReLM
28
10
0
19 Oct 2023
Large Language Models for Code Analysis: Do LLMs Really Do Their Job?
Chongzhou Fang
Ning Miao
Shaurya Srivastav
Jialin Liu
Ruoyu Zhang
...
Asmita Asmita
Ryan Tsang
Najmeh Nazari
Han Wang
Houman Homayoun
38
41
0
18 Oct 2023
Document-Level Language Models for Machine Translation
Frithjof Petrick
Christian Herold
Pavel Petrushkov
Shahram Khadivi
Hermann Ney
37
9
0
18 Oct 2023
Measuring Pointwise
V
\mathcal{V}
V
-Usable Information In-Context-ly
Sheng Lu
Shan Chen
Yingya Li
Danielle Bitterman
G. Savova
Iryna Gurevych
24
0
0
18 Oct 2023
InferDPT: Privacy-Preserving Inference for Black-box Large Language Model
Meng Tong
Kejiang Chen
Jie Zhang
Yuang Qi
Weiming Zhang
Neng H. Yu
Tianwei Zhang
Zhikun Zhang
SILM
58
2
0
18 Oct 2023
DiagrammerGPT: Generating Open-Domain, Open-Platform Diagrams via LLM Planning
Abhaysinh Zala
Han Lin
Jaemin Cho
Mohit Bansal
48
14
0
18 Oct 2023
Previous
1
2
3
...
131
132
133
...
142
143
144
Next