Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2302.13971
Cited By
LLaMA: Open and Efficient Foundation Language Models
27 February 2023
Hugo Touvron
Thibaut Lavril
Gautier Izacard
Xavier Martinet
Marie-Anne Lachaux
Timothée Lacroix
Baptiste Rozière
Naman Goyal
Eric Hambro
Faisal Azhar
Aurelien Rodriguez
Armand Joulin
Edouard Grave
Guillaume Lample
ALM
PILM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"LLaMA: Open and Efficient Foundation Language Models"
50 / 7,076 papers shown
Title
CoT-BERT: Enhancing Unsupervised Sentence Representation through Chain-of-Thought
Bowen Zhang
Kehua Chang
Chunping Li
SSL
55
6
0
20 Sep 2023
Embed-Search-Align: DNA Sequence Alignment using Transformer Models
Pavan Holur
Kenneth Enevoldsen
Shreyas Rajesh
L. Mboning
Thalia Georgiou
Louis-S. Bouchard
Matteo Pellegrini
V. Roychowdhury
37
0
0
20 Sep 2023
In-Context Learning for Text Classification with Many Labels
Aristides Milios
Siva Reddy
Dzmitry Bahdanau
27
34
0
19 Sep 2023
A Family of Pretrained Transformer Language Models for Russian
Dmitry Zmitrovich
Alexander Abramov
Andrey Kalmykov
Maria Tikhonova
Ekaterina Taktasheva
...
Vitalii Kadulin
Sergey Markov
Tatiana Shavrina
Vladislav Mikhailov
Alena Fenogenova
38
26
0
19 Sep 2023
GPT4AIGChip: Towards Next-Generation AI Accelerator Design Automation via Large Language Models
Yonggan Fu
Yongan Zhang
Zhongzhi Yu
Sixu Li
Zhifan Ye
Chaojian Li
Cheng Wan
Ying Lin
51
64
0
19 Sep 2023
Language Modeling Is Compression
Grégoire Delétang
Anian Ruoss
Paul-Ambroise Duquenne
Elliot Catt
Tim Genewein
...
Wenliang Kevin Li
Matthew Aitchison
Laurent Orseau
Marcus Hutter
J. Veness
AI4CE
64
136
0
19 Sep 2023
Model Leeching: An Extraction Attack Targeting LLMs
Lewis Birch
William Hackett
Stefan Trawicki
N. Suri
Peter Garraghan
52
13
0
19 Sep 2023
Bridging Zero-shot Object Navigation and Foundation Models through Pixel-Guided Navigation Skill
Wenzhe Cai
Siyuan Huang
Guangran Cheng
Yuxing Long
Peng Gao
Changyin Sun
Hao Dong
LM&Ro
30
42
0
19 Sep 2023
Baichuan 2: Open Large-scale Language Models
Ai Ming Yang
Bin Xiao
Bingning Wang
Borong Zhang
Ce Bian
...
Youxin Jiang
Yuchen Gao
Yupeng Zhang
Guosheng Dong
Zhiying Wu
ELM
LRM
77
718
0
19 Sep 2023
Pruning Large Language Models via Accuracy Predictor
Yupeng Ji
Yibo Cao
Jiu-si Liu
KELM
42
4
0
18 Sep 2023
Optimal Scene Graph Planning with Large Language Model Guidance
Zhirui Dai
Arash Asgharivaskasi
T. Duong
Shusen Lin
Maria-Elizabeth Tzes
George Pappas
Nikolay Atanasov
LM&Ro
47
19
0
17 Sep 2023
Contrastive Decoding Improves Reasoning in Large Language Models
Sean O'Brien
Mike Lewis
SyDa
LRM
ReLM
37
33
0
17 Sep 2023
Sorted LLaMA: Unlocking the Potential of Intermediate Layers of Large Language Models for Dynamic Inference
Parsa Kavehzadeh
Mojtaba Valipour
Marzieh S. Tahaei
Ali Ghodsi
Boxing Chen
Mehdi Rezagholizadeh
40
6
0
16 Sep 2023
PatFig: Generating Short and Long Captions for Patent Figures
Dana Aubakirova
Kim Gerdes
Lufei Liu
17
9
0
15 Sep 2023
Sight Beyond Text: Multi-Modal Training Enhances LLMs in Truthfulness and Ethics
Haoqin Tu
Bingchen Zhao
Chen Wei
Cihang Xie
MLLM
46
14
0
13 Sep 2023
Can Whisper perform speech-based in-context learning?
Siyin Wang
Chao-Han Huck Yang
Ji Wu
Chao Zhang
41
26
0
13 Sep 2023
Cognitive Mirage: A Review of Hallucinations in Large Language Models
Hongbin Ye
Tong Liu
Aijia Zhang
Wei Hua
Weiqiang Jia
HILM
53
77
0
13 Sep 2023
CONVERSER: Few-Shot Conversational Dense Retrieval with Synthetic Data Generation
Chao-Wei Huang
Chen-Yu Hsu
Tsung-Yuan Hsu
Chen-An Li
Yun-Nung Chen
29
5
0
13 Sep 2023
Balanced and Explainable Social Media Analysis for Public Health with Large Language Models
Yan Jiang
Ruihong Qiu
Yi Zhang
Peng Zhang
32
7
0
12 Sep 2023
Can you text what is happening? Integrating pre-trained language encoders into trajectory prediction models for autonomous driving
Ali Keysan
Andreas Look
Eitan Kosman
Gonca Gürsun
Jörg Wagner
Yu Yao
Barbara Rakitsch
35
29
0
11 Sep 2023
Evaluating Chatbots to Promote Users' Trust -- Practices and Open Problems
Biplav Srivastava
Kausik Lakkaraju
T. Koppel
Vignesh Narayanan
Ashish Kundu
Sachindra Joshi
42
2
0
09 Sep 2023
Zero-Shot Robustification of Zero-Shot Models
Dyah Adila
Changho Shin
Lin Cai
Frederic Sala
51
20
0
08 Sep 2023
Knowledge-tuning Large Language Models with Structured Medical Knowledge Bases for Reliable Response Generation in Chinese
Hao Wang
Sendong Zhao
Zewen Qiang
Zijian Li
Nuwa Xi
...
Haoqiang Guo
Yuhan Chen
Haoming Xu
Bing Qin
Ting Liu
LM&MA
AI4MH
39
19
0
08 Sep 2023
Manifold-based Verbalizer Space Re-embedding for Tuning-free Prompt-based Classification
Hao Wang
Sendong Zhao
Chi-Liang Liu
Nuwa Xi
Muzhen Cai
Bing Qin
Ting Liu
31
1
0
08 Sep 2023
ImageBind-LLM: Multi-modality Instruction Tuning
Jiaming Han
Renrui Zhang
Wenqi Shao
Peng Gao
Peng Xu
...
Yafei Wen
Xiaoxin Chen
Xiangyu Yue
Hongsheng Li
Yu Qiao
MLLM
54
119
0
07 Sep 2023
FLM-101B: An Open LLM and How to Train It with
100
K
B
u
d
g
e
t
100K Budget
100
K
B
u
d
g
e
t
Xiang Li
Yiqun Yao
Xin Jiang
Xuezhi Fang
Xuying Meng
...
Li Du
Bowen Qin
Zheng Zhang
Aixin Sun
Yequan Wang
60
22
0
07 Sep 2023
Enhancing Pipeline-Based Conversational Agents with Large Language Models
Mina Foosherian
Hendrik Purwins
Purna Rathnayake
Touhidul Alam
Rui Teimao
K. Thoben
LLMAG
37
2
0
07 Sep 2023
Exploring an LM to generate Prolog Predicates from Mathematics Questions
Xiaocheng Yang
Yik-Cheung Tam
ReLM
LRM
24
0
0
07 Sep 2023
Evaluating ChatGPT as a Recommender System: A Rigorous Approach
Dario Di Palma
Giovanni Maria Biancofiore
Vito Walter Anelli
Fedelucio Narducci
Tommaso Di Noia
E. Sciascio
ALM
59
28
0
07 Sep 2023
SAM3D: Segment Anything Model in Volumetric Medical Images
Nhat-Tan Bui
Dinh-Hieu Hoang
Minh-Triet Tran
Gianfranco Doretto
Donald Adjeroh
Brijesh Patel
Arabinda Choudhary
Ngan Le
MedIm
40
40
0
07 Sep 2023
From Base to Conversational: Japanese Instruction Dataset and Tuning Large Language Models
Masahiro Suzuki
Masanori Hirano
Hiroki Sakaji
53
6
0
07 Sep 2023
Norm Tweaking: High-performance Low-bit Quantization of Large Language Models
Liang Li
Qingyuan Li
Bo Zhang
Xiangxiang Chu
MQ
52
29
0
06 Sep 2023
GPT Can Solve Mathematical Problems Without a Calculator
Zhiyong Yang
Ming Ding
Qingsong Lv
Zhihuan Jiang
Zehai He
Yuyi Guo
Jinfeng Bai
Jie Tang
RALM
LRM
52
53
0
06 Sep 2023
CIEM: Contrastive Instruction Evaluation Method for Better Instruction Tuning
Hongyu Hu
Jiyuan Zhang
Minyi Zhao
Zhenbang Sun
MLLM
30
43
0
05 Sep 2023
Siren's Song in the AI Ocean: A Survey on Hallucination in Large Language Models
Yue Zhang
Yafu Li
Leyang Cui
Deng Cai
Lemao Liu
...
Longyue Wang
Anh Tuan Luu
Wei Bi
Freda Shi
Shuming Shi
RALM
LRM
HILM
55
528
0
03 Sep 2023
eDKM: An Efficient and Accurate Train-time Weight Clustering for Large Language Models
Minsik Cho
Keivan Alizadeh Vahid
Qichen Fu
Saurabh N. Adya
C. C. D. Mundo
Mohammad Rastegari
Devang Naik
Peter Zatloukal
MQ
40
6
0
02 Sep 2023
Taken out of context: On measuring situational awareness in LLMs
Lukas Berglund
Asa Cooper Stickland
Mikita Balesni
Max Kaufmann
Meg Tong
Tomasz Korbak
Daniel Kokotajlo
Owain Evans
LLMAG
LRM
34
63
0
01 Sep 2023
YaRN: Efficient Context Window Extension of Large Language Models
Bowen Peng
Jeffrey Quesnelle
Honglu Fan
Enrico Shippole
OSLM
32
232
0
31 Aug 2023
Sparkles: Unlocking Chats Across Multiple Images for Multimodal Instruction-Following Models
Yupan Huang
Zaiqiao Meng
Fangyu Liu
Yixuan Su
Nigel Collier
Yutong Lu
MLLM
41
22
0
31 Aug 2023
Text-to-OverpassQL: A Natural Language Interface for Complex Geodata Querying of OpenStreetMap
Michael Staniek
Raphael Schumann
Maike Zufle
Stefan Riezler
40
6
0
30 Aug 2023
Quantifying and Analyzing Entity-level Memorization in Large Language Models
Zhenhong Zhou
Jiuyang Xiang
Chao-Yi Chen
Sen Su
PILM
38
8
0
30 Aug 2023
When Do Program-of-Thoughts Work for Reasoning?
Zhen Bi
Ningyu Zhang
Yinuo Jiang
Shumin Deng
Guozhou Zheng
Huajun Chen
LRM
59
20
0
29 Aug 2023
AnomalyGPT: Detecting Industrial Anomalies Using Large Vision-Language Models
Zhaopeng Gu
Bingke Zhu
Guibo Zhu
Yingying Chen
Ming Tang
Jinqiao Wang
VLM
MLLM
42
103
0
29 Aug 2023
CoVR: Learning Composed Video Retrieval from Web Video Captions
Lucas Ventura
Antoine Yang
Cordelia Schmid
Gül Varol
22
26
0
28 Aug 2023
Mobile Foundation Model as Firmware
Jinliang Yuan
Chenchen Yang
Dongqi Cai
Shihe Wang
Xin Yuan
...
Di Zhang
Hanzi Mei
Xianqing Jia
Shangguang Wang
Mengwei Xu
55
19
0
28 Aug 2023
RecMind: Large Language Model Powered Agent For Recommendation
Yancheng Wang
Ziyan Jiang
Zheng Chen
Fan Yang
Yingxue Zhou
Eunah Cho
Xing Fan
Xiaojiang Huang
Yanbin Lu
Yingzhen Yang
LLMAG
LM&Ro
LRM
52
93
0
28 Aug 2023
Spoken Language Intelligence of Large Language Models for Language Learning
Linkai Peng
Baorian Nuchged
Yingming Gao
ELM
64
4
0
28 Aug 2023
AI-Generated Content (AIGC) for Various Data Modalities: A Survey
Lin Geng Foo
Hossein Rahmani
Jing Liu
81
31
0
27 Aug 2023
Exploring Large Language Models for Knowledge Graph Completion
Liang Yao
Jiazhen Peng
Chengsheng Mao
Yuan Luo
42
37
0
26 Aug 2023
Evolution of ESG-focused DLT Research: An NLP Analysis of the Literature
Walter Hernandez Cruz
K. Tylinski
Alastair Moore
Niall Roche
Nikhil Vadgama
Horst Treiblmaier
J. Shangguan
Paolo Tasca
Jiahua Xu
33
2
0
23 Aug 2023
Previous
1
2
3
...
133
134
135
...
140
141
142
Next