Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2302.13971
Cited By
LLaMA: Open and Efficient Foundation Language Models
27 February 2023
Hugo Touvron
Thibaut Lavril
Gautier Izacard
Xavier Martinet
Marie-Anne Lachaux
Timothée Lacroix
Baptiste Rozière
Naman Goyal
Eric Hambro
Faisal Azhar
Aurelien Rodriguez
Armand Joulin
Edouard Grave
Guillaume Lample
ALM
PILM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"LLaMA: Open and Efficient Foundation Language Models"
50 / 7,185 papers shown
Title
Generative Language Models Exhibit Social Identity Biases
Tiancheng Hu
Yara Kyrychenko
Steve Rathje
Nigel Collier
S. V. D. Linden
Jon Roozenbeek
43
42
0
24 Oct 2023
MindLLM: Pre-training Lightweight Large Language Model from Scratch, Evaluations and Domain Applications
Yizhe Yang
Huashan Sun
Jiawei Li
Runheng Liu
Yinghao Li
Yuhang Liu
Heyan Huang
Yang Gao
ALM
LRM
25
8
0
24 Oct 2023
BLESS: Benchmarking Large Language Models on Sentence Simplification
Tannon Kew
Alison Chi
Laura Vásquez-Rodríguez
Sweta Agrawal
Dennis Aumiller
Fernando Alva-Manchego
Teven Le Scao
56
25
0
24 Oct 2023
Learning From Free-Text Human Feedback -- Collect New Datasets Or Extend Existing Ones?
Dominic Petrak
N. Moosavi
Ye Tian
Nikolai Rozanov
Iryna Gurevych
47
6
0
24 Oct 2023
Large Language Models are Temporal and Causal Reasoners for Video Question Answering
Dohwan Ko
Ji Soo Lee
Wooyoung Kang
Byungseok Roh
Hyunwoo J. Kim
LRM
46
34
0
24 Oct 2023
RAPL: A Relation-Aware Prototype Learning Approach for Few-Shot Document-Level Relation Extraction
Shiao Meng
Xuming Hu
Aiwei Liu
Shuang Li
Fukun Ma
Yawen Yang
Lijie Wen
61
7
0
24 Oct 2023
Variator: Accelerating Pre-trained Models with Plug-and-Play Compression Modules
Chaojun Xiao
Yuqi Luo
Wenbin Zhang
Pengle Zhang
Xu Han
...
Zhengyan Zhang
Ruobing Xie
Zhiyuan Liu
Maosong Sun
Jie Zhou
40
0
0
24 Oct 2023
Retrieval-based Knowledge Transfer: An Effective Approach for Extreme Large Language Model Compression
Jiduan Liu
Jiahao Liu
Qifan Wang
Jingang Wang
Xunliang Cai
Dongyan Zhao
Ran Wang
Rui Yan
43
4
0
24 Oct 2023
POE: Process of Elimination for Multiple Choice Reasoning
Chenkai Ma
Xinya Du
LRM
30
5
0
24 Oct 2023
TCRA-LLM: Token Compression Retrieval Augmented Large Language Model for Inference Cost Reduction
Junyi Liu
Liangzhi Li
Tong Xiang
Bowen Wang
Yiming Qian
43
32
0
24 Oct 2023
CRaSh: Clustering, Removing, and Sharing Enhance Fine-tuning without Full Large Language Model
Kaiyan Zhang
Ning Ding
Biqing Qi
Xuekai Zhu
Xinwei Long
Bowen Zhou
66
4
0
24 Oct 2023
A Communication Theory Perspective on Prompting Engineering Methods for Large Language Models
Yuanfeng Song
Yuanqin He
Xuefang Zhao
Hanlin Gu
Di Jiang
Haijun Yang
Lixin Fan
Qiang Yang
45
3
0
24 Oct 2023
The Janus Interface: How Fine-Tuning in Large Language Models Amplifies the Privacy Risks
Xiaoyi Chen
Siyuan Tang
Rui Zhu
Shijun Yan
Lei Jin
Zihao Wang
Liya Su
Zhikun Zhang
Xiaofeng Wang
Haixu Tang
AAML
PILM
29
17
0
24 Oct 2023
Leveraging Large Language Models for Enhanced Product Descriptions in eCommerce
Jianghong Zhou
Bo Liu
Jhalak Nilesh Acharya Yao Hong
Kuang-chih Lee
Musen Wen
6
5
0
24 Oct 2023
LoRAShear: Efficient Large Language Model Structured Pruning and Knowledge Recovery
Tianyi Chen
Tianyu Ding
Badal Yadav
Ilya Zharkov
Luming Liang
49
28
0
24 Oct 2023
A Review of Reinforcement Learning for Natural Language Processing, and Applications in Healthcare
Ying Liu
Haozhu Wang
Huixue Zhou
Mingchen Li
Yu Hou
Sicheng Zhou
Fang Wang
Rama Hoetzlein
Rui Zhang
OffRL
LM&MA
16
1
0
23 Oct 2023
Specialist or Generalist? Instruction Tuning for Specific NLP Tasks
Chufan Shi
Yixuan Su
Cheng Yang
Yujiu Yang
Deng Cai
63
18
0
23 Oct 2023
S3Eval: A Synthetic, Scalable, Systematic Evaluation Suite for Large Language Models
Fangyu Lei
Qian Liu
Yiming Huang
Shizhu He
Jun Zhao
Kang Liu
ELM
LRM
38
12
0
23 Oct 2023
SpecTr: Fast Speculative Decoding via Optimal Transport
Ziteng Sun
A. Suresh
Jae Hun Ro
Ahmad Beirami
Himanshu Jain
Felix X. Yu
64
71
0
23 Oct 2023
Location-Aware Visual Question Generation with Lightweight Models
Nicholas Collin Suwono
Justin Chih-Yao Chen
Tun-Min Hung
T. Huang
I-Bin Liao
Yung-Hui Li
Lun-Wei Ku
Shao-Hua Sun
28
4
0
23 Oct 2023
GRENADE: Graph-Centric Language Model for Self-Supervised Representation Learning on Text-Attributed Graphs
Yichuan Li
Kaize Ding
Kyumin Lee
SSL
40
25
0
23 Oct 2023
LLM-in-the-loop: Leveraging Large Language Model for Thematic Analysis
Shih-Chieh Dai
Aiping Xiong
Lun-Wei Ku
52
71
0
23 Oct 2023
Federated Learning of Large Language Models with Parameter-Efficient Prompt Tuning and Adaptive Optimization
Tianshi Che
Ji Liu
Yang Zhou
Jiaxiang Ren
Jiwen Zhou
Victor S. Sheng
H. Dai
Dejing Dou
38
51
0
23 Oct 2023
SLOG: A Structural Generalization Benchmark for Semantic Parsing
Bingzhi Li
L. Donatelli
Alexander Koller
Tal Linzen
Yuekun Yao
Najoung Kim
46
15
0
23 Oct 2023
Did the Neurons Read your Book? Document-level Membership Inference for Large Language Models
Matthieu Meeus
Shubham Jain
Marek Rei
Yves-Alexandre de Montjoye
MIALM
34
30
0
23 Oct 2023
When Language Models Fall in Love: Animacy Processing in Transformer Language Models
Michael Hanna
Yonatan Belinkov
Sandro Pezzelle
35
11
0
23 Oct 2023
Assessing Step-by-Step Reasoning against Lexical Negation: A Case Study on Syllogism
Mengyu Ye
Tatsuki Kuribayashi
Jun Suzuki
Goro Kobayashi
Hiroaki Funayama
LRM
57
8
0
23 Oct 2023
Geographical Erasure in Language Generation
Pola Schwöbel
Jacek Golebiowski
Michele Donini
Cédric Archambeau
Danish Pruthi
34
5
0
23 Oct 2023
MCC-KD: Multi-CoT Consistent Knowledge Distillation
Hongzhan Chen
Siyue Wu
Xiaojun Quan
Rui Wang
Ming Yan
Ji Zhang
LRM
33
17
0
23 Oct 2023
A Survey on LLM-Generated Text Detection: Necessity, Methods, and Future Directions
Junchao Wu
Shu Yang
Runzhe Zhan
Yulin Yuan
Derek F. Wong
Lidia S. Chao
DeLMO
37
25
0
23 Oct 2023
Reasoning about Ambiguous Definite Descriptions
Stefan F. Schouten
Peter Bloem
Ilia Markov
Piek Vossen
LRM
UQLM
21
0
0
23 Oct 2023
Extending Input Contexts of Language Models through Training on Segmented Sequences
Petros Karypis
Julian McAuley
George Karypis
37
0
0
23 Oct 2023
Plan, Verify and Switch: Integrated Reasoning with Diverse X-of-Thoughts
Tengxiao Liu
Qipeng Guo
Yuqing Yang
Xiangkun Hu
Yue Zhang
Xipeng Qiu
Zheng Zhang
LRM
LLMAG
26
31
0
23 Oct 2023
Large Search Model: Redefining Search Stack in the Era of LLMs
Liang Wang
Nan Yang
Xiaolong Huang
Linjun Yang
Rangan Majumder
Furu Wei
LRM
KELM
50
14
0
23 Oct 2023
FedSplitX: Federated Split Learning for Computationally-Constrained Heterogeneous Clients
Jiyun Shin
Jinhyun Ahn
Honggu Kang
Joonhyuk Kang
FedML
68
7
0
23 Oct 2023
Exploring the Boundaries of GPT-4 in Radiology
Qianchu Liu
Stephanie L. Hyland
Shruthi Bannur
Kenza Bouzid
Daniel Coelho De Castro
...
Anja Thieme
A. Nori
M. Lungren
Ozan Oktay
Javier Alvarez-Valle
LM&MA
AI4CE
56
37
0
23 Oct 2023
HallusionBench: An Advanced Diagnostic Suite for Entangled Language Hallucination and Visual Illusion in Large Vision-Language Models
Tianrui Guan
Fuxiao Liu
Xiyang Wu
Ruiqi Xian
Zongxia Li
...
Lichang Chen
Furong Huang
Yaser Yacoob
Dinesh Manocha
Dinesh Manocha
VLM
MLLM
48
164
0
23 Oct 2023
Language Models Hallucinate, but May Excel at Fact Verification
Jian Guan
Jesse Dodge
David Wadden
Minlie Huang
Hao Peng
LRM
HILM
45
29
0
23 Oct 2023
Meaning Representations from Trajectories in Autoregressive Models
Tian Yu Liu
Matthew Trager
Alessandro Achille
Pramuditha Perera
Luca Zancato
Stefano Soatto
34
14
0
23 Oct 2023
The Skipped Beat: A Study of Sociopragmatic Understanding in LLMs for 64 Languages
Chiyu Zhang
Khai Duy Doan
Qisheng Liao
Muhammad Abdul-Mageed
43
6
0
23 Oct 2023
Evaluating Large Language Models on Controlled Generation Tasks
Jiao Sun
Yufei Tian
Wangchunshu Zhou
Nan Xu
Qian Hu
Rahul Gupta
John Wieting
Nanyun Peng
Xuezhe Ma
LRM
ELM
47
61
0
23 Oct 2023
PRCA: Fitting Black-Box Large Language Models for Retrieval Question Answering via Pluggable Reward-Driven Contextual Adapter
Haoyan Yang
Zhitao Li
Yong Zhang
Jianzong Wang
Ning Cheng
Ming Li
Jing Xiao
RALM
19
29
0
23 Oct 2023
Towards a Mechanistic Interpretation of Multi-Step Reasoning Capabilities of Language Models
Buse Giledereli
Jiaoda Li
Yu Fei
Alessandro Stolfo
Wangchunshu Zhou
Guangtao Zeng
Antoine Bosselut
Mrinmaya Sachan
LRM
59
42
0
23 Oct 2023
Vision Language Models in Autonomous Driving: A Survey and Outlook
Xingcheng Zhou
Mingyu Liu
Ekim Yurtsever
B. L. Žagar
Walter Zimmer
Hu Cao
Alois C. Knoll
VLM
44
46
0
22 Oct 2023
Evaluating Subjective Cognitive Appraisals of Emotions from Large Language Models
Hongli Zhan
Desmond C. Ong
Junyi Jessy Li
94
7
0
22 Oct 2023
From Static to Dynamic: A Continual Learning Framework for Large Language Models
Mingzhe Du
Anh Tuan Luu
Bin Ji
See-kiong Ng
11
2
0
22 Oct 2023
LUNA: A Model-Based Universal Analysis Framework for Large Language Models
Da Song
Xuan Xie
Jiayang Song
Derui Zhu
Yuheng Huang
Felix Juefei Xu
Lei Ma
ALM
40
4
0
22 Oct 2023
PromptMix: A Class Boundary Augmentation Method for Large Language Model Distillation
Gaurav Sahu
Olga Vechtomova
Dzmitry Bahdanau
I. Laradji
VLM
68
24
0
22 Oct 2023
An In-Context Schema Understanding Method for Knowledge Base Question Answering
Yantao Liu
Zixuan Li
Xiaolong Jin
Yucan Guo
Long Bai
Saiping Guan
Jiafeng Guo
Xueqi Cheng
37
1
0
22 Oct 2023
Orthogonal Subspace Learning for Language Model Continual Learning
Xiao Wang
Tianze Chen
Qiming Ge
Han Xia
Rong Bao
Rui Zheng
Qi Zhang
Tao Gui
Xuanjing Huang
CLL
127
94
0
22 Oct 2023
Previous
1
2
3
...
130
131
132
...
142
143
144
Next