Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2302.13971
Cited By
LLaMA: Open and Efficient Foundation Language Models
27 February 2023
Hugo Touvron
Thibaut Lavril
Gautier Izacard
Xavier Martinet
Marie-Anne Lachaux
Timothée Lacroix
Baptiste Rozière
Naman Goyal
Eric Hambro
Faisal Azhar
Aurelien Rodriguez
Armand Joulin
Edouard Grave
Guillaume Lample
ALM
PILM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"LLaMA: Open and Efficient Foundation Language Models"
50 / 5,823 papers shown
Title
Let Storytelling Tell Vivid Stories: An Expressive and Fluent Multimodal Storyteller
Chuanqi Zang
Jiji Tang
Rongsheng Zhang
Zeng Zhao
Tangjie Lv
Mingtao Pei
Wei Liang
35
3
0
12 Mar 2024
NavCoT: Boosting LLM-Based Vision-and-Language Navigation via Learning Disentangled Reasoning
Bingqian Lin
Yunshuang Nie
Ziming Wei
Jiaqi Chen
Shikui Ma
Jianhua Han
Hang Xu
Xiaojun Chang
Xiaodan Liang
LM&Ro
LRM
67
21
0
12 Mar 2024
SVD-LLM: Truncation-aware Singular Value Decomposition for Large Language Model Compression
Xin Wang
Yu Zheng
Zhongwei Wan
Mi Zhang
MQ
57
44
0
12 Mar 2024
(
N
,
K
)
\mathbf{(N,K)}
(
N
,
K
)
-Puzzle: A Cost-Efficient Testbed for Benchmarking Reinforcement Learning Algorithms in Generative Language Model
Yufeng Zhang
Liyu Chen
Boyi Liu
Yingxiang Yang
Qiwen Cui
Yunzhe Tao
Hongxia Yang
111
0
0
11 Mar 2024
UPS: Efficiently Building Foundation Models for PDE Solving via Cross-Modal Adaptation
Junhong Shen
Tanya Marwah
Ameet Talwalkar
AI4CE
54
4
0
11 Mar 2024
MAP-Elites with Transverse Assessment for Multimodal Problems in Creative Domains
Marvin Zammit
Antonios Liapis
Georgios N. Yannakakis
34
1
0
11 Mar 2024
ERA-CoT: Improving Chain-of-Thought through Entity Relationship Analysis
Yanming Liu
Xinyue Peng
Tianyu Du
Jianwei Yin
Weihao Liu
Xuhong Zhang
LRM
35
16
0
11 Mar 2024
Exploring Large Language Models and Hierarchical Frameworks for Classification of Large Unstructured Legal Documents
Nishchal Prasad
M. Boughanem
T. Dkaki
AILaw
ELM
40
5
0
11 Mar 2024
Development of a Reliable and Accessible Caregiving Language Model (CaLM)
B. Parmanto
Bayu Aryoyudanta
Wilbert Soekinto
Agus Setiawan
Yuhan Wang
Haomin Hu
Andi Saptono
Yong K Choi
37
0
0
11 Mar 2024
ACT-MNMT Auto-Constriction Turning for Multilingual Neural Machine Translation
Shaojie Dai
Xin Liu
Ping Luo
Yue Yu
LRM
42
1
0
11 Mar 2024
Stealing Part of a Production Language Model
Nicholas Carlini
Daniel Paleka
Krishnamurthy Dvijotham
Thomas Steinke
Jonathan Hayase
...
Arthur Conmy
Itay Yona
Eric Wallace
David Rolnick
Florian Tramèr
MLAU
AAML
32
74
0
11 Mar 2024
Guiding Clinical Reasoning with Large Language Models via Knowledge Seeds
Jiageng Wu
Xian Wu
Jie Yang
LRM
ELM
48
8
0
11 Mar 2024
Academically intelligent LLMs are not necessarily socially intelligent
Ruoxi Xu
Hongyu Lin
Xianpei Han
Le Sun
Yingfei Sun
ELM
37
6
0
11 Mar 2024
DNNShield: Embedding Identifiers for Deep Neural Network Ownership Verification
Jasper Stang
T. Krauß
Alexandra Dmitrienko
30
0
0
11 Mar 2024
Unraveling the Mystery of Scaling Laws: Part I
Hui Su
Zhi Tian
Xiaoyu Shen
Xunliang Cai
36
19
0
11 Mar 2024
QuantTune: Optimizing Model Quantization with Adaptive Outlier-Driven Fine Tuning
Jiun-Man Chen
Yu-Hsuan Chao
Yu-Jie Wang
Ming-Der Shieh
Chih-Chung Hsu
Wei-Fen Lin
MQ
45
1
0
11 Mar 2024
Unsupervised Real-Time Hallucination Detection based on the Internal States of Large Language Models
Weihang Su
Changyue Wang
Qingyao Ai
Hu Yiran
Zhijing Wu
Yujia Zhou
Yiqun Liu
HILM
52
28
0
11 Mar 2024
What Makes Quantization for Large Language Models Hard? An Empirical Study from the Lens of Perturbation
Zhuocheng Gong
Jiahao Liu
Jingang Wang
Xunliang Cai
Dongyan Zhao
Rui Yan
MQ
35
8
0
11 Mar 2024
InfiBench: Evaluating the Question-Answering Capabilities of Code Large Language Models
Linyi Li
Shijie Geng
Zhenwen Li
Yibo He
Hao Yu
Ziyue Hua
Guanghan Ning
Siwei Wang
Tao Xie
Hongxia Yang
ELM
37
2
0
11 Mar 2024
LIEDER: Linguistically-Informed Evaluation for Discourse Entity Recognition
Xiaomeng Zhu
Robert Frank
41
0
0
10 Mar 2024
No Language is an Island: Unifying Chinese and English in Financial Large Language Models, Instruction Data, and Benchmarks
Gang Hu
Ke Qin
Chenhan Yuan
Min Peng
Alejandro Lopez-Lira
Benyou Wang
Sophia Ananiadou
Wanlong Yu
Jimin Huang
Qianqian Xie
32
4
0
10 Mar 2024
TRAD: Enhancing LLM Agents with Step-Wise Thought Retrieval and Aligned Decision
Ruiwen Zhou
Yingxuan Yang
Kangrui Chen
Ying Wen
Wenhao Wang
Chunling Xi
Guoqiang Xu
Jiliang Tang
Lingjuan Lyu
LLMAG
32
8
0
10 Mar 2024
Mipha: A Comprehensive Overhaul of Multimodal Assistant with Small Language Models
Minjie Zhu
Yichen Zhu
Xin Liu
Ning Liu
Zhiyuan Xu
Yaxin Peng
Chaomin Shen
Zhicai Ou
Feifei Feng
Jian Tang
VLM
57
20
0
10 Mar 2024
In-context Prompt Learning for Test-time Vision Recognition with Frozen Vision-language Model
Junhui Yin
Xinyu Zhang
Lin Wu
Xianghua Xie
Xiaojie Wang
VPVLM
VLM
MLLM
44
2
0
10 Mar 2024
Large Language Models on Fine-grained Emotion Detection Dataset with Data Augmentation and Transfer Learning
Kaipeng Wang
Zhi Jing
Yongye Su
Yikun Han
42
3
0
10 Mar 2024
Target-constrained Bidirectional Planning for Generation of Target-oriented Proactive Dialogue
Jian Wang
Dongding Lin
Wenjie Li
32
2
0
10 Mar 2024
HAM-TTS: Hierarchical Acoustic Modeling for Token-Based Zero-Shot Text-to-Speech with Model and Data Scaling
Chunhui Wang
Chang Zeng
Bowen Zhang
Ziyang Ma
Yefan Zhu
Zifeng Cai
Jian Zhao
Zhonglin Jiang
Yong Chen
SyDa
44
5
0
09 Mar 2024
S
2
\textbf{S}^2
S
2
IP-LLM: Semantic Space Informed Prompt Learning with LLM for Time Series Forecasting
Zijie Pan
Yushan Jiang
Sahil Garg
Anderson Schneider
Yuriy Nevmyvaka
Dongjin Song
AI4TS
55
7
0
09 Mar 2024
FLAP: Flow-Adhering Planning with Constrained Decoding in LLMs
Shamik Roy
Sailik Sengupta
Daniele Bonadiman
Saab Mansour
Arshit Gupta
29
5
0
09 Mar 2024
SPAFormer: Sequential 3D Part Assembly with Transformers
Boshen Xu
Sipeng Zheng
Qin Jin
49
2
0
09 Mar 2024
OmniJet-
α
α
α
: The first cross-task foundation model for particle physics
Joschka Birk
Anna Hallin
Gregor Kasieczka
AI4CE
48
22
0
08 Mar 2024
GEAR: An Efficient KV Cache Compression Recipe for Near-Lossless Generative Inference of LLM
Hao Kang
Qingru Zhang
Souvik Kundu
Geonhwa Jeong
Zaoxing Liu
Tushar Krishna
Tuo Zhao
MQ
49
79
0
08 Mar 2024
DeepSeek-VL: Towards Real-World Vision-Language Understanding
Haoyu Lu
Wen Liu
Bo Zhang
Bing-Li Wang
Kai Dong
...
Yaofeng Sun
Chengqi Deng
Hanwei Xu
Zhenda Xie
Chong Ruan
VLM
41
304
0
08 Mar 2024
Part-aware Personalized Segment Anything Model for Patient-Specific Segmentation
Chenhui Zhao
Liyue Shen
VLM
47
3
0
08 Mar 2024
Explaining Pre-Trained Language Models with Attribution Scores: An Analysis in Low-Resource Settings
Wei Zhou
Heike Adel
Hendrik Schuff
Ngoc Thang Vu
LRM
38
2
0
08 Mar 2024
Debiasing Multimodal Large Language Models
Yi-Fan Zhang
Weichen Yu
Qingsong Wen
Xue Wang
Zhang Zhang
Liang Wang
Rong Jin
Tien-Ping Tan
55
4
0
08 Mar 2024
Overcoming Reward Overoptimization via Adversarial Policy Optimization with Lightweight Uncertainty Estimation
Xiaoying Zhang
Jean-François Ton
Wei Shen
Hongning Wang
Yang Liu
39
14
0
08 Mar 2024
Synthetic data generation for system identification: leveraging knowledge transfer from similar systems
Dario Piga
Matteo Rufolo
Gabriele Maroni
Manas Mejari
Marco Forgione
33
1
0
08 Mar 2024
Rule-driven News Captioning
Ning Xu
Tingting Zhang
Hongshuo Tian
An-An Liu
68
0
0
08 Mar 2024
Multimodal Infusion Tuning for Large Models
Hao Sun
Yu Song
Xinyao Yu
Jiaqing Liu
Yen-Wei Chen
Lanfen Lin
VLM
40
0
0
08 Mar 2024
XPSR: Cross-modal Priors for Diffusion-based Image Super-Resolution
Yunpeng Qu
Kun Yuan
Kai Zhao
Qizhi Xie
Jinhua Hao
Ming Sun
Chao Zhou
27
17
0
08 Mar 2024
Defending Against Unforeseen Failure Modes with Latent Adversarial Training
Stephen Casper
Lennart Schulze
Oam Patel
Dylan Hadfield-Menell
AAML
62
30
0
08 Mar 2024
MEIT: Multi-Modal Electrocardiogram Instruction Tuning on Large Language Models for Report Generation
Zhongwei Wan
Che Liu
Xin Wang
Chaofan Tao
Hui Shen
Zhenwu Peng
Jie Fu
Rossella Arcucci
Huaxiu Yao
Mi Zhang
57
7
0
07 Mar 2024
A Survey on Human-AI Teaming with Large Pre-Trained Models
Vanshika Vats
Marzia Binta Nizam
Minghao Liu
Ziyuan Wang
Richard Ho
...
Celeste Shen
Rachel Shen
Nafisa Hussain
Kesav Ravichandran
James Davis
LM&MA
50
8
0
07 Mar 2024
A Safe Harbor for AI Evaluation and Red Teaming
Shayne Longpre
Sayash Kapoor
Kevin Klyman
Ashwin Ramaswami
Rishi Bommasani
...
Daniel Kang
Sandy Pentland
Arvind Narayanan
Percy Liang
Peter Henderson
57
38
0
07 Mar 2024
iScore: Visual Analytics for Interpreting How Language Models Automatically Score Summaries
Adam Joseph Coscia
Langdon Holmes
Wesley Morris
Joon Suh Choi
Scott Crossley
Alex Endert
33
6
0
07 Mar 2024
Common 7B Language Models Already Possess Strong Math Capabilities
Chen Li
Weiqi Wang
Jingcheng Hu
Yixuan Wei
Nanning Zheng
Han Hu
Zheng-Wei Zhang
Houwen Peng
ALM
LRM
45
78
0
07 Mar 2024
Yi: Open Foundation Models by 01.AI
01. AI
Alex Young
01.AI Alex Young
Bei Chen
Chao Li
...
Yue Wang
Yuxuan Cai
Zhenyu Gu
Zhiyuan Liu
Zonghong Dai
OSLM
LRM
150
512
0
07 Mar 2024
QAQ: Quality Adaptive Quantization for LLM KV Cache
Shichen Dong
Wenfang Cheng
Jiayu Qin
Wei Wang
MQ
51
34
0
07 Mar 2024
Where does In-context Translation Happen in Large Language Models
Suzanna Sia
David Mueller
Kevin Duh
LRM
43
0
0
07 Mar 2024
Previous
1
2
3
...
93
94
95
...
115
116
117
Next