Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2302.13971
Cited By
LLaMA: Open and Efficient Foundation Language Models
27 February 2023
Hugo Touvron
Thibaut Lavril
Gautier Izacard
Xavier Martinet
Marie-Anne Lachaux
Timothée Lacroix
Baptiste Rozière
Naman Goyal
Eric Hambro
Faisal Azhar
Aurelien Rodriguez
Armand Joulin
Edouard Grave
Guillaume Lample
ALM
PILM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"LLaMA: Open and Efficient Foundation Language Models"
50 / 7,178 papers shown
Title
Evaluating the Data Model Robustness of Text-to-SQL Systems Based on Real User Queries
Jonathan Fürst
Catherine Kosten
Farhad Nooralahzadeh
Yi Zhang
Kurt Stockinger
LMTD
24
7
0
13 Feb 2024
Implicit Bias in Noisy-SGD: With Applications to Differentially Private Training
Tom Sander
Maxime Sylvestre
Alain Durmus
36
1
0
13 Feb 2024
LoTa-Bench: Benchmarking Language-oriented Task Planners for Embodied Agents
Jae-Woo Choi
Youngwoo Yoon
Hyobin Ong
Jaehong Kim
Minsu Jang
34
14
0
13 Feb 2024
LLaGA: Large Language and Graph Assistant
Runjin Chen
Tong Zhao
Ajay Jaiswal
Neil Shah
Zhangyang Wang
37
59
0
13 Feb 2024
World Model on Million-Length Video And Language With Blockwise RingAttention
Hao Liu
Wilson Yan
Matei A. Zaharia
Pieter Abbeel
VGen
59
68
0
13 Feb 2024
Addressing cognitive bias in medical language models
Samuel Schmidgall
Carl Harris
Ime Essien
Daniel Olshvang
Tawsifur Rahman
Ji Woong Kim
Rojin Ziaei
Jason K. Eshraghian
Peter M Abadir
Rama Chellappa
ELM
43
25
0
12 Feb 2024
Large Language Models as Agents in Two-Player Games
Yang Liu
Peng Sun
Hang Li
LLMAG
45
3
0
12 Feb 2024
Efficient and Scalable Fine-Tune of Language Models for Genome Understanding
Huixin Zhan
Ying Nian Wu
Zijun Zhang
ALM
35
1
0
12 Feb 2024
Aya Model: An Instruction Finetuned Open-Access Multilingual Language Model
Ahmet Üstün
Viraat Aryabumi
Zheng-Xin Yong
Wei-Yin Ko
Daniel D'souza
...
Shayne Longpre
Niklas Muennighoff
Marzieh Fadaee
Julia Kreutzer
Sara Hooker
ALM
ELM
SyDa
LRM
40
200
0
12 Feb 2024
Tailoring Education with GenAI: A New Horizon in Lesson Planning
K. Karpouzis
Dimitris Pantazatos
Joanna Taouki
Kalliopi Meli
30
9
0
12 Feb 2024
Differentially Private Zeroth-Order Methods for Scalable Large Language Model Finetuning
Zhicheng Liu
Jian Lou
Wenxuan Bao
Yihan Hu
Baochun Li
Zhan Qin
K. Ren
47
7
0
12 Feb 2024
Tuning-Free Stochastic Optimization
Ahmed Khaled
Chi Jin
37
7
0
12 Feb 2024
TELLER: A Trustworthy Framework for Explainable, Generalizable and Controllable Fake News Detection
Hui Liu
Wenya Wang
Haoru Li
Haoliang Li
49
3
0
12 Feb 2024
Towards an Understanding of Stepwise Inference in Transformers: A Synthetic Graph Navigation Model
Mikail Khona
Maya Okawa
Jan Hula
Rahul Ramesh
Kento Nishi
Robert P. Dick
Ekdeep Singh Lubana
Hidenori Tanaka
51
5
0
12 Feb 2024
Diffusion of Thoughts: Chain-of-Thought Reasoning in Diffusion Language Models
Jiacheng Ye
Shansan Gong
Liheng Chen
Lin Zheng
Jiahui Gao
...
Chuan Wu
Xin Jiang
Zhenguo Li
Wei Bi
Lingpeng Kong
DiffM
LRM
AI4CE
64
14
0
12 Feb 2024
Asking Multimodal Clarifying Questions in Mixed-Initiative Conversational Search
Yifei Yuan
Clemencia Siro
Mohammad Aliannejadi
Maarten de Rijke
Wai Lam
39
8
0
12 Feb 2024
Anchor-based Large Language Models
Jianhui Pang
Fanghua Ye
Derek F. Wong
Xin He
Wanshun Chen
Longyue Wang
KELM
82
9
0
12 Feb 2024
Only the Curve Shape Matters: Training Foundation Models for Zero-Shot Multivariate Time Series Forecasting through Next Curve Shape Prediction
Cheng Feng
Long Huang
Denis Krompass
AI4TS
48
5
0
12 Feb 2024
Dólares or Dollars? Unraveling the Bilingual Prowess of Financial LLMs Between Spanish and English
Xiao Zhang
Ruoyu Xiang
Chenhan Yuan
Duanyu Feng
Weiguang Han
...
Xiao-Yang Liu
Sophia Ananiadou
Min Peng
Jimin Huang
Qianqian Xie
46
6
0
12 Feb 2024
VisLingInstruct: Elevating Zero-Shot Learning in Multi-Modal Language Models with Autonomous Instruction Optimization
Dongsheng Zhu
Xunzhu Tang
Weidong Han
Jinghui Lu
Yukun Zhao
Guoliang Xing
Junfeng Wang
Dawei Yin
VLM
MLLM
60
9
0
12 Feb 2024
Exploring Perceptual Limitation of Multimodal Large Language Models
Jiarui Zhang
Jinyi Hu
Mahyar Khayatkhoei
Filip Ilievski
Maosong Sun
LRM
43
10
0
12 Feb 2024
Insights into Natural Language Database Query Errors: From Attention Misalignment to User Handling Strategies
Zheng Ning
Yuan Tian
Zheng Zhang
Tianyi Zhang
Tao Li
76
6
0
11 Feb 2024
Previously on the Stories: Recap Snippet Identification for Story Reading
JiangNan Li
Qiujing Wang
Liyan Xu
Wenjie Pang
Mo Yu
Zheng Lin
Weiping Wang
Jie Zhou
52
3
0
11 Feb 2024
Open-ended VQA benchmarking of Vision-Language models by exploiting Classification datasets and their semantic hierarchy
Simon Ging
M. A. Bravo
Thomas Brox
VLM
75
11
0
11 Feb 2024
CPSDBench: A Large Language Model Evaluation Benchmark and Baseline for Chinese Public Security Domain
Xin Tong
Bo Jin
Zhi Lin
Binjun Wang
Ting Yu
Qiang Cheng
ELM
53
0
0
11 Feb 2024
A Benchmark for Multi-modal Foundation Models on Low-level Vision: from Single Images to Pairs
Zicheng Zhang
Haoning Wu
Erli Zhang
Guangtao Zhai
Weisi Lin
VLM
29
8
0
11 Feb 2024
Instruct Once, Chat Consistently in Multiple Rounds: An Efficient Tuning Framework for Dialogue
Jian Wang
Chak Tou Leong
Jiashuo Wang
Dongding Lin
Wenjie Li
Xiao-Yong Wei
62
8
0
10 Feb 2024
OpenFedLLM: Training Large Language Models on Decentralized Private Data via Federated Learning
Rui Ye
Wenhao Wang
Jingyi Chai
Dihan Li
Zexi Li
Yinda Xu
Yaxin Du
Yanfeng Wang
Siheng Chen
ALM
FedML
AIFin
29
82
0
10 Feb 2024
GenTranslate: Large Language Models are Generative Multilingual Speech and Machine Translators
Yuchen Hu
Chen Chen
Chao-Han Huck Yang
Ruizhe Li
Dong Zhang
Zhehuai Chen
Eng Siong Chng
25
21
0
10 Feb 2024
Sentinels of the Stream: Unleashing Large Language Models for Dynamic Packet Classification in Software Defined Networks -- Position Paper
Shariq Murtuza
43
1
0
10 Feb 2024
Reasoning Grasping via Multimodal Large Language Model
Shiyu Jin
Jinxuan Xu
Yutian Lei
Liangjun Zhang
LRM
49
20
0
09 Feb 2024
Is it safe to cross? Interpretable Risk Assessment with GPT-4V for Safety-Aware Street Crossing
Hochul Hwang
Sunjae Kwon
Yekyung Kim
Donghyun Kim
37
11
0
09 Feb 2024
Large Language Models for Captioning and Retrieving Remote Sensing Images
João Daniel Silva
João Magalhães
D. Tuia
Bruno Martins
46
29
0
09 Feb 2024
StruQ: Defending Against Prompt Injection with Structured Queries
Sizhe Chen
Julien Piet
Chawin Sitawarin
David Wagner
SILM
AAML
40
73
0
09 Feb 2024
LLaVA-Docent: Instruction Tuning with Multimodal Large Language Model to Support Art Appreciation Education
Unggi Lee
Minji Jeon
Yunseo Lee
Gyuri Byun
Yoorim Son
Jaeyoon Shin
Hongkyu Ko
Hyeoncheol Kim
32
9
0
09 Feb 2024
On the Efficacy of Eviction Policy for Key-Value Constrained Generative Language Model Inference
Siyu Ren
Kenny Q. Zhu
31
28
0
09 Feb 2024
ScreenAgent: A Vision Language Model-driven Computer Control Agent
Runliang Niu
Jindong Li
Shiqi Wang
Yali Fu
Xiyu Hu
Xueyuan Leng
He Kong
Yi Chang
Qi Wang
LLMAG
MLLM
LM&Ro
68
42
0
09 Feb 2024
Learn To be Efficient: Build Structured Sparsity in Large Language Models
Haizhong Zheng
Xiaoyan Bai
Xueshen Liu
Z. Morley Mao
Beidi Chen
Fan Lai
Atul Prakash
74
12
0
09 Feb 2024
ViGoR: Improving Visual Grounding of Large Vision Language Models with Fine-Grained Reward Modeling
Siming Yan
Min Bai
Weifeng Chen
Xiong Zhou
Qixing Huang
Erran L. Li
VLM
25
19
0
09 Feb 2024
FL-NAS: Towards Fairness of NAS for Resource Constrained Devices via Large Language Models
Ruiyang Qin
Yuting Hu
Zheyu Yan
Jinjun Xiong
Ahmed Abbasi
Yiyu Shi
39
7
0
09 Feb 2024
EntGPT: Entity Linking with Generative Large Language Models
Yifan Ding
Amrit Poudel
Qingkai Zeng
Tim Weninger
Balaji Veeramani
Sanmitra Bhattacharya
ReLM
KELM
LRM
51
4
0
09 Feb 2024
Large Language Models: A Survey
Shervin Minaee
Tomas Mikolov
Narjes Nikzad
M. Asgari-Chenaghlu
R. Socher
Xavier Amatriain
Jianfeng Gao
ALM
LM&MA
ELM
139
385
0
09 Feb 2024
Rethinking Data Selection for Supervised Fine-Tuning
Ming Shen
34
17
0
08 Feb 2024
Animated Stickers: Bringing Stickers to Life with Video Diffusion
David Yan
Winnie Zhang
Luxin Zhang
Anmol Kalia
Dingkang Wang
...
Guan Pang
Ali K. Thabet
Peter Vajda
Amy Bearman
Licheng Yu
VGen
DiffM
78
2
0
08 Feb 2024
DiscDiff: Latent Diffusion Model for DNA Sequence Generation
Zehui Li
Yuhao Ni
W. Beardall
Guoxuan Xia
Akashaditya Das
Guy-Bart Stan
Yiren Zhao
40
7
0
08 Feb 2024
An Interactive Agent Foundation Model
Zane Durante
Bidipta Sarkar
Ran Gong
Rohan Taori
Yusuke Noda
...
Katsushi Ikeuchi
Fei-Fei Li
Jianfeng Gao
Naoki Wake
Qiuyuan Huang
LM&Ro
AI4CE
LLMAG
91
16
0
08 Feb 2024
WebLINX: Real-World Website Navigation with Multi-Turn Dialogue
Xing Han Lù
Zdeněk Kasner
Siva Reddy
39
63
0
08 Feb 2024
On the Convergence of Zeroth-Order Federated Tuning for Large Language Models
Zhenqing Ling
Daoyuan Chen
Liuyi Yao
Yaliang Li
Ying Shen
FedML
66
13
0
08 Feb 2024
Large Language Model Meets Graph Neural Network in Knowledge Distillation
Shengxiang Hu
Guobing Zou
Song Yang
Yanglan Gan
Bofeng Zhang
Yixin Chen
59
7
0
08 Feb 2024
How do Transformers perform In-Context Autoregressive Learning?
Michael E. Sander
Raja Giryes
Taiji Suzuki
Mathieu Blondel
Gabriel Peyré
52
9
0
08 Feb 2024
Previous
1
2
3
...
106
107
108
...
142
143
144
Next