Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2302.13971
Cited By
LLaMA: Open and Efficient Foundation Language Models
27 February 2023
Hugo Touvron
Thibaut Lavril
Gautier Izacard
Xavier Martinet
Marie-Anne Lachaux
Timothée Lacroix
Baptiste Rozière
Naman Goyal
Eric Hambro
Faisal Azhar
Aurelien Rodriguez
Armand Joulin
Edouard Grave
Guillaume Lample
ALM
PILM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"LLaMA: Open and Efficient Foundation Language Models"
50 / 5,825 papers shown
Title
Derivative-Free Optimization for Low-Rank Adaptation in Large Language Models
Feihu Jin
Yin Liu
Ying Tan
35
3
0
04 Mar 2024
Differentially Private Synthetic Data via Foundation Model APIs 2: Text
Chulin Xie
Zinan Lin
A. Backurs
Sivakanth Gopi
Da Yu
...
Haotian Jiang
Huishuai Zhang
Yin Tat Lee
Bo Li
Sergey Yekhanin
SyDa
63
34
0
04 Mar 2024
SynCode: LLM Generation with Grammar Augmentation
Shubham Ugare
Tarun Suresh
Hangoo Kang
Sasa Misailovic
Gagandeep Singh
40
12
0
03 Mar 2024
Leveraging Biomolecule and Natural Language through Multi-Modal Learning: A Survey
Qizhi Pei
Lijun Wu
Kaiyuan Gao
Jinhua Zhu
Yue Wang
Zun Wang
Tao Qin
Rui Yan
AI4CE
62
19
0
03 Mar 2024
GuardT2I: Defending Text-to-Image Models from Adversarial Prompts
Yijun Yang
Ruiyuan Gao
Xiao Yang
Qiang Xu
Qiang Xu
32
15
0
03 Mar 2024
GPTSee: Enhancing Moment Retrieval and Highlight Detection via Description-Based Similarity Features
Yunzhuo Sun
Yifang Xu
Zien Xie
Yukun Shu
Sidan Du
38
6
0
03 Mar 2024
MovieLLM: Enhancing Long Video Understanding with AI-Generated Movies
Zhende Song
Chenchen Wang
Jiamu Sheng
C. Zhang
Gang Yu
Jiayuan Fan
Tao Chen
VGen
37
19
0
03 Mar 2024
Automatic Question-Answer Generation for Long-Tail Knowledge
Rohan Kumar
Youngmin Kim
Sunitha Ravi
Haitian Sun
Christos Faloutsos
Ruslan Salakhutdinov
Minji Yoon
25
8
0
03 Mar 2024
Large Language Multimodal Models for 5-Year Chronic Disease Cohort Prediction Using EHR Data
Jun-En Ding
Phan Nguyen Minh Thao
Wen-Chih Peng
Jian-Zhe Wang
Chun-Cheng Chug
...
Dongsheng Luo
Chi-Te Wang
Pei-fu Chen
Feng Liu
Fang-Ming Hung
LM&MA
49
4
0
02 Mar 2024
IntactKV: Improving Large Language Model Quantization by Keeping Pivot Tokens Intact
Ruikang Liu
Haoli Bai
Haokun Lin
Yuening Li
Han Gao
Zheng-Jun Xu
Lu Hou
Jun Yao
Chun Yuan
MQ
23
29
0
02 Mar 2024
API Is Enough: Conformal Prediction for Large Language Models Without Logit-Access
Jiayuan Su
Jing Luo
Hongwei Wang
Lu Cheng
84
16
0
02 Mar 2024
Pseudo-Label Calibration Semi-supervised Multi-Modal Entity Alignment
Luyao Wang
Pengnian Qi
Xigang Bao
Chunlai Zhou
Biao Qin
41
9
0
02 Mar 2024
Balancing Exploration and Exploitation in LLM using Soft RLLF for Enhanced Negation Understanding
Ha-Thanh Nguyen
Ken Satoh
55
3
0
02 Mar 2024
STAR: Constraint LoRA with Dynamic Active Learning for Data-Efficient Fine-Tuning of Large Language Models
Linhai Zhang
Jialong Wu
Deyu Zhou
Guoqiang Xu
30
4
0
02 Mar 2024
FaiMA: Feature-aware In-context Learning for Multi-domain Aspect-based Sentiment Analysis
Songhua Yang
Xinke Jiang
Hanjie Zhao
Wenxuan Zeng
Hongde Liu
Yuxiang Jia
39
4
0
02 Mar 2024
MediSwift: Efficient Sparse Pre-trained Biomedical Language Models
Vithursan Thangarasa
Mahmoud Salem
Shreyas Saxena
Kevin Leong
Joel Hestness
Sean Lie
MedIm
40
1
0
01 Mar 2024
Mitigating Reversal Curse in Large Language Models via Semantic-aware Permutation Training
Qingyan Guo
Rui Wang
Junliang Guo
Xu Tan
Jiang Bian
Yujiu Yang
LRM
24
5
0
01 Mar 2024
DiaHalu: A Dialogue-level Hallucination Evaluation Benchmark for Large Language Models
Kedi Chen
Qin Chen
Jie Zhou
Yishen He
Liang He
HILM
50
1
0
01 Mar 2024
VisionLLaMA: A Unified LLaMA Backbone for Vision Tasks
Xiangxiang Chu
Jianlin Su
Bo Zhang
Chunhua Shen
MLLM
49
10
0
01 Mar 2024
HALC: Object Hallucination Reduction via Adaptive Focal-Contrast Decoding
Zhaorun Chen
Zhuokai Zhao
Hongyin Luo
Huaxiu Yao
Bo Li
Jiawei Zhou
MLLM
46
60
0
01 Mar 2024
Gender Bias in Large Language Models across Multiple Languages
Jinman Zhao
Yitian Ding
Chen Jia
Yining Wang
Zifan Qian
34
25
0
01 Mar 2024
Multimodal ArXiv: A Dataset for Improving Scientific Comprehension of Large Vision-Language Models
Lei Li
Yuqi Wang
Runxin Xu
Peiyi Wang
Xiachong Feng
Lingpeng Kong
Qi Liu
45
51
0
01 Mar 2024
Improving Socratic Question Generation using Data Augmentation and Preference Optimization
Nischal Ashok Kumar
Andrew Lan
43
8
0
01 Mar 2024
Large Convolutional Model Tuning via Filter Subspace
Wei Chen
Zichen Miao
Qiang Qiu
62
3
0
01 Mar 2024
Standardizing the Measurement of Text Diversity: A Tool and a Comparative Analysis of Scores
Chantal Shaib
Joe Barrow
Jiuding Sun
Alexa F. Siu
Byron C. Wallace
A. Nenkova
79
33
0
01 Mar 2024
LLM-Ensemble: Optimal Large Language Model Ensemble Method for E-commerce Product Attribute Value Extraction
Chenhao Fang
Xiaohan Li
Zezhong Fan
Jianpeng Xu
Kaushiki Nag
Evren Körpeoglu
Sushant Kumar
Kannan Achan
19
36
0
29 Feb 2024
UNITS: A Unified Multi-Task Time Series Model
Shanghua Gao
Teddy Koker
Owen Queen
Thomas Hartvigsen
Theodoros Tsiligkaridis
Marinka Zitnik
AI4TS
54
17
0
29 Feb 2024
FAC
2
^2
2
E: Better Understanding Large Language Model Capabilities by Dissociating Language and Cognition
Xiaoqiang Wang
Bang Liu
Lingfei Wu
44
0
0
29 Feb 2024
Resonance RoPE: Improving Context Length Generalization of Large Language Models
Suyuchen Wang
I. Kobyzev
Peng Lu
Mehdi Rezagholizadeh
Bang Liu
43
11
0
29 Feb 2024
Panda-70M: Captioning 70M Videos with Multiple Cross-Modality Teachers
Tsai-Shien Chen
Aliaksandr Siarohin
Willi Menapace
Ekaterina Deyneka
Hsiang-wei Chao
...
Yuwei Fang
Hsin-Ying Lee
Jian Ren
Ming-Hsuan Yang
Sergey Tulyakov
VGen
89
180
0
29 Feb 2024
The All-Seeing Project V2: Towards General Relation Comprehension of the Open World
Weiyun Wang
Yiming Ren
Hao Luo
Tiantong Li
Chenxiang Yan
...
Qingyun Li
Lewei Lu
Xizhou Zhu
Yu Qiao
Jifeng Dai
MLLM
55
48
0
29 Feb 2024
Retrieval-Augmented Generation for AI-Generated Content: A Survey
Penghao Zhao
Hailin Zhang
Qinhan Yu
Zhengren Wang
Yunteng Geng
Fangcheng Fu
Ling Yang
Wentao Zhang
Jie Jiang
Bin Cui
3DV
132
235
0
29 Feb 2024
Towards Tracing Trustworthiness Dynamics: Revisiting Pre-training Period of Large Language Models
Chao Qian
Jie Zhang
Wei Yao
Dongrui Liu
Zhen-fei Yin
Yu Qiao
Yong Liu
Jing Shao
LLMSV
LRM
57
13
0
29 Feb 2024
Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models
Soham De
Samuel L. Smith
Anushan Fernando
Aleksandar Botev
George-Christian Muraru
...
David Budden
Yee Whye Teh
Razvan Pascanu
Nando de Freitas
Çağlar Gülçehre
Mamba
61
117
0
29 Feb 2024
On the Scaling Laws of Geographical Representation in Language Models
Nathan Godey
Eric Villemonte de la Clergerie
Benoît Sagot
51
6
0
29 Feb 2024
Navigating Hallucinations for Reasoning of Unintentional Activities
Shresth Grover
Vibhav Vineet
Yogesh S Rawat
LRM
57
1
0
29 Feb 2024
OpenMedLM: Prompt engineering can out-perform fine-tuning in medical question-answering with open-source large language models
Jenish Maharjan
A. Garikipati
N. Singh
Leo Cyrus
Mayank Sharma
M. Ciobanu
G. Barnes
R. Thapa
Q. Mao
R. Das
LM&MA
ELM
42
26
0
29 Feb 2024
Exploring the Potential of Large Language Models for Improving Digital Forensic Investigation Efficiency
Akila Wickramasekara
Frank Breitinger
Mark Scanlon
52
8
0
29 Feb 2024
SEED: Customize Large Language Models with Sample-Efficient Adaptation for Code Generation
Xue Jiang
Yihong Dong
Zhi Jin
Ge Li
VLM
47
4
0
29 Feb 2024
WanJuan-CC: A Safe and High-Quality Open-sourced English Webtext Dataset
Jiantao Qiu
Haijun Lv
Zhenjiang Jin
Rui Wang
Wenchang Ning
...
Zhongying Tu
Lin Dahua
Yu Qiao
Hang Yan
Conghui He
49
6
0
29 Feb 2024
PlanGPT: Enhancing Urban Planning with Tailored Language Model and Efficient Retrieval
He Zhu
Wenjia Zhang
Nuoxian Huang
Boyang Li
Luyao Niu
...
Yicheng Tao
Junyou Su
Zhaoya Gong
Chenyu Fang
Xing Liu
LLMAG
58
10
0
29 Feb 2024
GSM-Plus: A Comprehensive Benchmark for Evaluating the Robustness of LLMs as Mathematical Problem Solvers
Qintong Li
Leyang Cui
Xueliang Zhao
Lingpeng Kong
Wei Bi
LRM
45
49
0
29 Feb 2024
RiNALMo: General-Purpose RNA Language Models Can Generalize Well on Structure Prediction Tasks
Rafael Josip Penić
Tin Vlasic
Roland G. Huber
Yue Wan
M. Šikić
AI4CE
24
27
0
29 Feb 2024
Improving Legal Judgement Prediction in Romanian with Long Text Encoders
Mihai Masala
Traian Rebedea
Horia Velicu
AILaw
43
2
0
29 Feb 2024
Beyond Language Models: Byte Models are Digital World Simulators
Shangda Wu
Xu Tan
Zili Wang
Rui Wang
Xiaobing Li
Maosong Sun
35
12
0
29 Feb 2024
Unveiling Typographic Deceptions: Insights of the Typographic Vulnerability in Large Vision-Language Model
Hao-Ran Cheng
Erjia Xiao
Jindong Gu
Le Yang
Jinhao Duan
Jize Zhang
Jiahang Cao
Kaidi Xu
Renjing Xu
39
6
0
29 Feb 2024
Think Fast, Think Slow, Think Critical: Designing an Automated Propaganda Detection Tool
L. Zavolokina
Kilian Sprenkamp
Zoya Katashinskaya
Daniel Gordon Jones
Gerhard Schwabe
45
13
0
29 Feb 2024
VIXEN: Visual Text Comparison Network for Image Difference Captioning
Alexander Black
Jing Shi
Yifei Fai
Tu Bui
John Collomosse
52
5
0
29 Feb 2024
A SOUND APPROACH: Using Large Language Models to generate audio descriptions for egocentric text-audio retrieval
Andreea-Maria Oncescu
João F. Henriques
Andrew Zisserman
Samuel Albanie
A. Sophia Koepke
38
5
0
29 Feb 2024
Percept, Chat, and then Adapt: Multimodal Knowledge Transfer of Foundation Models for Open-World Video Recognition
Boyu Chen
Siran Chen
Kunchang Li
Qinglin Xu
Yu Qiao
Yali Wang
34
3
0
29 Feb 2024
Previous
1
2
3
...
95
96
97
...
115
116
117
Next