ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2302.13971
  4. Cited By
LLaMA: Open and Efficient Foundation Language Models

LLaMA: Open and Efficient Foundation Language Models

27 February 2023
Hugo Touvron
Thibaut Lavril
Gautier Izacard
Xavier Martinet
Marie-Anne Lachaux
Timothée Lacroix
Baptiste Rozière
Naman Goyal
Eric Hambro
Faisal Azhar
Aurelien Rodriguez
Armand Joulin
Edouard Grave
Guillaume Lample
    ALM
    PILM
ArXivPDFHTML

Papers citing "LLaMA: Open and Efficient Foundation Language Models"

50 / 5,821 papers shown
Title
Don't Half-listen: Capturing Key-part Information in Continual Instruction Tuning
Don't Half-listen: Capturing Key-part Information in Continual Instruction Tuning
Yongquan He
Xuancheng Huang
Xuancheng Huang
Peng Zhang
CLL
ALM
72
5
0
15 Mar 2024
Recurrent Drafter for Fast Speculative Decoding in Large Language Models
Recurrent Drafter for Fast Speculative Decoding in Large Language Models
Aonan Zhang
Chong-Jun Wang
Yi Wang
Xuanyu Zhang
Yunfei Cheng
37
17
0
14 Mar 2024
FakeWatch: A Framework for Detecting Fake News to Ensure Credible
  Elections
FakeWatch: A Framework for Detecting Fake News to Ensure Credible Elections
Shaina Raza
Tahniat Khan
Veronica Chatrath
Drai Paulen-Patterson
Mizanur Rahman
Oluwanifemi Bamgbose
28
4
0
14 Mar 2024
Video Mamba Suite: State Space Model as a Versatile Alternative for
  Video Understanding
Video Mamba Suite: State Space Model as a Versatile Alternative for Video Understanding
Guo Chen
Yifei Huang
Jilan Xu
Baoqi Pei
Zhe Chen
Zhiqi Li
Jiahao Wang
Kunchang Li
Tong Lu
Limin Wang
Mamba
64
73
0
14 Mar 2024
Reawakening knowledge: Anticipatory recovery from catastrophic
  interference via structured training
Reawakening knowledge: Anticipatory recovery from catastrophic interference via structured training
Yanlai Yang
Matt Jones
Michael C. Mozer
Mengye Ren
68
1
0
14 Mar 2024
MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training
MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training
Brandon McKinzie
Zhe Gan
J. Fauconnier
Sam Dodge
Bowen Zhang
...
Zirui Wang
Ruoming Pang
Peter Grasch
Alexander Toshev
Yinfei Yang
MLLM
43
189
0
14 Mar 2024
uaMix-MAE: Efficient Tuning of Pretrained Audio Transformers with
  Unsupervised Audio Mixtures
uaMix-MAE: Efficient Tuning of Pretrained Audio Transformers with Unsupervised Audio Mixtures
Afrina Tabassum
Dung N. Tran
Trung D. Q. Dang
Ismini Lourentzou
K. Koishida
50
0
0
14 Mar 2024
Eyes Closed, Safety On: Protecting Multimodal LLMs via Image-to-Text
  Transformation
Eyes Closed, Safety On: Protecting Multimodal LLMs via Image-to-Text Transformation
Yunhao Gou
Kai Chen
Zhili Liu
Lanqing Hong
Hang Xu
Zhenguo Li
Dit-Yan Yeung
James T. Kwok
Yu Zhang
MLLM
46
42
0
14 Mar 2024
OpenGraph: Open-Vocabulary Hierarchical 3D Graph Representation in
  Large-Scale Outdoor Environments
OpenGraph: Open-Vocabulary Hierarchical 3D Graph Representation in Large-Scale Outdoor Environments
Yinan Deng
Jiahui Wang
Jingyu Zhao
Xinyu Tian
Guangyan Chen
Yi Yang
Yufeng Yue
3DV
40
13
0
14 Mar 2024
GiT: Towards Generalist Vision Transformer through Universal Language
  Interface
GiT: Towards Generalist Vision Transformer through Universal Language Interface
Haiyang Wang
Hao Tang
Li Jiang
Shaoshuai Shi
Muhammad Ferjad Naeem
Hongsheng Li
Bernt Schiele
Liwei Wang
VLM
46
10
0
14 Mar 2024
BurstAttention: An Efficient Distributed Attention Framework for
  Extremely Long Sequences
BurstAttention: An Efficient Distributed Attention Framework for Extremely Long Sequences
Sun Ao
Weilin Zhao
Xu Han
Cheng Yang
Zhiyuan Liu
Chuan Shi
Maosong Sun
GNN
40
8
0
14 Mar 2024
UniCode: Learning a Unified Codebook for Multimodal Large Language
  Models
UniCode: Learning a Unified Codebook for Multimodal Large Language Models
Sipeng Zheng
Bohan Zhou
Yicheng Feng
Ye Wang
Zongqing Lu
VLM
MLLM
46
7
0
14 Mar 2024
A Continued Pretrained LLM Approach for Automatic Medical Note
  Generation
A Continued Pretrained LLM Approach for Automatic Medical Note Generation
Dong Yuan
Eti Rastogi
Gautam Naik
Sree Prasanna Rajagopal
Sagar Goyal
Fen Zhao
Jai Chintagunta
Jeff Ward
LM&MA
AI4MH
45
20
0
14 Mar 2024
RAGGED: Towards Informed Design of Retrieval Augmented Generation
  Systems
RAGGED: Towards Informed Design of Retrieval Augmented Generation Systems
Jennifer Hsia
Afreen Shaikh
Zhiruo Wang
Graham Neubig
RALM
27
10
0
14 Mar 2024
The First to Know: How Token Distributions Reveal Hidden Knowledge in
  Large Vision-Language Models?
The First to Know: How Token Distributions Reveal Hidden Knowledge in Large Vision-Language Models?
Qinyu Zhao
Ming Xu
Kartik Gupta
Akshay Asthana
Liang Zheng
Stephen Gould
29
8
0
14 Mar 2024
ChartInstruct: Instruction Tuning for Chart Comprehension and Reasoning
ChartInstruct: Instruction Tuning for Chart Comprehension and Reasoning
Ahmed Masry
Mehrad Shahmohammadi
Md. Rizwan Parvez
Enamul Hoque
Chenyu You
50
31
0
14 Mar 2024
VisionGPT: Vision-Language Understanding Agent Using Generalized
  Multimodal Framework
VisionGPT: Vision-Language Understanding Agent Using Generalized Multimodal Framework
Chris Kelly
Luhui Hu
Bang Yang
Yu Tian
Deshun Yang
Cindy Yang
Zaoshan Huang
Zihao Li
Jiayin Hu
Yuexian Zou
47
9
0
14 Mar 2024
Second-Order Information Matters: Revisiting Machine Unlearning for
  Large Language Models
Second-Order Information Matters: Revisiting Machine Unlearning for Large Language Models
Kang Gu
Md. Rafi Ur Rashid
Najrin Sultana
Shagufta Mehnaz
MU
47
6
0
13 Mar 2024
Teaching Machines to Code: Smart Contract Translation with LLMs
Teaching Machines to Code: Smart Contract Translation with LLMs
Rabimba Karanjai
Lei Xu
Weidong Shi
45
6
0
13 Mar 2024
Simple and Scalable Strategies to Continually Pre-train Large Language
  Models
Simple and Scalable Strategies to Continually Pre-train Large Language Models
Adam Ibrahim
Benjamin Thérien
Kshitij Gupta
Mats L. Richter
Quentin Anthony
Timothée Lesort
Eugene Belilovsky
Irina Rish
KELM
CLL
46
54
0
13 Mar 2024
Strengthening Multimodal Large Language Model with Bootstrapped
  Preference Optimization
Strengthening Multimodal Large Language Model with Bootstrapped Preference Optimization
Renjie Pi
Tianyang Han
Wei Xiong
Jipeng Zhang
Runtao Liu
Rui Pan
Tong Zhang
MLLM
50
34
0
13 Mar 2024
Do Language Models Care About Text Quality? Evaluating Web-Crawled
  Corpora Across 11 Languages
Do Language Models Care About Text Quality? Evaluating Web-Crawled Corpora Across 11 Languages
Rik van Noord
Taja Kuzman
Peter Rupnik
Nikola Ljubesic
Miquel Espla-Gomis
Gema Ramírez-Sánchez
Antonio Toral
ALM
40
2
0
13 Mar 2024
Token Alignment via Character Matching for Subword Completion
Token Alignment via Character Matching for Subword Completion
Ben Athiwaratkun
Shiqi Wang
Mingyue Shang
Yuchen Tian
Zijian Wang
Sujan Kumar Gonugondla
Sanjay Krishna Gouda
Rob Kwiatowski
Ramesh Nallapati
Bing Xiang
50
4
0
13 Mar 2024
Bifurcated Attention: Accelerating Massively Parallel Decoding with
  Shared Prefixes in LLMs
Bifurcated Attention: Accelerating Massively Parallel Decoding with Shared Prefixes in LLMs
Ben Athiwaratkun
Sujan Kumar Gonugondla
Sanjay Krishna Gouda
Haifeng Qian
Hantian Ding
...
Liangfu Chen
Parminder Bhatia
Ramesh Nallapati
Sudipta Sengupta
Bing Xiang
59
4
0
13 Mar 2024
Language models scale reliably with over-training and on downstream
  tasks
Language models scale reliably with over-training and on downstream tasks
S. Gadre
Georgios Smyrnis
Vaishaal Shankar
Suchin Gururangan
Mitchell Wortsman
...
Y. Carmon
Achal Dave
Reinhard Heckel
Niklas Muennighoff
Ludwig Schmidt
ALM
ELM
LRM
108
42
0
13 Mar 2024
Masked Generative Story Transformer with Character Guidance and Caption
  Augmentation
Masked Generative Story Transformer with Character Guidance and Caption Augmentation
Christos Papadimitriou
Giorgos Filandrianos
Maria Lymperaiou
Giorgos Stamou
DiffM
102
1
0
13 Mar 2024
SMART: Submodular Data Mixture Strategy for Instruction Tuning
SMART: Submodular Data Mixture Strategy for Instruction Tuning
Kowndinya Renduchintala
S. Bhatia
Ganesh Ramakrishnan
49
3
0
13 Mar 2024
CoIN: A Benchmark of Continual Instruction tuNing for Multimodel Large
  Language Model
CoIN: A Benchmark of Continual Instruction tuNing for Multimodel Large Language Model
Cheng Chen
Sitong Su
Xu Luo
Hengtao Shen
Lianli Gao
Jingkuan Song
CLL
42
13
0
13 Mar 2024
LLM-Assisted Light: Leveraging Large Language Model Capabilities for
  Human-Mimetic Traffic Signal Control in Complex Urban Environments
LLM-Assisted Light: Leveraging Large Language Model Capabilities for Human-Mimetic Traffic Signal Control in Complex Urban Environments
Maonan Wang
Aoyu Pang
Yuheng Kan
Man-On Pun
Chung Shue Chen
Bo Huang
43
18
0
13 Mar 2024
StreamingDialogue: Prolonged Dialogue Learning via Long Context
  Compression with Minimal Losses
StreamingDialogue: Prolonged Dialogue Learning via Long Context Compression with Minimal Losses
Jia-Nan Li
Quan Tu
Cunli Mao
Zhengtao Yu
Ji-Rong Wen
Rui Yan
OffRL
29
3
0
13 Mar 2024
TINA: Think, Interaction, and Action Framework for Zero-Shot Vision
  Language Navigation
TINA: Think, Interaction, and Action Framework for Zero-Shot Vision Language Navigation
Dingbang Li
Wenzhou Chen
Xin Lin
LLMAG
LM&Ro
49
4
0
13 Mar 2024
Learning to Watermark LLM-generated Text via Reinforcement Learning
Learning to Watermark LLM-generated Text via Reinforcement Learning
Xiaojun Xu
Yuanshun Yao
Yang Liu
29
10
0
13 Mar 2024
MoleculeQA: A Dataset to Evaluate Factual Accuracy in Molecular
  Comprehension
MoleculeQA: A Dataset to Evaluate Factual Accuracy in Molecular Comprehension
Xingyu Lu
He Cao
Zijing Liu
Shengyuan Bai
Leqing Chen
Yuan Yao
Hai-Tao Zheng
Yu Li
HILM
23
7
0
13 Mar 2024
Large Language Models are Contrastive Reasoners
Large Language Models are Contrastive Reasoners
Liang Yao
ReLM
ELM
LRM
47
2
0
13 Mar 2024
DialogGen: Multi-modal Interactive Dialogue System for Multi-turn Text-to-Image Generation
DialogGen: Multi-modal Interactive Dialogue System for Multi-turn Text-to-Image Generation
Minbin Huang
Yanxin Long
Xinchi Deng
Ruihang Chu
Jiangfeng Xiong
Xiaodan Liang
Hong Cheng
Qinglin Lu
Wei Liu
MLLM
EGVM
65
8
0
13 Mar 2024
Mechanics of Next Token Prediction with Self-Attention
Mechanics of Next Token Prediction with Self-Attention
Yingcong Li
Yixiao Huang
M. E. Ildiz
A. S. Rawat
Samet Oymak
42
28
0
12 Mar 2024
CHAI: Clustered Head Attention for Efficient LLM Inference
CHAI: Clustered Head Attention for Efficient LLM Inference
Saurabh Agarwal
Bilge Acun
Basil Homer
Mostafa Elhoushi
Yejin Lee
Shivaram Venkataraman
Dimitris Papailiopoulos
Carole-Jean Wu
60
8
0
12 Mar 2024
Big City Bias: Evaluating the Impact of Metropolitan Size on
  Computational Job Market Abilities of Language Models
Big City Bias: Evaluating the Impact of Metropolitan Size on Computational Job Market Abilities of Language Models
Charlie Campanella
Rob van der Goot
28
0
0
12 Mar 2024
LG-Traj: LLM Guided Pedestrian Trajectory Prediction
LG-Traj: LLM Guided Pedestrian Trajectory Prediction
Pranav Singh Chib
Pravendra Singh
30
11
0
12 Mar 2024
Beyond Text: Frozen Large Language Models in Visual Signal Comprehension
Beyond Text: Frozen Large Language Models in Visual Signal Comprehension
Lei Zhu
Fangyun Wei
Yanye Lu
MLLM
VLM
52
18
0
12 Mar 2024
Rethinking Generative Large Language Model Evaluation for Semantic
  Comprehension
Rethinking Generative Large Language Model Evaluation for Semantic Comprehension
Fangyun Wei
Xi Chen
Linzi Luo
ELM
ALM
LRM
38
7
0
12 Mar 2024
The Missing Piece in Model Editing: A Deep Dive into the Hidden Damage
  Brought By Model Editing
The Missing Piece in Model Editing: A Deep Dive into the Hidden Damage Brought By Model Editing
Jianchen Wang
Zhouhong Gu
Xiaoxuan Zhu
Lin Zhang
Haoning Ye
Zhuozhi Xiong
Hongwei Feng
Yanghua Xiao
KELM
43
2
0
12 Mar 2024
Fine-tuning Large Language Models with Sequential Instructions
Fine-tuning Large Language Models with Sequential Instructions
Hanxu Hu
Simon Yu
Pinzhen Chen
Edoardo Ponti
ALM
LRM
81
15
0
12 Mar 2024
Masked AutoDecoder is Effective Multi-Task Vision Generalist
Masked AutoDecoder is Effective Multi-Task Vision Generalist
Han Qiu
Jiaxing Huang
Peng Gao
Lewei Lu
Xiaoqin Zhang
Shijian Lu
51
4
0
12 Mar 2024
Harder Tasks Need More Experts: Dynamic Routing in MoE Models
Harder Tasks Need More Experts: Dynamic Routing in MoE Models
Quzhe Huang
Zhenwei An
Zhuang Nan
Mingxu Tao
Chen Zhang
...
Kun Xu
Kun Xu
Liwei Chen
Songfang Huang
Yansong Feng
MoE
42
26
0
12 Mar 2024
Characterization of Large Language Model Development in the Datacenter
Characterization of Large Language Model Development in the Datacenter
Qi Hu
Zhisheng Ye
Zerui Wang
Guoteng Wang
Mengdie Zhang
...
Dahua Lin
Xiaolin Wang
Yingwei Luo
Yonggang Wen
Tianwei Zhang
56
45
0
12 Mar 2024
SIFiD: Reassess Summary Factual Inconsistency Detection with LLM
SIFiD: Reassess Summary Factual Inconsistency Detection with LLM
Jiuding Yang
Hui Liu
Weidong Guo
Zhuwei Rao
Yu-Syuan Xu
Di Niu
HILM
29
0
0
12 Mar 2024
Towards Faithful Explanations: Boosting Rationalization with Shortcuts
  Discovery
Towards Faithful Explanations: Boosting Rationalization with Shortcuts Discovery
Linan Yue
Qi Liu
Yichao Du
Li Wang
Weibo Gao
Yanqing An
34
5
0
12 Mar 2024
Let Storytelling Tell Vivid Stories: An Expressive and Fluent Multimodal
  Storyteller
Let Storytelling Tell Vivid Stories: An Expressive and Fluent Multimodal Storyteller
Chuanqi Zang
Jiji Tang
Rongsheng Zhang
Zeng Zhao
Tangjie Lv
Mingtao Pei
Wei Liang
35
3
0
12 Mar 2024
NavCoT: Boosting LLM-Based Vision-and-Language Navigation via Learning Disentangled Reasoning
NavCoT: Boosting LLM-Based Vision-and-Language Navigation via Learning Disentangled Reasoning
Bingqian Lin
Yunshuang Nie
Ziming Wei
Jiaqi Chen
Shikui Ma
Jianhua Han
Hang Xu
Xiaojun Chang
Xiaodan Liang
LM&Ro
LRM
67
21
0
12 Mar 2024
Previous
123...929394...115116117
Next