Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2205.01068
Cited By
OPT: Open Pre-trained Transformer Language Models
2 May 2022
Susan Zhang
Stephen Roller
Naman Goyal
Mikel Artetxe
Moya Chen
Shuohui Chen
Christopher Dewan
Mona T. Diab
Xian Li
Xi Lin
Todor Mihaylov
Myle Ott
Sam Shleifer
Kurt Shuster
Daniel Simig
Punit Singh Koura
Anjali Sridhar
Tianlu Wang
Luke Zettlemoyer
VLM
OSLM
AI4CE
Re-assign community
ArXiv
PDF
HTML
Papers citing
"OPT: Open Pre-trained Transformer Language Models"
50 / 2,460 papers shown
Title
PG-Video-LLaVA: Pixel Grounding Large Video-Language Models
Shehan Munasinghe
Rusiru Thushara
Muhammad Maaz
H. Rasheed
Salman Khan
Mubarak Shah
Fahad Khan
VLM
MLLM
37
34
0
22 Nov 2023
On the Calibration of Large Language Models and Alignment
Chiwei Zhu
Benfeng Xu
Quan Wang
Yongdong Zhang
Zhendong Mao
82
34
0
22 Nov 2023
Enhancing Uncertainty-Based Hallucination Detection with Stronger Focus
Tianhang Zhang
Lin Qiu
Qipeng Guo
Cheng Deng
Yue Zhang
Zheng Zhang
Cheng Zhou
Xinbing Wang
Luoyi Fu
HILM
79
49
0
22 Nov 2023
Towards Better Parameter-Efficient Fine-Tuning for Large Language Models: A Position Paper
Chengyu Wang
Junbing Yan
Wei Zhang
Jun Huang
ALM
47
3
0
22 Nov 2023
Multi-modal In-Context Learning Makes an Ego-evolving Scene Text Recognizer
Zhen Zhao
Jingqun Tang
Chunhui Lin
Binghong Wu
Can Huang
Hao Liu
Xin Tan
Zhizhong Zhang
Yuan Xie
41
23
0
22 Nov 2023
Attribution and Alignment: Effects of Local Context Repetition on Utterance Production and Comprehension in Dialogue
Aron Molnar
Jaap Jumelet
Mario Giulianelli
Arabella J. Sinclair
40
2
0
21 Nov 2023
Oasis: Data Curation and Assessment System for Pretraining of Large Language Models
Tong Zhou
Yubo Chen
Pengfei Cao
Kang Liu
Jun Zhao
Shengping Liu
31
3
0
21 Nov 2023
Advancing Transformer Architecture in Long-Context Large Language Models: A Comprehensive Survey
Yunpeng Huang
Jingwei Xu
Junyu Lai
Zixu Jiang
Taolue Chen
...
Xiaoxing Ma
Lijuan Yang
Zhou Xin
Shupeng Li
Penghao Zhao
LLMAG
KELM
49
56
0
21 Nov 2023
AcademicGPT: Empowering Academic Research
Shufa Wei
Xiaolong Xu
Xianbiao Qi
Xi Yin
Jun Xia
...
Chihao Dai
Lihua Wang
Xiaohui Liu
Lei Zhang
Yutao Xie
LM&MA
52
3
0
21 Nov 2023
Applications of Large Scale Foundation Models for Autonomous Driving
Yu Huang
Yue Chen
Zhu Li
ELM
AI4CE
LRM
ALM
LM&Ro
66
15
0
20 Nov 2023
LQ-LoRA: Low-rank Plus Quantized Matrix Decomposition for Efficient Language Model Finetuning
Han Guo
P. Greengard
Eric P. Xing
Yoon Kim
MQ
43
44
0
20 Nov 2023
VLM-Eval: A General Evaluation on Video Large Language Models
Shuailin Li
Yuang Zhang
Yucheng Zhao
Qiuyue Wang
Fan Jia
Yingfei Liu
Tiancai Wang
MLLM
ELM
44
2
0
20 Nov 2023
Event Camera Data Dense Pre-training
Yan Yang
Liyuan Pan
Liu Liu
40
4
0
20 Nov 2023
Self-Supervised Pretraining for Heterogeneous Hypergraph Neural Networks
Abdalgader Abubaker
Takanori Maehara
M. Nimishakavi
Vassilis Plachouras
SSL
45
0
0
19 Nov 2023
RecExplainer: Aligning Large Language Models for Explaining Recommendation Models
Yuxuan Lei
Jianxun Lian
Jing Yao
Xu Huang
Defu Lian
Xing Xie
LRM
35
5
0
18 Nov 2023
An Embodied Generalist Agent in 3D World
Jiangyong Huang
Silong Yong
Xiaojian Ma
Xiongkun Linghu
Puhao Li
Yan Wang
Qing Li
Song-Chun Zhu
Baoxiong Jia
Siyuan Huang
LM&Ro
31
139
0
18 Nov 2023
Modality-invariant and Specific Prompting for Multimodal Human Perception Understanding
Hao Sun
Ziwei Niu
Xinyao Yu
Jiaqing Liu
Yen-Wei Chen
Lanfen Lin
39
0
0
17 Nov 2023
FREE: The Foundational Semantic Recognition for Modeling Environmental Ecosystems
Shiyuan Luo
Juntong Ni
Shengyu Chen
Runlong Yu
Yiqun Xie
Licheng Liu
Zhenong Jin
Huaxiu Yao
Xiaowei Jia
44
8
0
17 Nov 2023
Hijacking Large Language Models via Adversarial In-Context Learning
Yao Qiang
Xiangyu Zhou
Dongxiao Zhu
37
33
0
16 Nov 2023
The Curious Decline of Linguistic Diversity: Training Language Models on Synthetic Text
Yanzhu Guo
Guokan Shang
Michalis Vazirgiannis
Chloé Clavel
39
52
0
16 Nov 2023
Can Language Model Moderators Improve the Health of Online Discourse?
Hyundong Justin Cho
Shuai Liu
Taiwei Shi
Darpan Jain
Basem Rizk
...
Zixun Lu
Nuan Wen
Jonathan Gratch
Emilio Ferrera
Jonathan May
AI4MH
45
13
0
16 Nov 2023
Investigating Data Contamination in Modern Benchmarks for Large Language Models
Chunyuan Deng
Yilun Zhao
Xiangru Tang
Mark B. Gerstein
Arman Cohan
AAML
ELM
37
53
0
16 Nov 2023
Knowledge Plugins: Enhancing Large Language Models for Domain-Specific Recommendations
Jing Yao
Wei Xu
Jianxun Lian
Xiting Wang
Xiaoyuan Yi
Xing Xie
ALM
38
19
0
16 Nov 2023
Large Language Models are Few-Shot Training Example Generators: A Case Study in Fallacy Recognition
Tariq Alhindi
Smaranda Muresan
Preslav Nakov
HILM
LRM
47
5
0
16 Nov 2023
A Speed Odyssey for Deployable Quantization of LLMs
Qingyuan Li
Ran Meng
Yiduo Li
Bo Zhang
Liang Li
Yifan Lu
Xiangxiang Chu
Yerui Sun
Yuchen Xie
MQ
67
7
0
16 Nov 2023
SQATIN: Supervised Instruction Tuning Meets Question Answering for Improved Dialogue NLU
E. Razumovskaia
Goran Glavaš
Anna Korhonen
Ivan Vulić
LRM
34
2
0
16 Nov 2023
Assessing Translation capabilities of Large Language Models involving English and Indian Languages
Vandan Mujadia
Ashok Urlana
Yash Bhaskar
Penumalla Aditya Pavani
Kukkapalli Shravya
Parameswari Krishnamurthy
D. Sharma
ELM
196
7
0
15 Nov 2023
Mind's Mirror: Distilling Self-Evaluation Capability and Comprehensive Thinking from Large Language Models
Weize Liu
Guocong Li
Kai Zhang
Bang Du
Qiyuan Chen
Xuming Hu
Hongxia Xu
Jintai Chen
Jian Wu
LRM
18
6
0
15 Nov 2023
How Vocabulary Sharing Facilitates Multilingualism in LLaMA?
Fei Yuan
Shuai Yuan
Zhiyong Wu
Lei Li
47
10
0
15 Nov 2023
Improving Zero-shot Visual Question Answering via Large Language Models with Reasoning Question Prompts
Yunshi Lan
Xiang Li
Xin Liu
Yang Li
Wei Qin
Weining Qian
LRM
ReLM
46
26
0
15 Nov 2023
A Robust Semantics-based Watermark for Large Language Model against Paraphrasing
Jie Ren
Han Xu
Yiding Liu
Yingqian Cui
Shuaiqiang Wang
Dawei Yin
Jiliang Tang
OffRL
33
46
0
15 Nov 2023
Efficient Continual Pre-training for Building Domain Specific Large Language Models
Yong Xie
Karan Aggarwal
Aitzaz Ahmad
CLL
47
21
0
14 Nov 2023
Zero-shot audio captioning with audio-language model guidance and audio context keywords
Leonard Salewski
Stefan Fauth
A. Sophia Koepke
Zeynep Akata
37
10
0
14 Nov 2023
Anti-LM Decoding for Zero-shot In-context Machine Translation
Suzanna Sia
Alexandra DeLucia
Kevin Duh
46
2
0
14 Nov 2023
A Survey of Confidence Estimation and Calibration in Large Language Models
Jiahui Geng
Fengyu Cai
Yuxia Wang
Heinz Koeppl
Preslav Nakov
Iryna Gurevych
UQCV
46
57
0
14 Nov 2023
REST: Retrieval-Based Speculative Decoding
Zhenyu He
Zexuan Zhong
Tianle Cai
Jason D. Lee
Di He
RALM
28
81
0
14 Nov 2023
Unlock the Power: Competitive Distillation for Multi-Modal Large Language Models
Xinwei Li
Li Lin
Shuai Wang
Chen Qian
25
3
0
14 Nov 2023
Vision-Language Instruction Tuning: A Review and Analysis
Chen Li
Yixiao Ge
Dian Li
Ying Shan
VLM
41
12
0
14 Nov 2023
Insights into Classifying and Mitigating LLMs' Hallucinations
Alessandro Bruno
P. Mazzeo
Aladine Chetouani
Marouane Tliba
M. A. Kerkouri
HILM
56
10
0
14 Nov 2023
Chat-UniVi: Unified Visual Representation Empowers Large Language Models with Image and Video Understanding
Peng Jin
Ryuichi Takanobu
Caiwan Zhang
Xiaochun Cao
Li-ming Yuan
MLLM
41
227
0
14 Nov 2023
Explainable History Distillation by Marked Temporal Point Process
Sishun Liu
Ke Deng
Yan Wang
Xiuzhen Zhang
35
0
0
13 Nov 2023
SPHINX: The Joint Mixing of Weights, Tasks, and Visual Embeddings for Multi-modal Large Language Models
Ziyi Lin
Chris Liu
Renrui Zhang
Peng Gao
Longtian Qiu
...
Siyuan Huang
Yichi Zhang
Xuming He
Hongsheng Li
Yu Qiao
MLLM
VLM
33
214
0
13 Nov 2023
To See is to Believe: Prompting GPT-4V for Better Visual Instruction Tuning
Junke Wang
Lingchen Meng
Zejia Weng
Bo He
Zuxuan Wu
Yu-Gang Jiang
MLLM
VLM
38
94
0
13 Nov 2023
Leveraging Multiple Teachers for Test-Time Adaptation of Language-Guided Classifiers
Kangda Wei
Sayan Ghosh
Rakesh R Menon
Shashank Srivastava
40
2
0
13 Nov 2023
Psychometric Predictive Power of Large Language Models
Tatsuki Kuribayashi
Yohei Oseki
Timothy Baldwin
LM&MA
37
3
0
13 Nov 2023
Finding and Editing Multi-Modal Neurons in Pre-Trained Transformers
Haowen Pan
Yixin Cao
Xiaozhi Wang
Xun Yang
Meng Wang
KELM
44
25
0
13 Nov 2023
Controlled Text Generation for Black-box Language Models via Score-based Progressive Editor
Sangwon Yu
Changmin Lee
Hojin Lee
Sungroh Yoon
37
0
0
13 Nov 2023
Do large language models and humans have similar behaviors in causal inference with script knowledge?
Xudong Hong
Margarita Ryzhova
Daniel Adrian Biondi
Ram Sarkar
47
5
0
13 Nov 2023
What Large Language Models Bring to Text-rich VQA?
Xuejing Liu
Wei Tang
Xinzhe Ni
Jinghui Lu
Rui Zhao
Zechao Li
Fei Tan
24
9
0
13 Nov 2023
Exploring the Factual Consistency in Dialogue Comprehension of Large Language Models
Shuaijie She
Shujian Huang
Xingyun Wang
Yanke Zhou
Jiajun Chen
ELM
HILM
22
0
0
13 Nov 2023
Previous
1
2
3
...
27
28
29
...
48
49
50
Next