ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2205.01068
  4. Cited By
OPT: Open Pre-trained Transformer Language Models

OPT: Open Pre-trained Transformer Language Models

2 May 2022
Susan Zhang
Stephen Roller
Naman Goyal
Mikel Artetxe
Moya Chen
Shuohui Chen
Christopher Dewan
Mona T. Diab
Xian Li
Xi Lin
Todor Mihaylov
Myle Ott
Sam Shleifer
Kurt Shuster
Daniel Simig
Punit Singh Koura
Anjali Sridhar
Tianlu Wang
Luke Zettlemoyer
    VLM
    OSLM
    AI4CE
ArXivPDFHTML

Papers citing "OPT: Open Pre-trained Transformer Language Models"

50 / 2,454 papers shown
Title
CQIL: Inference Latency Optimization with Concurrent Computation of
  Quasi-Independent Layers
CQIL: Inference Latency Optimization with Concurrent Computation of Quasi-Independent Layers
Longwei Zou
Qingyang Wang
Han Zhao
Jiangang Kong
Yi Yang
Yangdong Deng
50
0
0
10 Apr 2024
FairPair: A Robust Evaluation of Biases in Language Models through
  Paired Perturbations
FairPair: A Robust Evaluation of Biases in Language Models through Paired Perturbations
Jane Dwivedi-Yu
Raaz Dwivedi
Timo Schick
43
2
0
09 Apr 2024
MORPHeus: a Multimodal One-armed Robot-assisted Peeling System with
  Human Users In-the-loop
MORPHeus: a Multimodal One-armed Robot-assisted Peeling System with Human Users In-the-loop
Ruolin Ye
Yifei Hu
Yuhan Bian
Bian
Luke Kulm
T. Bhattacharjee
56
6
0
09 Apr 2024
FreeEval: A Modular Framework for Trustworthy and Efficient Evaluation
  of Large Language Models
FreeEval: A Modular Framework for Trustworthy and Efficient Evaluation of Large Language Models
Zhuohao Yu
Chang Gao
Wenjin Yao
Yidong Wang
Zhengran Zeng
Wei Ye
Jindong Wang
Yue Zhang
Shikun Zhang
46
1
0
09 Apr 2024
Privacy Preserving Prompt Engineering: A Survey
Privacy Preserving Prompt Engineering: A Survey
Kennedy Edemacu
Xintao Wu
63
18
0
09 Apr 2024
MoReVQA: Exploring Modular Reasoning Models for Video Question Answering
MoReVQA: Exploring Modular Reasoning Models for Video Question Answering
Juhong Min
Shyamal Buch
Arsha Nagrani
Minsu Cho
Cordelia Schmid
LRM
46
21
0
09 Apr 2024
Language Models on a Diet: Cost-Efficient Development of Encoders for
  Closely-Related Languages via Additional Pretraining
Language Models on a Diet: Cost-Efficient Development of Encoders for Closely-Related Languages via Additional Pretraining
Nikola Ljubesic
Vít Suchomel
Peter Rupnik
Taja Kuzman
Rik van Noord
CLL
35
5
0
08 Apr 2024
Interpreting Themes from Educational Stories
Interpreting Themes from Educational Stories
Yigeng Zhang
Fabio A. González
Thamar Solorio
64
0
0
08 Apr 2024
DLoRA: Distributed Parameter-Efficient Fine-Tuning Solution for Large
  Language Model
DLoRA: Distributed Parameter-Efficient Fine-Tuning Solution for Large Language Model
Chao Gao
Sai Qian Zhang
ALM
123
7
0
08 Apr 2024
StockGPT: A GenAI Model for Stock Prediction and Trading
StockGPT: A GenAI Model for Stock Prediction and Trading
Dat Mai
AIFin
AI4TS
48
5
0
07 Apr 2024
Facial Affective Behavior Analysis with Instruction Tuning
Facial Affective Behavior Analysis with Instruction Tuning
Yifan Li
Anh Dao
Wentao Bao
Zhen Tan
Tianlong Chen
Huan Liu
Yu Kong
CVBM
67
15
0
07 Apr 2024
Multilingual Large Language Model: A Survey of Resources, Taxonomy and
  Frontiers
Multilingual Large Language Model: A Survey of Resources, Taxonomy and Frontiers
Libo Qin
Qiguang Chen
Yuhang Zhou
Zhi Chen
Hai-Tao Zheng
Lizi Liao
Min Li
Wanxiang Che
Philip S. Yu
LRM
57
36
0
07 Apr 2024
GenEARL: A Training-Free Generative Framework for Multimodal Event
  Argument Role Labeling
GenEARL: A Training-Free Generative Framework for Multimodal Event Argument Role Labeling
Hritik Bansal
Po-Nien Kung
P. Brantingham
Weisheng Wang
Miao Zheng
VLM
36
1
0
07 Apr 2024
PhyloLM : Inferring the Phylogeny of Large Language Models and
  Predicting their Performances in Benchmarks
PhyloLM : Inferring the Phylogeny of Large Language Models and Predicting their Performances in Benchmarks
Nicolas Yax
Pierre-Yves Oudeyer
Stefano Palminteri
67
5
0
06 Apr 2024
Joint Visual and Text Prompting for Improved Object-Centric Perception
  with Multimodal Large Language Models
Joint Visual and Text Prompting for Improved Object-Centric Perception with Multimodal Large Language Models
Songtao Jiang
Yan Zhang
Chenyi Zhou
Yeying Jin
Yang Feng
Jian Wu
Zuozhu Liu
LRM
VLM
50
4
0
06 Apr 2024
Multilingual Brain Surgeon: Large Language Models Can be Compressed Leaving No Language Behind
Multilingual Brain Surgeon: Large Language Models Can be Compressed Leaving No Language Behind
Hongchuan Zeng
Hongshen Xu
Lu Chen
Kai Yu
59
5
0
06 Apr 2024
Koala: Key frame-conditioned long video-LLM
Koala: Key frame-conditioned long video-LLM
Reuben Tan
Ximeng Sun
Ping Hu
Jui-hsien Wang
Hanieh Deilamsalehy
Bryan A. Plummer
Bryan C. Russell
Kate Saenko
38
36
0
05 Apr 2024
BEAR: A Unified Framework for Evaluating Relational Knowledge in Causal
  and Masked Language Models
BEAR: A Unified Framework for Evaluating Relational Knowledge in Causal and Masked Language Models
Hongyu Lin
Alexis Conneau
Alan Akbik
40
5
0
05 Apr 2024
Simple Techniques for Enhancing Sentence Embeddings in Generative
  Language Models
Simple Techniques for Enhancing Sentence Embeddings in Generative Language Models
Bowen Zhang
Kehua Chang
Chunping Li
42
11
0
05 Apr 2024
FFN-SkipLLM: A Hidden Gem for Autoregressive Decoding with Adaptive Feed
  Forward Skipping
FFN-SkipLLM: A Hidden Gem for Autoregressive Decoding with Adaptive Feed Forward Skipping
Ajay Jaiswal
Bodun Hu
Lu Yin
Yeonju Ro
Shiwei Liu
Tianlong Chen
Aditya Akella
58
12
0
05 Apr 2024
Outlier-Efficient Hopfield Layers for Large Transformer-Based Models
Outlier-Efficient Hopfield Layers for Large Transformer-Based Models
Jerry Yao-Chieh Hu
Pei-Hsuan Chang
Haozheng Luo
Hong-Yu Chen
Weijian Li
Wei-Po Wang
Han Liu
49
26
0
04 Apr 2024
LongVLM: Efficient Long Video Understanding via Large Language Models
LongVLM: Efficient Long Video Understanding via Large Language Models
Yuetian Weng
Mingfei Han
Haoyu He
Xiaojun Chang
Bohan Zhuang
VLM
68
57
0
04 Apr 2024
Towards Pareto Optimal Throughput in Small Language Model Serving
Towards Pareto Optimal Throughput in Small Language Model Serving
Pol G. Recasens
Yue Zhu
Chen Wang
Eun Kyung Lee
Olivier Tardieu
Alaa Youssef
Jordi Torres
Josep Ll. Berral
40
4
0
04 Apr 2024
Robust Pronoun Fidelity with English LLMs: Are they Reasoning,
  Repeating, or Just Biased?
Robust Pronoun Fidelity with English LLMs: Are they Reasoning, Repeating, or Just Biased?
Vagrant Gautam
Eileen Bingert
D. Zhu
Anne Lauscher
Dietrich Klakow
45
8
0
04 Apr 2024
The Impact of Unstated Norms in Bias Analysis of Language Models
The Impact of Unstated Norms in Bias Analysis of Language Models
Farnaz Kohankhaki
D. B. Emerson
David B. Emerson
Laleh Seyyed-Kalantari
Faiza Khan Khattak
62
1
0
04 Apr 2024
BCAmirs at SemEval-2024 Task 4: Beyond Words: A Multimodal and
  Multilingual Exploration of Persuasion in Memes
BCAmirs at SemEval-2024 Task 4: Beyond Words: A Multimodal and Multilingual Exploration of Persuasion in Memes
Amirhossein Abaskohi
AmirHossein Dabiri Aghdam
Lele Wang
Giuseppe Carenini
20
1
0
03 Apr 2024
Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale
  Prediction
Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction
Keyu Tian
Yi-Xin Jiang
Zehuan Yuan
Bingyue Peng
Liwei Wang
VGen
55
263
0
03 Apr 2024
Automatic Prompt Selection for Large Language Models
Automatic Prompt Selection for Large Language Models
Viet-Tung Do
Van-Khanh Hoang
Duy-Hung Nguyen
Shahab Sabahi
Jeff Yang
Hajime Hotta
Minh-Tien Nguyen
Hung Le
35
6
0
03 Apr 2024
Unblind Text Inputs: Predicting Hint-text of Text Input in Mobile Apps
  via LLM
Unblind Text Inputs: Predicting Hint-text of Text Input in Mobile Apps via LLM
Zhe Liu
Chunyang Chen
Junjie Wang
Mengzhuo Chen
Boyu Wu
Yuekai Huang
Jun Hu
Qing Wang
37
10
0
03 Apr 2024
Rethinking Kullback-Leibler Divergence in Knowledge Distillation for
  Large Language Models
Rethinking Kullback-Leibler Divergence in Knowledge Distillation for Large Language Models
Taiqiang Wu
Chaofan Tao
Jiahao Wang
Zhe Zhao
Ngai Wong
ALM
54
15
0
03 Apr 2024
Towards Large Language Model driven Reference-less Translation
  Evaluation for English and Indian Languages
Towards Large Language Model driven Reference-less Translation Evaluation for English and Indian Languages
Vandan Mujadia
Pruthwik Mishra
Arafat Ahsan
D. Sharma
ELM
42
2
0
03 Apr 2024
Benchmarking Large Language Models for Persian: A Preliminary Study
  Focusing on ChatGPT
Benchmarking Large Language Models for Persian: A Preliminary Study Focusing on ChatGPT
Amirhossein Abaskohi
Sara Baruni
Mostafa Masoudi
Nesa Abbasi
Mohammad Hadi Babalou
...
Samin Mahdizadeh Sani
Nikoo Naghavian
Danial Namazifard
Pouya Sadeghi
Yadollah Yaghoobzadeh
LRM
32
4
0
03 Apr 2024
Min-K%++: Improved Baseline for Detecting Pre-Training Data from Large Language Models
Min-K%++: Improved Baseline for Detecting Pre-Training Data from Large Language Models
Jingyang Zhang
Jingwei Sun
Eric C. Yeats
Ouyang Yang
Martin Kuo
Jianyi Zhang
Hao Frank Yang
Hai "Helen" Li
43
43
0
03 Apr 2024
READ: Improving Relation Extraction from an ADversarial Perspective
READ: Improving Relation Extraction from an ADversarial Perspective
Dawei Li
William Hogan
Jingbo Shang
AAML
36
0
0
02 Apr 2024
Deconstructing In-Context Learning: Understanding Prompts via Corruption
Deconstructing In-Context Learning: Understanding Prompts via Corruption
Namrata Shivagunde
Vladislav Lialin
Sherin Muckatira
Anna Rumshisky
38
2
0
02 Apr 2024
Improving Retrieval Augmented Open-Domain Question-Answering with
  Vectorized Contexts
Improving Retrieval Augmented Open-Domain Question-Answering with Vectorized Contexts
Zhuo Chen
Xinyu Wang
Yong-jia Jiang
Pengjun Xie
Fei Huang
Kewei Tu
RALM
29
2
0
02 Apr 2024
VLRM: Vision-Language Models act as Reward Models for Image Captioning
VLRM: Vision-Language Models act as Reward Models for Image Captioning
Maksim Dzabraev
Alexander Kunitsyn
Andrei Ivaniuta
VLM
MLLM
31
3
0
02 Apr 2024
Minimize Quantization Output Error with Bias Compensation
Minimize Quantization Output Error with Bias Compensation
Cheng Gong
Haoshuai Zheng
Mengting Hu
Zheng Lin
Deng-Ping Fan
Yuzhi Zhang
Tao Li
MQ
38
2
0
02 Apr 2024
A Survey on Large Language Model-Based Game Agents
A Survey on Large Language Model-Based Game Agents
Sihao Hu
Tiansheng Huang
Gaowen Liu
Ramana Rao Kompella
Gaowen Liu
Selim Furkan Tekin
Yichang Xu
Zachary Yahn
Ling Liu
LLMAG
LM&Ro
AI4CE
LM&MA
71
52
0
02 Apr 2024
CMAT: A Multi-Agent Collaboration Tuning Framework for Enhancing Small Language Models
CMAT: A Multi-Agent Collaboration Tuning Framework for Enhancing Small Language Models
Xuechen Liang
Meiling Tao
Yinghui Xia
Yiting Xie
Jun Wang
JingSong Yang
LLMAG
33
12
0
02 Apr 2024
Privacy Backdoors: Enhancing Membership Inference through Poisoning
  Pre-trained Models
Privacy Backdoors: Enhancing Membership Inference through Poisoning Pre-trained Models
Yuxin Wen
Leo Marchyok
Sanghyun Hong
Jonas Geiping
Tom Goldstein
Nicholas Carlini
SILM
AAML
39
9
0
01 Apr 2024
ChatGLM-RLHF: Practices of Aligning Large Language Models with Human
  Feedback
ChatGLM-RLHF: Practices of Aligning Large Language Models with Human Feedback
Zhenyu Hou
Yiin Niu
Zhengxiao Du
Xiaohan Zhang
Xiao Liu
...
Qinkai Zheng
Minlie Huang
Hongning Wang
Jie Tang
Yuxiao Dong
ALM
42
18
0
01 Apr 2024
A Survey on Multilingual Large Language Models: Corpora, Alignment, and
  Bias
A Survey on Multilingual Large Language Models: Corpora, Alignment, and Bias
Yuemei Xu
Ling Hu
Jiayi Zhao
Zihan Qiu
Yuqi Ye
Hanwen Gu
LRM
32
37
0
01 Apr 2024
LLaMA-Excitor: General Instruction Tuning via Indirect Feature
  Interaction
LLaMA-Excitor: General Instruction Tuning via Indirect Feature Interaction
Bo Zou
Chao Yang
Yu Qiao
Chengbin Quan
Youjian Zhao
47
6
0
01 Apr 2024
Learning by Correction: Efficient Tuning Task for Zero-Shot Generative
  Vision-Language Reasoning
Learning by Correction: Efficient Tuning Task for Zero-Shot Generative Vision-Language Reasoning
Rongjie Li
Yu Wu
Xuming He
MLLM
LRM
VLM
30
2
0
01 Apr 2024
From Robustness to Improved Generalization and Calibration in
  Pre-trained Language Models
From Robustness to Improved Generalization and Calibration in Pre-trained Language Models
Josip Jukić
Jan Snajder
53
0
0
31 Mar 2024
Extensive Self-Contrast Enables Feedback-Free Language Model Alignment
Extensive Self-Contrast Enables Feedback-Free Language Model Alignment
Xiao Liu
Xixuan Song
Yuxiao Dong
Jie Tang
SyDa
36
5
0
31 Mar 2024
Harnessing the Power of Large Language Model for Uncertainty Aware Graph Processing
Zhenyu Qian
Yiming Qian
Yuting Song
Fei Gao
Hai Jin
Chen Yu
Xia Xie
53
0
0
31 Mar 2024
MetaIE: Distilling a Meta Model from LLM for All Kinds of Information
  Extraction Tasks
MetaIE: Distilling a Meta Model from LLM for All Kinds of Information Extraction Tasks
Letian Peng
Zilong Wang
Feng Yao
Zihan Wang
Jingbo Shang
SyDa
46
13
0
30 Mar 2024
DiLM: Distilling Dataset into Language Model for Text-level Dataset
  Distillation
DiLM: Distilling Dataset into Language Model for Text-level Dataset Distillation
Aru Maekawa
Satoshi Kosugi
Kotaro Funakoshi
Manabu Okumura
DD
59
10
0
30 Mar 2024
Previous
123...171819...484950
Next