ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2205.01068
  4. Cited By
OPT: Open Pre-trained Transformer Language Models

OPT: Open Pre-trained Transformer Language Models

2 May 2022
Susan Zhang
Stephen Roller
Naman Goyal
Mikel Artetxe
Moya Chen
Shuohui Chen
Christopher Dewan
Mona T. Diab
Xian Li
Xi Lin
Todor Mihaylov
Myle Ott
Sam Shleifer
Kurt Shuster
Daniel Simig
Punit Singh Koura
Anjali Sridhar
Tianlu Wang
Luke Zettlemoyer
    VLM
    OSLM
    AI4CE
ArXivPDFHTML

Papers citing "OPT: Open Pre-trained Transformer Language Models"

50 / 2,459 papers shown
Title
FACTIFY3M: A Benchmark for Multimodal Fact Verification with
  Explainability through 5W Question-Answering
FACTIFY3M: A Benchmark for Multimodal Fact Verification with Explainability through 5W Question-Answering
Megha Chakraborty
Khusbu Pahwa
Anku Rani
Shreyas Chatterjee
Dwip Dalal
...
Shreyash Mishra
K. Sensharma
Aman Chadha
Amit P. Sheth
Amitava Das
DiffM
37
7
0
22 May 2023
Can We Edit Factual Knowledge by In-Context Learning?
Can We Edit Factual Knowledge by In-Context Learning?
Ce Zheng
Lei Li
Qingxiu Dong
Yuxuan Fan
Zhiyong Wu
Jingjing Xu
Baobao Chang
KELM
39
187
0
22 May 2023
A Frustratingly Simple Decoding Method for Neural Text Generation
A Frustratingly Simple Decoding Method for Neural Text Generation
Haoran Yang
Deng Cai
Huayang Li
Wei Bi
Wai Lam
Shuming Shi
51
11
0
22 May 2023
Has It All Been Solved? Open NLP Research Questions Not Solved by Large
  Language Models
Has It All Been Solved? Open NLP Research Questions Not Solved by Large Language Models
Oana Ignat
Zhijing Jin
Artem Abzaliev
Laura Biester
Santiago Castro
...
Verónica Pérez-Rosas
Siqi Shen
Zekun Wang
Winston Wu
Rada Mihalcea
LRM
46
6
0
21 May 2023
Explaining How Transformers Use Context to Build Predictions
Explaining How Transformers Use Context to Build Predictions
Javier Ferrando
Gerard I. Gállego
Ioannis Tsiamas
Marta R. Costa-jussá
37
32
0
21 May 2023
TheoremQA: A Theorem-driven Question Answering dataset
TheoremQA: A Theorem-driven Question Answering dataset
Wenhu Chen
Ming Yin
Max W.F. Ku
Pan Lu
Yixin Wan
Xueguang Ma
Jianyu Xu
Xinyi Wang
Tony Xia
AIMat
38
125
0
21 May 2023
Integer or Floating Point? New Outlooks for Low-Bit Quantization on
  Large Language Models
Integer or Floating Point? New Outlooks for Low-Bit Quantization on Large Language Models
Yijia Zhang
Lingran Zhao
Shijie Cao
Wenqiang Wang
Ting Cao
Fan Yang
Mao Yang
Shanghang Zhang
Ningyi Xu
MQ
29
17
0
21 May 2023
Paragraph-level Citation Recommendation based on Topic Sentences as
  Queries
Paragraph-level Citation Recommendation based on Topic Sentences as Queries
Zoran Medic
Jan Snajder
3DV
31
0
0
20 May 2023
XuanYuan 2.0: A Large Chinese Financial Chat Model with Hundreds of
  Billions Parameters
XuanYuan 2.0: A Large Chinese Financial Chat Model with Hundreds of Billions Parameters
Xuanyu Zhang
Qing Yang
Dongliang Xu
ALM
OSLM
37
97
0
19 May 2023
OPT-R: Exploring the Role of Explanations in Finetuning and Prompting
  for Reasoning Skills of Large Language Models
OPT-R: Exploring the Role of Explanations in Finetuning and Prompting for Reasoning Skills of Large Language Models
Badr AlKhamissi
Siddharth Verma
Ping Yu
Zhijing Jin
Asli Celikyilmaz
Mona T. Diab
LRM
ReLM
35
10
0
19 May 2023
Evaluation of medium-large Language Models at zero-shot closed book
  generative question answering
Evaluation of medium-large Language Models at zero-shot closed book generative question answering
René Peinl
Johannes Wirth
ELM
26
7
0
19 May 2023
Scaling laws for language encoding models in fMRI
Scaling laws for language encoding models in fMRI
Richard Antonello
Aditya R. Vaidya
Alexander G. Huth
MedIm
35
59
0
19 May 2023
Reducing Sequence Length by Predicting Edit Operations with Large
  Language Models
Reducing Sequence Length by Predicting Edit Operations with Large Language Models
Masahiro Kaneko
Naoaki Okazaki
28
4
0
19 May 2023
RCOT: Detecting and Rectifying Factual Inconsistency in Reasoning by
  Reversing Chain-of-Thought
RCOT: Detecting and Rectifying Factual Inconsistency in Reasoning by Reversing Chain-of-Thought
Tianci Xue
Ziqi Wang
Zhenhailong Wang
Chi Han
Pengfei Yu
Heng Ji
KELM
LRM
48
33
0
19 May 2023
CCGen: Explainable Complementary Concept Generation in E-Commerce
CCGen: Explainable Complementary Concept Generation in E-Commerce
Jie Huang
Yifan Gao
Zheng Li
Jingfeng Yang
Yangqiu Song
Chao Zhang
Zining Zhu
Haoming Jiang
Kevin Chen-Chuan Chang
Bing Yin
3DV
LRM
37
5
0
19 May 2023
VisionLLM: Large Language Model is also an Open-Ended Decoder for
  Vision-Centric Tasks
VisionLLM: Large Language Model is also an Open-Ended Decoder for Vision-Centric Tasks
Wen Wang
Zhe Chen
Xiaokang Chen
Jiannan Wu
Xizhou Zhu
...
Ping Luo
Tong Lu
Jie Zhou
Yu Qiao
Jifeng Dai
MLLM
VLM
38
464
0
18 May 2023
Efficient Prompting via Dynamic In-Context Learning
Efficient Prompting via Dynamic In-Context Learning
Wangchunshu Zhou
Yuchen Eleanor Jiang
Ryan Cotterell
Mrinmaya Sachan
34
19
0
18 May 2023
Aligning Instruction Tasks Unlocks Large Language Models as Zero-Shot
  Relation Extractors
Aligning Instruction Tasks Unlocks Large Language Models as Zero-Shot Relation Extractors
Kai Zhang
Bernal Jiménez Gutiérrez
Yu-Chuan Su
31
67
0
18 May 2023
LLMScore: Unveiling the Power of Large Language Models in Text-to-Image
  Synthesis Evaluation
LLMScore: Unveiling the Power of Large Language Models in Text-to-Image Synthesis Evaluation
Yujie Lu
Xianjun Yang
Xiujun Li
Xinze Wang
William Yang Wang
EGVM
57
73
0
18 May 2023
Learning In-context Learning for Named Entity Recognition
Learning In-context Learning for Named Entity Recognition
Jiawei Chen
Yaojie Lu
Hongyu Lin
Jie Lou
Wei Jia
Dai Dai
Hua Wu
Boxi Cao
Xianpei Han
Le Sun
NAI
53
19
0
18 May 2023
MedBLIP: Bootstrapping Language-Image Pre-training from 3D Medical
  Images and Texts
MedBLIP: Bootstrapping Language-Image Pre-training from 3D Medical Images and Texts
Qiuhui Chen
Xinyue Hu
Zirui Wang
Yi Hong
LM&MA
MedIm
30
35
0
18 May 2023
Paxion: Patching Action Knowledge in Video-Language Foundation Models
Paxion: Patching Action Knowledge in Video-Language Foundation Models
Zhenhailong Wang
Ansel Blume
Sha Li
Genglin Liu
Jaemin Cho
Zineng Tang
Joey Tianyi Zhou
Heng Ji
KELM
VGen
27
26
0
18 May 2023
Language Models Meet World Models: Embodied Experiences Enhance Language
  Models
Language Models Meet World Models: Embodied Experiences Enhance Language Models
Jiannan Xiang
Tianhua Tao
Yi Gu
Tianmin Shu
Zirui Wang
Zichao Yang
Zhiting Hu
ALM
LLMAG
LM&Ro
CLL
50
94
0
18 May 2023
Token-wise Decomposition of Autoregressive Language Model Hidden States
  for Analyzing Model Predictions
Token-wise Decomposition of Autoregressive Language Model Hidden States for Analyzing Model Predictions
Byung-Doh Oh
William Schuler
31
2
0
17 May 2023
Compress, Then Prompt: Improving Accuracy-Efficiency Trade-off of LLM
  Inference with Transferable Prompt
Compress, Then Prompt: Improving Accuracy-Efficiency Trade-off of LLM Inference with Transferable Prompt
Zhaozhuo Xu
Zirui Liu
Beidi Chen
Yuxin Tang
Jue Wang
Kaixiong Zhou
Xia Hu
Anshumali Shrivastava
MQ
37
29
0
17 May 2023
Statistical Knowledge Assessment for Large Language Models
Statistical Knowledge Assessment for Large Language Models
Qingxiu Dong
Jingjing Xu
Lingpeng Kong
Zhifang Sui
Lei Li
HILM
47
6
0
17 May 2023
PMC-VQA: Visual Instruction Tuning for Medical Visual Question Answering
PMC-VQA: Visual Instruction Tuning for Medical Visual Question Answering
Xiaoman Zhang
Chaoyi Wu
Ziheng Zhao
Weixiong Lin
Ya Zhang
Yanfeng Wang
Weidi Xie
LM&MA
53
157
0
17 May 2023
Large-Scale Text Analysis Using Generative Language Models: A Case Study
  in Discovering Public Value Expressions in AI Patents
Large-Scale Text Analysis Using Generative Language Models: A Case Study in Discovering Public Value Expressions in AI Patents
Sergio Pelaez
Gaurav Verma
Barbara Ribeiro
P. Shapira
31
13
0
17 May 2023
FACE: Evaluating Natural Language Generation with Fourier Analysis of
  Cross-Entropy
FACE: Evaluating Natural Language Generation with Fourier Analysis of Cross-Entropy
Zuhao Yang
Yingfang Yuan
Yang Xu
Shuo Zhan
Huajun Bai
Kefan Chen
CVBM
30
4
0
17 May 2023
M3KE: A Massive Multi-Level Multi-Subject Knowledge Evaluation Benchmark
  for Chinese Large Language Models
M3KE: A Massive Multi-Level Multi-Subject Knowledge Evaluation Benchmark for Chinese Large Language Models
Chuang Liu
Renren Jin
Yuqi Ren
Linhao Yu
Tianyu Dong
...
Peiyi Zhang
Qingqing Lyu
Xiaowen Su
Qun Liu
Deyi Xiong
ELM
ALM
16
24
0
17 May 2023
MemoryBank: Enhancing Large Language Models with Long-Term Memory
MemoryBank: Enhancing Large Language Models with Long-Term Memory
Wanjun Zhong
Lianghong Guo
Qi-Fei Gao
He Ye
Yanlin Wang
LLMAG
RALM
KELM
30
125
0
17 May 2023
Can Language Models Solve Graph Problems in Natural Language?
Can Language Models Solve Graph Problems in Natural Language?
Heng Wang
Shangbin Feng
Tianxing He
Zhaoxuan Tan
Xiaochuang Han
Yulia Tsvetkov
ReLM
LRM
29
182
0
17 May 2023
Knowledge Card: Filling LLMs' Knowledge Gaps with Plug-in Specialized
  Language Models
Knowledge Card: Filling LLMs' Knowledge Gaps with Plug-in Specialized Language Models
Shangbin Feng
Weijia Shi
Yuyang Bai
Vidhisha Balachandran
Tianxing He
Yulia Tsvetkov
KELM
55
31
0
17 May 2023
"I'm fully who I am": Towards Centering Transgender and Non-Binary
  Voices to Measure Biases in Open Language Generation
"I'm fully who I am": Towards Centering Transgender and Non-Binary Voices to Measure Biases in Open Language Generation
Anaelia Ovalle
Palash Goyal
Jwala Dhamala
Zachary Jaggers
Kai-Wei Chang
Aram Galstyan
R. Zemel
Rahul Gupta
40
61
0
17 May 2023
Explaining black box text modules in natural language with language
  models
Explaining black box text modules in natural language with language models
Chandan Singh
Aliyah R. Hsu
Richard Antonello
Shailee Jain
Alexander G. Huth
Bin-Xia Yu
Jianfeng Gao
MILM
39
49
0
17 May 2023
SpecInfer: Accelerating Generative Large Language Model Serving with
  Tree-based Speculative Inference and Verification
SpecInfer: Accelerating Generative Large Language Model Serving with Tree-based Speculative Inference and Verification
Xupeng Miao
Gabriele Oliaro
Zhihao Zhang
Xinhao Cheng
Zeyu Wang
...
Chunan Shi
Zhuoming Chen
Daiyaan Arfeen
Reyna Abhyankar
Zhihao Jia
LRM
68
122
0
16 May 2023
What In-Context Learning "Learns" In-Context: Disentangling Task
  Recognition and Task Learning
What In-Context Learning "Learns" In-Context: Disentangling Task Recognition and Task Learning
Jane Pan
Tianyu Gao
Howard Chen
Danqi Chen
30
111
0
16 May 2023
SatLM: Satisfiability-Aided Language Models Using Declarative Prompting
SatLM: Satisfiability-Aided Language Models Using Declarative Prompting
Xi Ye
Qiaochu Chen
Işıl Dillig
Greg Durrett
ReLM
ReCod
LRM
40
64
0
16 May 2023
StructGPT: A General Framework for Large Language Model to Reason over
  Structured Data
StructGPT: A General Framework for Large Language Model to Reason over Structured Data
Jinhao Jiang
Kun Zhou
Zican Dong
Keming Ye
Wayne Xin Zhao
Ji-Rong Wen
LRM
LMTD
RALM
55
265
0
16 May 2023
Bridging the Domain Gap: Self-Supervised 3D Scene Understanding with
  Foundation Models
Bridging the Domain Gap: Self-Supervised 3D Scene Understanding with Foundation Models
Zhimin Chen
Longlong Jing
Yingwei Li
Bing Li
37
31
0
15 May 2023
Knowledge Rumination for Pre-trained Language Models
Knowledge Rumination for Pre-trained Language Models
Yunzhi Yao
Peng Wang
Shengyu Mao
Chuanqi Tan
Fei Huang
Huajun Chen
Ningyu Zhang
KELM
37
3
0
15 May 2023
Assessing Hidden Risks of LLMs: An Empirical Study on Robustness,
  Consistency, and Credibility
Assessing Hidden Risks of LLMs: An Empirical Study on Robustness, Consistency, and Credibility
Wen-song Ye
Mingfeng Ou
Tianyi Li
Yipeng Chen
Xuetao Ma
...
Sai Wu
Jie Fu
Gang Chen
Haobo Wang
Jiaqi Zhao
46
36
0
15 May 2023
Text Classification via Large Language Models
Text Classification via Large Language Models
Xiaofei Sun
Xiaoya Li
Jiwei Li
Fei Wu
Shangwei Guo
Tianwei Zhang
Guoyin Wang
RALM
LRM
45
139
0
15 May 2023
STORYWARS: A Dataset and Instruction Tuning Baselines for Collaborative
  Story Understanding and Generation
STORYWARS: A Dataset and Instruction Tuning Baselines for Collaborative Story Understanding and Generation
Yulun Du
Lydia B. Chilton
43
8
0
14 May 2023
Make Prompt-based Black-Box Tuning Colorful: Boosting Model
  Generalization from Three Orthogonal Perspectives
Make Prompt-based Black-Box Tuning Colorful: Boosting Model Generalization from Three Orthogonal Perspectives
Qiushi Sun
Chengcheng Han
Nuo Chen
Renyu Zhu
Jing Gong
Xiang Li
Ming Gao
VLM
27
8
0
14 May 2023
Is ChatGPT Fair for Recommendation? Evaluating Fairness in Large
  Language Model Recommendation
Is ChatGPT Fair for Recommendation? Evaluating Fairness in Large Language Model Recommendation
Jizhi Zhang
Keqin Bao
Yang Zhang
Wenjie Wang
Fuli Feng
Xiangnan He
LRM
ALM
35
158
0
12 May 2023
Measuring Progress in Fine-grained Vision-and-Language Understanding
Measuring Progress in Fine-grained Vision-and-Language Understanding
Emanuele Bugliarello
Laurent Sartran
Aishwarya Agrawal
Lisa Anne Hendricks
Aida Nematzadeh
VLM
36
22
0
12 May 2023
MEGABYTE: Predicting Million-byte Sequences with Multiscale Transformers
MEGABYTE: Predicting Million-byte Sequences with Multiscale Transformers
L. Yu
Daniel Simig
Colin Flaherty
Armen Aghajanyan
Luke Zettlemoyer
M. Lewis
32
84
0
12 May 2023
Not All Languages Are Created Equal in LLMs: Improving Multilingual
  Capability by Cross-Lingual-Thought Prompting
Not All Languages Are Created Equal in LLMs: Improving Multilingual Capability by Cross-Lingual-Thought Prompting
Haoyang Huang
Tianyi Tang
Dongdong Zhang
Wayne Xin Zhao
Ting Song
Yan Xia
Furu Wei
LRM
40
157
0
11 May 2023
Recommendation as Instruction Following: A Large Language Model
  Empowered Recommendation Approach
Recommendation as Instruction Following: A Large Language Model Empowered Recommendation Approach
Junjie Zhang
Ruobing Xie
Yupeng Hou
Wayne Xin Zhao
Leyu Lin
Ji-Rong Wen
44
203
0
11 May 2023
Previous
123...414243...484950
Next