ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2205.01068
  4. Cited By
OPT: Open Pre-trained Transformer Language Models

OPT: Open Pre-trained Transformer Language Models

2 May 2022
Susan Zhang
Stephen Roller
Naman Goyal
Mikel Artetxe
Moya Chen
Shuohui Chen
Christopher Dewan
Mona T. Diab
Xian Li
Xi Lin
Todor Mihaylov
Myle Ott
Sam Shleifer
Kurt Shuster
Daniel Simig
Punit Singh Koura
Anjali Sridhar
Tianlu Wang
Luke Zettlemoyer
    VLM
    OSLM
    AI4CE
ArXivPDFHTML

Papers citing "OPT: Open Pre-trained Transformer Language Models"

50 / 2,468 papers shown
Title
LLM4SGG: Large Language Models for Weakly Supervised Scene Graph
  Generation
LLM4SGG: Large Language Models for Weakly Supervised Scene Graph Generation
Kibum Kim
Kanghoon Yoon
Jaeyeong Jeon
Yeonjun In
Jinyoung Moon
Donghyun Kim
Chanyoung Park
39
15
0
16 Oct 2023
Generative Calibration for In-context Learning
Generative Calibration for In-context Learning
Zhongtao Jiang
Yuanzhe Zhang
Cao Liu
Jun Zhao
Kang Liu
178
18
0
16 Oct 2023
Repetition In Repetition Out: Towards Understanding Neural Text
  Degeneration from the Data Perspective
Repetition In Repetition Out: Towards Understanding Neural Text Degeneration from the Data Perspective
Huayang Li
Tian Lan
Z. Fu
Deng Cai
Lemao Liu
Nigel Collier
Taro Watanabe
Yixuan Su
46
15
0
16 Oct 2023
AdaLomo: Low-memory Optimization with Adaptive Learning Rate
AdaLomo: Low-memory Optimization with Adaptive Learning Rate
Kai Lv
Hang Yan
Qipeng Guo
Haijun Lv
Xipeng Qiu
ODL
32
21
0
16 Oct 2023
Let's reward step by step: Step-Level reward model as the Navigators for
  Reasoning
Let's reward step by step: Step-Level reward model as the Navigators for Reasoning
Qianli Ma
Haotian Zhou
Tingkai Liu
Jianbo Yuan
Pengfei Liu
Yang You
Hongxia Yang
LRM
40
45
0
16 Oct 2023
Bridging Code Semantic and LLMs: Semantic Chain-of-Thought Prompting for Code Generation
Yingwei Ma
Yue Yu
Shanshan Li
Yu Jiang
Yong Guo
Yuanliang Zhang
Yutao Xie
Xiangke Liao
36
5
0
16 Oct 2023
FATE-LLM: A Industrial Grade Federated Learning Framework for Large
  Language Models
FATE-LLM: A Industrial Grade Federated Learning Framework for Large Language Models
Tao Fan
Yan Kang
Guoqiang Ma
Weijing Chen
Wenbin Wei
Lixin Fan
Qiang Yang
45
62
0
16 Oct 2023
Bootstrap Your Own Skills: Learning to Solve New Tasks with Large
  Language Model Guidance
Bootstrap Your Own Skills: Learning to Solve New Tasks with Large Language Model Guidance
Jesse Zhang
Jiahui Zhang
Karl Pertsch
Ziyi Liu
Xiang Ren
Minsuk Chang
Shao-Hua Sun
Joseph J Lim
LLMAG
LM&Ro
115
61
0
16 Oct 2023
Assessing the Reliability of Large Language Model Knowledge
Assessing the Reliability of Large Language Model Knowledge
Weixuan Wang
Barry Haddow
Alexandra Birch
Wei Peng
KELM
HILM
80
15
0
15 Oct 2023
VLIS: Unimodal Language Models Guide Multimodal Language Generation
VLIS: Unimodal Language Models Guide Multimodal Language Generation
Jiwan Chung
Youngjae Yu
VLM
42
1
0
15 Oct 2023
Beyond Segmentation: Road Network Generation with Multi-Modal LLMs
Beyond Segmentation: Road Network Generation with Multi-Modal LLMs
Sumedh Rasal
Sanjay K. Boddhu
50
5
0
15 Oct 2023
DPZero: Private Fine-Tuning of Language Models without Backpropagation
DPZero: Private Fine-Tuning of Language Models without Backpropagation
Liang Zhang
Bingcong Li
K. K. Thekumparampil
Sewoong Oh
Niao He
33
11
0
14 Oct 2023
MiniGPT-v2: large language model as a unified interface for
  vision-language multi-task learning
MiniGPT-v2: large language model as a unified interface for vision-language multi-task learning
Jun Chen
Deyao Zhu
Xiaoqian Shen
Xiang Li
Zechun Liu
Pengchuan Zhang
Raghuraman Krishnamoorthi
Vikas Chandra
Yunyang Xiong
Mohamed Elhoseiny
MLLM
168
448
0
14 Oct 2023
Large Language Model Unlearning
Large Language Model Unlearning
Yuanshun Yao
Xiaojun Xu
Yang Liu
MU
54
116
0
14 Oct 2023
Unsupervised Domain Adaption for Neural Information Retrieval
Unsupervised Domain Adaption for Neural Information Retrieval
Carlos Dominguez
Jon Ander Campos
Eneko Agirre
Gorka Azkune
32
0
0
13 Oct 2023
Dialogue Chain-of-Thought Distillation for Commonsense-aware
  Conversational Agents
Dialogue Chain-of-Thought Distillation for Commonsense-aware Conversational Agents
Hyungjoo Chae
Yongho Song
Kai Tzu-iunn Ong
Taeyoon Kwon
Minjin Kim
Youngjae Yu
Dongha Lee
Dongyeop Kang
Jinyoung Yeo
LRM
34
39
0
13 Oct 2023
User Inference Attacks on Large Language Models
User Inference Attacks on Large Language Models
Nikhil Kandpal
Krishna Pillutla
Alina Oprea
Peter Kairouz
Christopher A. Choquette-Choo
Zheng Xu
SILM
AAML
76
15
0
13 Oct 2023
QUIK: Towards End-to-End 4-Bit Inference on Generative Large Language
  Models
QUIK: Towards End-to-End 4-Bit Inference on Generative Large Language Models
Saleh Ashkboos
Ilia Markov
Elias Frantar
Tingxuan Zhong
Xincheng Wang
Jie Ren
Torsten Hoefler
Dan Alistarh
MQ
SyDa
126
22
0
13 Oct 2023
Dynamic Sparse No Training: Training-Free Fine-tuning for Sparse LLMs
Dynamic Sparse No Training: Training-Free Fine-tuning for Sparse LLMs
Yuxin Zhang
Lirui Zhao
Mingbao Lin
Yunyun Sun
Yiwu Yao
Xingjia Han
Jared Tanner
Shiwei Liu
Rongrong Ji
SyDa
45
40
0
13 Oct 2023
SeqXGPT: Sentence-Level AI-Generated Text Detection
SeqXGPT: Sentence-Level AI-Generated Text Detection
Pengyu Wang
Linyang Li
Ke Ren
Botian Jiang
Dong Zhang
Xipeng Qiu
DeLMO
39
52
0
13 Oct 2023
Search-Adaptor: Embedding Customization for Information Retrieval
Search-Adaptor: Embedding Customization for Information Retrieval
Jinsung Yoon
Sercan O. Arik
Yanfei Chen
Tomas Pfister
27
2
0
12 Oct 2023
Toward Joint Language Modeling for Speech Units and Text
Toward Joint Language Modeling for Speech Units and Text
Ju-Chieh Chou
Chung-Ming Chien
Wei-Ning Hsu
Karen Livescu
Arun Babu
Alexis Conneau
Alexei Baevski
Michael Auli
VLM
33
20
0
12 Oct 2023
HoneyBee: Progressive Instruction Finetuning of Large Language Models
  for Materials Science
HoneyBee: Progressive Instruction Finetuning of Large Language Models for Materials Science
Yu Song
Santiago Miret
Huan Zhang
Bang Liu
ALM
32
19
0
12 Oct 2023
DistillSpec: Improving Speculative Decoding via Knowledge Distillation
DistillSpec: Improving Speculative Decoding via Knowledge Distillation
Yongchao Zhou
Kaifeng Lyu
A. S. Rawat
A. Menon
Afshin Rostamizadeh
Sanjiv Kumar
Jean-François Kagy
Rishabh Agarwal
55
84
0
12 Oct 2023
Towards Robust Multi-Modal Reasoning via Model Selection
Towards Robust Multi-Modal Reasoning via Model Selection
Xiangyan Liu
Rongxue Li
Wei Ji
Tao Lin
LLMAG
LRM
50
3
0
12 Oct 2023
Context Compression for Auto-regressive Transformers with Sentinel
  Tokens
Context Compression for Auto-regressive Transformers with Sentinel Tokens
Siyu Ren
Qi Jia
Kenny Q. Zhu
24
11
0
12 Oct 2023
Composite Backdoor Attacks Against Large Language Models
Composite Backdoor Attacks Against Large Language Models
Hai Huang
Zhengyu Zhao
Michael Backes
Yun Shen
Yang Zhang
AAML
46
42
0
11 Oct 2023
LLM4Vis: Explainable Visualization Recommendation using ChatGPT
LLM4Vis: Explainable Visualization Recommendation using ChatGPT
Lei Wang
Songheng Zhang
Yun Wang
Ee-Peng Lim
Yong Wang
LRM
36
39
0
11 Oct 2023
Transformers for Green Semantic Communication: Less Energy, More
  Semantics
Transformers for Green Semantic Communication: Less Energy, More Semantics
Shubhabrata Mukherjee
Cory Beard
Sejun Song
40
1
0
11 Oct 2023
Goodtriever: Adaptive Toxicity Mitigation with Retrieval-augmented
  Models
Goodtriever: Adaptive Toxicity Mitigation with Retrieval-augmented Models
Luiza Amador Pozzobon
Beyza Ermis
Patrick Lewis
Sara Hooker
64
20
0
11 Oct 2023
Survey on Factuality in Large Language Models: Knowledge, Retrieval and
  Domain-Specificity
Survey on Factuality in Large Language Models: Knowledge, Retrieval and Domain-Specificity
Cunxiang Wang
Xiaoze Liu
Yuanhao Yue
Xiangru Tang
Tianhang Zhang
...
Linyi Yang
Jindong Wang
Xing Xie
Zheng Zhang
Yue Zhang
HILM
KELM
55
188
0
11 Oct 2023
Multimodal Graph Learning for Generative Tasks
Multimodal Graph Learning for Generative Tasks
Minji Yoon
Jing Yu Koh
Bryan Hooi
Ruslan Salakhutdinov
35
8
0
11 Oct 2023
How Do Large Language Models Capture the Ever-changing World Knowledge?
  A Review of Recent Advances
How Do Large Language Models Capture the Ever-changing World Knowledge? A Review of Recent Advances
Zihan Zhang
Meng Fang
Lingxi Chen
Mohammad-Reza Namazi-Rad
Jun Wang
KELM
29
21
0
11 Oct 2023
QFT: Quantized Full-parameter Tuning of LLMs with Affordable Resources
QFT: Quantized Full-parameter Tuning of LLMs with Affordable Resources
Zhikai Li
Xiaoxuan Liu
Banghua Zhu
Zhen Dong
Qingyi Gu
Kurt Keutzer
MQ
37
7
0
11 Oct 2023
Generating and Evaluating Tests for K-12 Students with Language Model
  Simulations: A Case Study on Sentence Reading Efficiency
Generating and Evaluating Tests for K-12 Students with Language Model Simulations: A Case Study on Sentence Reading Efficiency
E. Zelikman
Wanjing Anya Ma
Jasmine E. Tran
Diyi Yang
Jason D. Yeatman
Nick Haber
AI4Ed
32
9
0
10 Oct 2023
Sheared LLaMA: Accelerating Language Model Pre-training via Structured
  Pruning
Sheared LLaMA: Accelerating Language Model Pre-training via Structured Pruning
Mengzhou Xia
Tianyu Gao
Zhiyuan Zeng
Danqi Chen
55
278
0
10 Oct 2023
Understanding the Effects of RLHF on LLM Generalisation and Diversity
Understanding the Effects of RLHF on LLM Generalisation and Diversity
Robert Kirk
Ishita Mediratta
Christoforos Nalmpantis
Jelena Luketina
Eric Hambro
Edward Grefenstette
Roberta Raileanu
AI4CE
ALM
121
127
0
10 Oct 2023
Hexa: Self-Improving for Knowledge-Grounded Dialogue System
Hexa: Self-Improving for Knowledge-Grounded Dialogue System
DaeJin Jo
D. W. Nam
Gunsoo Han
Kyoung-Woon On
Taehwan Kwon
Seungeun Rho
Sungwoong Kim
19
0
0
10 Oct 2023
A Semantic Invariant Robust Watermark for Large Language Models
A Semantic Invariant Robust Watermark for Large Language Models
Aiwei Liu
Leyi Pan
Xuming Hu
Shiao Meng
Lijie Wen
WaLM
60
58
0
10 Oct 2023
Compressing Context to Enhance Inference Efficiency of Large Language
  Models
Compressing Context to Enhance Inference Efficiency of Large Language Models
Yucheng Li
Bo Dong
Chenghua Lin
Frank Guerin
19
60
0
09 Oct 2023
LLM for SoC Security: A Paradigm Shift
LLM for SoC Security: A Paradigm Shift
Dipayan Saha
Shams Tarek
Katayoon Yahyaei
S. Saha
Jingbo Zhou
M. Tehranipoor
Farimah Farahmandi
88
48
0
09 Oct 2023
NEFTune: Noisy Embeddings Improve Instruction Finetuning
NEFTune: Noisy Embeddings Improve Instruction Finetuning
Neel Jain
Ping Yeh-Chiang
Yuxin Wen
John Kirchenbauer
Hong-Min Chu
...
Avi Schwarzschild
Aniruddha Saha
Micah Goldblum
Jonas Geiping
Tom Goldstein
31
77
0
09 Oct 2023
GraphLLM: Boosting Graph Reasoning Ability of Large Language Model
GraphLLM: Boosting Graph Reasoning Ability of Large Language Model
Ziwei Chai
Tianjie Zhang
Liang Wu
Kaiqiao Han
Xiaohai Hu
Xuanwen Huang
Yang Yang
AI4MH
LRM
25
59
0
09 Oct 2023
A Bias-Variance-Covariance Decomposition of Kernel Scores for Generative
  Models
A Bias-Variance-Covariance Decomposition of Kernel Scores for Generative Models
Sebastian G. Gruber
Florian Buettner
UQCV
UD
35
1
0
09 Oct 2023
Cabbage Sweeter than Cake? Analysing the Potential of Large Language
  Models for Learning Conceptual Spaces
Cabbage Sweeter than Cake? Analysing the Potential of Large Language Models for Learning Conceptual Spaces
Usashi Chatterjee
Amit Gajbhiye
Steven Schockaert
34
3
0
09 Oct 2023
Parrot Mind: Towards Explaining the Complex Task Reasoning of Pretrained
  Large Language Models with Template-Content Structure
Parrot Mind: Towards Explaining the Complex Task Reasoning of Pretrained Large Language Models with Template-Content Structure
Haotong Yang
Fanxu Meng
Zhouchen Lin
Muhan Zhang
LRM
48
2
0
09 Oct 2023
Empower Nested Boolean Logic via Self-Supervised Curriculum Learning
Empower Nested Boolean Logic via Self-Supervised Curriculum Learning
Hongqiu Wu
Linfeng Liu
Haizhen Zhao
Min Zhang
LRM
AI4CE
NAI
ELM
48
7
0
09 Oct 2023
Negative Object Presence Evaluation (NOPE) to Measure Object
  Hallucination in Vision-Language Models
Negative Object Presence Evaluation (NOPE) to Measure Object Hallucination in Vision-Language Models
Holy Lovenia
Wenliang Dai
Samuel Cahyawijaya
Ziwei Ji
Pascale Fung
MLLM
41
52
0
09 Oct 2023
Explainable Claim Verification via Knowledge-Grounded Reasoning with
  Large Language Models
Explainable Claim Verification via Knowledge-Grounded Reasoning with Large Language Models
Haoran Wang
Kai Shu
LRM
50
22
0
08 Oct 2023
ChatRadio-Valuer: A Chat Large Language Model for Generalizable
  Radiology Report Generation Based on Multi-institution and Multi-system Data
ChatRadio-Valuer: A Chat Large Language Model for Generalizable Radiology Report Generation Based on Multi-institution and Multi-system Data
Tianyang Zhong
Wei Zhao
Yutong Zhang
Yi Pan
Peixin Dong
...
Dinggang Shen
Jun-Feng Han
Tianming Liu
Jun Liu
Tuo Zhang
MedIm
LM&MA
55
14
0
08 Oct 2023
Previous
123...303132...484950
Next