ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2201.05273
  4. Cited By
Pretrained Language Models for Text Generation: A Survey

Pretrained Language Models for Text Generation: A Survey

14 January 2022
Junyi Li
Tianyi Tang
Wayne Xin Zhao
J. Nie
Ji-Rong Wen
    AI4CE
ArXivPDFHTML

Papers citing "Pretrained Language Models for Text Generation: A Survey"

50 / 137 papers shown
Title
NutriGen: Personalized Meal Plan Generator Leveraging Large Language Models to Enhance Dietary and Nutritional Adherence
NutriGen: Personalized Meal Plan Generator Leveraging Large Language Models to Enhance Dietary and Nutritional Adherence
Saman Khamesian
Asiful Arefeen
Stephanie M. Carpenter
Hassan Ghasemzadeh
86
0
0
28 Feb 2025
Consistency of Responses and Continuations Generated by Large Language Models on Social Media
Consistency of Responses and Continuations Generated by Large Language Models on Social Media
Wenlu Fan
Yinlin Zhu
Chenyang Wang
Bin Wang
Wentao Xu
117
1
0
14 Jan 2025
BeliN: A Novel Corpus for Bengali Religious News Headline Generation using Contextual Feature Fusion
Md Osama
Ashim Dey
Kawsar Ahmed
Muhammad Ashad Kabir
115
0
0
03 Jan 2025
Enhancing Table Recognition with Vision LLMs: A Benchmark and Neighbor-Guided Toolchain Reasoner
Enhancing Table Recognition with Vision LLMs: A Benchmark and Neighbor-Guided Toolchain Reasoner
Yitong Zhou
Mingyue Cheng
Qingyang Mao
Qi Liu
F. Xu
LMTD
77
0
0
30 Dec 2024
GPT for Games: An Updated Scoping Review (2020-2024)
GPT for Games: An Updated Scoping Review (2020-2024)
Daijin Yang
Erica Kleinman
Casper Harteveld
LLMAG
AI4TS
AI4CE
122
3
0
01 Nov 2024
Natural Language Processing for the Legal Domain: A Survey of Tasks, Datasets, Models, and Challenges
Natural Language Processing for the Legal Domain: A Survey of Tasks, Datasets, Models, and Challenges
Farid Ariai
Gianluca Demartini
ELM
AILaw
VLM
71
5
0
25 Oct 2024
Parameter-Efficient Fine-Tuning in Large Models: A Survey of Methodologies
Parameter-Efficient Fine-Tuning in Large Models: A Survey of Methodologies
Liwen Wang
Sheng Chen
Linnan Jiang
Shu Pan
Runze Cai
Sen Yang
Fei Yang
114
5
0
24 Oct 2024
Large Language Model Inference Acceleration: A Comprehensive Hardware Perspective
Large Language Model Inference Acceleration: A Comprehensive Hardware Perspective
Jinhao Li
Jiaming Xu
Shan Huang
Yonghua Chen
Wen Li
...
Jiayi Pan
Li Ding
Hao Zhou
Yu Wang
Guohao Dai
104
19
0
06 Oct 2024
DiffZOO: A Purely Query-Based Black-Box Attack for Red-teaming Text-to-Image Generative Model via Zeroth Order Optimization
DiffZOO: A Purely Query-Based Black-Box Attack for Red-teaming Text-to-Image Generative Model via Zeroth Order Optimization
Pucheng Dang
Xing Hu
Dong Li
Rui Zhang
Qi Guo
Kaidi Xu
DiffM
77
5
0
18 Aug 2024
Large Language Model Enhanced Knowledge Representation Learning: A Survey
Large Language Model Enhanced Knowledge Representation Learning: A Survey
Xin Wang
Zirui Chen
Haofen Wang
Leong Hou U
Zhao Li
Wenbin Guo
KELM
123
3
0
01 Jul 2024
IndicMT Eval: A Dataset to Meta-Evaluate Machine Translation metrics for
  Indian Languages
IndicMT Eval: A Dataset to Meta-Evaluate Machine Translation metrics for Indian Languages
Ananya B. Sai
Vignesh Nagarajan
Tanay Dixit
Raj Dabre
Anoop Kunchukuttan
Pratyush Kumar
Mitesh M. Khapra
100
22
0
20 Dec 2022
Fast Inference from Transformers via Speculative Decoding
Fast Inference from Transformers via Speculative Decoding
Yaniv Leviathan
Matan Kalman
Yossi Matias
LRM
118
702
0
30 Nov 2022
Contrastive Decoding: Open-ended Text Generation as Optimization
Contrastive Decoding: Open-ended Text Generation as Optimization
Xiang Lisa Li
Ari Holtzman
Daniel Fried
Percy Liang
Jason Eisner
Tatsunori Hashimoto
Luke Zettlemoyer
M. Lewis
95
358
0
27 Oct 2022
ELMER: A Non-Autoregressive Pre-trained Language Model for Efficient and
  Effective Text Generation
ELMER: A Non-Autoregressive Pre-trained Language Model for Efficient and Effective Text Generation
Junyi Li
Tianyi Tang
Wayne Xin Zhao
J. Nie
Ji-Rong Wen
59
17
0
24 Oct 2022
GLM-130B: An Open Bilingual Pre-trained Model
GLM-130B: An Open Bilingual Pre-trained Model
Aohan Zeng
Xiao Liu
Zhengxiao Du
Zihan Wang
Hanyu Lai
...
Jidong Zhai
Wenguang Chen
Peng Zhang
Yuxiao Dong
Jie Tang
BDL
LRM
344
1,091
0
05 Oct 2022
Optimal Brain Compression: A Framework for Accurate Post-Training
  Quantization and Pruning
Optimal Brain Compression: A Framework for Accurate Post-Training Quantization and Pruning
Elias Frantar
Sidak Pal Singh
Dan Alistarh
MQ
94
236
0
24 Aug 2022
LLM.int8(): 8-bit Matrix Multiplication for Transformers at Scale
LLM.int8(): 8-bit Matrix Multiplication for Transformers at Scale
Tim Dettmers
M. Lewis
Younes Belkada
Luke Zettlemoyer
MQ
83
650
0
15 Aug 2022
What Language Model Architecture and Pretraining Objective Work Best for
  Zero-Shot Generalization?
What Language Model Architecture and Pretraining Objective Work Best for Zero-Shot Generalization?
Thomas Wang
Adam Roberts
Daniel Hesslow
Teven Le Scao
Hyung Won Chung
Iz Beltagy
Julien Launay
Colin Raffel
97
174
0
12 Apr 2022
Teaching language models to support answers with verified quotes
Teaching language models to support answers with verified quotes
Jacob Menick
Maja Trebacz
Vladimir Mikulik
John Aslanides
Francis Song
...
Mia Glaese
Susannah Young
Lucy Campbell-Gillingham
G. Irving
Nat McAleese
ELM
RALM
294
265
0
21 Mar 2022
Rethinking and Refining the Distinct Metric
Rethinking and Refining the Distinct Metric
Siyang Liu
Sahand Sabour
Yinhe Zheng
Pei Ke
Xiaoyan Zhu
Minlie Huang
56
12
0
28 Feb 2022
LongT5: Efficient Text-To-Text Transformer for Long Sequences
LongT5: Efficient Text-To-Text Transformer for Long Sequences
Mandy Guo
Joshua Ainslie
David C. Uthus
Santiago Ontanon
Jianmo Ni
Yun-hsuan Sung
Yinfei Yang
VLM
57
313
0
15 Dec 2021
Towards a Unified View of Parameter-Efficient Transfer Learning
Towards a Unified View of Parameter-Efficient Transfer Learning
Junxian He
Chunting Zhou
Xuezhe Ma
Taylor Berg-Kirkpatrick
Graham Neubig
AAML
129
935
0
08 Oct 2021
Leveraging Pretrained Models for Automatic Summarization of
  Doctor-Patient Conversations
Leveraging Pretrained Models for Automatic Summarization of Doctor-Patient Conversations
Longxiang Zhang
Renato M. P. Negrinho
Arindam Ghosh
V. Jagannathan
H. Hassanzadeh
Thomas Schaaf
Matthew R. Gormley
LM&MA
AI4MH
115
68
0
24 Sep 2021
Enriching and Controlling Global Semantics for Text Summarization
Enriching and Controlling Global Semantics for Text Summarization
Thong Nguyen
Anh Tuan Luu
Truc Lu
Tho Quan
41
35
0
22 Sep 2021
A Plug-and-Play Method for Controlled Text Generation
A Plug-and-Play Method for Controlled Text Generation
Damian Pascual
Béni Egressy
Clara Meister
Ryan Cotterell
Roger Wattenhofer
113
93
0
20 Sep 2021
Mitigating Data Scarceness through Data Synthesis, Augmentation and
  Curriculum for Abstractive Summarization
Mitigating Data Scarceness through Data Synthesis, Augmentation and Curriculum for Abstractive Summarization
Ahmed Magooda
Diane Litman
63
5
0
17 Sep 2021
Topic-Aware Contrastive Learning for Abstractive Dialogue Summarization
Topic-Aware Contrastive Learning for Abstractive Dialogue Summarization
Junpeng Liu
Yanyan Zou
Hainan Zhang
Hongshen Chen
Zhuoye Ding
Caixia Yuan
Xiaojie Wang
49
66
0
10 Sep 2021
AfroMT: Pretraining Strategies and Reproducible Benchmarks for
  Translation of 8 African Languages
AfroMT: Pretraining Strategies and Reproducible Benchmarks for Translation of 8 African Languages
Machel Reid
Junjie Hu
Graham Neubig
Y. Matsuo
122
33
0
10 Sep 2021
A Three-Stage Learning Framework for Low-Resource Knowledge-Grounded
  Dialogue Generation
A Three-Stage Learning Framework for Low-Resource Knowledge-Grounded Dialogue Generation
Shilei Liu
Xiaofeng Zhao
Bochao Li
Feiliang Ren
Longhui Zhang
Shujuan Yin
57
32
0
09 Sep 2021
IndicBART: A Pre-trained Model for Indic Natural Language Generation
IndicBART: A Pre-trained Model for Indic Natural Language Generation
Raj Dabre
Himani Shrotriya
Anoop Kunchukuttan
Ratish Puduppully
Mitesh M. Khapra
Pratyush Kumar
101
74
0
07 Sep 2021
DialogLM: Pre-trained Model for Long Dialogue Understanding and
  Summarization
DialogLM: Pre-trained Model for Long Dialogue Understanding and Summarization
Ming Zhong
Yang Liu
Yichong Xu
Chenguang Zhu
Michael Zeng
VLM
AI4CE
70
127
0
06 Sep 2021
Train Short, Test Long: Attention with Linear Biases Enables Input
  Length Extrapolation
Train Short, Test Long: Attention with Linear Biases Enables Input Length Extrapolation
Ofir Press
Noah A. Smith
M. Lewis
321
755
0
27 Aug 2021
End-to-End Dense Video Captioning with Parallel Decoding
End-to-End Dense Video Captioning with Parallel Decoding
Teng Wang
Ruimao Zhang
Zhichao Lu
Feng Zheng
Ran Cheng
Ping Luo
3DV
69
183
0
17 Aug 2021
Pre-train, Prompt, and Predict: A Systematic Survey of Prompting Methods
  in Natural Language Processing
Pre-train, Prompt, and Predict: A Systematic Survey of Prompting Methods in Natural Language Processing
Pengfei Liu
Weizhe Yuan
Jinlan Fu
Zhengbao Jiang
Hiroaki Hayashi
Graham Neubig
VLM
SyDa
200
3,971
0
28 Jul 2021
A Survey on Dialogue Summarization: Recent Advances and New Frontiers
A Survey on Dialogue Summarization: Recent Advances and New Frontiers
Xiachong Feng
Xiaocheng Feng
Bing Qin
62
100
0
07 Jul 2021
Attention Bottlenecks for Multimodal Fusion
Attention Bottlenecks for Multimodal Fusion
Arsha Nagrani
Shan Yang
Anurag Arnab
A. Jansen
Cordelia Schmid
Chen Sun
98
565
0
30 Jun 2021
Neural Machine Translation for Low-Resource Languages: A Survey
Neural Machine Translation for Low-Resource Languages: A Survey
Surangika Ranathunga
E. Lee
Marjana Prifti Skenduli
Ravi Shekhar
Mehreen Alam
Rishemjit Kaur
104
245
0
29 Jun 2021
CPM-2: Large-scale Cost-effective Pre-trained Language Models
CPM-2: Large-scale Cost-effective Pre-trained Language Models
Zhengyan Zhang
Yuxian Gu
Xu Han
Shengqi Chen
Chaojun Xiao
...
Minlie Huang
Wentao Han
Yang Liu
Xiaoyan Zhu
Maosong Sun
MoE
68
87
0
20 Jun 2021
Pre-Trained Models: Past, Present and Future
Pre-Trained Models: Past, Present and Future
Xu Han
Zhengyan Zhang
Ning Ding
Yuxian Gu
Xiao Liu
...
Jie Tang
Ji-Rong Wen
Jinhui Yuan
Wayne Xin Zhao
Jun Zhu
AIFin
MQ
AI4MH
136
849
0
14 Jun 2021
FastSeq: Make Sequence Generation Faster
FastSeq: Make Sequence Generation Faster
Yu Yan
Fei Hu
Jiusheng Chen
Nikhil Bhendawade
Ting Ye
Yeyun Gong
Nan Duan
Desheng Cui
Bingyu Chi
Ruifei Zhang
VLM
46
15
0
08 Jun 2021
Few-shot Knowledge Graph-to-Text Generation with Pretrained Language
  Models
Few-shot Knowledge Graph-to-Text Generation with Pretrained Language Models
Junyi Li
Tianyi Tang
Wayne Xin Zhao
Zhicheng Wei
N. Yuan
Ji-Rong Wen
53
48
0
03 Jun 2021
DYPLOC: Dynamic Planning of Content Using Mixed Language Models for Text
  Generation
DYPLOC: Dynamic Planning of Content Using Mixed Language Models for Text Generation
Xinyu Hua
Ashwin Sreevatsa
Lu Wang
23
23
0
01 Jun 2021
Cross-Lingual Abstractive Summarization with Limited Parallel Resources
Cross-Lingual Abstractive Summarization with Limited Parallel Resources
Yu Bai
Yang Gao
Heyan Huang
82
52
0
28 May 2021
BASS: Boosting Abstractive Summarization with Unified Semantic Graph
BASS: Boosting Abstractive Summarization with Unified Semantic Graph
Wenhao Wu
Wei Li
Xinyan Xiao
Jiachen Liu
Ziqiang Cao
Sujian Li
Hua Wu
Haifeng Wang
60
45
0
25 May 2021
Contrastive Learning for Many-to-many Multilingual Neural Machine
  Translation
Contrastive Learning for Many-to-many Multilingual Neural Machine Translation
Xiao Pan
Mingxuan Wang
Liwei Wu
Lei Li
65
206
0
20 May 2021
Knowledge-based Review Generation by Coherence Enhanced Text Planning
Knowledge-based Review Generation by Coherence Enhanced Text Planning
Junyi Li
Wayne Xin Zhao
Zhicheng Wei
Nicholas Jing Yuan
Ji-Rong Wen
64
21
0
09 May 2021
PanGu-$α$: Large-scale Autoregressive Pretrained Chinese Language
  Models with Auto-parallel Computation
PanGu-ααα: Large-scale Autoregressive Pretrained Chinese Language Models with Auto-parallel Computation
Wei Zeng
Xiaozhe Ren
Teng Su
Hui Wang
Yi-Lun Liao
...
Gaojun Fan
Yaowei Wang
Xuefeng Jin
Qun Liu
Yonghong Tian
ALM
MoE
AI4CE
69
213
0
26 Apr 2021
RoFormer: Enhanced Transformer with Rotary Position Embedding
RoFormer: Enhanced Transformer with Rotary Position Embedding
Jianlin Su
Yu Lu
Shengfeng Pan
Ahmed Murtadha
Bo Wen
Yunfeng Liu
268
2,443
0
20 Apr 2021
Structure-Aware Abstractive Conversation Summarization via Discourse and
  Action Graphs
Structure-Aware Abstractive Conversation Summarization via Discourse and Action Graphs
Jiaao Chen
Diyi Yang
59
98
0
16 Apr 2021
Efficient Attentions for Long Document Summarization
Efficient Attentions for Long Document Summarization
L. Huang
Shuyang Cao
Nikolaus Nova Parulian
Heng Ji
Lu Wang
127
286
0
05 Apr 2021
123
Next