ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2212.13005
22
8

TextBox 2.0: A Text Generation Library with Pre-trained Language Models

26 December 2022
Tianyi Tang
Junyi Li
Z. Chen
Yiwen Hu
Zhuohao Yu
Wen-Dao Dai
Zican Dong
Xiaoxue Cheng
Yuhao Wang
Wayne Xin Zhao
J. Nie
Ji-Rong Wen
ArXivPDFHTML
Abstract

To facilitate research on text generation, this paper presents a comprehensive and unified library, TextBox 2.0, focusing on the use of pre-trained language models (PLMs). To be comprehensive, our library covers 131313 common text generation tasks and their corresponding 838383 datasets and further incorporates 454545 PLMs covering general, translation, Chinese, dialogue, controllable, distilled, prompting, and lightweight PLMs. We also implement 444 efficient training strategies and provide 444 generation objectives for pre-training new PLMs from scratch. To be unified, we design the interfaces to support the entire research pipeline (from data loading to training and evaluation), ensuring that each step can be fulfilled in a unified way. Despite the rich functionality, it is easy to use our library, either through the friendly Python API or command line. To validate the effectiveness of our library, we conduct extensive experiments and exemplify four types of research scenarios. The project is released at the link: https://github.com/RUCAIBox/TextBox.

View on arXiv
Comments on this paper