ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2401.17268
25
18

Weaver: Foundation Models for Creative Writing

30 January 2024
Tiannan Wang
Jiamin Chen
Qingrui Jia
Shuai Wang
Ruoyu Fang
Huilin Wang
Zhaowei Gao
Chunzhao Xie
Chuou Xu
Jihong Dai
Yibin Liu
Jialong Wu
Shengwei Ding
Long Li
Zhiwei Huang
Xinle Deng
Teng Yu
Gangan Ma
Han Xiao
Z. Chen
Danjun Xiang
Yunxia Wang
Yuanyuan Zhu
Yi Xiao
Jing Wang
Yiru Wang
Siran Ding
Jiayang Huang
Jiayi Xu
Yilihamujiang Tayier
Zhenyu Hu
Yuan Gao
Chengfeng Zheng
Yu-Jie Ye
Yihan Li
Lei Wan
Xinyue Jiang
Yujie Wang
Siyuan Cheng
Zhule Song
Xiangru Tang
Xiaohua Xu
Ningyu Zhang
Huajun Chen
Yuchen Eleanor Jiang
Wangchunshu Zhou
ArXivPDFHTML
Abstract

This work introduces Weaver, our first family of large language models (LLMs) dedicated to content creation. Weaver is pre-trained on a carefully selected corpus that focuses on improving the writing capabilities of large language models. We then fine-tune Weaver for creative and professional writing purposes and align it to the preference of professional writers using a suit of novel methods for instruction data synthesis and LLM alignment, making it able to produce more human-like texts and follow more diverse instructions for content creation. The Weaver family consists of models of Weaver Mini (1.8B), Weaver Base (6B), Weaver Pro (14B), and Weaver Ultra (34B) sizes, suitable for different applications and can be dynamically dispatched by a routing agent according to query complexity to balance response quality and computation cost. Evaluation on a carefully curated benchmark for assessing the writing capabilities of LLMs shows Weaver models of all sizes outperform generalist LLMs several times larger than them. Notably, our most-capable Weaver Ultra model surpasses GPT-4, a state-of-the-art generalist LLM, on various writing scenarios, demonstrating the advantage of training specialized LLMs for writing purposes. Moreover, Weaver natively supports retrieval-augmented generation (RAG) and function calling (tool usage). We present various use cases of these abilities for improving AI-assisted writing systems, including integration of external knowledge bases, tools, or APIs, and providing personalized writing assistance. Furthermore, we discuss and summarize a guideline and best practices for pre-training and fine-tuning domain-specific LLMs.

View on arXiv
Comments on this paper