ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2010.13002
  4. Cited By
Pre-trained Summarization Distillation

Pre-trained Summarization Distillation

24 October 2020
Sam Shleifer
Alexander M. Rush
ArXivPDFHTML

Papers citing "Pre-trained Summarization Distillation"

22 / 22 papers shown
Title
Outlier Reduction with Gated Attention for Improved Post-training
  Quantization in Large Sequence-to-sequence Speech Foundation Models
Outlier Reduction with Gated Attention for Improved Post-training Quantization in Large Sequence-to-sequence Speech Foundation Models
Dominik Wagner
Ilja Baumann
K. Riedhammer
Tobias Bocklet
MQ
30
1
0
16 Jun 2024
Initializing Models with Larger Ones
Initializing Models with Larger Ones
Zhiqiu Xu
Yanjie Chen
Kirill Vishniakov
Yida Yin
Zhiqiang Shen
Trevor Darrell
Lingjie Liu
Zhuang Liu
33
17
0
30 Nov 2023
Boosting Summarization with Normalizing Flows and Aggressive Training
Boosting Summarization with Normalizing Flows and Aggressive Training
Yu Yang
Xiaotong Shen
AI4CE
TPM
21
0
0
01 Nov 2023
f-Divergence Minimization for Sequence-Level Knowledge Distillation
f-Divergence Minimization for Sequence-Level Knowledge Distillation
Yuqiao Wen
Zichao Li
Wenyu Du
Lili Mou
30
53
0
27 Jul 2023
Expand, Rerank, and Retrieve: Query Reranking for Open-Domain Question
  Answering
Expand, Rerank, and Retrieve: Query Reranking for Open-Domain Question Answering
Yung-Sung Chuang
Wei Fang
Shang-Wen Li
Wen-tau Yih
James R. Glass
LRM
22
12
0
26 May 2023
MuLER: Detailed and Scalable Reference-based Evaluation
MuLER: Detailed and Scalable Reference-based Evaluation
Taelin Karidi
Leshem Choshen
Gal Patel
Omri Abend
38
0
0
24 May 2023
A Systematic Study of Knowledge Distillation for Natural Language
  Generation with Pseudo-Target Training
A Systematic Study of Knowledge Distillation for Natural Language Generation with Pseudo-Target Training
Nitay Calderon
Subhabrata Mukherjee
Roi Reichart
Amir Kantor
36
17
0
03 May 2023
Multitask Prompt Tuning Enables Parameter-Efficient Transfer Learning
Multitask Prompt Tuning Enables Parameter-Efficient Transfer Learning
Zhen Wang
Rameswar Panda
Leonid Karlinsky
Rogerio Feris
Huan Sun
Yoon Kim
VLM
VPVLM
33
107
0
06 Mar 2023
Distillation of encoder-decoder transformers for sequence labelling
Distillation of encoder-decoder transformers for sequence labelling
M. Farina
D. Pappadopulo
Anant Gupta
Leslie Huang
Ozan Irsoy
Thamar Solorio
VLM
103
3
0
10 Feb 2023
CrisisLTLSum: A Benchmark for Local Crisis Event Timeline Extraction and
  Summarization
CrisisLTLSum: A Benchmark for Local Crisis Event Timeline Extraction and Summarization
Hossein Rajaby Faghihi
Bashar Alhafni
Kecheng Zhang
Shihao Ran
Joel R. Tetreault
A. Jaimes
16
9
0
25 Oct 2022
Referee: Reference-Free Sentence Summarization with Sharper
  Controllability through Symbolic Knowledge Distillation
Referee: Reference-Free Sentence Summarization with Sharper Controllability through Symbolic Knowledge Distillation
Melanie Sclar
Peter West
Sachin Kumar
Yulia Tsvetkov
Yejin Choi
20
19
0
25 Oct 2022
Harnessing Abstractive Summarization for Fact-Checked Claim Detection
Harnessing Abstractive Summarization for Fact-Checked Claim Detection
Varad Bhatnagar
Diptesh Kanojia
Kameswari Chebrolu
HILM
27
8
0
10 Sep 2022
Neural Generation Meets Real People: Building a Social, Informative
  Open-Domain Dialogue Agent
Neural Generation Meets Real People: Building a Social, Informative Open-Domain Dialogue Agent
Ethan A. Chi
Ashwin Paranjape
A. See
Caleb Chiam
Trenton Chang
...
Dilara Soylu
Jillian Tang
A. Narayan
Giovanni Campagna
Christopher D. Manning
39
7
0
25 Jul 2022
Multi-LexSum: Real-World Summaries of Civil Rights Lawsuits at Multiple
  Granularities
Multi-LexSum: Real-World Summaries of Civil Rights Lawsuits at Multiple Granularities
Zejiang Shen
Kyle Lo
L. Yu
N. Dahlberg
Margo Schlanger
Doug Downey
ELM
AILaw
37
43
0
22 Jun 2022
An End-to-End Dialogue Summarization System for Sales Calls
An End-to-End Dialogue Summarization System for Sales Calls
Abedelkadir Asi
Song Wang
Roy Eisenstadt
Dean Geckt
Yarin Kuper
Yi Mao
Royi Ronen
28
16
0
27 Apr 2022
EHRKit: A Python Natural Language Processing Toolkit for Electronic
  Health Record Texts
EHRKit: A Python Natural Language Processing Toolkit for Electronic Health Record Texts
Irene Z Li
Keen You
Yujie Qiao
Lucas Huang
Chia-Chun Hsieh
Benjamin Rosand
Xiangru Tang
Dragomir R. Radev
40
4
0
13 Apr 2022
Efficient Large Scale Language Modeling with Mixtures of Experts
Efficient Large Scale Language Modeling with Mixtures of Experts
Mikel Artetxe
Shruti Bhosale
Naman Goyal
Todor Mihaylov
Myle Ott
...
Jeff Wang
Luke Zettlemoyer
Mona T. Diab
Zornitsa Kozareva
Ves Stoyanov
MoE
61
188
0
20 Dec 2021
Summarize-then-Answer: Generating Concise Explanations for Multi-hop
  Reading Comprehension
Summarize-then-Answer: Generating Concise Explanations for Multi-hop Reading Comprehension
Naoya Inoue
H. Trivedi
Steven K. Sinha
Niranjan Balasubramanian
Kentaro Inui
58
14
0
14 Sep 2021
Block Pruning For Faster Transformers
Block Pruning For Faster Transformers
François Lagunas
Ella Charlaix
Victor Sanh
Alexander M. Rush
VLM
16
218
0
10 Sep 2021
FNet: Mixing Tokens with Fourier Transforms
FNet: Mixing Tokens with Fourier Transforms
James Lee-Thorp
Joshua Ainslie
Ilya Eckstein
Santiago Ontanon
26
517
0
09 May 2021
BERT-of-Theseus: Compressing BERT by Progressive Module Replacing
BERT-of-Theseus: Compressing BERT by Progressive Module Replacing
Canwen Xu
Wangchunshu Zhou
Tao Ge
Furu Wei
Ming Zhou
221
197
0
07 Feb 2020
GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language
  Understanding
GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language Understanding
Alex Jinpeng Wang
Amanpreet Singh
Julian Michael
Felix Hill
Omer Levy
Samuel R. Bowman
ELM
297
6,959
0
20 Apr 2018
1