ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2209.12356
  4. Cited By
News Summarization and Evaluation in the Era of GPT-3
v1v2 (latest)

News Summarization and Evaluation in the Era of GPT-3

26 September 2022
Tanya Goyal
Junyi Jessy Li
Greg Durrett
    ELM
ArXiv (abs)PDFHTML

Papers citing "News Summarization and Evaluation in the Era of GPT-3"

50 / 71 papers shown
Title
Sequence-level Large Language Model Training with Contrastive Preference Optimization
Sequence-level Large Language Model Training with Contrastive Preference Optimization
Zhili Feng
Dhananjay Ram
Cole Hawkins
Aditya Rawal
Jinman Zhao
Sheng Zha
102
1
0
23 Feb 2025
Evaluating Large Language Models for Public Health Classification and Extraction Tasks
Evaluating Large Language Models for Public Health Classification and Extraction Tasks
Joshua Harris
Timothy Laurence
Leo Loman
Fan Grayson
Toby Nonnenmacher
...
Hamish Mohammed
Thomas Finnie
Luke Hounsome
Michael Borowitz
Steven Riley
LM&MAAI4MH
145
5
0
20 Feb 2025
Evaluating Small Language Models for News Summarization: Implications and Factors Influencing Performance
Evaluating Small Language Models for News Summarization: Implications and Factors Influencing Performance
Borui Xu
Yao Chen
Zeyi Wen
Weiguo Liu
Bingsheng He
175
2
0
02 Feb 2025
Benchmarking and Defending Against Indirect Prompt Injection Attacks on Large Language Models
Benchmarking and Defending Against Indirect Prompt Injection Attacks on Large Language Models
Jingwei Yi
Yueqi Xie
Bin Zhu
Emre Kiciman
Guangzhong Sun
Xing Xie
Fangzhao Wu
AAML
156
82
0
28 Jan 2025
Zero-Shot Strategies for Length-Controllable Summarization
Zero-Shot Strategies for Length-Controllable Summarization
Fabian Retkowski
A. Waibel
134
4
0
31 Dec 2024
Modeling Story Expectations to Understand Engagement: A Generative Framework Using LLMs
Modeling Story Expectations to Understand Engagement: A Generative Framework Using LLMs
Hortense Fong
George Gui
HAI
138
0
0
13 Dec 2024
Coverage-based Fairness in Multi-document Summarization
Coverage-based Fairness in Multi-document Summarization
Haoyuan Li
Yusen Zhang
Rui Zhang
Snigdha Chaturvedi
169
0
0
11 Dec 2024
BRIEF: Bridging Retrieval and Inference for Multi-hop Reasoning via Compression
BRIEF: Bridging Retrieval and Inference for Multi-hop Reasoning via Compression
Yuankai Li
Jia-Chen Gu
Di Wu
Kai-Wei Chang
Nanyun Peng
RALMMQ
70
0
0
20 Oct 2024
Simultaneous Reward Distillation and Preference Learning: Get You a Language Model Who Can Do Both
Simultaneous Reward Distillation and Preference Learning: Get You a Language Model Who Can Do Both
Abhijnan Nath
Changsoo Jung
Ethan Seefried
Nikhil Krishnaswamy
475
4
0
11 Oct 2024
Text2Chart31: Instruction Tuning for Chart Generation with Automatic Feedback
Text2Chart31: Instruction Tuning for Chart Generation with Automatic Feedback
Fatemeh Pesaran Zadeh
Juyeon Kim
Jin-Hwa Kim
Gunhee Kim
ALM
111
5
0
05 Oct 2024
How to Train Long-Context Language Models (Effectively)
How to Train Long-Context Language Models (Effectively)
Tianyu Gao
Alexander Wettig
Howard Yen
Danqi Chen
RALM
174
48
0
03 Oct 2024
Cascading Large Language Models for Salient Event Graph Generation
Cascading Large Language Models for Salient Event Graph Generation
Xingwei Tan
Yuxiang Zhou
Gabriele Pergola
Yulan He
121
1
0
26 Jun 2024
PlagBench: Exploring the Duality of Large Language Models in Plagiarism Generation and Detection
PlagBench: Exploring the Duality of Large Language Models in Plagiarism Generation and Detection
Jooyoung Lee
Toshini Agrawal
Adaku Uchendu
Thai V. Le
Jinghui Chen
Dongwon Lee
168
1
0
24 Jun 2024
A Comprehensive Survey on Process-Oriented Automatic Text Summarization with Exploration of LLM-Based Methods
A Comprehensive Survey on Process-Oriented Automatic Text Summarization with Exploration of LLM-Based Methods
Hanlei Jin
Yang Zhang
Dan Meng
Jun Wang
Jinghua Tan
244
96
0
05 Mar 2024
From Transcripts to Insights: Uncovering Corporate Risks Using Generative AI
From Transcripts to Insights: Uncovering Corporate Risks Using Generative AI
Alex G. Kim
Maximilian Muhn
Valeri V. Nikolaev
118
10
0
26 Oct 2023
GLoRE: Evaluating Logical Reasoning of Large Language Models
GLoRE: Evaluating Logical Reasoning of Large Language Models
Hanmeng Liu
Zhiyang Teng
Ruoxi Ning
Jian Liu
Qiji Zhou
Yuexin Zhang
Yue Zhang
ReLMELMLRM
148
8
0
13 Oct 2023
Sweeping Heterogeneity with Smart MoPs: Mixture of Prompts for LLM Task Adaptation
Sweeping Heterogeneity with Smart MoPs: Mixture of Prompts for LLM Task Adaptation
Chen Dun
Mirian Hipolito Garcia
Guoqing Zheng
Ahmed Hassan Awadallah
Anastasios Kyrillidis
Robert Sim
192
6
0
04 Oct 2023
Bloated Disclosures: Can ChatGPT Help Investors Process Information?
Bloated Disclosures: Can ChatGPT Help Investors Process Information?
Alex G. Kim
Maximilian Muhn
Valeri V. Nikolaev
93
34
0
17 Jun 2023
APPLS: Evaluating Evaluation Metrics for Plain Language Summarization
APPLS: Evaluating Evaluation Metrics for Plain Language Summarization
Yue Guo
Tal August
Gondy Leroy
T. Cohen
Lucy Lu Wang
139
9
0
23 May 2023
Z-Code++: A Pre-trained Language Model Optimized for Abstractive
  Summarization
Z-Code++: A Pre-trained Language Model Optimized for Abstractive Summarization
Pengcheng He
Baolin Peng
Liyang Lu
Song Wang
Jie Mei
...
Chenguang Zhu
Wayne Xiong
Michael Zeng
Jianfeng Gao
Xuedong Huang
103
47
0
21 Aug 2022
Self-critiquing models for assisting human evaluators
Self-critiquing models for assisting human evaluators
William Saunders
Catherine Yeh
Jeff Wu
Steven Bills
Ouyang Long
Jonathan Ward
Jan Leike
ALMELM
114
306
0
12 Jun 2022
The Unreliability of Explanations in Few-shot Prompting for Textual
  Reasoning
The Unreliability of Explanations in Few-shot Prompting for Textual Reasoning
Xi Ye
Greg Durrett
ReLMLRM
83
185
0
06 May 2022
Re-Examining System-Level Correlations of Automatic Summarization
  Evaluation Metrics
Re-Examining System-Level Correlations of Automatic Summarization Evaluation Metrics
Daniel Deutsch
Rotem Dror
Dan Roth
59
45
0
21 Apr 2022
Spurious Correlations in Reference-Free Evaluation of Text Generation
Spurious Correlations in Reference-Free Evaluation of Text Generation
Esin Durmus
Faisal Ladhak
Tatsunori Hashimoto
62
31
0
21 Apr 2022
PaLM: Scaling Language Modeling with Pathways
PaLM: Scaling Language Modeling with Pathways
Aakanksha Chowdhery
Sharan Narang
Jacob Devlin
Maarten Bosma
Gaurav Mishra
...
Kathy Meier-Hellstern
Douglas Eck
J. Dean
Slav Petrov
Noah Fiedel
PILMLRM
537
6,301
0
05 Apr 2022
BRIO: Bringing Order to Abstractive Summarization
BRIO: Bringing Order to Abstractive Summarization
Yixin Liu
Pengfei Liu
Dragomir R. Radev
Graham Neubig
100
286
0
31 Mar 2022
Training language models to follow instructions with human feedback
Training language models to follow instructions with human feedback
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
...
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
OSLMALM
897
13,228
0
04 Mar 2022
Rethinking the Role of Demonstrations: What Makes In-Context Learning
  Work?
Rethinking the Role of Demonstrations: What Makes In-Context Learning Work?
Sewon Min
Xinxi Lyu
Ari Holtzman
Mikel Artetxe
M. Lewis
Hannaneh Hajishirzi
Luke Zettlemoyer
LLMAGLRM
193
1,501
0
25 Feb 2022
Repairing the Cracked Foundation: A Survey of Obstacles in Evaluation
  Practices for Generated Text
Repairing the Cracked Foundation: A Survey of Obstacles in Evaluation Practices for Generated Text
Sebastian Gehrmann
Elizabeth Clark
Thibault Sellam
ELMAI4CE
149
193
0
14 Feb 2022
QAFactEval: Improved QA-Based Factual Consistency Evaluation for
  Summarization
QAFactEval: Improved QA-Based Factual Consistency Evaluation for Summarization
Alexander R. Fabbri
Chien-Sheng Wu
Wenhao Liu
Caiming Xiong
HILM
94
218
0
16 Dec 2021
SummaC: Re-Visiting NLI-based Models for Inconsistency Detection in
  Summarization
SummaC: Re-Visiting NLI-based Models for Inconsistency Detection in Summarization
Philippe Laban
Tobias Schnabel
Paul N. Bennett
Marti A. Hearst
HILM
107
398
0
18 Nov 2021
Training Dynamics for Text Summarization Models
Training Dynamics for Text Summarization Models
Tanya Goyal
Jiacheng Xu
Junjie Li
Greg Durrett
125
32
0
15 Oct 2021
ASPECTNEWS: Aspect-Oriented Summarization of News Documents
ASPECTNEWS: Aspect-Oriented Summarization of News Documents
Ojas Ahuja
Jiacheng Xu
A. Gupta
Kevin Horecka
Greg Durrett
104
46
0
15 Oct 2021
Multitask Prompted Training Enables Zero-Shot Task Generalization
Multitask Prompted Training Enables Zero-Shot Task Generalization
Victor Sanh
Albert Webson
Colin Raffel
Stephen H. Bach
Lintang Sutawika
...
T. Bers
Stella Biderman
Leo Gao
Thomas Wolf
Alexander M. Rush
LRM
355
1,710
0
15 Oct 2021
The Perils of Using Mechanical Turk to Evaluate Open-Ended Text
  Generation
The Perils of Using Mechanical Turk to Evaluate Open-Ended Text Generation
Marzena Karpinska
Nader Akoury
Mohit Iyyer
281
108
0
14 Sep 2021
An Exploratory Study on Long Dialogue Summarization: What Works and
  What's Next
An Exploratory Study on Long Dialogue Summarization: What Works and What's Next
Yusen Zhang
Ansong Ni
Tao Yu
Rui Zhang
Chenguang Zhu
Budhaditya Deb
Asli Celikyilmaz
Ahmed Hassan Awadallah
Dragomir R. Radev
RALM
130
58
0
10 Sep 2021
Finetuned Language Models Are Zero-Shot Learners
Finetuned Language Models Are Zero-Shot Learners
Jason W. Wei
Maarten Bosma
Vincent Zhao
Kelvin Guu
Adams Wei Yu
Brian Lester
Nan Du
Andrew M. Dai
Quoc V. Le
ALMUQCV
254
3,789
0
03 Sep 2021
All That's 'Human' Is Not Gold: Evaluating Human Evaluation of Generated
  Text
All That's 'Human' Is Not Gold: Evaluating Human Evaluation of Generated Text
Elizabeth Clark
Tal August
Sofia Serrano
Nikita Haduong
Suchin Gururangan
Noah A. Smith
DeLMO
133
415
0
30 Jun 2021
BookSum: A Collection of Datasets for Long-form Narrative Summarization
BookSum: A Collection of Datasets for Long-form Narrative Summarization
Wojciech Kry'sciñski
Nazneen Rajani
Divyansh Agarwal
Caiming Xiong
Dragomir R. Radev
RALM
109
154
0
18 May 2021
Understanding Factuality in Abstractive Summarization with FRANK: A
  Benchmark for Factuality Metrics
Understanding Factuality in Abstractive Summarization with FRANK: A Benchmark for Factuality Metrics
Artidoro Pagnoni
Vidhisha Balachandran
Yulia Tsvetkov
HILM
282
311
0
27 Apr 2021
Cross-Task Generalization via Natural Language Crowdsourcing
  Instructions
Cross-Task Generalization via Natural Language Crowdsourcing Instructions
Swaroop Mishra
Daniel Khashabi
Chitta Baral
Hannaneh Hajishirzi
LRM
173
753
0
18 Apr 2021
Annotating and Modeling Fine-grained Factuality in Summarization
Annotating and Modeling Fine-grained Factuality in Summarization
Tanya Goyal
Greg Durrett
HILM
75
153
0
09 Apr 2021
QuestEval: Summarization Asks for Fact-based Evaluation
QuestEval: Summarization Asks for Fact-based Evaluation
Thomas Scialom
Paul-Alexis Dray
Patrick Gallinari
Sylvain Lamprier
Benjamin Piwowarski
Jacopo Staiano
Alex Jinpeng Wang
HILM
71
276
0
23 Mar 2021
GENIE: Toward Reproducible and Standardized Human Evaluation for Text
  Generation
GENIE: Toward Reproducible and Standardized Human Evaluation for Text Generation
Daniel Khashabi
Gabriel Stanovsky
Jonathan Bragg
Nicholas Lourie
Jungo Kasai
Yejin Choi
Noah A. Smith
Daniel S. Weld
115
21
0
17 Jan 2021
CTRLsum: Towards Generic Controllable Text Summarization
CTRLsum: Towards Generic Controllable Text Summarization
Junxian He
Wojciech Kry'sciñski
Bryan McCann
Nazneen Rajani
Caiming Xiong
270
142
0
08 Dec 2020
Metrics also Disagree in the Low Scoring Range: Revisiting Summarization
  Evaluation Metrics
Metrics also Disagree in the Low Scoring Range: Revisiting Summarization Evaluation Metrics
Manik Bhandari
Pranav Narayan Gour
A. Ashfaq
Pengfei Liu
61
17
0
08 Nov 2020
With Little Power Comes Great Responsibility
With Little Power Comes Great Responsibility
Dallas Card
Peter Henderson
Urvashi Khandelwal
Robin Jia
Kyle Mahowald
Dan Jurafsky
273
118
0
13 Oct 2020
Evaluating Factuality in Generation with Dependency-level Entailment
Evaluating Factuality in Generation with Dependency-level Entailment
Tanya Goyal
Greg Durrett
128
151
0
12 Oct 2020
Towards Question-Answering as an Automatic Metric for Evaluating the
  Content Quality of a Summary
Towards Question-Answering as an Automatic Metric for Evaluating the Content Quality of a Summary
Daniel Deutsch
Tania Bedrax-Weiss
Dan Roth
83
113
0
01 Oct 2020
SummEval: Re-evaluating Summarization Evaluation
SummEval: Re-evaluating Summarization Evaluation
Alexander R. Fabbri
Wojciech Kry'sciñski
Bryan McCann
Caiming Xiong
R. Socher
Dragomir R. Radev
HILM
119
722
0
24 Jul 2020
12
Next