ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2301.13848
  4. Cited By
Benchmarking Large Language Models for News Summarization

Benchmarking Large Language Models for News Summarization

31 January 2023
Tianyi Zhang
Faisal Ladhak
Esin Durmus
Percy Liang
Kathleen McKeown
Tatsunori B. Hashimoto
    ELM
ArXivPDFHTML

Papers citing "Benchmarking Large Language Models for News Summarization"

50 / 300 papers shown
Title
Empowering Meta-Analysis: Leveraging Large Language Models for Scientific Synthesis
Jawad Ibn Ahad
Rafeed Mohammad Sultan
Abraham Kaikobad
Fuad Rahman
M. R. Amin
Nabeel Mohammed
Shafin Rahman
42
0
0
16 Nov 2024
Towards Optimizing a Retrieval Augmented Generation using Large Language
  Model on Academic Data
Towards Optimizing a Retrieval Augmented Generation using Large Language Model on Academic Data
Anum Afzal
Juraj Vladika
Gentrit Fazlija
Andrei Staradubets
Florian Matthes
RALM
36
0
0
13 Nov 2024
Beyond the Safety Bundle: Auditing the Helpful and Harmless Dataset
Beyond the Safety Bundle: Auditing the Helpful and Harmless Dataset
Khaoula Chehbouni
Jonathan Colaço-Carr
Yash More
Jackie CK Cheung
G. Farnadi
78
0
0
12 Nov 2024
LIFBench: Evaluating the Instruction Following Performance and Stability
  of Large Language Models in Long-Context Scenarios
LIFBench: Evaluating the Instruction Following Performance and Stability of Large Language Models in Long-Context Scenarios
Xiaodong Wu
Minhao Wang
Yichen Liu
Xiaoming Shi
He Yan
Xiangju Lu
Junmin Zhu
Wei Zhang
159
3
0
11 Nov 2024
Does This Summary Answer My Question? Modeling Query-Focused Summary
  Readers with Rational Speech Acts
Does This Summary Answer My Question? Modeling Query-Focused Summary Readers with Rational Speech Acts
Cesare Spinoso-Di Piano
Jackie Chi Kit Cheung
24
0
0
10 Nov 2024
Summarization of Opinionated Political Documents with Varied
  Perspectives
Summarization of Opinionated Political Documents with Varied Perspectives
Nicholas Deas
Kathleen McKeown
19
0
0
06 Nov 2024
Understanding the Effects of Human-written Paraphrases in LLM-generated
  Text Detection
Understanding the Effects of Human-written Paraphrases in LLM-generated Text Detection
Hiu Ting Lau
Arkaitz Zubiaga
DeLMO
45
1
0
06 Nov 2024
Beemo: Benchmark of Expert-edited Machine-generated Outputs
Beemo: Benchmark of Expert-edited Machine-generated Outputs
Ekaterina Artemova
Jason Samuel Lucas
Saranya Venkatraman
Jooyoung Lee
Sergei Tilga
Adaku Uchendu
Vladislav Mikhailov
DeLMO
MoE
68
4
0
06 Nov 2024
On Positional Bias of Faithfulness for Long-form Summarization
On Positional Bias of Faithfulness for Long-form Summarization
David Wan
Jesse Vig
Joey Tianyi Zhou
Shafiq R. Joty
HILM
56
3
0
31 Oct 2024
GraphAide: Advanced Graph-Assisted Query and Reasoning System
GraphAide: Advanced Graph-Assisted Query and Reasoning System
Sumit Purohit
George Chin
Patrick S Mackey
Joseph A Cottam
37
0
0
29 Oct 2024
A Bayesian Approach to Harnessing the Power of LLMs in Authorship
  Attribution
A Bayesian Approach to Harnessing the Power of LLMs in Authorship Attribution
Zhengmian Hu
Tong Zheng
Heng Huang
BDL
29
2
0
29 Oct 2024
Online Detecting LLM-Generated Texts via Sequential Hypothesis Testing by Betting
Online Detecting LLM-Generated Texts via Sequential Hypothesis Testing by Betting
Can Chen
Jun-Kun Wang
DeLMO
42
0
0
29 Oct 2024
Prompting and Fine-Tuning of Small LLMs for Length-Controllable
  Telephone Call Summarization
Prompting and Fine-Tuning of Small LLMs for Length-Controllable Telephone Call Summarization
David Thulke
Yingbo Gao
Rricha Jalota
Christian Dugast
Hermann Ney
29
3
0
24 Oct 2024
DomainSum: A Hierarchical Benchmark for Fine-Grained Domain Shift in
  Abstractive Text Summarization
DomainSum: A Hierarchical Benchmark for Fine-Grained Domain Shift in Abstractive Text Summarization
Haohan Yuan
Haopeng Zhang
26
1
0
21 Oct 2024
DiscoGraMS: Enhancing Movie Screen-Play Summarization using Movie Character-Aware Discourse Graph
DiscoGraMS: Enhancing Movie Screen-Play Summarization using Movie Character-Aware Discourse Graph
Maitreya Prafulla Chitale
Uday Bindal
Rajakrishnan Rajkumar
Rahul Mishra
29
0
0
18 Oct 2024
Disentangling Likes and Dislikes in Personalized Generative Explainable
  Recommendation
Disentangling Likes and Dislikes in Personalized Generative Explainable Recommendation
Ryotaro Shimizu
Takashi Wada
Yu Wang
Johannes Kruse
Sean O'Brien
...
Yuya Yoshikawa
Yuki Saito
Fugee Tsung
M. Goto
Julian McAuley
31
0
0
17 Oct 2024
TemporalBench: Benchmarking Fine-grained Temporal Understanding for
  Multimodal Video Models
TemporalBench: Benchmarking Fine-grained Temporal Understanding for Multimodal Video Models
Mu Cai
Reuben Tan
Jianrui Zhang
Bocheng Zou
Kai Zhang
...
Yao Dou
J. Park
Jianfeng Gao
Yong Jae Lee
Jianwei Yang
44
12
0
14 Oct 2024
MLP-SLAM: Multilayer Perceptron-Based Simultaneous Localization and
  Mapping With a Dynamic and Static Object Discriminator
MLP-SLAM: Multilayer Perceptron-Based Simultaneous Localization and Mapping With a Dynamic and Static Object Discriminator
Taozhe Li
Wei Sun
34
1
0
14 Oct 2024
HSR-Enhanced Sparse Attention Acceleration
HSR-Enhanced Sparse Attention Acceleration
Bo Chen
Yingyu Liang
Zhizhou Sha
Zhenmei Shi
Zhao-quan Song
95
18
0
14 Oct 2024
ELF-Gym: Evaluating Large Language Models Generated Features for Tabular
  Prediction
ELF-Gym: Evaluating Large Language Models Generated Features for Tabular Prediction
Yanlin Zhang
Ning Li
Quan Gan
Wenbo Zhang
David Wipf
Minjie Wang
23
0
0
13 Oct 2024
As Simple as Fine-tuning: LLM Alignment via Bidirectional Negative
  Feedback Loss
As Simple as Fine-tuning: LLM Alignment via Bidirectional Negative Feedback Loss
Xin Mao
Feng-Lin Li
Huimin Xu
Wei Zhang
Wang Chen
A. Luu
32
1
0
07 Oct 2024
GlobeSumm: A Challenging Benchmark Towards Unifying Multi-lingual,
  Cross-lingual and Multi-document News Summarization
GlobeSumm: A Challenging Benchmark Towards Unifying Multi-lingual, Cross-lingual and Multi-document News Summarization
Yangfan Ye
Xiachong Feng
Xiaocheng Feng
Weitao Ma
Libo Qin
Dongliang Xu
Qing Yang
Hongtao Liu
Bing Qin
34
1
0
05 Oct 2024
SwiftKV: Fast Prefill-Optimized Inference with Knowledge-Preserving
  Model Transformation
SwiftKV: Fast Prefill-Optimized Inference with Knowledge-Preserving Model Transformation
Aurick Qiao
Z. Yao
Samyam Rajbhandari
Yuxiong He
VLM
37
0
0
04 Oct 2024
MetaMetrics: Calibrating Metrics For Generation Tasks Using Human Preferences
MetaMetrics: Calibrating Metrics For Generation Tasks Using Human Preferences
Genta Indra Winata
David Anugraha
Lucky Susanto
Garry Kuwanto
Derry Wijaya
37
7
0
03 Oct 2024
Are Large Language Models In-Context Personalized Summarizers? Get an
  iCOPERNICUS Test Done!
Are Large Language Models In-Context Personalized Summarizers? Get an iCOPERNICUS Test Done!
Divya Patel
Pathik Patel
Ankush Chander
Sourish Dasgupta
Tanmoy Chakraborty
24
1
0
30 Sep 2024
A Critical Look at Meta-evaluating Summarisation Evaluation Metrics
A Critical Look at Meta-evaluating Summarisation Evaluation Metrics
Xiang Dai
Sarvnaz Karimi
Biaoyan Fang
36
0
0
29 Sep 2024
HelloBench: Evaluating Long Text Generation Capabilities of Large
  Language Models
HelloBench: Evaluating Long Text Generation Capabilities of Large Language Models
Haoran Que
Feiyu Duan
Liqun He
Yutao Mou
Wangchunshu Zhou
...
Ge Zhang
Junran Peng
Zhaoxiang Zhang
Songyang Zhang
Kai Chen
LM&MA
ELM
VLM
51
11
0
24 Sep 2024
Unlocking Memorization in Large Language Models with Dynamic Soft
  Prompting
Unlocking Memorization in Large Language Models with Dynamic Soft Prompting
Zhepeng Wang
Runxue Bao
Yawen Wu
Jackson Taylor
Cao Xiao
Feng Zheng
Weiwen Jiang
Shangqian Gao
Yanfu Zhang
PILM
39
7
0
20 Sep 2024
Human Interest or Conflict? Leveraging LLMs for Automated Framing
  Analysis in TV Shows
Human Interest or Conflict? Leveraging LLMs for Automated Framing Analysis in TV Shows
David Alonso del Barrio
Max Tiel
D. Gática-Pérez
38
3
0
19 Sep 2024
Enriching Datasets with Demographics through Large Language Models:
  What's in a Name?
Enriching Datasets with Demographics through Large Language Models: What's in a Name?
Khaled AlNuaimi
Gautier Marti
Mathieu Ravaut
Abdulla Alketbi
Andreas Henschel
Raed Jaradat
31
1
0
17 Sep 2024
From Experts to the Public: Governing Multimodal Language Models in
  Politically Sensitive Video Analysis
From Experts to the Public: Governing Multimodal Language Models in Politically Sensitive Video Analysis
Tanusree Sharma
Yujin Potter
Zachary Kilhoffer
Yun Huang
Dawn Song
Yang Wang
56
3
0
15 Sep 2024
NovAScore: A New Automated Metric for Evaluating Document Level Novelty
NovAScore: A New Automated Metric for Evaluating Document Level Novelty
Lin Ai
Ziwei Gong
Harshsaiprasad Deshpande
Alexander Johnson
Emmy Phung
Ahmad Emami
Julia Hirschberg
18
1
0
14 Sep 2024
Synthetic continued pretraining
Synthetic continued pretraining
Zitong Yang
Neil Band
Shuangping Li
Emmanuel Candès
Tatsunori Hashimoto
CLL
SyDa
38
11
0
11 Sep 2024
MarsCode Agent: AI-native Automated Bug Fixing
MarsCode Agent: AI-native Automated Bug Fixing
Y. Liu
Pengfei Gao
Xinchen Wang
Jie Liu
Yexuan Shi
Zhao Zhang
Chao Peng
LLMAG
36
21
0
02 Sep 2024
ProteinGPT: Multimodal LLM for Protein Property Prediction and Structure Understanding
ProteinGPT: Multimodal LLM for Protein Property Prediction and Structure Understanding
Yijia Xiao
Edward Sun
Yiqiao Jin
Qifan Wang
Wei Wang
47
10
0
21 Aug 2024
Benchmarking Large Language Models for Math Reasoning Tasks
Benchmarking Large Language Models for Math Reasoning Tasks
Kathrin Seßler
Yao Rong
Emek Gözlüklü
Enkelejda Kasneci
LRM
30
3
0
20 Aug 2024
MegaFake: A Theory-Driven Dataset of Fake News Generated by Large
  Language Models
MegaFake: A Theory-Driven Dataset of Fake News Generated by Large Language Models
Lionel Z. Wang
Yiming Ma
Renfei Gao
Beichen Guo
Han Zhu
Wenqi Fan
Zexin Lu
Ka Chung Ng
SyDa
28
2
0
19 Aug 2024
Confidence-weighted integration of human and machine judgments for superior decision-making
Confidence-weighted integration of human and machine judgments for superior decision-making
Felipe Yánez
Xiaoliang Luo
Omar Valerio Minero
Bradley C. Love
19
2
0
15 Aug 2024
Using generative AI to support standardization work -- the case of 3GPP
Using generative AI to support standardization work -- the case of 3GPP
M. Staron
Jonathan Strom
Albin Karlsson
Wilhelm Meding
21
2
0
08 Aug 2024
Zero-shot Factual Consistency Evaluation Across Domains
Zero-shot Factual Consistency Evaluation Across Domains
Raunak Agarwal
HILM
44
0
0
07 Aug 2024
Leveraging Entailment Judgements in Cross-Lingual Summarisation
Leveraging Entailment Judgements in Cross-Lingual Summarisation
Huajian Zhang
Laura Perez-Beltrachini
HILM
41
0
0
01 Aug 2024
Improving Faithfulness of Large Language Models in Summarization via
  Sliding Generation and Self-Consistency
Improving Faithfulness of Large Language Models in Summarization via Sliding Generation and Self-Consistency
Taiji Li
Zhi Li
Yin Zhang
HILM
38
6
0
31 Jul 2024
Interpreting and Mitigating Hallucination in MLLMs through Multi-agent
  Debate
Interpreting and Mitigating Hallucination in MLLMs through Multi-agent Debate
Zheng Lin
Zhenxing Niu
Zhibin Wang
Yinghui Xu
39
4
0
30 Jul 2024
ThinK: Thinner Key Cache by Query-Driven Pruning
ThinK: Thinner Key Cache by Query-Driven Pruning
Yuhui Xu
Zhanming Jie
Hanze Dong
Lei Wang
Xudong Lu
Aojun Zhou
Amrita Saha
Caiming Xiong
Doyen Sahoo
72
14
0
30 Jul 2024
An Efficient Inference Framework for Early-exit Large Language Models
An Efficient Inference Framework for Early-exit Large Language Models
Ruijie Miao
Yihan Yan
Xinshuo Yao
Tong Yang
29
0
0
25 Jul 2024
Know Your Limits: A Survey of Abstention in Large Language Models
Know Your Limits: A Survey of Abstention in Large Language Models
Bingbing Wen
Jihan Yao
Shangbin Feng
Chenjun Xu
Yulia Tsvetkov
Bill Howe
Lucy Lu Wang
59
5
0
25 Jul 2024
RedAgent: Red Teaming Large Language Models with Context-aware
  Autonomous Language Agent
RedAgent: Red Teaming Large Language Models with Context-aware Autonomous Language Agent
Huiyu Xu
Wenhui Zhang
Zhibo Wang
Feng Xiao
Rui Zheng
Yunhe Feng
Zhongjie Ba
Kui Ren
AAML
LLMAG
34
11
0
23 Jul 2024
Enhancing LLM's Cognition via Structurization
Enhancing LLM's Cognition via Structurization
Kai-Chun Liu
Zhihang Fu
Chao Chen
Wei Zhang
Rongxin Jiang
Fan Zhou
Yao-Shen Chen
Yue-bo Wu
Jieping Ye
55
1
0
23 Jul 2024
UniMEL: A Unified Framework for Multimodal Entity Linking with Large
  Language Models
UniMEL: A Unified Framework for Multimodal Entity Linking with Large Language Models
Liu Qi
Yongyi He
Lian Defu
Zhi Zheng
Tong Xu
Liu Che
Chen Enhong
MLLM
33
1
0
23 Jul 2024
Operationalizing a Threat Model for Red-Teaming Large Language Models
  (LLMs)
Operationalizing a Threat Model for Red-Teaming Large Language Models (LLMs)
Apurv Verma
Satyapriya Krishna
Sebastian Gehrmann
Madhavan Seshadri
Anu Pradhan
Tom Ault
Leslie Barrett
David Rabinowitz
John Doucette
Nhathai Phan
54
10
0
20 Jul 2024
Previous
123456
Next