ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1904.09675
  4. Cited By
BERTScore: Evaluating Text Generation with BERT
v1v2v3 (latest)

BERTScore: Evaluating Text Generation with BERT

21 April 2019
Tianyi Zhang
Varsha Kishore
Felix Wu
Kilian Q. Weinberger
Yoav Artzi
ArXiv (abs)PDFHTML

Papers citing "BERTScore: Evaluating Text Generation with BERT"

50 / 3,520 papers shown
Title
StyleBART: Decorate Pretrained Model with Style Adapters for
  Unsupervised Stylistic Headline Generation
StyleBART: Decorate Pretrained Model with Style Adapters for Unsupervised Stylistic Headline Generation
Hanqing Wang
Yajing Luo
Boya Xiong
Guanhua Chen
Yun-Nung Chen
53
0
0
26 Oct 2023
Cultural Adaptation of Recipes
Cultural Adaptation of Recipes
Yong Cao
Yova Kementchedjhieva
Ruixiang Cui
Antonia Karamolegkou
Li Zhou
Megan Dare
Lucia Donatelli
Daniel Hershcovich
95
6
0
26 Oct 2023
Automatic Logical Forms improve fidelity in Table-to-Text generation
Automatic Logical Forms improve fidelity in Table-to-Text generation
Iñigo Alonso
Eneko Agirre
LMTD
66
3
0
26 Oct 2023
JudgeLM: Fine-tuned Large Language Models are Scalable Judges
JudgeLM: Fine-tuned Large Language Models are Scalable Judges
Lianghui Zhu
Xinggang Wang
Xinlong Wang
ELMALM
184
143
0
26 Oct 2023
BOOST: Harnessing Black-Box Control to Boost Commonsense in LMs'
  Generation
BOOST: Harnessing Black-Box Control to Boost Commonsense in LMs' Generation
Yufei Tian
Felix Zhang
Nanyun Peng
63
0
0
25 Oct 2023
Follow-on Question Suggestion via Voice Hints for Voice Assistants
Follow-on Question Suggestion via Voice Hints for Voice Assistants
B. Fetahu
Pedro Faustini
Giuseppe Castellucci
Anjie Fang
Oleg Rokhlenko
S. Malmasi
51
2
0
25 Oct 2023
Critic-Driven Decoding for Mitigating Hallucinations in Data-to-text
  Generation
Critic-Driven Decoding for Mitigating Hallucinations in Data-to-text Generation
Mateusz Lango
Ondrej Dusek
60
8
0
25 Oct 2023
HANSEN: Human and AI Spoken Text Benchmark for Authorship Analysis
HANSEN: Human and AI Spoken Text Benchmark for Authorship Analysis
Nafis Irtiza Tripto
Adaku Uchendu
Thai V. Le
Mattia Setzu
F. Giannotti
Dongwon Lee
DeLMO
58
7
0
25 Oct 2023
Diversity Enhanced Narrative Question Generation for Storybooks
Diversity Enhanced Narrative Question Generation for Storybooks
Hokeun Yoon
Jinyeong Bak
94
8
0
25 Oct 2023
RCAgent: Cloud Root Cause Analysis by Autonomous Agents with
  Tool-Augmented Large Language Models
RCAgent: Cloud Root Cause Analysis by Autonomous Agents with Tool-Augmented Large Language Models
Zefan Wang
Zichuan Liu
Yingying Zhang
Aoxiao Zhong
Lunting Fan
Lingfei Wu
Qingsong Wen
93
32
0
25 Oct 2023
Can GPT models Follow Human Summarization Guidelines? A Study for Targeted Communication Goals
Can GPT models Follow Human Summarization Guidelines? A Study for Targeted Communication Goals
Yongxin Zhou
Fabien Ringeval
Franccois Portet
ELMALM
65
0
0
25 Oct 2023
Background Summarization of Event Timelines
Background Summarization of Event Timelines
Adithya Pratapa
Kevin Small
Markus Dreyer
120
2
0
24 Oct 2023
Clinfo.ai: An Open-Source Retrieval-Augmented Large Language Model
  System for Answering Medical Questions using Scientific Literature
Clinfo.ai: An Open-Source Retrieval-Augmented Large Language Model System for Answering Medical Questions using Scientific Literature
Alejandro Lozano
Scott L. Fleming
Chia-Chun Chiang
Nigam Shah
ELMRALM
99
41
0
24 Oct 2023
BLESS: Benchmarking Large Language Models on Sentence Simplification
BLESS: Benchmarking Large Language Models on Sentence Simplification
Tannon Kew
Alison Chi
Laura Vásquez-Rodríguez
Sweta Agrawal
Dennis Aumiller
Fernando Alva-Manchego
Teven Le Scao
95
26
0
24 Oct 2023
Enhancing Biomedical Lay Summarisation with External Knowledge Graphs
Enhancing Biomedical Lay Summarisation with External Knowledge Graphs
Tomas Goldsack
Zhihao Zhang
Chen Tang
Carolina Scarton
Chenghua Lin
58
10
0
24 Oct 2023
Creating a silver standard for patent simplification
Creating a silver standard for patent simplification
Silvia Casola
A. Lavelli
Horacio Saggion
AILaw
63
3
0
24 Oct 2023
Improving Biomedical Abstractive Summarisation with Knowledge
  Aggregation from Citation Papers
Improving Biomedical Abstractive Summarisation with Knowledge Aggregation from Citation Papers
Chen Tang
Shunyu Wang
Tomas Goldsack
Chenghua Lin
62
18
0
24 Oct 2023
Fighting Fire with Fire: The Dual Role of LLMs in Crafting and Detecting
  Elusive Disinformation
Fighting Fire with Fire: The Dual Role of LLMs in Crafting and Detecting Elusive Disinformation
Jason Samuel Lucas
Adaku Uchendu
Michiharu Yamashita
Jooyoung Lee
Shaurya Rohatgi
Dongwon Lee
96
48
0
24 Oct 2023
GPT-4 as an Effective Zero-Shot Evaluator for Scientific Figure Captions
GPT-4 as an Effective Zero-Shot Evaluator for Scientific Figure Captions
Ting-Yao Hsu
Chieh-Yang Huang
Ryan Rossi
Sungchul Kim
C. Lee Giles
‘Kenneth’ Huang
125
13
0
23 Oct 2023
Exploring the Potential of Large Language Models in Generating
  Code-Tracing Questions for Introductory Programming Courses
Exploring the Potential of Large Language Models in Generating Code-Tracing Questions for Introductory Programming Courses
Aysa Xuemo Fan
Ranran Haoran Zhang
Luc Paquette
Rui Zhang
AI4Ed
44
3
0
23 Oct 2023
Reference Free Domain Adaptation for Translation of Noisy Questions with
  Question Specific Rewards
Reference Free Domain Adaptation for Translation of Noisy Questions with Question Specific Rewards
Baban Gain
Ramakrishna Appicharla
Soumya Chennabasavaraj
Nikesh Garera
Asif Ekbal
M. Chelliah
72
0
0
23 Oct 2023
Location-Aware Visual Question Generation with Lightweight Models
Location-Aware Visual Question Generation with Lightweight Models
Nicholas Collin Suwono
Justin Chih-Yao Chen
Tun-Min Hung
T. Huang
I-Bin Liao
Yung-Hui Li
Lun-Wei Ku
Shao-Hua Sun
55
4
0
23 Oct 2023
Paraphrase Types for Generation and Detection
Paraphrase Types for Generation and Detection
Jan Philip Wahle
Bela Gipp
Terry Ruas
70
4
0
23 Oct 2023
Unleashing the potential of prompt engineering in Large Language Models:
  a comprehensive review
Unleashing the potential of prompt engineering in Large Language Models: a comprehensive review
Banghao Chen
Zhaofeng Zhang
Nicolas Langrené
Shengxin Zhu
LLMAG
112
13
0
23 Oct 2023
Which Prompts Make The Difference? Data Prioritization For Efficient
  Human LLM Evaluation
Which Prompts Make The Difference? Data Prioritization For Efficient Human LLM Evaluation
M. Boubdir
Edward Kim
Beyza Ermis
Marzieh Fadaee
Sara Hooker
ALM
88
19
0
22 Oct 2023
Vision Language Models in Autonomous Driving: A Survey and Outlook
Vision Language Models in Autonomous Driving: A Survey and Outlook
Xingcheng Zhou
Mingyu Liu
Ekim Yurtsever
B. L. Žagar
Walter Zimmer
Hu Cao
Alois C. Knoll
VLM
111
60
0
22 Oct 2023
Evaluating Subjective Cognitive Appraisals of Emotions from Large
  Language Models
Evaluating Subjective Cognitive Appraisals of Emotions from Large Language Models
Hongli Zhan
Desmond C. Ong
Junyi Jessy Li
148
7
0
22 Oct 2023
From Chaos to Clarity: Claim Normalization to Empower Fact-Checking
From Chaos to Clarity: Claim Normalization to Empower Fact-Checking
Megha Sundriyal
Tanmoy Chakraborty
Preslav Nakov
62
14
0
22 Oct 2023
Chainpoll: A high efficacy method for LLM hallucination detection
Chainpoll: A high efficacy method for LLM hallucination detection
Robert Friel
Atindriyo Sanyal
LRMHILM
80
28
0
22 Oct 2023
Revisiting Instruction Fine-tuned Model Evaluation to Guide Industrial
  Applications
Revisiting Instruction Fine-tuned Model Evaluation to Guide Industrial Applications
Manuel Faysse
Gautier Viaud
C´eline Hudelot
Pierre Colombo
84
11
0
21 Oct 2023
Beyond Accuracy: Evaluating Self-Consistency of Code Large Language
  Models with IdentityChain
Beyond Accuracy: Evaluating Self-Consistency of Code Large Language Models with IdentityChain
Marcus J. Min
Yangruibo Ding
Luca Buratti
Saurabh Pujar
Gail E. Kaiser
Suman Jana
Baishakhi Ray
LRMHILM
84
21
0
21 Oct 2023
Toward Stronger Textual Attack Detectors
Toward Stronger Textual Attack Detectors
Pierre Colombo
Marine Picot
Nathan Noiry
Guillaume Staerman
Pablo Piantanida
563
5
0
21 Oct 2023
AITA Generating Moral Judgements of the Crowd with Reasoning
AITA Generating Moral Judgements of the Crowd with Reasoning
Osama Bsher
Ameer Sabri
53
0
0
21 Oct 2023
Optimizing Retrieval-augmented Reader Models via Token Elimination
Optimizing Retrieval-augmented Reader Models via Token Elimination
Moshe Berchansky
Peter Izsak
Avi Caciularu
Ido Dagan
Moshe Wasserblat
RALM
95
12
0
20 Oct 2023
Retrieval-Augmented Neural Response Generation Using Logical Reasoning
  and Relevance Scoring
Retrieval-Augmented Neural Response Generation Using Logical Reasoning and Relevance Scoring
Nicholas Walker
Stefan Ultes
Pierre Lison
RALMLRM
81
2
0
20 Oct 2023
Tuna: Instruction Tuning using Feedback from Large Language Models
Tuna: Instruction Tuning using Feedback from Large Language Models
Haoran Li
Yiran Liu
Xingxing Zhang
Wei Lu
Furu Wei
ALM
83
3
0
20 Oct 2023
An LLM can Fool Itself: A Prompt-Based Adversarial Attack
An LLM can Fool Itself: A Prompt-Based Adversarial Attack
Xilie Xu
Keyi Kong
Ning Liu
Li-zhen Cui
Di Wang
Jingfeng Zhang
Mohan Kankanhalli
AAMLSILM
129
88
0
20 Oct 2023
NameGuess: Column Name Expansion for Tabular Data
NameGuess: Column Name Expansion for Tabular Data
Jiani Zhang
Zhengyuan Shen
Balasubramaniam Srinivasan
Shen Wang
Huzefa Rangwala
George Karypis
49
6
0
19 Oct 2023
Fast and Accurate Factual Inconsistency Detection Over Long Documents
Fast and Accurate Factual Inconsistency Detection Over Long Documents
B. Lattimer
Patrick Chen
Xinyuan Zhang
Yi Yang
HILM
102
19
0
19 Oct 2023
Better to Ask in English: Cross-Lingual Evaluation of Large Language
  Models for Healthcare Queries
Better to Ask in English: Cross-Lingual Evaluation of Large Language Models for Healthcare Queries
Yiqiao Jin
Mohit Chandra
Gaurav Verma
Yibo Hu
Munmun De Choudhury
Srijan Kumar
LM&MAELM
159
76
0
19 Oct 2023
CLAIR: Evaluating Image Captions with Large Language Models
CLAIR: Evaluating Image Captions with Large Language Models
David M. Chan
Suzanne Petryk
Joseph E. Gonzalez
Trevor Darrell
John F. Canny
94
21
0
19 Oct 2023
Probing LLMs for hate speech detection: strengths and vulnerabilities
Probing LLMs for hate speech detection: strengths and vulnerabilities
Sarthak Roy
Ashish Harshavardhan
Animesh Mukherjee
Punyajoy Saha
132
36
0
19 Oct 2023
The Shifted and The Overlooked: A Task-oriented Investigation of
  User-GPT Interactions
The Shifted and The Overlooked: A Task-oriented Investigation of User-GPT Interactions
Siru Ouyang
Shuohang Wang
Yang Liu
Ming Zhong
Yizhu Jiao
Dan Iter
Reid Pryzant
Chenguang Zhu
Heng Ji
Jiawei Han
98
32
0
19 Oct 2023
REMARK-LLM: A Robust and Efficient Watermarking Framework for Generative
  Large Language Models
REMARK-LLM: A Robust and Efficient Watermarking Framework for Generative Large Language Models
Ruisi Zhang
Shehzeen Samarah Hussain
Paarth Neekhara
F. Koushanfar
73
36
0
18 Oct 2023
DiagrammerGPT: Generating Open-Domain, Open-Platform Diagrams via LLM
  Planning
DiagrammerGPT: Generating Open-Domain, Open-Platform Diagrams via LLM Planning
Abhaysinh Zala
Han Lin
Jaemin Cho
Mohit Bansal
91
16
0
18 Oct 2023
A Tale of Pronouns: Interpretability Informs Gender Bias Mitigation for
  Fairer Instruction-Tuned Machine Translation
A Tale of Pronouns: Interpretability Informs Gender Bias Mitigation for Fairer Instruction-Tuned Machine Translation
Giuseppe Attanasio
Flor Miriam Plaza del Arco
Debora Nozza
Anne Lauscher
72
19
0
18 Oct 2023
InfoDiffusion: Information Entropy Aware Diffusion Process for
  Non-Autoregressive Text Generation
InfoDiffusion: Information Entropy Aware Diffusion Process for Non-Autoregressive Text Generation
Renzhi Wang
Jing Li
Piji Li
DiffM
79
3
0
18 Oct 2023
Zero-shot Faithfulness Evaluation for Text Summarization with Foundation
  Language Model
Zero-shot Faithfulness Evaluation for Text Summarization with Foundation Language Model
Qi Jia
Siyu Ren
Yizhu Liu
Kenny Q. Zhu
ALMHILM
91
17
0
18 Oct 2023
Quantifying Language Models' Sensitivity to Spurious Features in Prompt
  Design or: How I learned to start worrying about prompt formatting
Quantifying Language Models' Sensitivity to Spurious Features in Prompt Design or: How I learned to start worrying about prompt formatting
Melanie Sclar
Yejin Choi
Yulia Tsvetkov
Alane Suhr
108
361
0
17 Oct 2023
Medical Text Simplification: Optimizing for Readability with
  Unlikelihood Training and Reranked Beam Search Decoding
Medical Text Simplification: Optimizing for Readability with Unlikelihood Training and Reranked Beam Search Decoding
Lorenzo Jaime Yu Flores
Heyuan Huang
Kejian Shi
Sophie Chheang
Arman Cohan
MedIm
68
7
0
17 Oct 2023
Previous
123...373839...697071
Next