Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1904.09675
Cited By
v1
v2
v3 (latest)
BERTScore: Evaluating Text Generation with BERT
21 April 2019
Tianyi Zhang
Varsha Kishore
Felix Wu
Kilian Q. Weinberger
Yoav Artzi
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"BERTScore: Evaluating Text Generation with BERT"
50 / 3,520 papers shown
Title
Chain of Natural Language Inference for Reducing Large Language Model Ungrounded Hallucinations
Deren Lei
Yaxi Li
Mengya Hu
Mingyu Wang
Vincent Yun
Emily Ching
Eslam Kamal
HILM
LRM
59
40
0
06 Oct 2023
Automatic and Human-AI Interactive Text Generation
Yao Dou
Philippe Laban
Claire Gardent
Wei Xu
82
4
0
05 Oct 2023
Learning Personalized Alignment for Evaluating Open-ended Text Generation
Danqing Wang
Kevin Kaichuang Yang
Hanlin Zhu
Xiaomeng Yang
Andrew Cohen
Lei Li
Yuandong Tian
ALM
LM&MA
87
11
0
05 Oct 2023
Retrieval-augmented Generation to Improve Math Question-Answering: Trade-offs Between Groundedness and Human Preference
Zachary Levonian
Chenglu Li
Wangda Zhu
Anoushka Gade
Owen Henkel
Millie-Ellen Postle
Wanli Xing
AI4Ed
RALM
101
34
0
04 Oct 2023
T
3
^3
3
Bench: Benchmarking Current Progress in Text-to-3D Generation
Yuze He
Yushi Bai
Matthieu Lin
Wang Zhao
Yubin Hu
Jenny Sheng
Ran Yi
Juanzi Li
Yong Liu
130
33
0
04 Oct 2023
Low Resource Summarization using Pre-trained Language Models
Mubashir Munaf
Hammad Afzal
N. Iltaf
Khawir Mahmood
37
7
0
04 Oct 2023
Integrating UMLS Knowledge into Large Language Models for Medical Question Answering
Rui Yang
Edison Marrese-Taylor
Yuhe Ke
Lechao Cheng
Qingyu Chen
Irene Li
ELM
AI4MH
LM&MA
85
16
0
04 Oct 2023
LC-Score: Reference-less estimation of Text Comprehension Difficulty
Paul Tardy
Charlotte Roze
Paul Poupet
35
0
0
04 Oct 2023
Improving Automatic VQA Evaluation Using Large Language Models
Oscar Manas
Benno Krojer
Aishwarya Agrawal
95
25
0
04 Oct 2023
TWIZ-v2: The Wizard of Multimodal Conversational-Stimulus
Rafael Ferreira
Diogo Tavares
Diogo Glória-Silva
Rodrigo Valerio
João Bordalo
Ines Simoes
Vasco Ramos
David Semedo
João Magalhães
45
4
0
03 Oct 2023
Jury: A Comprehensive Evaluation Toolkit
Devrim Cavusoglu
Secil Sen
Ulas Sert
S. Altinuc
ELM
16
2
0
03 Oct 2023
LoFT: Local Proxy Fine-tuning For Improving Transferability Of Adversarial Attacks Against Large Language Model
Muhammad Ahmed Shah
Roshan S. Sharma
Hira Dhamyal
R. Olivier
Ankit Shah
...
Massa Baali
Soham Deshmukh
Michael Kuhlmann
Bhiksha Raj
Rita Singh
AAML
67
21
0
02 Oct 2023
Human Mobility Question Answering (Vision Paper)
Hao Xue
Flora D. Salim
51
0
0
02 Oct 2023
Defending Against Authorship Identification Attacks
Haining Wang
58
2
0
02 Oct 2023
Fusing Models with Complementary Expertise
Hongyi Wang
Felipe Maia Polo
Yuekai Sun
Souvik Kundu
Eric Xing
Mikhail Yurochkin
FedML
MoMe
94
33
0
02 Oct 2023
It's MBR All the Way Down: Modern Generation Techniques Through the Lens of Minimum Bayes Risk
Amanda Bertsch
Alex Xie
Graham Neubig
Matthew R. Gormley
86
36
0
02 Oct 2023
Analyzing and Mitigating Object Hallucination in Large Vision-Language Models
Yiyang Zhou
Chenhang Cui
Jaehong Yoon
Linjun Zhang
Zhun Deng
Chelsea Finn
Mohit Bansal
Huaxiu Yao
MLLM
167
186
0
01 Oct 2023
TIGERScore: Towards Building Explainable Metric for All Text Generation Tasks
Dongfu Jiang
Yishan Li
Ge Zhang
Wenhao Huang
Bill Yuchen Lin
Wenhu Chen
ALM
111
69
0
01 Oct 2023
Overview of the BioLaySumm 2023 Shared Task on Lay Summarization of Biomedical Research Articles
Tomas Goldsack
Jiancheng Yang
Qianqian Xie
Carolina Scarton
Matthew Shardlow
Sophia Ananiadou
Chenghua Lin
72
16
0
29 Sep 2023
STRONG -- Structure Controllable Legal Opinion Summary Generation
Yang Zhong
Diane Litman
ELM
AILaw
60
3
0
29 Sep 2023
LatticeGen: A Cooperative Framework which Hides Generated Text in a Lattice for Privacy-Aware Generation on Cloud
Mengke Zhang
Tianxing He
Tianle Wang
Lu Mi
Fatemehsadat Mireshghallah
Binyi Chen
Hao Wang
Yulia Tsvetkov
75
0
0
29 Sep 2023
Benchmarking Cognitive Biases in Large Language Models as Evaluators
Ryan Koo
Minhwa Lee
Vipul Raheja
Jong Inn Park
Zae Myung Kim
Dongyeop Kang
ALM
114
87
0
29 Sep 2023
Hallucination Reduction in Long Input Text Summarization
Gregor Lenz
Ronit Mandal
Abhishek Agarwal
Debarshi Kumar Sanyal
HILM
59
9
0
28 Sep 2023
TPE: Towards Better Compositional Reasoning over Conceptual Tools with Multi-persona Collaboration
Hongru Wang
Huimin Wang
Lingzhi Wang
Minda Hu
Rui Wang
Boyang Xue
Hongyuan Lu
Fei Mi
Kam-Fai Wong
LRM
KELM
LLMAG
91
13
0
28 Sep 2023
Large Language Model Routing with Benchmark Datasets
Tal Shnitzer
Anthony Ou
Mírian Silva
Kate Soule
Yuekai Sun
Justin Solomon
Neil Thompson
Mikhail Yurochkin
RALM
83
71
0
27 Sep 2023
Question answering using deep learning in low resource Indian language Marathi
Dhiraj Amin
S. Govilkar
Sagar Kulkarni
46
3
0
27 Sep 2023
Identifying and Mitigating Privacy Risks Stemming from Language Models: A Survey
Victoria Smith
Ali Shahin Shamsabadi
Carolyn Ashurst
Adrian Weller
PILM
108
27
0
27 Sep 2023
Legal Question-Answering in the Indian Context: Efficacy, Challenges, and Potential of Modern AI Models
S. Nigam
Shubham Kumar Mishra
Ayush Kumar Mishra
Noel Shallum
Arnab Bhattacharya
AILaw
ELM
63
9
0
26 Sep 2023
Are Human-generated Demonstrations Necessary for In-context Learning?
Rui Li
Guoyin Wang
Jiwei Li
LRM
51
14
0
26 Sep 2023
Ragas: Automated Evaluation of Retrieval Augmented Generation
ES Shahul
Jithin James
Luis Espinosa-Anke
Steven Schockaert
145
205
0
26 Sep 2023
ALLURE: Auditing and Improving LLM-based Evaluation of Text using Iterative In-Context-Learning
Hosein Hasanbeig
Hiteshi Sharma
Leo Betthauser
Felipe Vieira Frujeri
Ida Momennejad
114
16
0
24 Sep 2023
Calibrating LLM-Based Evaluator
Yuxuan Liu
Tianchi Yang
Shaohan Huang
Zihan Zhang
Haizhen Huang
Furu Wei
Weiwei Deng
Feng Sun
Qi Zhang
117
33
0
23 Sep 2023
Diversifying Question Generation over Knowledge Base via External Natural Questions
Shasha Guo
Jing Zhang
Xirui Ke
Cuiping Li
Hong Chen
126
5
0
23 Sep 2023
Investigating Large Language Models and Control Mechanisms to Improve Text Readability of Biomedical Abstracts
Z. Li
Samuel Belkadi
Nicolo Micheletti
Lifeng Han
Matthew Shardlow
Goran Nenadic
98
5
0
22 Sep 2023
Effective Distillation of Table-based Reasoning Ability from LLMs
Bohao Yang
Chen Tang
Kangning Zhao
Chenghao Xiao
Chenghua Lin
LRM
65
27
0
22 Sep 2023
ReConcile: Round-Table Conference Improves Reasoning via Consensus among Diverse LLMs
Justin Chih-Yao Chen
Swarnadeep Saha
Joey Tianyi Zhou
LLMAG
LRM
103
143
0
22 Sep 2023
Domain Adaptation for Arabic Machine Translation: The Case of Financial Texts
Emad A. Alghamdi
Jezia Zakraoui
Fares A. Abanmy
78
1
0
22 Sep 2023
Semantic similarity prediction is better than other semantic similarity measures
Steffen Herbold
28
4
0
22 Sep 2023
Unlocking Model Insights: A Dataset for Automated Model Card Generation
Shruti Singh
Hitesh Lodwal
Husain Malwat
Rakesh Thakur
Mayank Singh
SyDa
54
3
0
22 Sep 2023
Is it Possible to Modify Text to a Target Readability Level? An Initial Investigation Using Zero-Shot Large Language Models
Asma Farajidizaji
Vatsal Raina
Mark Gales
64
2
0
22 Sep 2023
LongDocFACTScore: Evaluating the Factuality of Long Document Abstractive Summarisation
Jennifer A Bishop
Qianqian Xie
Sophia Ananiadou
HILM
82
12
0
21 Sep 2023
Foundation Metrics for Evaluating Effectiveness of Healthcare Conversations Powered by Generative AI
Mahyar Abbasian
Elahe Khatibi
Iman Azimi
David Oniani
Zahra Shakeri Hossein Abad
...
Bryant Lin
Olivier Gevaert
Li-Jia Li
Ramesh C. Jain
Amir M. Rahmani
LM&MA
ELM
AI4MH
139
78
0
21 Sep 2023
Reranking for Natural Language Generation from Logical Forms: A Study based on Large Language Models
Levon Haroutunian
Zhuang Li
Lucian Galescu
Philip R. Cohen
Raj Tumuluri
Gholamreza Haffari
LRM
89
1
0
21 Sep 2023
SQUARE: Automatic Question Answering Evaluation using Multiple Positive and Negative References
Matteo Gabburo
Siddhant Garg
Rik Koncel-Kedziorski
Alessandro Moschitti
77
1
0
21 Sep 2023
Towards Answering Health-related Questions from Medical Videos: Datasets and Approaches
Deepak Gupta
Kush Attal
Dina Demner-Fushman
LM&MA
54
1
0
21 Sep 2023
Striking Gold in Advertising: Standardization and Exploration of Ad Text Generation
Masato Mita
Soichiro Murakami
Akihiko Kato
Peinan Zhang
117
8
0
21 Sep 2023
Controlled Generation with Prompt Insertion for Natural Language Explanations in Grammatical Error Correction
Masahiro Kaneko
Naoaki Okazaki
LRM
99
5
0
20 Sep 2023
A Family of Pretrained Transformer Language Models for Russian
Dmitry Zmitrovich
Alexander Abramov
Andrey Kalmykov
Maria Tikhonova
Ekaterina Taktasheva
...
Vitalii Kadulin
Sergey Markov
Tatiana Shavrina
Vladislav Mikhailov
Alena Fenogenova
109
26
0
19 Sep 2023
Toward Unified Controllable Text Generation via Regular Expression Instruction
Xin Zheng
Hongyu Lin
Xianpei Han
Le Sun
101
5
0
19 Sep 2023
Prompt, Condition, and Generate: Classification of Unsupported Claims with In-Context Learning
Peter Ebert Christensen
Srishti Yadav
Serge J. Belongie
52
1
0
19 Sep 2023
Previous
1
2
3
...
39
40
41
...
69
70
71
Next