v1v2v3 (latest)

BERTScore: Evaluating Text Generation with BERT

21 April 2019

Papers citing "BERTScore: Evaluating Text Generation with BERT"

50 / 3,520 papers shown

Title
Chain of Natural Language Inference for Reducing Large Language Model Ungrounded Hallucinations Deren Lei Yaxi Li Mengya Hu Mingyu Wang Vincent Yun Emily Ching Eslam Kamal HILM LRM 59 40 0 06 Oct 2023
Automatic and Human-AI Interactive Text Generation Yao Dou Philippe Laban Claire Gardent Wei Xu 82 4 0 05 Oct 2023
Learning Personalized Alignment for Evaluating Open-ended Text Generation Danqing Wang Kevin Kaichuang Yang Hanlin Zhu Xiaomeng Yang Andrew Cohen Lei Li Yuandong Tian ALM LM&MA 87 11 0 05 Oct 2023
Retrieval-augmented Generation to Improve Math Question-Answering: Trade-offs Between Groundedness and Human Preference Zachary Levonian Chenglu Li Wangda Zhu Anoushka Gade Owen Henkel Millie-Ellen Postle Wanli Xing AI4Ed RALM 101 34 0 04 Oct 2023
T $^3$ Bench: Benchmarking Current Progress in Text-to-3D Generation Yuze He Yushi Bai Matthieu Lin Wang Zhao Yubin Hu Jenny Sheng Ran Yi Juanzi Li Yong Liu 130 33 0 04 Oct 2023
Low Resource Summarization using Pre-trained Language Models Mubashir Munaf Hammad Afzal N. Iltaf Khawir Mahmood 37 7 0 04 Oct 2023
Integrating UMLS Knowledge into Large Language Models for Medical Question Answering Rui Yang Edison Marrese-Taylor Yuhe Ke Lechao Cheng Qingyu Chen Irene Li ELM AI4MH LM&MA 85 16 0 04 Oct 2023
LC-Score: Reference-less estimation of Text Comprehension Difficulty Paul Tardy Charlotte Roze Paul Poupet 35 0 0 04 Oct 2023
Improving Automatic VQA Evaluation Using Large Language Models Oscar Manas Benno Krojer Aishwarya Agrawal 95 25 0 04 Oct 2023
TWIZ-v2: The Wizard of Multimodal Conversational-Stimulus Rafael Ferreira Diogo Tavares Diogo Glória-Silva Rodrigo Valerio João Bordalo Ines Simoes Vasco Ramos David Semedo João Magalhães 45 4 0 03 Oct 2023
Jury: A Comprehensive Evaluation Toolkit Devrim Cavusoglu Secil Sen Ulas Sert S. Altinuc ELM 16 2 0 03 Oct 2023
LoFT: Local Proxy Fine-tuning For Improving Transferability Of Adversarial Attacks Against Large Language Model Muhammad Ahmed Shah Roshan S. Sharma Hira Dhamyal R. Olivier Ankit Shah ... Massa Baali Soham Deshmukh Michael Kuhlmann Bhiksha Raj Rita Singh AAML 67 21 0 02 Oct 2023
Human Mobility Question Answering (Vision Paper) Hao Xue Flora D. Salim 51 0 0 02 Oct 2023
Defending Against Authorship Identification Attacks Haining Wang 58 2 0 02 Oct 2023
Fusing Models with Complementary Expertise Hongyi Wang Felipe Maia Polo Yuekai Sun Souvik Kundu Eric Xing Mikhail Yurochkin FedML MoMe 94 33 0 02 Oct 2023
It's MBR All the Way Down: Modern Generation Techniques Through the Lens of Minimum Bayes Risk Amanda Bertsch Alex Xie Graham Neubig Matthew R. Gormley 86 36 0 02 Oct 2023
Analyzing and Mitigating Object Hallucination in Large Vision-Language Models Yiyang Zhou Chenhang Cui Jaehong Yoon Linjun Zhang Zhun Deng Chelsea Finn Mohit Bansal Huaxiu Yao MLLM 167 186 0 01 Oct 2023
TIGERScore: Towards Building Explainable Metric for All Text Generation Tasks Dongfu Jiang Yishan Li Ge Zhang Wenhao Huang Bill Yuchen Lin Wenhu Chen ALM 111 69 0 01 Oct 2023
Overview of the BioLaySumm 2023 Shared Task on Lay Summarization of Biomedical Research Articles Tomas Goldsack Jiancheng Yang Qianqian Xie Carolina Scarton Matthew Shardlow Sophia Ananiadou Chenghua Lin 72 16 0 29 Sep 2023
STRONG -- Structure Controllable Legal Opinion Summary Generation Yang Zhong Diane Litman ELM AILaw 60 3 0 29 Sep 2023
LatticeGen: A Cooperative Framework which Hides Generated Text in a Lattice for Privacy-Aware Generation on Cloud Mengke Zhang Tianxing He Tianle Wang Lu Mi Fatemehsadat Mireshghallah Binyi Chen Hao Wang Yulia Tsvetkov 75 0 0 29 Sep 2023
Benchmarking Cognitive Biases in Large Language Models as Evaluators Ryan Koo Minhwa Lee Vipul Raheja Jong Inn Park Zae Myung Kim Dongyeop Kang ALM 114 87 0 29 Sep 2023
Hallucination Reduction in Long Input Text Summarization Gregor Lenz Ronit Mandal Abhishek Agarwal Debarshi Kumar Sanyal HILM 59 9 0 28 Sep 2023
TPE: Towards Better Compositional Reasoning over Conceptual Tools with Multi-persona Collaboration Hongru Wang Huimin Wang Lingzhi Wang Minda Hu Rui Wang Boyang Xue Hongyuan Lu Fei Mi Kam-Fai Wong LRM KELM LLMAG 91 13 0 28 Sep 2023
Large Language Model Routing with Benchmark Datasets Tal Shnitzer Anthony Ou Mírian Silva Kate Soule Yuekai Sun Justin Solomon Neil Thompson Mikhail Yurochkin RALM 83 71 0 27 Sep 2023
Question answering using deep learning in low resource Indian language Marathi Dhiraj Amin S. Govilkar Sagar Kulkarni 46 3 0 27 Sep 2023
Identifying and Mitigating Privacy Risks Stemming from Language Models: A Survey Victoria Smith Ali Shahin Shamsabadi Carolyn Ashurst Adrian Weller PILM 108 27 0 27 Sep 2023
Legal Question-Answering in the Indian Context: Efficacy, Challenges, and Potential of Modern AI Models S. Nigam Shubham Kumar Mishra Ayush Kumar Mishra Noel Shallum Arnab Bhattacharya AILaw ELM 63 9 0 26 Sep 2023
Are Human-generated Demonstrations Necessary for In-context Learning? Rui Li Guoyin Wang Jiwei Li LRM 51 14 0 26 Sep 2023
Ragas: Automated Evaluation of Retrieval Augmented Generation ES Shahul Jithin James Luis Espinosa-Anke Steven Schockaert 145 205 0 26 Sep 2023
ALLURE: Auditing and Improving LLM-based Evaluation of Text using Iterative In-Context-Learning Hosein Hasanbeig Hiteshi Sharma Leo Betthauser Felipe Vieira Frujeri Ida Momennejad 114 16 0 24 Sep 2023
Calibrating LLM-Based Evaluator Yuxuan Liu Tianchi Yang Shaohan Huang Zihan Zhang Haizhen Huang Furu Wei Weiwei Deng Feng Sun Qi Zhang 117 33 0 23 Sep 2023
Diversifying Question Generation over Knowledge Base via External Natural Questions Shasha Guo Jing Zhang Xirui Ke Cuiping Li Hong Chen 126 5 0 23 Sep 2023
Investigating Large Language Models and Control Mechanisms to Improve Text Readability of Biomedical Abstracts Z. Li Samuel Belkadi Nicolo Micheletti Lifeng Han Matthew Shardlow Goran Nenadic 98 5 0 22 Sep 2023
Effective Distillation of Table-based Reasoning Ability from LLMs Bohao Yang Chen Tang Kangning Zhao Chenghao Xiao Chenghua Lin LRM 65 27 0 22 Sep 2023
ReConcile: Round-Table Conference Improves Reasoning via Consensus among Diverse LLMs Justin Chih-Yao Chen Swarnadeep Saha Joey Tianyi Zhou LLMAG LRM 103 143 0 22 Sep 2023
Domain Adaptation for Arabic Machine Translation: The Case of Financial Texts Emad A. Alghamdi Jezia Zakraoui Fares A. Abanmy 78 1 0 22 Sep 2023
Semantic similarity prediction is better than other semantic similarity measures Steffen Herbold 28 4 0 22 Sep 2023
Unlocking Model Insights: A Dataset for Automated Model Card Generation Shruti Singh Hitesh Lodwal Husain Malwat Rakesh Thakur Mayank Singh SyDa 54 3 0 22 Sep 2023
Is it Possible to Modify Text to a Target Readability Level? An Initial Investigation Using Zero-Shot Large Language Models Asma Farajidizaji Vatsal Raina Mark Gales 64 2 0 22 Sep 2023
LongDocFACTScore: Evaluating the Factuality of Long Document Abstractive Summarisation Jennifer A Bishop Qianqian Xie Sophia Ananiadou HILM 82 12 0 21 Sep 2023
Foundation Metrics for Evaluating Effectiveness of Healthcare Conversations Powered by Generative AI Mahyar Abbasian Elahe Khatibi Iman Azimi David Oniani Zahra Shakeri Hossein Abad ... Bryant Lin Olivier Gevaert Li-Jia Li Ramesh C. Jain Amir M. Rahmani LM&MA ELM AI4MH 139 78 0 21 Sep 2023
Reranking for Natural Language Generation from Logical Forms: A Study based on Large Language Models Levon Haroutunian Zhuang Li Lucian Galescu Philip R. Cohen Raj Tumuluri Gholamreza Haffari LRM 89 1 0 21 Sep 2023
SQUARE: Automatic Question Answering Evaluation using Multiple Positive and Negative References Matteo Gabburo Siddhant Garg Rik Koncel-Kedziorski Alessandro Moschitti 77 1 0 21 Sep 2023
Towards Answering Health-related Questions from Medical Videos: Datasets and Approaches Deepak Gupta Kush Attal Dina Demner-Fushman LM&MA 54 1 0 21 Sep 2023
Striking Gold in Advertising: Standardization and Exploration of Ad Text Generation Masato Mita Soichiro Murakami Akihiko Kato Peinan Zhang 117 8 0 21 Sep 2023
Controlled Generation with Prompt Insertion for Natural Language Explanations in Grammatical Error Correction Masahiro Kaneko Naoaki Okazaki LRM 99 5 0 20 Sep 2023
A Family of Pretrained Transformer Language Models for Russian Dmitry Zmitrovich Alexander Abramov Andrey Kalmykov Maria Tikhonova Ekaterina Taktasheva ... Vitalii Kadulin Sergey Markov Tatiana Shavrina Vladislav Mikhailov Alena Fenogenova 109 26 0 19 Sep 2023
Toward Unified Controllable Text Generation via Regular Expression Instruction Xin Zheng Hongyu Lin Xianpei Han Le Sun 101 5 0 19 Sep 2023
Prompt, Condition, and Generate: Classification of Unsupported Claims with In-Context Learning Peter Ebert Christensen Srishti Yadav Serge J. Belongie 52 1 0 19 Sep 2023