Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1904.09675
Cited By
v1
v2
v3 (latest)
BERTScore: Evaluating Text Generation with BERT
21 April 2019
Tianyi Zhang
Varsha Kishore
Felix Wu
Kilian Q. Weinberger
Yoav Artzi
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"BERTScore: Evaluating Text Generation with BERT"
50 / 3,520 papers shown
Title
StyleBART: Decorate Pretrained Model with Style Adapters for Unsupervised Stylistic Headline Generation
Hanqing Wang
Yajing Luo
Boya Xiong
Guanhua Chen
Yun-Nung Chen
53
0
0
26 Oct 2023
Cultural Adaptation of Recipes
Yong Cao
Yova Kementchedjhieva
Ruixiang Cui
Antonia Karamolegkou
Li Zhou
Megan Dare
Lucia Donatelli
Daniel Hershcovich
95
6
0
26 Oct 2023
Automatic Logical Forms improve fidelity in Table-to-Text generation
Iñigo Alonso
Eneko Agirre
LMTD
66
3
0
26 Oct 2023
JudgeLM: Fine-tuned Large Language Models are Scalable Judges
Lianghui Zhu
Xinggang Wang
Xinlong Wang
ELM
ALM
184
143
0
26 Oct 2023
BOOST: Harnessing Black-Box Control to Boost Commonsense in LMs' Generation
Yufei Tian
Felix Zhang
Nanyun Peng
63
0
0
25 Oct 2023
Follow-on Question Suggestion via Voice Hints for Voice Assistants
B. Fetahu
Pedro Faustini
Giuseppe Castellucci
Anjie Fang
Oleg Rokhlenko
S. Malmasi
51
2
0
25 Oct 2023
Critic-Driven Decoding for Mitigating Hallucinations in Data-to-text Generation
Mateusz Lango
Ondrej Dusek
60
8
0
25 Oct 2023
HANSEN: Human and AI Spoken Text Benchmark for Authorship Analysis
Nafis Irtiza Tripto
Adaku Uchendu
Thai V. Le
Mattia Setzu
F. Giannotti
Dongwon Lee
DeLMO
58
7
0
25 Oct 2023
Diversity Enhanced Narrative Question Generation for Storybooks
Hokeun Yoon
Jinyeong Bak
94
8
0
25 Oct 2023
RCAgent: Cloud Root Cause Analysis by Autonomous Agents with Tool-Augmented Large Language Models
Zefan Wang
Zichuan Liu
Yingying Zhang
Aoxiao Zhong
Lunting Fan
Lingfei Wu
Qingsong Wen
93
32
0
25 Oct 2023
Can GPT models Follow Human Summarization Guidelines? A Study for Targeted Communication Goals
Yongxin Zhou
Fabien Ringeval
Franccois Portet
ELM
ALM
65
0
0
25 Oct 2023
Background Summarization of Event Timelines
Adithya Pratapa
Kevin Small
Markus Dreyer
120
2
0
24 Oct 2023
Clinfo.ai: An Open-Source Retrieval-Augmented Large Language Model System for Answering Medical Questions using Scientific Literature
Alejandro Lozano
Scott L. Fleming
Chia-Chun Chiang
Nigam Shah
ELM
RALM
99
41
0
24 Oct 2023
BLESS: Benchmarking Large Language Models on Sentence Simplification
Tannon Kew
Alison Chi
Laura Vásquez-Rodríguez
Sweta Agrawal
Dennis Aumiller
Fernando Alva-Manchego
Teven Le Scao
95
26
0
24 Oct 2023
Enhancing Biomedical Lay Summarisation with External Knowledge Graphs
Tomas Goldsack
Zhihao Zhang
Chen Tang
Carolina Scarton
Chenghua Lin
58
10
0
24 Oct 2023
Creating a silver standard for patent simplification
Silvia Casola
A. Lavelli
Horacio Saggion
AILaw
63
3
0
24 Oct 2023
Improving Biomedical Abstractive Summarisation with Knowledge Aggregation from Citation Papers
Chen Tang
Shunyu Wang
Tomas Goldsack
Chenghua Lin
62
18
0
24 Oct 2023
Fighting Fire with Fire: The Dual Role of LLMs in Crafting and Detecting Elusive Disinformation
Jason Samuel Lucas
Adaku Uchendu
Michiharu Yamashita
Jooyoung Lee
Shaurya Rohatgi
Dongwon Lee
96
48
0
24 Oct 2023
GPT-4 as an Effective Zero-Shot Evaluator for Scientific Figure Captions
Ting-Yao Hsu
Chieh-Yang Huang
Ryan Rossi
Sungchul Kim
C. Lee Giles
‘Kenneth’ Huang
125
13
0
23 Oct 2023
Exploring the Potential of Large Language Models in Generating Code-Tracing Questions for Introductory Programming Courses
Aysa Xuemo Fan
Ranran Haoran Zhang
Luc Paquette
Rui Zhang
AI4Ed
44
3
0
23 Oct 2023
Reference Free Domain Adaptation for Translation of Noisy Questions with Question Specific Rewards
Baban Gain
Ramakrishna Appicharla
Soumya Chennabasavaraj
Nikesh Garera
Asif Ekbal
M. Chelliah
72
0
0
23 Oct 2023
Location-Aware Visual Question Generation with Lightweight Models
Nicholas Collin Suwono
Justin Chih-Yao Chen
Tun-Min Hung
T. Huang
I-Bin Liao
Yung-Hui Li
Lun-Wei Ku
Shao-Hua Sun
55
4
0
23 Oct 2023
Paraphrase Types for Generation and Detection
Jan Philip Wahle
Bela Gipp
Terry Ruas
70
4
0
23 Oct 2023
Unleashing the potential of prompt engineering in Large Language Models: a comprehensive review
Banghao Chen
Zhaofeng Zhang
Nicolas Langrené
Shengxin Zhu
LLMAG
112
13
0
23 Oct 2023
Which Prompts Make The Difference? Data Prioritization For Efficient Human LLM Evaluation
M. Boubdir
Edward Kim
Beyza Ermis
Marzieh Fadaee
Sara Hooker
ALM
88
19
0
22 Oct 2023
Vision Language Models in Autonomous Driving: A Survey and Outlook
Xingcheng Zhou
Mingyu Liu
Ekim Yurtsever
B. L. Žagar
Walter Zimmer
Hu Cao
Alois C. Knoll
VLM
111
60
0
22 Oct 2023
Evaluating Subjective Cognitive Appraisals of Emotions from Large Language Models
Hongli Zhan
Desmond C. Ong
Junyi Jessy Li
148
7
0
22 Oct 2023
From Chaos to Clarity: Claim Normalization to Empower Fact-Checking
Megha Sundriyal
Tanmoy Chakraborty
Preslav Nakov
62
14
0
22 Oct 2023
Chainpoll: A high efficacy method for LLM hallucination detection
Robert Friel
Atindriyo Sanyal
LRM
HILM
80
28
0
22 Oct 2023
Revisiting Instruction Fine-tuned Model Evaluation to Guide Industrial Applications
Manuel Faysse
Gautier Viaud
C´eline Hudelot
Pierre Colombo
84
11
0
21 Oct 2023
Beyond Accuracy: Evaluating Self-Consistency of Code Large Language Models with IdentityChain
Marcus J. Min
Yangruibo Ding
Luca Buratti
Saurabh Pujar
Gail E. Kaiser
Suman Jana
Baishakhi Ray
LRM
HILM
84
21
0
21 Oct 2023
Toward Stronger Textual Attack Detectors
Pierre Colombo
Marine Picot
Nathan Noiry
Guillaume Staerman
Pablo Piantanida
563
5
0
21 Oct 2023
AITA Generating Moral Judgements of the Crowd with Reasoning
Osama Bsher
Ameer Sabri
53
0
0
21 Oct 2023
Optimizing Retrieval-augmented Reader Models via Token Elimination
Moshe Berchansky
Peter Izsak
Avi Caciularu
Ido Dagan
Moshe Wasserblat
RALM
95
12
0
20 Oct 2023
Retrieval-Augmented Neural Response Generation Using Logical Reasoning and Relevance Scoring
Nicholas Walker
Stefan Ultes
Pierre Lison
RALM
LRM
81
2
0
20 Oct 2023
Tuna: Instruction Tuning using Feedback from Large Language Models
Haoran Li
Yiran Liu
Xingxing Zhang
Wei Lu
Furu Wei
ALM
83
3
0
20 Oct 2023
An LLM can Fool Itself: A Prompt-Based Adversarial Attack
Xilie Xu
Keyi Kong
Ning Liu
Li-zhen Cui
Di Wang
Jingfeng Zhang
Mohan Kankanhalli
AAML
SILM
129
88
0
20 Oct 2023
NameGuess: Column Name Expansion for Tabular Data
Jiani Zhang
Zhengyuan Shen
Balasubramaniam Srinivasan
Shen Wang
Huzefa Rangwala
George Karypis
49
6
0
19 Oct 2023
Fast and Accurate Factual Inconsistency Detection Over Long Documents
B. Lattimer
Patrick Chen
Xinyuan Zhang
Yi Yang
HILM
102
19
0
19 Oct 2023
Better to Ask in English: Cross-Lingual Evaluation of Large Language Models for Healthcare Queries
Yiqiao Jin
Mohit Chandra
Gaurav Verma
Yibo Hu
Munmun De Choudhury
Srijan Kumar
LM&MA
ELM
159
76
0
19 Oct 2023
CLAIR: Evaluating Image Captions with Large Language Models
David M. Chan
Suzanne Petryk
Joseph E. Gonzalez
Trevor Darrell
John F. Canny
94
21
0
19 Oct 2023
Probing LLMs for hate speech detection: strengths and vulnerabilities
Sarthak Roy
Ashish Harshavardhan
Animesh Mukherjee
Punyajoy Saha
132
36
0
19 Oct 2023
The Shifted and The Overlooked: A Task-oriented Investigation of User-GPT Interactions
Siru Ouyang
Shuohang Wang
Yang Liu
Ming Zhong
Yizhu Jiao
Dan Iter
Reid Pryzant
Chenguang Zhu
Heng Ji
Jiawei Han
98
32
0
19 Oct 2023
REMARK-LLM: A Robust and Efficient Watermarking Framework for Generative Large Language Models
Ruisi Zhang
Shehzeen Samarah Hussain
Paarth Neekhara
F. Koushanfar
73
36
0
18 Oct 2023
DiagrammerGPT: Generating Open-Domain, Open-Platform Diagrams via LLM Planning
Abhaysinh Zala
Han Lin
Jaemin Cho
Mohit Bansal
91
16
0
18 Oct 2023
A Tale of Pronouns: Interpretability Informs Gender Bias Mitigation for Fairer Instruction-Tuned Machine Translation
Giuseppe Attanasio
Flor Miriam Plaza del Arco
Debora Nozza
Anne Lauscher
72
19
0
18 Oct 2023
InfoDiffusion: Information Entropy Aware Diffusion Process for Non-Autoregressive Text Generation
Renzhi Wang
Jing Li
Piji Li
DiffM
79
3
0
18 Oct 2023
Zero-shot Faithfulness Evaluation for Text Summarization with Foundation Language Model
Qi Jia
Siyu Ren
Yizhu Liu
Kenny Q. Zhu
ALM
HILM
91
17
0
18 Oct 2023
Quantifying Language Models' Sensitivity to Spurious Features in Prompt Design or: How I learned to start worrying about prompt formatting
Melanie Sclar
Yejin Choi
Yulia Tsvetkov
Alane Suhr
108
361
0
17 Oct 2023
Medical Text Simplification: Optimizing for Readability with Unlikelihood Training and Reranked Beam Search Decoding
Lorenzo Jaime Yu Flores
Heyuan Huang
Kejian Shi
Sophie Chheang
Arman Cohan
MedIm
68
7
0
17 Oct 2023
Previous
1
2
3
...
37
38
39
...
69
70
71
Next