Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1904.09675
Cited By
v1
v2
v3 (latest)
BERTScore: Evaluating Text Generation with BERT
21 April 2019
Tianyi Zhang
Varsha Kishore
Felix Wu
Kilian Q. Weinberger
Yoav Artzi
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"BERTScore: Evaluating Text Generation with BERT"
50 / 3,520 papers shown
Title
Fine-tuning Large Language Models with Sequential Instructions
Hanxu Hu
Simon Yu
Pinzhen Chen
Edoardo Ponti
ALM
LRM
137
15
0
12 Mar 2024
SmallToLarge (S2L): Scalable Data Selection for Fine-tuning Large Language Models by Summarizing Training Trajectories of Small Models
Yu Yang
Siddhartha Mishra
Jeffrey N Chiang
Baharan Mirzasoleiman
98
24
0
12 Mar 2024
Large Model driven Radiology Report Generation with Clinical Quality Reinforcement Learning
Zijian Zhou
Miaojing Shi
Meng Wei
Oluwatosin O. Alabi
Zijie Yue
Tom Vercauteren
LM&MA
84
7
0
11 Mar 2024
MedKP: Medical Dialogue with Knowledge Enhancement and Clinical Pathway Encoding
Jiageng Wu
Xian Wu
Yefeng Zheng
Jie Yang
MedIm
LM&MA
77
3
0
11 Mar 2024
From Instructions to Constraints: Language Model Alignment with Automatic Constraint Verification
Fei Wang
Chao Shang
Sarthak Jain
Shuai Wang
Qiang Ning
Bonan Min
Vittorio Castelli
Yassine Benajiba
Dan Roth
ALM
55
8
0
10 Mar 2024
Transformer based Multitask Learning for Image Captioning and Object Detection
Debolena Basak
P. K. Srijith
M. Desarkar
74
2
0
10 Mar 2024
Measuring Bias in a Ranked List using Term-based Representations
Amin Abolghasemi
Leif Azzopardi
Arian Askari
Maarten de Rijke
Suzan Verberne
82
7
0
09 Mar 2024
KG-Rank: Enhancing Large Language Models for Medical QA with Knowledge Graphs and Ranking Techniques
Rui Yang
Haoran Liu
Edison Marrese-Taylor
Qingcheng Zeng
Yu He Ke
...
Lechao Cheng
Qingyu Chen
James Caverlee
Yutaka Matsuo
Irene Li
LM&MA
AI4MH
102
34
0
09 Mar 2024
ACLSum: A New Dataset for Aspect-based Summarization of Scientific Publications
Sotaro Takeshita
Tommaso Green
Ines Reinig
Kai Eckert
Simone Paolo Ponzetto
69
12
0
08 Mar 2024
ERBench: An Entity-Relationship based Automatically Verifiable Hallucination Benchmark for Large Language Models
Jio Oh
Soyeon Kim
Junseok Seo
Jindong Wang
Ruochen Xu
Xing Xie
Steven Euijong Whang
76
4
0
08 Mar 2024
ROUGE-K: Do Your Summaries Have Keywords?
Sotaro Takeshita
Simone Paolo Ponzetto
Kai Eckert
73
1
0
08 Mar 2024
Know Your Audience: The benefits and pitfalls of generating plain language summaries beyond the "general" audience
Tal August
Kyle Lo
Noah A. Smith
Katharina Reinecke
92
14
0
08 Mar 2024
Can Your Model Tell a Negation from an Implicature? Unravelling Challenges With Intent Encoders
Yuwei Zhang
Siffi Singh
Sailik Sengupta
Igor Shalyminov
Hang Su
Hwanjun Song
Saab Mansour
81
2
0
07 Mar 2024
Persona Extraction Through Semantic Similarity for Emotional Support Conversation Generation
Seunghee Han
Se Jin Park
Chae Won Kim
Y. Ro
99
1
0
07 Mar 2024
On the Essence and Prospect: An Investigation of Alignment Approaches for Big Models
Xinpeng Wang
Shitong Duan
Xiaoyuan Yi
Jing Yao
Shanlin Zhou
Zhihua Wei
Peng Zhang
Dongkuan Xu
Maosong Sun
Xing Xie
OffRL
120
17
0
07 Mar 2024
Exploring LLM-based Agents for Root Cause Analysis
Devjeet Roy
Xuchao Zhang
Rashi Bhave
Chetan Bansal
P. Las-Casas
Rodrigo Fonseca
Saravan Rajmohan
115
32
0
07 Mar 2024
MEIT: Multimodal Electrocardiogram Instruction Tuning on Large Language Models for Report Generation
Zhongwei Wan
Che Liu
Xin Wang
Chaofan Tao
Hui Shen
Zhenwu Peng
Jie Fu
Rossella Arcucci
Huaxiu Yao
108
10
0
07 Mar 2024
Semi-Supervised Dialogue Abstractive Summarization via High-Quality Pseudolabel Selection
Jianfeng He
Hang Su
Jason (Jinglun) Cai
Igor Shalyminov
Hwanjun Song
Saab Mansour
76
4
0
06 Mar 2024
FaaF: Facts as a Function for the evaluation of generated text
Vasileios Katranidis
Gabor Barany
HILM
RALM
74
4
0
06 Mar 2024
A Modular Approach for Multimodal Summarization of TV Shows
Louis Mahon
Mirella Lapata
86
10
0
06 Mar 2024
BiVert: Bidirectional Vocabulary Evaluation using Relations for Machine Translation
Carinne Cherf
Yuval Pinter
37
1
0
06 Mar 2024
Book2Dial: Generating Teacher-Student Interactions from Textbooks for Cost-Effective Development of Educational Chatbots
Junling Wang
Jakub Macina
Nico Daheim
Sankalan Pal Chowdhury
Mrinmaya Sachan
71
10
0
05 Mar 2024
Data Augmentation using Large Language Models: Data Perspectives, Learning Paradigms and Challenges
Bosheng Ding
Chengwei Qin
Ruochen Zhao
Tianze Luo
Xinze Li
Guizhen Chen
Wenhan Xia
Junjie Hu
Anh Tuan Luu
Shafiq Joty
107
20
0
05 Mar 2024
Benchmarking the Text-to-SQL Capability of Large Language Models: A Comprehensive Evaluation
Bin Zhang
Yuxiao Ye
Guoqing Du
Xiaoru Hu
Zhishuai Li
Sun Yang
Chi Harold Liu
Rui Zhao
Ziyue Li
Hangyu Mao
LMTD
97
34
0
05 Mar 2024
A Second Look on BASS -- Boosting Abstractive Summarization with Unified Semantic Graphs -- A Replication Study
Osman Alperen Koras
Jorg Schlotterer
Christin Seifert
100
1
0
05 Mar 2024
Revisiting Meta-evaluation for Grammatical Error Correction
Masamune Kobayashi
Masato Mita
Mamoru Komachi
68
0
0
05 Mar 2024
A Comprehensive Survey on Process-Oriented Automatic Text Summarization with Exploration of LLM-Based Methods
Hanlei Jin
Yang Zhang
Dan Meng
Jun Wang
Jinghua Tan
249
96
0
05 Mar 2024
SPUQ: Perturbation-Based Uncertainty Quantification for Large Language Models
Xiang Gao
Jiaxin Zhang
Lalla Mouatadid
Kamalika Das
83
14
0
04 Mar 2024
Vision-Language Models for Medical Report Generation and Visual Question Answering: A Review
Iryna Hartsock
Ghulam Rasool
102
82
0
04 Mar 2024
RIFF: Learning to Rephrase Inputs for Few-shot Fine-tuning of Language Models
Saeed Najafi
Alona Fyshe
78
2
0
04 Mar 2024
FENICE: Factuality Evaluation of summarization based on Natural language Inference and Claim Extraction
Alessandro Sciré
Karim Ghonim
Roberto Navigli
HILM
56
11
0
04 Mar 2024
Enhancing Multi-Domain Automatic Short Answer Grading through an Explainable Neuro-Symbolic Pipeline
Felix Künnecke
Anna Filighera
Colin Leong
Tim Steuer
83
1
0
04 Mar 2024
Can LLMs Generate Architectural Design Decisions? -An Exploratory Empirical study
Rudra Dhar
Karthik Vaidhyanathan
Vasudeva Varma
46
16
0
04 Mar 2024
DECIDER: A Dual-System Rule-Controllable Decoding Framework for Language Generation
Chen Xu
Tian Lan
Changlong Yu
Wei Wang
Jun Gao
...
Qunxi Dong
Kun Qian
Piji Li
Wei Bi
Bin Hu
82
1
0
04 Mar 2024
Ever-Evolving Memory by Blending and Refining the Past
Seo Hyun Kim
Keummin Ka
Yohan Jo
Seung-won Hwang
Dongha Lee
Jinyoung Yeo
KELM
59
1
0
03 Mar 2024
SyllabusQA: A Course Logistics Question Answering Dataset
Nigel Fernandez
Alexander Scarlatos
Andrew Lan
44
6
0
03 Mar 2024
Improving the Validity of Automatically Generated Feedback via Reinforcement Learning
Alexander Scarlatos
Digory Smith
Simon Woodhead
Andrew Lan
OffRL
80
12
0
02 Mar 2024
Reading Subtext: Evaluating Large Language Models on Short Story Summarization with Writers
Melanie Subbiah
Sean Zhang
Lydia B. Chilton
Kathleen McKeown
111
15
0
02 Mar 2024
Attribute Structuring Improves LLM-Based Evaluation of Clinical Text Summaries
Zelalem Gero
Chandan Singh
Yiqing Xie
Sheng Zhang
Tristan Naumann
Jianfeng Gao
Hoifung Poon
ELM
ALM
61
4
0
01 Mar 2024
DiaHalu: A Dialogue-level Hallucination Evaluation Benchmark for Large Language Models
Kedi Chen
Qin Chen
Jie Zhou
Yishen He
Liang He
HILM
78
2
0
01 Mar 2024
Cross-Lingual Learning vs. Low-Resource Fine-Tuning: A Case Study with Fact-Checking in Turkish
R. Çekinel
Pinar Senkul
Çağrı Çöltekin
83
2
0
01 Mar 2024
CASIMIR: A Corpus of Scientific Articles enhanced with Multiple Author-Integrated Revisions
Léane Jourdan
Florian Boudin
Nicolas Hernandez
Richard Dufour
80
8
0
01 Mar 2024
Multimodal ArXiv: A Dataset for Improving Scientific Comprehension of Large Vision-Language Models
Lei Li
Yuqi Wang
Runxin Xu
Peiyi Wang
Xiachong Feng
Lingpeng Kong
Qi Liu
129
58
0
01 Mar 2024
Improving Socratic Question Generation using Data Augmentation and Preference Optimization
Nischal Ashok Kumar
Andrew Lan
113
9
0
01 Mar 2024
Artwork Explanation in Large-scale Vision Language Models
Kazuki Hayashi
Yusuke Sakai
Hidetaka Kamigaito
Katsuhiko Hayashi
Taro Watanabe
30
0
0
29 Feb 2024
Query-OPT: Optimizing Inference of Large Language Models via Multi-Query Instructions in Meeting Summarization
Md Tahmid Rahman Laskar
Elena Khasanova
Xue-Yong Fu
Cheng-Hsiung Chen
TN ShashiBhushan
58
2
0
29 Feb 2024
Panda-70M: Captioning 70M Videos with Multiple Cross-Modality Teachers
Tsai-Shien Chen
Aliaksandr Siarohin
Willi Menapace
Ekaterina Deyneka
Hsiang-wei Chao
...
Yuwei Fang
Hsin-Ying Lee
Jian Ren
Ming-Hsuan Yang
Sergey Tulyakov
VGen
166
211
0
29 Feb 2024
Exploring the Efficacy of Large Language Models in Summarizing Mental Health Counseling Sessions: A Benchmark Study
Prottay Kumar Adhikary
Aseem Srivastava
Shivani Kumar
Salam Michael Singh
Puneet Manuja
Jini K. Gopinath
Vijay Krishnan
Swati Kedia
K. Deb
Tanmoy Chakraborty
AI4MH
102
10
0
29 Feb 2024
Reducing Hallucinations in Entity Abstract Summarization with Facts-Template Decomposition
Fangwei Zhu
Peiyi Wang
Zhifang Sui
HILM
47
2
0
29 Feb 2024
EBBS: An Ensemble with Bi-Level Beam Search for Zero-Shot Machine Translation
Yuqiao Wen
Behzad Shayegh
Chenyang Huang
Yanshuai Cao
Lili Mou
141
5
0
29 Feb 2024
Previous
1
2
3
...
29
30
31
...
69
70
71
Next