Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2404.09785
Cited By
Benchmarking Llama2, Mistral, Gemma and GPT for Factuality, Toxicity, Bias and Propensity for Hallucinations
15 April 2024
David Nadeau
Mike Kroutikov
Karen McNeil
Simon Baribeau
HILM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Benchmarking Llama2, Mistral, Gemma and GPT for Factuality, Toxicity, Bias and Propensity for Hallucinations"
2 / 2 papers shown
Title
Bi-Factorial Preference Optimization: Balancing Safety-Helpfulness in Language Models
Wenxuan Zhang
Philip Torr
Mohamed Elhoseiny
Adel Bibi
91
10
0
27 Aug 2024
Understanding Factuality in Abstractive Summarization with FRANK: A Benchmark for Factuality Metrics
Artidoro Pagnoni
Vidhisha Balachandran
Yulia Tsvetkov
HILM
233
306
0
27 Apr 2021
1