Climate Finance Bench

Climate Finance Bench introduces an open benchmark that targets question-answering over corporate climate disclosures using Large Language Models. We curate 33 recent sustainability reports in English drawn from companies across all 11 GICS sectors and annotate 330 expert-validated question-answer pairs that span pure extraction, numerical reasoning, and logical reasoning. Building on this dataset, we propose a comparison of RAG (retrieval-augmented generation) approaches. We show that the retriever's ability to locate passages that actually contain the answer is the chief performance bottleneck. We further argue for transparent carbon reporting in AI-for-climate applications, highlighting advantages of techniques such as Weight Quantization.
View on arXiv@article{mankour2025_2505.22752, title={ Climate Finance Bench }, author={ Rafik Mankour and Yassine Chafai and Hamada Saleh and Ghassen Ben Hassine and Thibaud Barreau and Peter Tankov }, journal={arXiv preprint arXiv:2505.22752}, year={ 2025 } }