Neighbor communities
0 / 0 papers shown
Title |
|---|
Top Contributors
| Name | # Papers | # Citations |
|---|---|---|
Social Events
| Date | Location | Event |
|---|---|---|
Title |
|---|
| Name | # Papers | # Citations |
|---|---|---|
| Date | Location | Event |
|---|---|---|
Focuses on research that actively explores methods and strategies to ensure language models' outputs align with human values, ethics, and intentions, constituting a significant portion of the paper's content.
Title |
|---|
Title | |||
|---|---|---|---|
![]() The ORCA Benchmark: Evaluating Real-World Calculation Accuracy in Large Language Models Claudia Herambourg Dawid Siuda Anna Szczepanek Julia Kopczyńska Joao R. L. Santos Wojciech Sas Joanna Śmietańska-Nowak | |||
![]() The Ouroboros of Benchmarking: Reasoning Evaluation in an Era of Saturation İbrahim Ethem Deveci Duygu Ataman | |||
![]() CodeClash: Benchmarking Goal-Oriented Software Engineering John Yang Kilian Lieret Joyce Yang Carlos E. Jimenez Ofir Press Ludwig Schmidt Diyi Yang | |||
![]() IF-CRITIC: Towards a Fine-Grained LLM Critic for Instruction-Following Evaluation Bosi Wen Yilin Niu Cunxiang Wang Pei Ke Xiaoying Ling Ying Zhang Aohan Zeng Hongning Wang Minlie Huang | |||
![]() Efficiency vs. Alignment: Investigating Safety and Fairness Risks in Parameter-Efficient Fine-Tuning of LLMs Mina Taraghi Yann Pequignot Amin Nikanjam Mohamed Amine Merzouk Foutse Khomh | |||
![]() CodeAlignBench: Assessing Code Generation Models on Developer-Preferred Code Adjustments Forough Mehralian Ryan Shar James R. Rae Alireza Hashemi | |||
| Name (-) |
|---|
| Name (-) |
|---|
| Name (-) |
|---|
| Date | Location | Event | |
|---|---|---|---|
| No social events available | |||