Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2306.02561
Cited By
v1
v2
v3 (latest)
LLM-Blender: Ensembling Large Language Models with Pairwise Ranking and Generative Fusion
5 June 2023
Dongfu Jiang
Xiang Ren
Bill Yuchen Lin
ELM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"LLM-Blender: Ensembling Large Language Models with Pairwise Ranking and Generative Fusion"
50 / 240 papers shown
Title
SelectLLM: Query-Aware Efficient Selection Algorithm for Large Language Models
Kaushal Kumar Maurya
KV Aditya Srivatsa
Ekaterina Kochmar
105
2
0
16 Aug 2024
FuseChat: Knowledge Fusion of Chat Models
Fanqi Wan
Longguang Zhong
Ziyi Yang
Ruijun Chen
Xiaojun Quan
ALM
KELM
MoMe
87
29
0
15 Aug 2024
Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment
Karel DÓosterlinck
Winnie Xu
Chris Develder
Thomas Demeester
A. Singh
Christopher Potts
Douwe Kiela
Shikib Mehri
80
17
0
12 Aug 2024
ProFuser: Progressive Fusion of Large Language Models
Tianyuan Shi
Fanqi Wan
Canbin Huang
Xiaojun Quan
Chenliang Li
Ming Yan
Ji Zhang
MoMe
67
3
0
09 Aug 2024
Cool-Fusion: Fuse Large Language Models without Training
Cong Liu
Xiaojun Quan
Yan Pan
Liangzhi Li
Weigang Wu
Xu Chen
MoMe
VLM
135
5
0
29 Jul 2024
Right Now, Wrong Then: Non-Stationary Direct Preference Optimization under Preference Drift
Seongho Son
William Bankes
Sayak Ray Chowdhury
Brooks Paige
Ilija Bogunovic
124
4
0
26 Jul 2024
Exploring Domain Robust Lightweight Reward Models based on Router Mechanism
Hyuk Namgoong
Jeesu Jung
Sangkeun Jung
Yoonhyung Roh
74
1
0
24 Jul 2024
MetaLLM: A High-performant and Cost-efficient Dynamic Framework for Wrapping LLMs
Quang H. Nguyen
Duy C. Hoang
Juliette Decugis
Saurav Manchanda
Nitesh Chawla
Khoa D. Doan
Khoa D. Doan
238
11
0
15 Jul 2024
LIONs: An Empirically Optimized Approach to Align Language Models
Xiao Yu
Qingyang Wu
Yu Li
Zhou Yu
ALM
95
6
0
09 Jul 2024
Merge, Ensemble, and Cooperate! A Survey on Collaborative Strategies in the Era of Large Language Models
Jinliang Lu
Ziliang Pang
Min Xiao
Yaochen Zhu
Rui Xia
Jiajun Zhang
MoMe
119
27
0
08 Jul 2024
MJ-Bench: Is Your Multimodal Reward Model Really a Good Judge for Text-to-Image Generation?
Zhaorun Chen
Yichao Du
Zichen Wen
Yiyang Zhou
Chenhang Cui
...
Jiawei Zhou
Zhuokai Zhao
Rafael Rafailov
Chelsea Finn
Huaxiu Yao
EGVM
MLLM
117
35
0
05 Jul 2024
Cost-Effective Proxy Reward Model Construction with On-Policy and Active Learning
Yifang Chen
Shuohang Wang
Ziyi Yang
Hiteshi Sharma
Nikos Karampatziakis
Donghan Yu
Kevin Jamieson
Simon Shaolei Du
Yelong Shen
OffRL
102
5
0
02 Jul 2024
Decoding-Time Language Model Alignment with Multiple Objectives
Ruizhe Shi
Yifang Chen
Yushi Hu
Alisa Liu
Hannaneh Hajishirzi
Noah A. Smith
Simon Du
140
43
0
27 Jun 2024
RouteLLM: Learning to Route LLMs with Preference Data
Isaac Ong
Amjad Almahairi
Vincent Wu
Wei-Lin Chiang
Tianhao Wu
Joseph E. Gonzalez
M. W. Kadous
Ion Stoica
170
105
0
26 Jun 2024
LLMs instead of Human Judges? A Large Scale Empirical Study across 20 NLP Evaluation Tasks
A. Bavaresco
Raffaella Bernardi
Leonardo Bertolazzi
Desmond Elliott
Raquel Fernández
...
David Schlangen
Alessandro Suglia
Aditya K Surikuchi
Ece Takmaz
A. Testoni
ALM
ELM
179
88
0
26 Jun 2024
Human-AI collectives produce the most accurate differential diagnoses
N. Zöller
Julian Berger
Irving Lin
Nathan Fu
Jayanth S Komarneni
...
Benjamin Harack
Eugene A. Chu
V. Trianni
Ralf H. J. M. Kurvers
Stefan M. Herzog
LM&MA
56
1
0
21 Jun 2024
Breaking the Ceiling of the LLM Community by Treating Token Generation as a Classification for Ensembling
Yao-Ching Yu
Chun-Chih Kuo
Ziqi Ye
Yu-Cheng Chang
Yueh-Se Li
88
12
0
18 Jun 2024
Adaptive Selection for Homogeneous Tools: An Instantiation in the RAG Scenario
Feiteng Mu
Yong Jiang
Liwen Zhang
Chu Liu
Wenjie Li
Pengjun Xie
Fei Huang
44
2
0
18 Jun 2024
WPO: Enhancing RLHF with Weighted Preference Optimization
Wenxuan Zhou
Ravi Agrawal
Shujian Zhang
Sathish Indurthi
Sanqiang Zhao
Kaiqiang Song
Silei Xu
Chenguang Zhu
105
20
0
17 Jun 2024
Iterative Utility Judgment Framework via LLMs Inspired by Relevance in Philosophy
Hengran Zhang
Keping Bi
Jiafeng Guo
Xueqi Cheng
101
1
0
17 Jun 2024
Regularizing Hidden States Enables Learning Generalizable Reward Model for LLMs
Rui Yang
Ruomeng Ding
Yong Lin
Huan Zhang
Tong Zhang
122
62
0
14 Jun 2024
Multi-Agent Collaboration via Cross-Team Orchestration
Zhuoyun Du
Chen Qian
Wei Liu
Zihao Xie
Yifei Wang
...
Weize Chen
Cheng Yang
Ye Tian
Xuantang Xiong
Lei Han
LLMAG
103
21
0
13 Jun 2024
Multivariate Stochastic Dominance via Optimal Transport and Applications to Models Benchmarking
Gabriel Rioux
Apoorva Nitsure
Mattia Rigotti
Kristjan Greenewald
Youssef Mroueh
93
1
0
10 Jun 2024
CERET: Cost-Effective Extrinsic Refinement for Text Generation
Jason (Jinglun) Cai
Hang Su
Monica Sunkara
Igor Shalyminov
Saab Mansour
82
1
0
08 Jun 2024
CaLM: Contrasting Large and Small Language Models to Verify Grounded Generation
I-Hung Hsu
Zifeng Wang
Long T. Le
Lesly Miculicich
Nanyun Peng
Chen-Yu Lee
Tomas Pfister
HILM
112
4
0
08 Jun 2024
Mixture-of-Agents Enhances Large Language Model Capabilities
Junlin Wang
Jue Wang
Ben Athiwaratkun
Ce Zhang
James Zou
LLMAG
AIFin
104
138
0
07 Jun 2024
Brainstorming Brings Power to Large Language Models of Knowledge Reasoning
Zining Qin
Chenhao Wang
Huiling Qin
Weijia Jia
LRM
73
1
0
02 Jun 2024
Inverse Constitutional AI: Compressing Preferences into Principles
Arduin Findeis
Timo Kaufmann
Eyke Hüllermeier
Samuel Albanie
Robert Mullins
SyDa
120
12
0
02 Jun 2024
Self-Exploring Language Models: Active Preference Elicitation for Online Alignment
Shenao Zhang
Donghan Yu
Hiteshi Sharma
Ziyi Yang
Shuohang Wang
Hany Hassan
Zhaoran Wang
LRM
101
38
0
29 May 2024
Are You Sure? Rank Them Again: Repeated Ranking For Better Preference Datasets
Peter Devine
ALM
87
3
0
29 May 2024
An Empirical Analysis on Large Language Models in Debate Evaluation
Xinyi Liu
Pinxin Liu
Hangfeng He
ELM
72
7
0
28 May 2024
SimPO: Simple Preference Optimization with a Reference-Free Reward
Yu Meng
Mengzhou Xia
Danqi Chen
185
492
0
23 May 2024
Annotation-Efficient Preference Optimization for Language Model Alignment
Yuu Jinnai
Ukyo Honda
65
0
0
22 May 2024
Risks and Opportunities of Open-Source Generative AI
Francisco Eiras
Aleksander Petrov
Bertie Vidgen
Christian Schroeder
Fabio Pizzati
...
Matthew Jackson
Phillip H. S. Torr
Trevor Darrell
Y. Lee
Jakob N. Foerster
96
19
0
14 May 2024
SambaNova SN40L: Scaling the AI Memory Wall with Dataflow and Composition of Experts
R. Prabhakar
R. Sivaramakrishnan
Darshan Gandhi
Yun Du
Mingran Wang
...
Urmish Thakker
Dawei Huang
Sumti Jairath
Kevin J. Brown
K. Olukotun
MoE
77
15
0
13 May 2024
LLM Discussion: Enhancing the Creativity of Large Language Models via Discussion Framework and Role-Play
Li-Chun Lu
Shou-Jen Chen
Tsung-Min Pai
Chan-Hung Yu
Hung-yi Lee
Shao-Hua Sun
LLMAG
98
50
0
10 May 2024
Prometheus 2: An Open Source Language Model Specialized in Evaluating Other Language Models
Seungone Kim
Juyoung Suk
Shayne Longpre
Bill Yuchen Lin
Jamin Shin
Sean Welleck
Graham Neubig
Moontae Lee
Kyungjae Lee
Minjoon Seo
MoMe
ALM
ELM
147
205
0
02 May 2024
Self-Play Preference Optimization for Language Model Alignment
Yue Wu
Zhiqing Sun
Huizhuo Yuan
Kaixuan Ji
Yiming Yang
Quanquan Gu
147
145
0
01 May 2024
Harnessing the Power of Multiple Minds: Lessons Learned from LLM Routing
KV Aditya Srivatsa
Kaushal Kumar Maurya
Ekaterina Kochmar
105
17
0
01 May 2024
Hybrid LLM: Cost-Efficient and Quality-Aware Query Routing
Dujian Ding
Ankur Mallick
Chi Wang
Robert Sim
Subhabrata Mukherjee
Victor Rühle
L. Lakshmanan
Ahmed Hassan Awadallah
168
107
0
22 Apr 2024
Ensemble Learning for Heterogeneous Large Language Models with Deep Parallel Collaboration
Yi-Chong Huang
Xiaocheng Feng
Baohang Li
Yang Xiang
Hui Wang
Bing Qin
Ting Liu
FedML
97
30
0
19 Apr 2024
REQUAL-LM: Reliability and Equity through Aggregation in Large Language Models
Sana Ebrahimi
N. Shahbazi
Abolfazl Asudeh
57
1
0
17 Apr 2024
Disentangling Instructive Information from Ranked Multiple Candidates for Multi-Document Scientific Summarization
Pancheng Wang
Shasha Li
Dong Li
Kehan Long
Jintao Tang
Ting Wang
67
2
0
16 Apr 2024
Improving Recall of Large Language Models: A Model Collaboration Approach for Relational Triple Extraction
Zepeng Ding
Wenhao Huang
Jiaqing Liang
Deqing Yang
Yanghua Xiao
KELM
74
6
0
15 Apr 2024
Post-Hoc Reversal: Are We Selecting Models Prematurely?
Rishabh Ranjan
Saurabh Garg
Mrigank Raman
Carlos Guestrin
Zachary Chase Lipton
72
0
0
11 Apr 2024
SLPL SHROOM at SemEval2024 Task 06: A comprehensive study on models ability to detect hallucination
Pouya Fallah
S. Gooran
Mohammad Jafarinasab
Pouya Sadeghi
Reza Farnia
Amirreza Tarabkhah
Zainab Sadat Taghavi
Hossein Sameti
HILM
77
3
0
07 Apr 2024
Advancing LLM Reasoning Generalists with Preference Trees
Lifan Yuan
Ganqu Cui
Hanbin Wang
Ning Ding
Xingyao Wang
...
Zhenghao Liu
Bowen Zhou
Hao Peng
Zhiyuan Liu
Maosong Sun
LRM
138
123
0
02 Apr 2024
An Expert is Worth One Token: Synergizing Multiple Expert LLMs as Generalist via Expert Token Routing
Ziwei Chai
Guoyin Wang
Jing Su
Tianjie Zhang
Xuanwen Huang
...
Jingjing Xu
Jianbo Yuan
Hongxia Yang
Leilei Gan
Yang Yang
89
7
0
25 Mar 2024
RouterBench: A Benchmark for Multi-LLM Routing System
Qitian Jason Hu
Jacob Bieker
Xiuyu Li
Nan Jiang
Benjamin Keigwin
Gaurav Ranganath
Kurt Keutzer
Shriyash Kaustubh Upadhyay
113
54
0
18 Mar 2024
Mastering Text, Code and Math Simultaneously via Fusing Highly Specialized Language Models
Ning Ding
Yulin Chen
Ganqu Cui
Xingtai Lv
Weilin Zhao
Ruobing Xie
Bowen Zhou
Zhiyuan Liu
Maosong Sun
ALM
MoMe
AI4CE
150
7
0
13 Mar 2024
Previous
1
2
3
4
5
Next