ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2306.02561
  4. Cited By
LLM-Blender: Ensembling Large Language Models with Pairwise Ranking and
  Generative Fusion
v1v2v3 (latest)

LLM-Blender: Ensembling Large Language Models with Pairwise Ranking and Generative Fusion

5 June 2023
Dongfu Jiang
Xiang Ren
Bill Yuchen Lin
    ELM
ArXiv (abs)PDFHTML

Papers citing "LLM-Blender: Ensembling Large Language Models with Pairwise Ranking and Generative Fusion"

50 / 240 papers shown
Title
Copyright-Protected Language Generation via Adaptive Model Fusion
Copyright-Protected Language Generation via Adaptive Model Fusion
Javier Abad
Konstantin Donhauser
Francesco Pinto
Fanny Yang
130
2
0
09 Dec 2024
Bench-CoE: a Framework for Collaboration of Experts from Benchmark
Bench-CoE: a Framework for Collaboration of Experts from Benchmark
Yuanshuai Wang
Xingjian Zhang
Jinkun Zhao
Siwei Wen
Peilin Feng
Shuhao Liao
Lei Huang
Wenjun Wu
MoEALM
183
2
0
05 Dec 2024
Enhancing Perception Capabilities of Multimodal LLMs with Training-Free
  Fusion
Enhancing Perception Capabilities of Multimodal LLMs with Training-Free Fusion
Zhuokun Chen
Jinwu Hu
Zeshuai Deng
Yufeng Wang
Bohan Zhuang
Mingkui Tan
146
0
0
02 Dec 2024
Simple and Provable Scaling Laws for the Test-Time Compute of Large Language Models
Simple and Provable Scaling Laws for the Test-Time Compute of Large Language Models
Yanxi Chen
Xuchen Pan
Yaliang Li
Bolin Ding
Jingren Zhou
LRM
158
1
0
29 Nov 2024
$H^3$Fusion: Helpful, Harmless, Honest Fusion of Aligned LLMs
H3H^3H3Fusion: Helpful, Harmless, Honest Fusion of Aligned LLMs
Selim Furkan Tekin
Fatih Ilhan
Tiansheng Huang
Sihao Hu
Zachary Yahn
Ling Liu
MoMe
136
3
0
26 Nov 2024
SparrowVQE: Visual Question Explanation for Course Content Understanding
SparrowVQE: Visual Question Explanation for Course Content Understanding
Jialu Li
Manish Kumar Thota
Ruslan Gokhman
Radek Holik
Youshan Zhang
103
1
0
12 Nov 2024
DELIFT: Data Efficient Language model Instruction Fine Tuning
DELIFT: Data Efficient Language model Instruction Fine Tuning
Ishika Agarwal
Krishnateja Killamsetty
Lucian Popa
Marina Danilevksy
ALMVLM
138
4
0
07 Nov 2024
Towards Optimizing SQL Generation via LLM Routing
Towards Optimizing SQL Generation via LLM Routing
Mohammadhossein Malekpour
Nour Shaheen
Foutse Khomh
Amine Mhedhbi
AI4TS
76
3
0
06 Nov 2024
From Medprompt to o1: Exploration of Run-Time Strategies for Medical
  Challenge Problems and Beyond
From Medprompt to o1: Exploration of Run-Time Strategies for Medical Challenge Problems and Beyond
Harsha Nori
Naoto Usuyama
Nicholas King
S. McKinney
Xavier Fernandes
Sheng Zhang
Eric Horvitz
LRMLM&MAELMVLM
107
13
0
06 Nov 2024
Scaling LLM Inference with Optimized Sample Compute Allocation
Scaling LLM Inference with Optimized Sample Compute Allocation
Kexun Zhang
Shang Zhou
Danqing Wang
William Yang Wang
Lei Li
86
11
0
29 Oct 2024
Are LLM-Judges Robust to Expressions of Uncertainty? Investigating the effect of Epistemic Markers on LLM-based Evaluation
Are LLM-Judges Robust to Expressions of Uncertainty? Investigating the effect of Epistemic Markers on LLM-based Evaluation
Dongryeol Lee
Yerin Hwang
Yongil Kim
Joonsuk Park
Kyomin Jung
ELM
155
10
0
28 Oct 2024
Skywork-Reward: Bag of Tricks for Reward Modeling in LLMs
Skywork-Reward: Bag of Tricks for Reward Modeling in LLMs
Chris Yuhao Liu
Liang Zeng
Qingbin Liu
Rui Yan
Jujie He
Chaojie Wang
Shuicheng Yan
Yang Liu
Yahui Zhou
AI4TS
113
116
0
24 Oct 2024
Magnetic Preference Optimization: Achieving Last-iterate Convergence for Language Model Alignment
Magnetic Preference Optimization: Achieving Last-iterate Convergence for Language Model Alignment
Mingzhi Wang
Chengdong Ma
Qizhi Chen
Linjian Meng
Yang Han
Jiancong Xiao
Zhaowei Zhang
Jing Huo
Weijie Su
Yaodong Yang
137
9
0
22 Oct 2024
An Electoral Approach to Diversify LLM-based Multi-Agent Collective
  Decision-Making
An Electoral Approach to Diversify LLM-based Multi-Agent Collective Decision-Making
Xiutian Zhao
Ke Wang
Wei Peng
98
4
0
19 Oct 2024
Sensitivity of Generative VLMs to Semantically and Lexically Altered
  Prompts
Sensitivity of Generative VLMs to Semantically and Lexically Altered Prompts
Sri Harsha Dumpala
Aman Jaiswal
Chandramouli Shama Sastry
E. Milios
Sageev Oore
Hassan Sajjad
VLM
72
2
0
16 Oct 2024
LLM Chain Ensembles for Scalable and Accurate Data Annotation
LLM Chain Ensembles for Scalable and Accurate Data Annotation
David Farr
Nico Manzonelli
Iain Cruickshank
Kate Starbird
Jevin West
58
5
0
16 Oct 2024
Exploring Model Kinship for Merging Large Language Models
Exploring Model Kinship for Merging Large Language Models
Yedi Hu
Yunzhi Yao
N. Zhang
Shumin Deng
Ningyu Zhang
MoMe
152
1
0
16 Oct 2024
Model Swarms: Collaborative Search to Adapt LLM Experts via Swarm Intelligence
Model Swarms: Collaborative Search to Adapt LLM Experts via Swarm Intelligence
Shangbin Feng
Zifeng Wang
Yike Wang
Sayna Ebrahimi
Hamid Palangi
...
Nathalie Rauschmayr
Yejin Choi
Yulia Tsvetkov
Chen-Yu Lee
Tomas Pfister
MoMe
103
9
0
15 Oct 2024
G-Designer: Architecting Multi-agent Communication Topologies via Graph Neural Networks
G-Designer: Architecting Multi-agent Communication Topologies via Graph Neural Networks
Guibin Zhang
Xinfeng Li
Xiangguo Sun
Guancheng Wan
Miao Yu
Sihang Li
Kun Wang
Dawei Cheng
Dawei Cheng
AAMLAI4CE
201
20
0
15 Oct 2024
Unintentional Unalignment: Likelihood Displacement in Direct Preference Optimization
Unintentional Unalignment: Likelihood Displacement in Direct Preference Optimization
Noam Razin
Sadhika Malladi
Adithya Bhaskar
Danqi Chen
Sanjeev Arora
Boris Hanin
216
35
0
11 Oct 2024
Exact Byte-Level Probabilities from Tokenized Language Models for FIM-Tasks and Model Ensembles
Exact Byte-Level Probabilities from Tokenized Language Models for FIM-Tasks and Model Ensembles
Buu Phan
Brandon Amos
Itai Gat
Marton Havasi
Matthew Muckley
Karen Ullrich
130
2
0
11 Oct 2024
Reward-Augmented Data Enhances Direct Preference Alignment of LLMs
Reward-Augmented Data Enhances Direct Preference Alignment of LLMs
Shenao Zhang
Zhihan Liu
Boyi Liu
Yanzhe Zhang
Yingxiang Yang
Yunxing Liu
Liyu Chen
Tao Sun
Ziyi Wang
189
3
0
10 Oct 2024
Self-Boosting Large Language Models with Synthetic Preference Data
Self-Boosting Large Language Models with Synthetic Preference Data
Qingxiu Dong
Li Dong
Xingxing Zhang
Zhifang Sui
Furu Wei
SyDa
92
12
0
09 Oct 2024
Efficient Few-shot Learning for Multi-label Classification of Scientific
  Documents with Many Classes
Efficient Few-shot Learning for Multi-label Classification of Scientific Documents with Many Classes
Tim Schopf
Alexander Blatzheim
Nektarios Machner
Florian Matthes
VLM
73
2
0
08 Oct 2024
As Simple as Fine-tuning: LLM Alignment via Bidirectional Negative
  Feedback Loss
As Simple as Fine-tuning: LLM Alignment via Bidirectional Negative Feedback Loss
Xin Mao
Feng-Lin Li
Huimin Xu
Wei Zhang
Wang Chen
Anh Tuan Luu
81
2
0
07 Oct 2024
Enhancing Future Link Prediction in Quantum Computing Semantic Networks
  through LLM-Initiated Node Features
Enhancing Future Link Prediction in Quantum Computing Semantic Networks through LLM-Initiated Node Features
Gilchan Park
Paul Baity
Byung-Jun Yoon
Adolfy Hoisie
108
0
0
05 Oct 2024
RainbowPO: A Unified Framework for Combining Improvements in Preference Optimization
RainbowPO: A Unified Framework for Combining Improvements in Preference Optimization
Hanyang Zhao
Genta Indra Winata
Anirban Das
Shi-Xiong Zhang
D. Yao
Wenpin Tang
Sambit Sahu
109
9
0
05 Oct 2024
Cut the Crap: An Economical Communication Pipeline for LLM-based
  Multi-Agent Systems
Cut the Crap: An Economical Communication Pipeline for LLM-based Multi-Agent Systems
Guibin Zhang
Xinfeng Li
Zhixun Li
Sukwon Yun
Guancheng Wan
Kun Wang
Dawei Cheng
Jeffrey Xu Yu
Tianlong Chen
92
20
0
03 Oct 2024
Efficiently Deploying LLMs with Controlled Risk
Efficiently Deploying LLMs with Controlled Risk
Michael J. Zellinger
Matt Thomson
97
1
0
03 Oct 2024
Determine-Then-Ensemble: Necessity of Top-k Union for Large Language Model Ensembling
Determine-Then-Ensemble: Necessity of Top-k Union for Large Language Model Ensembling
Yuxuan Yao
Han Wu
Mingyang Liu
Sichun Luo
Xiongwei Han
Jie Liu
Zhijiang Guo
Linqi Song
107
7
0
03 Oct 2024
Beyond Bradley-Terry Models: A General Preference Model for Language Model Alignment
Beyond Bradley-Terry Models: A General Preference Model for Language Model Alignment
Yifan Zhang
Ge Zhang
Yue Wu
Kangping Xu
Quanquan Gu
105
2
0
03 Oct 2024
Seeing Eye to AI: Human Alignment via Gaze-Based Response Rewards for Large Language Models
Seeing Eye to AI: Human Alignment via Gaze-Based Response Rewards for Large Language Models
Angela Lopez-Cardona
Carlos Segura
Alexandros Karatzoglou
Sergi Abadal
Ioannis Arapakis
ALM
171
4
0
02 Oct 2024
Beyond Scalar Reward Model: Learning Generative Judge from Preference
  Data
Beyond Scalar Reward Model: Learning Generative Judge from Preference Data
Ziyi Ye
Xiangsheng Li
Qiuchi Li
Qingyao Ai
Yujia Zhou
Wei Shen
Dong Yan
Yiqun Liu
123
17
0
01 Oct 2024
RouterDC: Query-Based Router by Dual Contrastive Learning for Assembling
  Large Language Models
RouterDC: Query-Based Router by Dual Contrastive Learning for Assembling Large Language Models
Shuhao Chen
Weisen Jiang
Baijiong Lin
James T. Kwok
Yu Zhang
RALMMQ
113
13
0
30 Sep 2024
Efficient Pointwise-Pairwise Learning-to-Rank for News Recommendation
Efficient Pointwise-Pairwise Learning-to-Rank for News Recommendation
Nithish Kannen
Yao Ma
Gerrit J. J. van den Burg
Jean Baptiste Faddoul
122
1
0
26 Sep 2024
Eagle: Efficient Training-Free Router for Multi-LLM Inference
Eagle: Efficient Training-Free Router for Multi-LLM Inference
Zesen Zhao
Shuowei Jin
Z. Morley Mao
64
5
0
23 Sep 2024
Direct Judgement Preference Optimization
Direct Judgement Preference Optimization
Peifeng Wang
Austin Xu
Yilun Zhou
Caiming Xiong
Shafiq Joty
ELM
109
13
0
23 Sep 2024
RRM: Robust Reward Model Training Mitigates Reward Hacking
RRM: Robust Reward Model Training Mitigates Reward Hacking
Tianqi Liu
Wei Xiong
Jie Jessie Ren
Lichang Chen
Junru Wu
...
Yuan Liu
Bilal Piot
Abe Ittycheriah
Aviral Kumar
Mohammad Saleh
AAML
93
23
0
20 Sep 2024
Aligning Language Models Using Follow-up Likelihood as Reward Signal
Aligning Language Models Using Follow-up Likelihood as Reward Signal
Chen Zhang
Dading Chong
Feng Jiang
Chengguang Tang
Anningzhe Gao
Guohua Tang
Haizhou Li
ALM
105
2
0
20 Sep 2024
AIPO: Improving Training Objective for Iterative Preference Optimization
AIPO: Improving Training Objective for Iterative Preference Optimization
Yaojie Shen
Xinyao Wang
Yulei Niu
Ying Zhou
Lexin Tang
Libo Zhang
Fan Chen
Longyin Wen
89
2
0
13 Sep 2024
Semi-Supervised Reward Modeling via Iterative Self-Training
Semi-Supervised Reward Modeling via Iterative Self-Training
Yifei He
Haoxiang Wang
Ziyan Jiang
Alexandros Papangelis
Han Zhao
OffRL
102
4
0
10 Sep 2024
What is the Role of Small Models in the LLM Era: A Survey
What is the Role of Small Models in the LLM Era: A Survey
Lihu Chen
Gaël Varoquaux
ALM
248
32
0
10 Sep 2024
LLM-based multi-agent poetry generation in non-cooperative environments
LLM-based multi-agent poetry generation in non-cooperative environments
Ran Zhang
Steffen Eger
LLMAG
90
7
0
05 Sep 2024
Towards a Unified View of Preference Learning for Large Language Models:
  A Survey
Towards a Unified View of Preference Learning for Large Language Models: A Survey
Bofei Gao
Feifan Song
Yibo Miao
Zefan Cai
Zhiyong Yang
...
Houfeng Wang
Zhifang Sui
Peiyi Wang
Baobao Chang
Baobao Chang
155
14
0
04 Sep 2024
Broadening Access to Simulations for End-Users via Large Language
  Models: Challenges and Opportunities
Broadening Access to Simulations for End-Users via Large Language Models: Challenges and Opportunities
Philippe J. Giabbanelli
Jose J. Padilla
Ameeta Agrawal
80
2
0
03 Sep 2024
User-Driven Value Alignment: Understanding Users' Perceptions and
  Strategies for Addressing Biased and Discriminatory Statements in AI
  Companions
User-Driven Value Alignment: Understanding Users' Perceptions and Strategies for Addressing Biased and Discriminatory Statements in AI Companions
Xianzhe Fan
Qing Xiao
Xuhui Zhou
Jiaxin Pei
Maarten Sap
Zhicong Lu
Hong Shen
135
8
0
01 Sep 2024
Critique-out-Loud Reward Models
Critique-out-Loud Reward Models
Zachary Ankner
Mansheej Paul
Brandon Cui
Jonathan D. Chang
Prithviraj Ammanabrolu
ALMLRM
108
38
0
21 Aug 2024
Plug, Play, and Fuse: Zero-Shot Joint Decoding via Word-Level Re-ranking
  Across Diverse Vocabularies
Plug, Play, and Fuse: Zero-Shot Joint Decoding via Word-Level Re-ranking Across Diverse Vocabularies
Sai Koneru
Matthias Huck
M. Exel
Jan Niehues
65
0
0
21 Aug 2024
Automatic Metrics in Natural Language Generation: A Survey of Current
  Evaluation Practices
Automatic Metrics in Natural Language Generation: A Survey of Current Evaluation Practices
Patrícia Schmidtová
Saad Mahamood
Simone Balloccu
Ondřej Dušek
Albert Gatt
Dimitra Gkatzia
David M. Howcroft
Ondřej Plátek
Adarsa Sivaprasad
80
5
0
17 Aug 2024
PEDAL: Enhancing Greedy Decoding with Large Language Models using
  Diverse Exemplars
PEDAL: Enhancing Greedy Decoding with Large Language Models using Diverse Exemplars
Sumanth Prabhu
89
1
0
16 Aug 2024
Previous
12345
Next