ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2306.02561
  4. Cited By
LLM-Blender: Ensembling Large Language Models with Pairwise Ranking and
  Generative Fusion
v1v2v3 (latest)

LLM-Blender: Ensembling Large Language Models with Pairwise Ranking and Generative Fusion

5 June 2023
Dongfu Jiang
Xiang Ren
Bill Yuchen Lin
    ELM
ArXiv (abs)PDFHTML

Papers citing "LLM-Blender: Ensembling Large Language Models with Pairwise Ranking and Generative Fusion"

50 / 240 papers shown
Title
A Short Survey on Small Reasoning Models: Training, Inference, Applications and Research Directions
A Short Survey on Small Reasoning Models: Training, Inference, Applications and Research Directions
Chengyu Wang
Taolin Zhang
Richang Hong
Jun Huang
ReLMLRM
105
2
0
12 Apr 2025
SEE: Continual Fine-tuning with Sequential Ensemble of Experts
SEE: Continual Fine-tuning with Sequential Ensemble of Experts
Zhilin Wang
Yafu Li
Xiaoye Qu
Yu Cheng
CLLKELM
126
0
0
09 Apr 2025
Learning Lie Group Generators from Trajectories
Learning Lie Group Generators from Trajectories
Lifan Hu
151
9
0
04 Apr 2025
Inference-Time Scaling for Generalist Reward Modeling
Inference-Time Scaling for Generalist Reward Modeling
Zijun Liu
P. Wang
Ran Xu
Shirong Ma
Chong Ruan
Ziwei Sun
Yang Liu
Y. Wu
OffRLLRM
201
54
0
03 Apr 2025
HERA: Hybrid Edge-cloud Resource Allocation for Cost-Efficient AI Agents
HERA: Hybrid Edge-cloud Resource Allocation for Cost-Efficient AI Agents
Shiyi Liu
Haiying Shen
Shuai Che
Mahdi Ghandi
Mingqin Li
LLMAG
176
0
0
01 Apr 2025
Enhancing Image Resolution of Solar Magnetograms: A Latent Diffusion Model Approach
Enhancing Image Resolution of Solar Magnetograms: A Latent Diffusion Model Approach
Francesco P. Ramunno
Paolo Massa
Vitaliy Kinakh
Brandon Panos
A. Csillaghy
Slava Voloshynovskiy
DiffM
107
0
0
31 Mar 2025
Exploring Data Scaling Trends and Effects in Reinforcement Learning from Human Feedback
Exploring Data Scaling Trends and Effects in Reinforcement Learning from Human Feedback
Wei Shen
Guanlin Liu
Zheng Wu
Ruofei Zhu
Qingping Yang
Chao Xin
Yu Yue
Lin Yan
156
14
0
28 Mar 2025
LeForecast: Enterprise Hybrid Forecast by Time Series Intelligence
LeForecast: Enterprise Hybrid Forecast by Time Series Intelligence
Zheng Tan
Yiwen Nie
Wenfa Wu
Guanyu Zhang
Yanze Liu
...
Chao Yang
Jiaxuan Fan
Yuan He
Hongsheng Qi
Yangzhou Du
AI4TS
89
0
0
27 Mar 2025
LLM-based Agent Simulation for Maternal Health Interventions: Uncertainty Estimation and Decision-focused Evaluation
LLM-based Agent Simulation for Maternal Health Interventions: Uncertainty Estimation and Decision-focused Evaluation
Sarah Martinson
Lingkai Kong
Cheol Woo Kim
Aparna Taneja
Milind Tambe
69
0
0
25 Mar 2025
Improving the End-to-End Efficiency of Offline Inference for Multi-LLM Applications Based on Sampling and Simulation
Improving the End-to-End Efficiency of Offline Inference for Multi-LLM Applications Based on Sampling and Simulation
Jingzhi Fang
Yanyan Shen
Yijiao Wang
Lei Chen
75
2
0
21 Mar 2025
LVAgent: Long Video Understanding by Multi-Round Dynamical Collaboration of MLLM Agents
LVAgent: Long Video Understanding by Multi-Round Dynamical Collaboration of MLLM Agents
Boyu Chen
Zhengrong Yue
Siran Chen
Zehua Wang
Yang Liu
Ziwei Sun
Yansen Wang
VLM
477
2
0
13 Mar 2025
Ensemble Learning for Large Language Models in Text and Code Generation: A Survey
Ensemble Learning for Large Language Models in Text and Code Generation: A Survey
Mari Ashiga
Wei Jie
Fan Wu
Vardan K. Voskanyan
Fateme Dinmohammadi
P. Brookes
Jingzhi Gong
Zheng Wang
102
0
0
13 Mar 2025
SCE: Scalable Consistency Ensembles Make Blackbox Large Language Model Generation More Reliable
Jiaxin Zhang
Zechao Li
Wendi Cui
Kamalika Das
Bradley Malin
Sricharan Kumar
111
0
0
13 Mar 2025
Fine-Tuning Diffusion Generative Models via Rich Preference Optimization
Fine-Tuning Diffusion Generative Models via Rich Preference Optimization
Hanyang Zhao
Haoxian Chen
Yucheng Guo
Genta Indra Winata
Tingting Ou
Ziyu Huang
D. Yao
Wenpin Tang
136
0
0
13 Mar 2025
G-Boost: Boosting Private SLMs with General LLMs
Yijiang Fan
Yuren Mao
Longbin Lai
Ying Zhang
Zhengping Qian
Yunjun Gao
70
0
0
13 Mar 2025
LuciBot: Automated Robot Policy Learning from Generated Videos
Xiaowen Qiu
Yian Wang
Jiting Cai
Zhehuan Chen
Chunru Lin
Tsun-Hsuan Wang
Chuang Gan
LM&RoVGen
127
1
0
12 Mar 2025
Life-Cycle Routing Vulnerabilities of LLM Router
Qiqi Lin
Xiaoyang Ji
Shengfang Zhai
Qingni Shen
Zhi-Li Zhang
Yuejian Fang
Yansong Gao
AAML
90
1
0
09 Mar 2025
RouterEval: A Comprehensive Benchmark for Routing LLMs to Explore Model-level Scaling Up in LLMs
RouterEval: A Comprehensive Benchmark for Routing LLMs to Explore Model-level Scaling Up in LLMs
Zhongzhan Huang
Guoming Ling
Vincent S. Liang
Yupei Lin
Yandong Chen
Shanshan Zhong
Hefeng Wu
LRM
226
7
0
08 Mar 2025
SHAPE : Self-Improved Visual Preference Alignment by Iteratively Generating Holistic Winner
Kejia Chen
Jiawen Zhang
Jiacong Hu
Jiazhen Yang
Jian Lou
Zunlei Feng
Mingli Song
134
0
0
06 Mar 2025
CoServe: Efficient Collaboration-of-Experts (CoE) Model Inference with Limited Memory
CoServe: Efficient Collaboration-of-Experts (CoE) Model Inference with Limited Memory
Jiashun Suo
Xiaojian Liao
Limin Xiao
Li Ruan
Jinquan Wang
Xiao Su
Zhisheng Huo
114
0
0
04 Mar 2025
Distributionally Robust Reinforcement Learning with Human Feedback
Debmalya Mandal
Paulius Sasnauskas
Goran Radanović
108
3
0
01 Mar 2025
Digital Player: Evaluating Large Language Models based Human-like Agent in Games
Digital Player: Evaluating Large Language Models based Human-like Agent in Games
Jinqiao Wang
Kai Wang
Shaojie Lin
Runze Wu
Bihan Xu
...
Zhipeng Hu
Z. Fan
Le Li
Tangjie Lyu
Changjie Fan
LLMAGELMAI4CE
137
1
0
28 Feb 2025
Can Textual Gradient Work in Federated Learning?
Can Textual Gradient Work in Federated Learning?
Minghui Chen
Ruinan Jin
Wenlong Deng
Yuanyuan Chen
Zhi Huang
Han Yu
Xiaoxiao Li
FedML
179
6
0
27 Feb 2025
Weaker LLMs' Opinions Also Matter: Mixture of Opinions Enhances LLM's Mathematical Reasoning
Weaker LLMs' Opinions Also Matter: Mixture of Opinions Enhances LLM's Mathematical Reasoning
Yanan Chen
Ali Pesaranghader
Tanmana Sadhu
LRM
120
0
0
26 Feb 2025
Multi-LLM Collaborative Search for Complex Problem Solving
Multi-LLM Collaborative Search for Complex Problem Solving
Sen Yang
Yafu Li
Wai Lam
Yu Cheng
LLMAGLRM
115
2
0
26 Feb 2025
Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems
Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems
Hao Peng
Yunjia Qi
Xiaozhi Wang
Zijun Yao
Bin Xu
Lei Hou
Juanzi Li
ALMLRM
101
7
0
26 Feb 2025
Harnessing Multiple Large Language Models: A Survey on LLM Ensemble
Harnessing Multiple Large Language Models: A Survey on LLM Ensemble
Zhijun Chen
Jingzheng Li
Pengpeng Chen
Zhuoran Li
Kai Sun
Yuankai Luo
Qianren Mao
Dingqi Yang
Hailong Sun
Philip S. Yu
ELM
134
20
0
25 Feb 2025
Improving LLM General Preference Alignment via Optimistic Online Mirror Descent
Improving LLM General Preference Alignment via Optimistic Online Mirror Descent
Yuheng Zhang
Dian Yu
Tao Ge
Linfeng Song
Zhichen Zeng
Haitao Mi
Nan Jiang
Dong Yu
134
4
0
24 Feb 2025
Uncertainty-Aware Fusion: An Ensemble Framework for Mitigating Hallucinations in Large Language Models
Prasenjit Dey
Srujana Merugu
Sivaramakrishnan Kaveri
HILM
48
0
0
22 Feb 2025
SimPER: A Minimalist Approach to Preference Alignment without Hyperparameters
SimPER: A Minimalist Approach to Preference Alignment without Hyperparameters
Teng Xiao
Yige Yuan
Ziyang Chen
Mingxiao Li
Shangsong Liang
Zhaochun Ren
V. Honavar
264
11
0
21 Feb 2025
Faster WIND: Accelerating Iterative Best-of-$N$ Distillation for LLM Alignment
Faster WIND: Accelerating Iterative Best-of-NNN Distillation for LLM Alignment
Tong Yang
Jincheng Mei
H. Dai
Zixin Wen
Shicong Cen
Dale Schuurmans
Yuejie Chi
Bo Dai
120
4
0
20 Feb 2025
Value-Incentivized Preference Optimization: A Unified Approach to Online and Offline RLHF
Value-Incentivized Preference Optimization: A Unified Approach to Online and Offline RLHF
Shicong Cen
Jincheng Mei
Katayoon Goshvadi
Hanjun Dai
Tong Yang
Sherry Yang
Dale Schuurmans
Yuejie Chi
Bo Dai
OffRL
152
37
0
20 Feb 2025
Rethinking Diverse Human Preference Learning through Principal Component Analysis
Rethinking Diverse Human Preference Learning through Principal Component Analysis
Feng Luo
Rui Yang
Hao Sun
Chunyuan Deng
Jiarui Yao
Jingyan Shen
Huan Zhang
Hanjie Chen
29
1
0
18 Feb 2025
Atom of Thoughts for Markov LLM Test-Time Scaling
Atom of Thoughts for Markov LLM Test-Time Scaling
Fengwei Teng
Zhaoyang Yu
Quan Shi
Jiayi Zhang
Chenglin Wu
Yuyu Luo
MULRM
134
23
0
17 Feb 2025
EvoFlow: Evolving Diverse Agentic Workflows On The Fly
EvoFlow: Evolving Diverse Agentic Workflows On The Fly
Guibin Zhang
Kaijie Chen
Guancheng Wan
Heng Chang
Hong Cheng
Kaidi Wang
Shuyue Hu
Lei Bai
253
6
0
11 Feb 2025
KABB: Knowledge-Aware Bayesian Bandits for Dynamic Expert Coordination in Multi-Agent Systems
KABB: Knowledge-Aware Bayesian Bandits for Dynamic Expert Coordination in Multi-Agent Systems
Jusheng Zhang
Zimeng Huang
Yijia Fan
Ningyuan Liu
Mingyan Li
Zhuojie Yang
Jiawei Yao
Jian Wang
Keze Wang
62
1
0
11 Feb 2025
Multi-agent Architecture Search via Agentic Supernet
Multi-agent Architecture Search via Agentic Supernet
Guibin Zhang
Luyang Niu
Sihang Li
Kaidi Wang
Lei Bai
Xinyu Wang
216
16
0
06 Feb 2025
COSMosFL: Ensemble of Small Language Models for Fault Localisation
COSMosFL: Ensemble of Small Language Models for Fault Localisation
Hyunjoon Cho
Sungmin Kang
Gabin An
S. Yoo
98
1
0
05 Feb 2025
Rethinking Mixture-of-Agents: Is Mixing Different Large Language Models Beneficial?
Rethinking Mixture-of-Agents: Is Mixing Different Large Language Models Beneficial?
Wenzhe Li
Yong Lin
Mengzhou Xia
Chi Jin
MoE
148
4
0
02 Feb 2025
Ensembles of Low-Rank Expert Adapters
Ensembles of Low-Rank Expert Adapters
Yinghao Li
Vianne Gao
Chao Zhang
MohamadAli Torkamani
169
0
0
31 Jan 2025
DFPE: A Diverse Fingerprint Ensemble for Enhancing LLM Performance
DFPE: A Diverse Fingerprint Ensemble for Enhancing LLM Performance
Seffi Cohen
Niv Goldshlager
Nurit Cohen-Inger
Bracha Shapira
Lior Rokach
150
1
0
29 Jan 2025
Faster Machine Translation Ensembling with Reinforcement Learning and Competitive Correction
Kritarth Prasad
Mohammadi Zaki
Pratik Rakesh Singh
Pankaj Wasnik
61
1
0
28 Jan 2025
From Drafts to Answers: Unlocking LLM Potential via Aggregation Fine-Tuning
From Drafts to Answers: Unlocking LLM Potential via Aggregation Fine-Tuning
Yafu Li
Zhilin Wang
Tingchen Fu
Ganqu Cui
Sen Yang
Yu Cheng
93
4
0
21 Jan 2025
Multi-Agent Collaboration Mechanisms: A Survey of LLMs
Multi-Agent Collaboration Mechanisms: A Survey of LLMs
Khanh-Tung Tran
Dung Dao
Minh-Duong Nguyen
Quoc-Viet Pham
Barry O'Sullivan
Hoang D. Nguyen
LLMAG
151
56
0
10 Jan 2025
Segmenting Text and Learning Their Rewards for Improved RLHF in Language Model
Segmenting Text and Learning Their Rewards for Improved RLHF in Language Model
Yueqin Yin
Shentao Yang
Yujia Xie
Ziyi Yang
Yuting Sun
Hany Awadalla
Weizhu Chen
Mingyuan Zhou
119
2
0
07 Jan 2025
Large Language Monkeys: Scaling Inference Compute with Repeated Sampling
Large Language Monkeys: Scaling Inference Compute with Repeated Sampling
Bradley Brown
Jordan Juravsky
Ryan Ehrlich
Ronald Clark
Quoc V. Le
Christopher Ré
Azalia Mirhoseini
ALMLRM
304
331
0
03 Jan 2025
AmalREC: A Dataset for Relation Extraction and Classification Leveraging Amalgamation of Large Language Models
AmalREC: A Dataset for Relation Extraction and Classification Leveraging Amalgamation of Large Language Models
Mansi
Pranshu Pandya
Mahek Bhavesh Vora
Soumya Bharadwaj
Ashish Anand
79
0
0
31 Dec 2024
Ensembling Large Language Models with Process Reward-Guided Tree Search
  for Better Complex Reasoning
Ensembling Large Language Models with Process Reward-Guided Tree Search for Better Complex Reasoning
Sungjin Park
Xiao Liu
Yeyun Gong
Edward Choi
LRM
99
10
0
20 Dec 2024
Dipper: Diversity in Prompts for Producing Large Language Model
  Ensembles in Reasoning tasks
Dipper: Diversity in Prompts for Producing Large Language Model Ensembles in Reasoning tasks
Gregory Kang Ruey Lau
Wenyang Hu
Diwen Liu
Jizhuo Chen
Szu Hui Ng
Bryan Kian Hsiang Low
LRMAI4CE
125
8
0
12 Dec 2024
PickLLM: Context-Aware RL-Assisted Large Language Model Routing
PickLLM: Context-Aware RL-Assisted Large Language Model Routing
Dimitrios Sikeridis
Dennis Ramdass
Pranay Pareek
152
3
0
12 Dec 2024
Previous
12345
Next