Preference Construction: A Bayesian Interactive Preference Elicitation Framework Based on Monte Carlo Tree Search

We present a novel preference learning framework to capture participant preferences efficiently within limited interaction rounds. It involves three main contributions. First, we develop a variational Bayesian approach to infer the participant's preference model by estimating posterior distributions and managing uncertainty from limited information. Second, we propose an adaptive questioning policy that maximizes cumulative uncertainty reduction, formulating questioning as a finite Markov decision process and using Monte Carlo Tree Search to prioritize promising question trajectories. By considering long-term effects and leveraging the efficiency of the Bayesian approach, the policy avoids shortsightedness. Third, we apply the framework to Multiple Criteria Decision Aiding, with pairwise comparison as the preference information and an additive value function as the preference model. We integrate the reparameterization trick to address high-variance issues, enhancing robustness and efficiency. Computational studies on real-world and synthetic datasets demonstrate the framework's practical usability, outperforming baselines in capturing preferences and achieving superior uncertainty reduction within limited interactions.
View on arXiv@article{wang2025_2503.15150, title={ Preference Construction: A Bayesian Interactive Preference Elicitation Framework Based on Monte Carlo Tree Search }, author={ Yan Wang and Jiapeng Liu and Milosz Kadziński and Xiuwu Liao }, journal={arXiv preprint arXiv:2503.15150}, year={ 2025 } }