Loss-Guided Auxiliary Agents for Overcoming Mode Collapse in GFlowNets

Although Generative Flow Networks (GFlowNets) are designed to capture multiple modes of a reward function, they often suffer from mode collapse in practice, getting trapped in early discovered modes and requiring prolonged training to find diverse solutions. Existing exploration techniques may rely on heuristic novelty signals. We propose Loss-Guided GFlowNets (LGGFN), a novel approach where an auxiliary GFlowNet's exploration is directly driven by the main GFlowNet's training loss. By prioritizing trajectories where the main model exhibits high loss, LGGFN focuses sampling on poorly understood regions of the state space. This targeted exploration significantly accelerates the discovery of diverse, high-reward samples. Empirically, across various benchmarks including grid environments, structured sequence generation, and Bayesian structure learning, LGGFN consistently enhances exploration efficiency and sample diversity compared to baselines. For instance, on a challenging sequence generation task, it discovered over 40 times more unique valid modes while simultaneously reducing the exploration error metric by approximately 99\%.
View on arXiv@article{malek2025_2505.15251, title={ Loss-Guided Auxiliary Agents for Overcoming Mode Collapse in GFlowNets }, author={ Idriss Malek and Abhijit Sharma and Salem Lahlou }, journal={arXiv preprint arXiv:2505.15251}, year={ 2025 } }