10
0
v1v2 (latest)

Investigating the effectiveness of multimodal data in forecasting SARS-COV-2 case surges

Main:19 Pages
10 Figures
1 Tables
Abstract

The COVID-19 pandemic response relied heavily on statistical and machine learning models to predict key outcomes such as case prevalence and fatality rates. These predictions were instrumental in enabling timely public health interventions that helped break transmission cycles. While most existing models are grounded in traditional epidemiological data, the potential of alternative datasets, such as those derived from genomic information and human behavior, remains underexplored. In the current study, we investigated the usefulness of diverse modalities of feature sets in predicting case surges. Our results highlight the relative effectiveness of biological (e.g., mutations), public health (e.g., case counts, policy interventions) and human behavioral features (e.g., mobility and social media conversations) in predicting country-level case surges. Importantly, we uncover considerable heterogeneity in predictive performance across countries and feature modalities, suggesting that surge prediction models may need to be tailored to specific national contexts and pandemic phases. Overall, our work highlights the value of integrating alternative data sources into existing disease surveillance frameworks to enhance the prediction of pandemic dynamics.

View on arXiv
@article{raghuvamsi2025_2505.22688,
  title={ Investigating the effectiveness of multimodal data in forecasting SARS-COV-2 case surges },
  author={ Palur Venkata Raghuvamsi and Siyuan Brandon Loh and Prasanta Bhattacharya and Joses Ho and Raphael Lee Tze Chuen and Alvin X. Han and Sebastian Maurer-Stroh },
  journal={arXiv preprint arXiv:2505.22688},
  year={ 2025 }
}
Comments on this paper