Big Data Analytics in Healthcare: Applications for Pandemic Forecastin

Authors

  • Masti Venkatesha Iyengar BIET, Davanagere, Karnataka, India Author

DOI:

https://doi.org/10.15662/IJARCST.2020.0301002

Keywords:

Big Data, Healthcare Analytics, Pandemic Forecasting, Disease Surveillance, Predictive Modeling, Machine Learning, Real-Time Analytics, Electronic Health Records, Syndromic Surveillance, Pre-2019 Literature

Abstract

Big data analytics has revolutionized many industries, and in healthcare, it offers pivotal capabilities— especially in the context of pandemic forecasting. This study explores the role of diverse big data sources—electronic health records, social media feeds, syndromic surveillance data, mobile location data, and environmental metrics—in forecasting pandemics. The paper’s objective is to synthesize pre-2019 methodologies and evidence supporting predictive modeling for early detection, trend analysis, and resource planning. We review how machine learning techniques (including regression, decision trees, clustering, and time-series analysis) harness volume, velocity, and variety of healthcare data to create accurate predictions of disease outbreaks. We also examine system architecture workflows that preprocess, integrate, train, and evaluate models for actionable insights. Key findings from literature before 2019 reveal that real-time analytics significantly enhances outbreak lead time, improves geographical granularity in forecasts, and supports efficient allocation of medical resources. Advantages include improved timeliness, scalability, and cost-effectiveness; disadvantages include data heterogeneity, privacy risks, and algorithmic bias. The results and discussion section consolidates empirical evidence of performance metrics (e.g., accuracy, lead-time gains), addresses implementation challenges, and highlights implications for public health policy. Finally, the paper concludes by reaffirming the crucial role of big data analytics in pandemic preparedness and forecasting, and proposes several avenues for future research—such as integrating genomic surveillance, enhancing data interoperability, leveraging deep learning approaches, and strengthening ethical frameworks. Together, this work underscores how pre-2019 advances in data analytics provide a foundation for more resilient pandemic forecasting systems.

References

1. Ginsberg, J., Mohebbi, M.H., Patel, R.S., Brammer, L., et al. (2009). Detecting influenza epidemics using search engine query data. Nature, 457(7232), 1012–1014.

2. Brownstein, J.S., Freifeld, C.C., Madoff, L.C. (2009). Digital disease detection — harnessing the web for public health surveillance. New England Journal of Medicine, 360(21), 2153–2157.

3. Wong, A., Delamater, P.L., et al. (2013). Real-time surveillance of pneumonia presentations using syndromic emergency department data. Journal of Biomedical Informatics, 46(2), 387–394.

4. Yang, W., Lipsitch, M., Shaman, J. (2015). Inference of seasonal infectious disease transmission dynamics. Proceedings of the National Academy of Sciences, 112(9), 2729–2734.

5. Henning, K.J. (2004). What is syndromic surveillance? MMWR Supplements, 53, 5–11.

6. Krishnamurthy, P., and Willis, A. (2016). Identifying disease hotspots using mobile phone data—A clustering approach. IEEE Journal of Biomedical and Health Informatics, 20(4), 1035–1043.

7. Hulth, A., Rydevik, G., Linde, A. (2009). Web queries as a source for syndromic surveillance. PLoS ONE, 4(2): e4378.

Downloads

Published

2020-01-01

How to Cite

Big Data Analytics in Healthcare: Applications for Pandemic Forecastin. (2020). International Journal of Advanced Research in Computer Science & Technology(IJARCST), 3(1), 2248-2253. https://doi.org/10.15662/IJARCST.2020.0301002