Big Data Analytics in Healthcare: Applications for Pandemic Forecastin
DOI:
https://doi.org/10.15662/IJARCST.2020.0301002Keywords:
Big Data, Healthcare Analytics, Pandemic Forecasting, Disease Surveillance, Predictive Modeling, Machine Learning, Real-Time Analytics, Electronic Health Records, Syndromic Surveillance, Pre-2019 LiteratureAbstract
Big data analytics has revolutionized many industries, and in healthcare, it offers pivotal capabilities— especially in the context of pandemic forecasting. This study explores the role of diverse big data sources—electronic health records, social media feeds, syndromic surveillance data, mobile location data, and environmental metrics—in forecasting pandemics. The paper’s objective is to synthesize pre-2019 methodologies and evidence supporting predictive modeling for early detection, trend analysis, and resource planning. We review how machine learning techniques (including regression, decision trees, clustering, and time-series analysis) harness volume, velocity, and variety of healthcare data to create accurate predictions of disease outbreaks. We also examine system architecture workflows that preprocess, integrate, train, and evaluate models for actionable insights. Key findings from literature before 2019 reveal that real-time analytics significantly enhances outbreak lead time, improves geographical granularity in forecasts, and supports efficient allocation of medical resources. Advantages include improved timeliness, scalability, and cost-effectiveness; disadvantages include data heterogeneity, privacy risks, and algorithmic bias. The results and discussion section consolidates empirical evidence of performance metrics (e.g., accuracy, lead-time gains), addresses implementation challenges, and highlights implications for public health policy. Finally, the paper concludes by reaffirming the crucial role of big data analytics in pandemic preparedness and forecasting, and proposes several avenues for future research—such as integrating genomic surveillance, enhancing data interoperability, leveraging deep learning approaches, and strengthening ethical frameworks. Together, this work underscores how pre-2019 advances in data analytics provide a foundation for more resilient pandemic forecasting systems.
References
1. Ginsberg, J., Mohebbi, M. H., Patel, R. S., Brammer, L., et al. (2009). Detecting influenza epidemics using search engine query data. Nature, 457(7232), 1012–1014.
2. Brownstein, J. S., Freifeld, C. C., & Madoff, L. C. (2009). Digital disease detection—harnessing the web for public health surveillance. New England Journal of Medicine, 360(21), 2153–2157.
3. Henning, K. J. (2004). What is syndromic surveillance? MMWR Supplements, 53, 5–11.
4. Jagadeesh, S., & Sugumar, R. (2017). Optimal knowledge extraction system based on GSA and AANN. International Journal of Control Theory and Applications, 10(12), 153–162.
5. Murugeshwari, B., & Sujatha, R. (2014). Preservation of Privacy for Multiparty Computation System with Homomorphic Encryption. International Journal of Emerging Technology and Advanced Engineering, 4(3), 530–535.
6. Vimal Raja, G. (2021). Mining Customer Sentiments from Financial Feedback and Reviews using Data Mining Algorithms. International Journal of Innovative Research in Computer and Communication Engineering, 9(12), 14705–14710.
7. Krishnamurthy, P., & Willis, A. (2016). Identifying disease hotspots using mobile phone data—A clustering approach. IEEE Journal of Biomedical and Health Informatics, 20(4), 1035–1043.
8. Potel, R. (2019). A Real-Time Analytics Architecture for Enterprise Order Lifecycle Visibility and Backlog Management. International Journal of Research and Applied Innovations, 2(6), 2460–2469.
9. Kumar, J. (2013). Preservation of the Privacy for Multiple Custodian Systems with Rule Sharing. Journal of Computer Science.
10. Pushparathi, V. G., Sudha, M., David, D. J., Anbazhagan, K., & Vethamani, S. E. (2020). A Continuous Decision Based Multi Kernel Median Filter for Noise Removal on Brain MRI Images. Advanced Imaging, 1(3), 5.
11. Garg, V. K., Soundappan, S. J., & Kaur, E. M. (2020). Enhancement in intrusion detection system for WLAN using genetic algorithms. South Asian Research Journal of Engineering and Technology, 2(6), 62–64. https://doi.org/10.36346/sarjet.2020.v02i06.003
12. Ranjith Rajasekharan. (2019). Hybrid cloud architecture for enterprise database system. International Journal of Science, Research and Technology (IJSRAT), 2(6), 2513–251.
13. Santhoshini, G., & Anbazhagan, K. (2014, February). An object based software tool for software measurement. In International Conference on Information Communication and Embedded Systems (ICICES2014) (pp. 1–5). IEEE.
14. Deivendran, P., Anbazhagan, K., Sailaja, P., Sujatha, E., Babu, M. R., & Sudhakar, S. (2020). Scalability service in data center persistent storage allocation using virtual machines. International Journal of Scientific & Technology Research, 9(02), 2135–2139.
15. Watham, S. D., & Vimal, V. R. (2013). Design and Implementation of Data Sanitization Technique For Effective Filtering With Enhanced Medical Support System in Cloud Architecture Diagram. International Journal of Emerging Technology and Advanced Engineering, 3(12), 471–473.
16. Rajurkar, P. (2018). Process integration strategies for reducing hazardous waste in membrane-based chlor-alkali production. International Journal of Innovative Research in Science, Engineering and Technology, 7(3), 3001–3009.
17. Murugeshwari, B., Amirthavalli, R., Sri, C. B., & Pari, S. N. (2023). Hybrid key authentication scheme for privacy over adhoc communication. arXiv preprint arXiv:2304.14652.
18. Hulth, A., Rydevik, G., & Linde, A. (2009). Web queries as a source for syndromic surveillance. PLoS ONE, 4(2), e4378.
19. Jayaraman, S., Rajendran, S., & P, S. P. (2019). Fuzzy c-means clustering and elliptic curve cryptography using privacy preserving in cloud. International Journal of Business Intelligence and Data Mining, 15(3), 273–287.
20. Wong, A., Delamater, P. L., et al. (2013). Real-time surveillance of pneumonia presentations using syndromic emergency department data. Journal of Biomedical Informatics, 46(2), 387–394.
21. Yang, W., Lipsitch, M., & Shaman, J. (2015). Inference of seasonal infectious disease transmission dynamics. Proceedings of the National Academy of Sciences, 112(9), 2729–2734.


