Multiscale Geographically and Temporally Weighted Regression with LASSO and Adaptive LASSO for Tuberculosis Incidence Mapping in West Java

Authors

  • Muhammad Yusuf Al Habsy Department of Mathematics, School of Mathematics and Science, Indonesia Defence University, Bogor 16810, Indonesia
  • Ro'fah Nur Rachmawati Statistics Department, School of Computer Science, Bina Nusantara University, Jakarta 11480, Indonesia
  • Purnomo Husnul Khotimah Research Center for Data and Information Sciences, National Research and Innovation Agency, Bandung 40135, Indonesia
  • Rifani Bhakti Natari Regional Research and Development Agency of Jambi, Jambi 36122, Indonesia
  • Dianadewi Riswantini Research Center for Data and Information Sciences, National Research and Innovation Agency, Bandung 40135, Indonesia
  • Devi Munandar Research Center for Data and Information Sciences, National Research and Innovation Agency, Bandung 40135, Indonesia
  • Muh. Hafizh Izzaturrahim Research Center for Data and Information Sciences, National Research and Innovation Agency, Bandung 40135, Indonesia

DOI:

https://doi.org/10.5614/cbms.2025.8.1.6

Keywords:

disease mapping, spatio-temporal, variabel selection, spatial statistics, kernel function

Abstract

Tuberculosis (TB) is a global health issue caused by Mycobacterium tuberculosis and can affect any organ of the body, especially the lungs. The trend of TB cases varies between regions, and analytic assessment is required to identify the predictor variables. The purpose of this research is to compare the Multiscale Geographically and Temporally Weighted Regression (MGTWR) and the Geographically and Temporally Weighted Regression (GTWR) method, which both use Gaussian, Exponential, Uniform, and Bi-Square kernel functions, to identify significant variables in each region annually. The MGTWR method has the advantage of using a flexible bandwidth for each observation, that results in more accurate coefficient estimates. The sample used was 27 districts and cities in West Java Province, involving 36 variables divided into 5 dimensions, namely global climate, health, demography, population, and government policy, with a time span of 2019?2022. To overcome the problem of multicollinearity, the approach was carried out using the Least Absolute Shrinkage Selection Operator (LASSO) and Adaptive LASSO methods. In determining the best model, the prioritized criteria are to achieve the highest R2, which indicates the optimal level of model fit, as well as the smallest AIC, which indicates the most efficient model goodness of fit. The best model is MGTWR with LASSO variable selection on the Bi-Square kernel. This model has an R2 of 91.25% and the smallest AIC of 139.868. From the best model, each region emerged with a cluster structure affected by various variables from 2019 to 2022, providing an in-depth understanding of TB mapping that can assist in formulating more effective intervention measures.

Author Biographies

Purnomo Husnul Khotimah, Research Center for Data and Information Sciences, National Research and Innovation Agency, Bandung 40135, Indonesia

Researcher at Research Center for Data and Information Sciences, National Research and Innovation Agency (BRIN)

Rifani Bhakti Natari, Regional Research and Development Agency of Jambi, Jambi 36122, Indonesia

Researcher at Regional Research and Development Agency of Jambi, Indonesia

Dianadewi Riswantini, Research Center for Data and Information Sciences, National Research and Innovation Agency, Bandung 40135, Indonesia

Researcher at Research Center for Data and Information Sciences, National Research and Innovation Agency (BRIN)

 

Devi Munandar, Research Center for Data and Information Sciences, National Research and Innovation Agency, Bandung 40135, Indonesia

Researcher at Research Center for Data and Information Sciences, National Research and Innovation Agency (BRIN)

Muh. Hafizh Izzaturrahim, Research Center for Data and Information Sciences, National Research and Innovation Agency, Bandung 40135, Indonesia

Researcher at Research Center for Data and Information Sciences, National Research and Innovation Agency (BRIN)

References

Abdul Rasam, A.R., Jumali, W.N.S., Abdul Jalil, I. and Muhamad Jaelani, L., Susceptibility risk index mapping of population at tuberculosis epidemic risk, Journal of ASIAN Behavioural Studies, 8(24), pp. 53-65, 2023.

Adrianto, S., Balqis, I.H.N., Soetanto, C.Z.N. and Ohyver, M., Cochrane Orcutt method to overcome autocorrelation in modeling factors affecting the number of hotel visitors in Indonesia, Procedia Computer Science, 216, pp. 630-638, 2023.

Al-Hasani, G., Asaduzzaman, M. and Soliman, A.-H., Geographically weighted Poisson regression models with different kernels: Application to road traffic accident data, Communications in Statistics: Case Studies, Data Analysis and Applications, 7(2), pp. 166-181, 2021.

Alene, K.A., Viney, K., Gray, D.J., McBryde, E.S., Wagnew, M. and Clements, A.C.A., Mapping tuberculosis treatment outcomes in Ethiopia, BMC Infectious Diseases, 19(1), 2019.

Araveeporn, A., The higher-order of adaptive lasso and elastic net methods for classification on high dimensional data, Mathematics, 9(10), p. 1091, 2021.

Arghir, I.A., Trenchea, M., Calboreanu, C.L., Ion, I., Fildan, A.P. and Otelea, M.R., The obstructive phenotype of chronic posttuberculosis disease ? is there a role for the occupational exposure?, Romanian Journal of Occupational Medicine, 74(1), pp. 36-40, 2023.

Atmadi, A., Rahayu, T., Wardani, I.K., Elsadi, R.M.C., Milasari, A.E. and Witadi, M.R.A., Insights into latent tuberculosis biomarkers from differential gene expression analysis in CD8 memory cells using secondary data in silico approach, International Journal of Applied Mathematics, Sciences, and Technology for National Defense, 2(3), pp. 133-144, 2024.

Biswas, M.H.A., Samad, S.A., Parvin, T., Islam, M.T. and Supriatna, A.K., Optimal control strategy to reduce the infection of pandemic HIV associated with tuberculosis, Communication in Biomathematical Sciences, 5(1), pp. 20-39, 2022.

Byun, H.G., Lee, N. and Hwang, S.S., A systematic review of spatial and spatio-temporal analyses in public health research in Korea, Journal of Preventive Medicine and Public Health, 54(5), pp. 301-308, 2021.

Chen, X., Emam, M., Zhang, L., Rifhat, R., Zhang, L. and Zheng, Y., Analysis of spatial characteristics and geographic weighted regression of tuberculosis prevalence in Kashgar, China, Preventive Medicine Reports, 35, p. 102362, 2023.

Christina, A., Rachmawati, R.N. and Pusponegoro, N.H., Climate impact on blue economy index: Bayesian spatio-temporal regression with statistical downscaling in Sumatera, Journal of Geospatial Science and Analytics, 1(1), pp. 13-34, 2025.

Comber, A., Hyper-local geographically weighted regression: Extending GWR through local model selection and local bandwidth optimization, Journal of Spatial Information Science, 17, pp. 63-84, 2018.

Courtois, Tubert-Bitter, P. and Ahmed, I., New adaptive lasso approaches for variable selection in automated pharmacovigilance signal detection, BMC Medical Research Methodology, 21(217), pp. 1-17, 2021.

Djuraidah, A., Rachmawati, R.N., Wigena, A.H. and Mangku, I.W., Extreme data analysis using spatio-temporal Bayes regression with INLA in statistical downscaling model, International Journal of Innovative Computing, Information and Control, 17(1), pp. 259-273, 2021.

Fotheringham, A.S., Crespo, R. and Yao, J., Geographical and temporal weighted regression (GTWR): Geographical and temporal weighted regression, Geographical Analysis, 47(4), pp. 431-452, 2015.

Hassan, M.M., Hassan, M.M., Yasmin, F., Khan, M.A.R., Zaman, S., Galibuzzaman, Islam, K.K. and Bairagi, A.K., A comparative assessment of machine learning algorithms with the least absolute shrinkage and selection operator for breast cancer detection and prediction, Decision Analytics Journal, 7, p. 100245, 2023.

He, X., Mai, X. and Shen, G., Poverty and physical geographic factors: An empirical analysis of Sichuan Province using the GWR model, Sustainability, 13(1), p. 100, 2020.

Henderi, H., Comparison of min-max normalization and z-score normalization in the k-nearest neighbor (KNN) algorithm to test the accuracy of types of breast cancer, IJIIS: International Journal of Informatics and Information Systems, 4(1), pp. 13-20, 2021.

Huang, Y., Tibbe, T., Tang, A. and Montoya, A., Lasso and group lasso with categorical predictors: Impact of coding strategy on variable selection and prediction, Journal of Behavioral Data Science, 3(2), pp. 15-42, 2024.

Humayun, M., Chirenda, J., Ye, W., Mukeredzi, I., Mujuru, H.A. and Yang, Z., Effect of gender on clinical presentation of tuberculosis (TB) and age-specific risk of TB, and TB-human immunodeficiency virus coinfection, Open Forum Infectious Diseases, 9(10), p. ofac512, 2022.

Iddrisu, A.-K., Amikiya, E.A. and Bukari, F.K., Spatio-temporal characteristics of tuberculosis in Ghana, F1000Research, 11, p. 200, 2022.

Jiang, H., Chen, X., Lv, J., Dai, B., Liu, Q., Ding, X., Pan, J., Ding, H., Lu, W., Zhu, L. and Lu, P., Prospective cohort study on tuberculosis incidence and risk factors in the elderly population of Eastern China, Heliyon, 10(3), p. e24507, 2024.

Liu, N., Zou, B., Li, S., Zhang, H. and Qin, K., Prediction of PM2.5 concentrations at unsampled points using multiscale geographically and temporally weighted regression, Environmental Pollution, 284, p. 117116, 2021.

Liu, Y. and Dong, F., Using geographically temporally weighted regression to assess the contribution of corruption governance to global PM2.5, Environmental Science and Pollution Research, 28(11), pp. 13536-13551, 2020.

Maison, D.P., Tuberculosis pathophysiology and anti-VEGF intervention, Journal of Clinical Tuberculosis and Other Mycobacterial Diseases, 27, p. 100300, 2022.

Moya, E.M.D., Pietrus, A. and Oliva, S.M., Mathematical model with fractional order derivatives for tuberculosis taking into account its relationship with HIV/AIDS and diabetes, Jambura Journal of Biomathematics (JJBM), 2(2), pp. 80-95, 2021.

Oyedele, O., Determining the optimal number of folds to use in a k-fold cross-validation: A neural network classification experiment, Research in Mathematics, 10(1), 2023.

Pan, Q., Zhu, R., Qiu, J. and Cai, G., Construction of an anthropometric discriminant model for identification of elite swimmers: An adaptive lasso approach, PeerJ, 11, p. e14635, 2023.

Passos, M., Mariano, A.B.R., Andreska da Silva, D. and Oliveira de Sousa, A.B., Performance of sensors for quality analysis of irrigation water, Revista Brasileira de Engenharia de Biossistemas, 16, 2023.

Peer, V., Schwartz, N. and Green, M.S., Gender differences in tuberculosis incidence rates?a pooled analysis of data from seven high-income countries by age group and time period, Frontiers in Public Health, 10, p. 997025, 2023.

Pereira, T.V., Nogueira, M.C. and Campos, E.M.S., Spatial analysis of tuberculosis and its relationship with socioeconomic indicators in a medium-sized city in Minas Gerais, Revista Brasileira de Epidemiologia, 24(suppl 1), p. e210021, 2021.

Pusponegoro, N.H. and Rachmawati, R.N., Spatial empirical best linear unbiased prediction in small area estimation of poverty, Procedia Computer Science, 135, pp. 712-718, 2018.

Putra, R., Wahyuning Tyas, S. and Fadhlurrahman, M.G., Geographically weighted regression with the best kernel function on open unemployment rate data in East Java Province, Enthusiastic: International Journal of Applied Statistics and Data Science, pp. 26-36, 2022.

Rachmawati, R.N., Rahman, J.Y., Pusponegoro, N.H., et al., Bayesian spatial hierarchical mixture models for excess zeros data: Review and application to female lymphatic filariasis cases, Commun. Math. Biol. Neurosci., 2024, pp. 1-19, 2024.

Ratnasari, D., Rahmawati, M., Khaerunnisa, A. and Kamilah, A.F., The effect of problem based learning (PBL) model on students' Development, 2(2), pp. 79-86, 2022.

Ighovie, E.S., Evelyn, E.E. and Correlation, A.O.D., Correlation of rainfall on tuberculosis distribution in South-Southern Nigeria, Journal of Management and Social Science Research, 4(1), pp. 1-7, 2023.

Shrestha, N., Detecting multicollinearity in regression analysis, American Journal of Applied Mathematics and Statistics, 8(2), pp. 39-42, 2020.

Sifriyani, S., Rasjid, M., Rosadi, D., Anwar, S., Wahyuni, R.D. and Jalaluddin, S., Spatial-temporal epidemiology of COVID-19 using a geographically and temporally weighted regression model, Symmetry, 14(4), p. 742, 2022.

Soeroto, A.Y., Pratiwi, C., Santoso, P. and Lestari, B.W., Factors affecting outcome of longer regimen multidrug-resistant tuberculosis treatment in West Java Indonesia: A retrospective cohort study, PLOS ONE, 16(2), p. e0246284, 2021.

Sunarsih, E., Zulkarnain, M., Hanum, L., Flora, R. and Damiri, N., Spatial pattern analysis of malaria cases in Muara Enim Regency using Moran Index and local indicator spatial autocorrelation, Open Access Macedonian Journal of Medical Sciences, 9(E), pp. 695-701, 2021.

Takano, Y. and Miyashiro, R., Best subset selection via cross-validation criterion, TOP, 28(2), pp. 475-488, 2020.

Tyas, S.W., Gunardi and Puspitasari, L.A., Geographically weighted generalized Poisson regression model with the best kernel function in the case of the number of postpartum maternal mortality in East Java, MethodsX, 10, p. 102002, 2023.

Wang, D., Yang, Y., Qiu, A., Kang, X., Han, J. and Chai, Z., A CUDA-based parallel geographically weighted regression for large-scale geographic data, ISPRS International Journal of Geo-Information, 9(11), p. 653, 2020.

Wu, C., Ren, F., Hu, W. and Du, Q., Multiscale geographically and temporally weighted regression: Exploring the spatiotemporal determinants of housing prices, International Journal of Geographical Information Science, 33(3), pp. 489-511, 2018.

Xiao, F., Wang, J., Xiong, M. and Mo, H., Does spatiotemporal heterogeneity matter? Air transport and the rise of high-tech industry in China, Applied Geography, 162, p. 103148, 2024.

Xu, M., Li, Y., Liu, B., Chen, R., Sheng, L., Yan, S., Chen, H., Hou, J., Yuan, L., Ke, L., Fan, M. and Hu, P., Temperature and humidity associated with increases in tuberculosis notifications: A time-series study in Hong Kong, Epidemiology and Infection, 149, p. e8, 2020.

Yacim, J.A. and Boshoff, D.G.B., A comparison of bandwidth and kernel function selection in geographically weighted regression for house valuation, International Journal of Technology, 10(1), p. 58, 2019.

Yu, H., Yang, J., Yan, Y., Zhang, H., Chen, Q. and Sun, L., Factors affecting the incidence of pulmonary tuberculosis based on the GTWR model in China, 2004?2021, Epidemiology and Infection, 152, p.e65, 2024.

Zhang, R., Zhao, T., Lu, Y. and Xu, X., Relaxed adaptive lasso and its asymptotic results, Symmetry, 14(7), p. 1422, 2022.

Downloads

Published

2025-07-15

How to Cite

Habsy, M. Y. A., Rachmawati, R. N., Khotimah, P. H. ., Natari, R. B., Riswantini, D., Munandar, D. ., & Izzaturrahim, M. H. (2025). Multiscale Geographically and Temporally Weighted Regression with LASSO and Adaptive LASSO for Tuberculosis Incidence Mapping in West Java. Communication in Biomathematical Sciences, 8(1), 79-92. https://doi.org/10.5614/cbms.2025.8.1.6

Issue

Section

Articles