Loading...
Visual Analytics of Multicollinearity Mitigation Techniques Using Correlation, VIF and Regularization Based Feature Selection


Journal of Computer and Communication Networks

Received On : 26 October 2025

Revised On : 02 January 2026

Accepted On : 26 January 2026

Published On : 30 January 2026

Volume 02, 2026

Pages : 025-036


Abstract

One of the worst issues with high-dimensional data is that it leads to multicollinearity, which results in unstable model coefficients, reduced interpretability, and weak predictive diagnostics. In this paper, a feature optimization framework will be introduced, wherein methods sensitive to multicollinearity are applied to combine statistical filtering, iterative elimination, and regularization in a single pipeline. Synthetic data in a systematic designed way has manipulated factors of correlation of features, redundancy and noise such that mitigation methods can be tested in controlled conditions. The suggested design will consist of preprocessing where the data will undergo Z-score normalization and a missing data processing step and correlation-based filtering which will eliminate very correlated pairs of features. The feature space is then further reduced by dropping redundant variables beyond specified thresholds by repeatedly using Variance Inflation Factor (VIF). Ridge (L2) and Lasso (L1) regularization techniques are provided to enforce more strength which in fact stabilizes coefficients and helps in sparsity. A refined feature set is then subjected to an orthogonal space by way of Principal Component Analysis (PCA) in order to minimize redundancy as much as possible. The overall experimental study demonstrates that the proposed solution will assist in reducing the degree of multicollinearity to a minimum, since the value of VIF will reduce over 85 percent and the stability of the model can also be improved. It can be seen via comparative analysis that, the RMSE has been reduced by approximately 18 – 25% and the R2 has been increased up to 12% in comparison to those of the base models where no feature optimization has taken place. By incorporating various mitigation tactics, interpretability and predictive accuracy are guaranteed, and as such, the framework can be used in real-world contexts in which high-dimensional data are frequently applicable.

Keywords

Multicollinearity, Feature Selection, Variance Inflation Factor (VIF), Regularization, Principal Component Analysis (PCA), Regression Analysis.

  1. J. Y.-L. Chan et al., “Mitigating the Multicollinearity Problem and Its Machine Learning Approach: A Review,” Mathematics, vol. 10, no. 8, p. 1283, Apr. 2022, doi: 10.3390/math10081283.
  2. T. Kyriazos and M. Poga, “Dealing with Multicollinearity in Factor Analysis: The Problem, Detections, and Solutions,” Open Journal of Statistics, vol. 13, no. 03, pp. 404–424, 2023, doi: 10.4236/ojs.2023.133020.
  3. C. García García, R. Salmerón Gómez, and J. García Pérez, “A review of ridge parameter selection: minimization of the mean squared error vs. mitigation of multicollinearity,” Communications in Statistics - Simulation and Computation, vol. 53, no. 8, pp. 3686–3698, Aug. 2022, doi: 10.1080/03610918.2022.2110594.
  4. N. Akhtar, M. F. Alharthi, and M. S. Khan, “Mitigating Multicollinearity in Regression: A Study on Improved Ridge Estimators,” Mathematics, vol. 12, no. 19, p. 3027, Sep. 2024, doi: 10.3390/math12193027.
  5. H. I. Dertli, D. B. Hayes, and T. G. Zorn, “Effects of multicollinearity and data granularity on regression models of stream temperature,” Journal of Hydrology, vol. 639, p. 131572, Aug. 2024, doi: 10.1016/j.jhydrol.2024.131572.
  6. K Enwere, E Nduka, U Ogoke, “Comparative Analysis of Ridge, Bridge and Lasso Regression Models in the Presence of Multicollinearity,” IPS Intelligentsia Multidisciplinary Journal, vol. 3, no. 1, pp. 1–8, Jan. 2023, doi: 10.54117/iimj.v 3i1.5.
  7. J. Cheng, J. Sun, K. Yao, M. Xu, and Y. Cao, “A variable selection method based on mutual information and variance inflation factor,” Spectrochimica Acta Part A: Molecular and Biomolecular Spectroscopy, vol. 268, p. 120652, Mar. 2022, doi: 10.1016/j.saa.2021.120652.
  8. M. M. Abdelwahab, M. R. Abonazel, A. T. Hammad, and A. M. El-Masry, “Modified Two-Parameter Liu Estimator for Addressing Multicollinearity in the Poisson Regression Model,” Axioms, vol. 13, no. 1, p. 46, Jan. 2024, doi: 10.3390/axioms13010046.
  9. V. K. S., B. B. S., J. Vellingiri, and D. R. Ramani, “Enhancing Mobile Edge Computing customer reviews analysis with Ensemble Sparse Support Vector L1 Regularization Based Crossover Discrete Mycorrhized Algorithm,” Biomedical Signal Processing and Control, vol. 109, p. 108019, Nov. 2025, doi: 10.1016/j.bspc.2025.108019.
  10. Z. Hasan et al., “NEV-NCD: Negative Learning, Entropy, and Variance Regularization Based Novel Action Categories Discovery,” 2023 IEEE International Conference on Image Processing (ICIP), pp. 2720–2724, Oct. 2023, doi: 10.1109/icip49359.2023.10222474.
  11. H. Wang and Y. Han, “An S-transform regularization-based denoising method for acoustic signals in photoacoustics,” Signal, Image and Video Processing, vol. 20, no. 1, Jan. 2026, doi: 10.1007/s11760-025-05043-1.
  12. Q. U. Ain, A. Javed, K. M. Malik, and A. Irtaza, “Regularized forensic efficient net: a game theory based generalized approach for video deepfakes detection,” Multimedia Tools and Applications, vol. 84, no. 25, pp. 29621–29664, Oct. 2024, doi: 10.1007/s11042-024-20268-x.
  13. J. Zhao, Y. Liu, L. Cheng, M. Zhang, M. Jiang, and J. Miao, “Sparse regularization-based feature selection: A survey,” Procedia Computer Science, vol. 221, pp. 1386–1393, 2023, doi: 10.1016/j.procs.2023.08.129.
  14. T. Chen, Y. Zeng, H. Yuan, G. Zhong, L. L. Lai, and Y. Y. Tang, “Multi-level regularization-based unsupervised multi-view feature selection with adaptive graph learning,” International Journal of Machine Learning and Cybernetics, vol. 14, no. 5, pp. 1695–1709, Nov. 2022, doi: 10.1007/s13042-022-01721-5.
  15. M. Khan, A. Ahmad, and M. Sarfraz, “L1 Regularization based Fetal Health Analysis Using ML Techniques,” 2024 15th International Conference on Computing Communication and Networking Technologies (ICCCNT), pp. 1–6, Jun. 2024, doi: 10.1109/icccnt61001.2024.10724506.
CRediT Author Statement

We would like to thank Reviewers for taking the time and effort necessary to review the manuscript. We sincerely appreciate all valuable comments and suggestions, which helped us to improve the quality of the manuscript.

Acknowledgements

The authors would like to thank to the reviewers for nice comments on the manuscript.

Funding

No funding was received to assist with the preparation of this manuscript.

Ethics Declarations

Conflict of interest

The authors have no conflicts of interest to declare that are relevant to the content of this article.

Availability of Data and Materials

Data sharing is not applicable to this article as no new data were created or analysed in this study.

Author Information

Contributions

All authors have equal contribution in the paper and all authors have read and agreed to the published version of the manuscript.

Corresponding Author



Rights and permissions

This is an open access article distributed under the terms of the Creative Commons Attribution 4.0 International License (CC BY 4.0).
https://creativecommons.org/licenses/by/4.0/.

This license permits unrestricted use, sharing, distribution, reproduction, and adaptation in any medium or format, including for commercial purposes, provided that appropriate credit is given to the original author(s) and the source, a link to the license is provided, and any changes made are indicated.

Copyright

© 2026 Aisling Yue Irwing. The author(s) retain copyright of the work. The author(s) grant the Journal of Computer and Communication Networks (JCCN) and its publisher, Ansis Publications, the right of first publication and the right to identify itself as the original publisher of the article.

Cite this Article

Aisling Yue Irwing, “Visual Analytics of Multicollinearity Mitigation Techniques Using Correlation, VIF and Regularization Based Feature Selection”, Journal of Computer and Communication Networks, pp. 025-036, 2026, doi: 10.64026/JCCN/2026003.