In multiple linear regression analysis, linear dependencies in the regressor variables lead to ill-conditioning known as multicollinearity. Multicollinearity inflates variance of the estimates as well as causes changes in direction of signs of the coefficient estimates leading to unreliable, and many times erroneous inference. Principal components regression and ridge or shrinkage approach have not provided completely satisfactory results in dealing with the multicollinearity. There are host of issues in ridge regression like choosing bias k and stability or consistency of the variances which still remain unresolved. In this paper, a partial ridge regression estimation is proposed, which involves selectively adjusting the ridge constants associated with highly collinear variables to control instability in the variances of coefficient estimates. Results based on synthetic data from simulations, and a real-world data set from the manufacturing industry show that the proposed method outperforms the existing solutions in terms of bias, mean square error, and relative efficiency of the estimated parameters.

Partial ridge regression under multicollinearity

GALLO, Michele
2016-01-01

Abstract

In multiple linear regression analysis, linear dependencies in the regressor variables lead to ill-conditioning known as multicollinearity. Multicollinearity inflates variance of the estimates as well as causes changes in direction of signs of the coefficient estimates leading to unreliable, and many times erroneous inference. Principal components regression and ridge or shrinkage approach have not provided completely satisfactory results in dealing with the multicollinearity. There are host of issues in ridge regression like choosing bias k and stability or consistency of the variances which still remain unresolved. In this paper, a partial ridge regression estimation is proposed, which involves selectively adjusting the ridge constants associated with highly collinear variables to control instability in the variances of coefficient estimates. Results based on synthetic data from simulations, and a real-world data set from the manufacturing industry show that the proposed method outperforms the existing solutions in terms of bias, mean square error, and relative efficiency of the estimated parameters.
File in questo prodotto:
File Dimensione Formato  
JAS2016_Partial.pdf

accesso solo dalla rete interna

Tipologia: Documento in Post-print
Licenza: PUBBLICO - Pubblico con Copyright
Dimensione 1.52 MB
Formato Adobe PDF
1.52 MB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11574/170488
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
social impact