

Regardless of your criterion for what constitutes a high VIF, there are at least three situations in which a high VIF is not a problem and can be safely ignored:ġ. Personally, I tend to get concerned when a VIF is greater than 2.50, which corresponds to an R 2 of. Authorities differ on how high the VIF has to be to constitute a problem.

The VIF has a lower bound of 1 but no upper bound. Thus, a VIF of 1.8 tells us that the variance (the square of the standard error) of a particular coefficient is 80% larger than it would be if that predictor was completely uncorrelated with all the other predictors. It’s called the variance inflation factor because it estimates how much the variance of a coefficient is “inflated” because of linear dependence with other predictors. LEARN MORE IN A SEMINAR WITH PAUL ALLISON The VIF may be calculated for each predictor by doing a linear regression of that predictor on all the other predictors, and then obtaining the R 2 from that regression. But many do not realize that there are several situations in which multicollinearity can be safely ignored.īefore examining those situations, let’s first consider the most widely-used diagnostic for multicollinearity, the variance inflation factor (VIF).

Most data analysts know that multicollinearity is not a good thing. It occurs when there are high correlations among predictor variables, leading to unreliable and unstable estimates of regression coefficients. Multicollinearity is a common problem when estimating linear or generalized linear models, including logistic regression and Cox regression. Guilford publications.When Can You Safely Ignore Multicollinearity? SeptemBy Paul Allison Principles and practice of structural equation modeling. Effects of sample size, estimation method, and model specification on structural equation modeling fit indexes. A beginner’s guide to structural equation modeling, Second edition. The reliability coefficient for maximum likelihood factor analysis. Structural equation modeling with EQS and EQS/Windows. The table below also summarizes these indices used in Confirmatory Factor Analysis (CFA) and Structural Equation Modelling (SEM) Analysis Values closer to 0 represent a good fit.įor structural equation models (SEM), Kline (2015) suggests that at a minimum the following indices should be reported: The model chi-square, the RMSEA, the CFI and the SRMR. RMSEA: The Root Mean Square Error of Approximation is a parsimony-adjusted index. Not very sensitive to sample size (Fan, Thompson, & Wang, 1999).Ĭompares the fit of a target model to the fit of an independent, or null, model. CFI: The Comparative Fit Index is a revised form of NFI. An NFI of 0.95, indicates the model of interest improves the fit by 95\ NNFI (also called the Tucker Lewis index TLI) is preferable for smaller samples.

NFI/NNFI/TLI: The (Non) Normed Fit Index. GFI/AGFI: The (Adjusted) Goodness of Fit is the proportion of variance accounted for by the estimated population covariance.Īnalogous to R2. However, it is quite sensitive to sample size. 05 (i.e., the hypothesis of a perfect fit cannot be rejected). These are discussed below: Chisq: The model Chi-squared assesses overall fit and the discrepancy between the sample and fitted covariance matrices. Present a discussion of the meaning of each fit index, its use and the threshold required In this tutorial, I present a comprehensive tutorial on the fit indices reported in the Confirmatory Factor Analysis (CFA) and Structural Equation Modelling (SEM) analysis, to test the fitness of the model and variable constructs.
