
Multicollinearity
Multicollinearity is the occurrence of high intercorrelations among two or more independent variables in a multiple regression model. It can also happen if an independent variable is computed from other variables in the data set or if two independent variables provide similar and repetitive results. Statistical analysts use multiple regression models to predict the value of a specified dependent variable based on the values of two or more independent variables. Instead, the market analysis must be based on markedly different independent variables to ensure that they analyze the market from different independent analytical viewpoints. Multicollinearity can lead to skewed or misleading results when a researcher or analyst attempts to determine how well each independent variable can be used most effectively to predict or understand the dependent variable in a statistical model.

What Is Multicollinearity?
Multicollinearity is the occurrence of high intercorrelations among two or more independent variables in a multiple regression model. Multicollinearity can lead to skewed or misleading results when a researcher or analyst attempts to determine how well each independent variable can be used most effectively to predict or understand the dependent variable in a statistical model.
In general, multicollinearity can lead to wider confidence intervals that produce less reliable probabilities in terms of the effect of independent variables in a model. That is, the statistical inferences from a model with multicollinearity may not be dependable.



Understanding Multicollinearity
Statistical analysts use multiple regression models to predict the value of a specified dependent variable based on the values of two or more independent variables. The dependent variable is sometimes referred to as the outcome, target, or criterion variable.
An example is a multivariate regression model that attempts to anticipate stock returns based on items such as price-to-earnings ratios (P/E ratios), market capitalization, past performance, or other data. The stock return is the dependent variable and the various bits of financial data are the independent variables.
Multicollinearity in a multiple regression model indicates that collinear independent variables are related in some fashion, although the relationship may or may not be casual. For example, past performance might be related to market capitalization, as stocks that have performed well in the past will have increasing market values.
In other words, multicollinearity can exist when two independent variables are highly correlated. It can also happen if an independent variable is computed from other variables in the data set or if two independent variables provide similar and repetitive results.
One of the most common ways of eliminating the problem of multicollinearity is to first identify collinear independent variables and then remove all but one. It is also possible to eliminate multicollinearity by combining two or more collinear variables into a single variable. Statistical analysis can then be conducted to study the relationship between the specified dependent variable and only a single independent variable.
Example of Multicollinearity
For investing, multicollinearity is a common consideration when performing technical analysis to predict probable future price movements of a security, such as a stock or a commodity future.
Market analysts want to avoid using technical indicators that are collinear in that they are based on very similar or related inputs; they tend to reveal similar predictions regarding the dependent variable of price movement. Instead, the market analysis must be based on markedly different independent variables to ensure that they analyze the market from different independent analytical viewpoints.
An example of a potential multicollinearity problem is performing technical analysis only using several similar indicators.
Noted technical analyst John Bollinger, creator of the Bollinger Bands indicator, notes that "a cardinal rule for the successful use of technical analysis requires avoiding multicollinearity amid indicators." To solve the problem, analysts avoid using two or more technical indicators of the same type. Instead, they analyze a security using one type of indicator, such as a momentum indicator, and then do a separate analysis using a different type of indicator, such as a trend indicator.
For example, stochastics, the relative strength index (RSI), and Williams %R are all momentum indicators that rely on similar inputs and are likely to produce similar results. In this case, it is better to remove all but one of the indicators or find a way to merge several of them into just one indicator, while also adding a trend indicator that is not likely to be highly correlated with the momentum indicator.
Related terms:
Investment Analyst
An investment analyst is an expert at evaluating financial information, typically for the purpose of making buy, sell, and hold recommendations for securities. read more
Bollinger Band® (Technical Analysis)
A Bollinger Band® is a momentum indicator used in technical analysis that depicts two standard deviations above and below a simple moving average. read more
Commodity Futures Contract
A commodity futures contract is an agreement to buy or sell a commodity at a set price and time in the future. Read how to invest in commodity futures. read more
Confidence Interval
A confidence interval, in statistics, refers to the probability that a population parameter will fall between two set values. read more
Error Term
An error term is a variable in a statistical model when the model doesn't represent the actual relationship between the independent and dependent variables. read more
Market Capitalization
Market capitalization is the total dollar market value of all of a company's outstanding shares. read more
Multiple Linear Regression (MLR)
Multiple linear regression (MLR) is a statistical technique that uses several explanatory variables to predict the outcome of a response variable. read more
Multivariate Model
The multivariate model is a popular statistical tool that uses multiple variables to forecast possible investment outcomes. read more
Price-to-Earnings (P/E) Ratio
The price-to-earnings (P/E) ratio is the ratio for valuing a company that measures its current share price relative to its per-share earnings. read more