Covariance
Definition
Covariance is a measure that indicates the extent to which two random variables change in tandem. It's a statistical concept that gives a sense of the degree to which two variables move together. When the covariance is positive, the variables tend to show similar behavior. They increase together or decrease together. If the covariance is negative, the variables tend to show opposite behavior. One variable may decrease as the other increases.
Mathematical Representation
Covariance can be mathematically represented as:
Cov(X, Y) = Σ E((X-μX)(Y-μY)) / n-1
where: - X is a random variable - E(X) = μX is the expected value (mean) of the random variable X - Y is another random variable - E(Y) = μY is the expected value (mean) of the random variable Y - n is the number of data points
Calculation
The calculation of covariance involves the following steps:
1. Compute the mean of the variables X and Y. 2. Subtract the mean of X from every X data point (X - μX) and do the same for Y (Y - μY). 3. Multiply the results obtained from step 2 for each data point. 4. Sum up the values obtained in step 3. 5. Divide the result by n-1.
Interpretation
The sign of the covariance can be interpreted as follows:
- Positive covariance: Indicates that two variables tend to move in the same direction. - Negative covariance: Reveals that two variables tend to move in inverse directions.
The magnitude of the covariance is not easy to interpret because it is not normalized and hence depends on the units of the variables.
Covariance vs Correlation
While both covariance and correlation measure the degree to which two variables move in relation to each other, they are not the same. Correlation is a scaled version of covariance that takes values between -1 and +1, and is dimensionless. Covariance, on the other hand, is expressed in units derived from the input variables.
Covariance Matrix
A covariance matrix is a square matrix giving the covariance between each pair of elements of a given random vector. In the matrix diagonal there are variances, i.e., the covariance of each element with itself.
Applications
Covariance is widely used in statistics and probability theory. Its applications include:
- In portfolio theory, covariance is used to determine the correlation coefficient between the rates of return of assets. - In linear regression, the covariance matrix of the slope and intercept estimates provides insight into the reliability of the estimates. - In pattern recognition, covariance matrices are used to differentiate between different classes of patterns.
Limitations
Covariance has its limitations. It is sensitive to the changes in scale. If all the values of the given variable are multiplied by a constant, then the covariance is also multiplied by the square of that constant. This makes it difficult to measure the relationship between variables with different scales.
See Also
- Correlation - Variance - Standard Deviation - Probability Theory - Linear Regression - Portfolio Theory