How we change what others think, feel, believe and do
Pearson devised a very common way of measuring correlation, often called the Pearson Product-Moment Correlation. It is is used when both variables are at least at interval level and data is parametric.
It is calculated by dividing the covariance of the two variables by the product of their standard deviations.
r = SUM((xi - xbar)(y - ybar)) / ((n - 1) * sx
r may also be considered as being:
r2 = explained variation / total variation
In other words, it is the proportion of variation that can be explained. A high explained proportion is good, and a value of one is perfect correlation. For example an r of 0.8 explains 64% of the variance.
When calculated from a population, Pearson's coefficient is denoted with the Greek letter 'rho' (ρ). When calculated from a sample, it is denoted with 'r'.
The Coefficient of Determination is calculated as r2.
Pearson r = sum((xi - xbar)(y - ybar)) / ((n - 1) * sx * sy)
This is quite high, showing a moderately good correlation between the sets of numbers.
Pearson is a parametric statistic and assumes:
The coefficient of determination, r2, represents the percent of the variance in the dependent variable explained by the dependent variable.
Correlation explains a certain amount of variance, but not all. This works on a square law, so a correlation of 0.5 indicates that the independent variable explains 25% of the variance of the dependent variable, and a correlation of 0.9 accounts for 81% of the of the variance.
This means that the unexplained variance is indicated by (1-r2). This i typically due to random factors.
Pearson's Correlation is also known as the Pearson Product-Moment Correlation or Sample Correlation Coefficient. 'r' is also known as 'Pearson's r'.
And the big