How to calcualte pearson correlation

(Comments)

Analysis of correlation and significance of parameters

Correlation

The study of the significance of the impact of input parameters on output parameters should begin with the analysis of the correlation of individual parameters. Three basic dependencies can be checked:

  • monotonic linear
  • monotonic non-linear
  • square

Pearson's correlation coefficient (monotonic linear relationship)

The most basic measure determining whether there is a linear correlation between parametersxi i yiis the Pearson correlation coefficient:

rp=i=1n(xix¯)(yiy¯)i=1n(xix¯)2i=1n(yiy¯)2

wherex¯andy¯mean the mean values ​​of the relevant parameters.

This formula can be simplified to

rp=cov(x,y)var(x)var(y)

wherex=[x1,x2,...],y=[y1,y2,...]

Spearman's correlation coefficient (monotonic non-linear relationship)

Spearman's rank correlation coefficient is more universal because it allows to determine the strength of monotonic correlation, which may be non-linear and is expressed by the relation:

rs=i=1n(RiR¯)(SiS¯)i=1n(RiR¯)2i=1n(SiS¯)2

whereRiis the rank of the observationxi, Si is the rank of the observationyiandR¯ i S¯are the mean values ​​of the respective ranksRi andSi.

Interpretation of the correlation coefficient value

Correlation type:

  • rs> 0 positive correlation – when the value of X increases, so does Y
  • rs= 0 no correlation – when X increases, Y sometimes increases and sometimes decreases
  • rs< 0 negative correlation – when X increases, Y decreases

Correlation strength:

  • |rs|<0.2– no linear relationship
  • 0.2|rs|<0.4- weak dependence
  • 0.4|rs|<0.7– moderate dependency
  • 0.7|rs|<0.9- quite a strong relationship
  • |rs|0.9- very strong dependence

Quadratic correlation coefficient

The quadratic correlation coefficient is determined on the basis of regression analysis.

Error sum of squaresSSEis designated as

SSE=i=1n(yiy^i)2

After performing the approximation with a polynomial of the second degree (i.e. determining the coefficientsa2,a1,a0) y^i is determined by substitutionxito the formula of the approximating function

y^i=a2xi2+a1xi+a0

total sum of squaresSST to

SST=i=1n(yiy¯)2

The correlation coefficient is determined from the relationship

rq=1SSESST

Statistical testing of the significance of the correlation coefficient

To determine whether the determined correlation coefficient is statistically significant, it is necessary to make a null hypothesis

H0:δ=0

meaning that there is no correlation between the parameters. The alternative hypothesis has the form

H1:δ0

It is assumed that the statistic takes the Student's t-distribution o k=n2degrees of freedom and hence, for example, for the Pearson correlation coefficient, the value of the statistics is

t=rpn21rp2

The value of the test statistic cannot be determined whenrp=1 therp=1or whenn<3.

In other cases, the value determined on its basisp (read from the Student's t-distribution) is compared with the assumed significance levelα

  • ifpαwe reject itH0accepting H1
  • ifp>αthere is no reason to reject itH0

Typically, a significance level is selectedα=0.05, agreeing that in 5% of situations we will reject the null hypothesis when it is true.

The same is done for the other correlation coefficients insteadrpsubstitutingrstherq.

Currently unrated

Comments

Riddles

22nd Jul- 2020, by: Editor in Chief
524 Shares 4 Comments
Generic placeholder image
20 Oct- 2019, by: Editor in Chief
524 Shares 4 Comments
Generic placeholder image
20Aug- 2019, by: Editor in Chief
524 Shares 4 Comments
10Aug- 2019, by: Editor in Chief
424 Shares 4 Comments
Generic placeholder image
10Aug- 2015, by: Editor in Chief
424 Shares 4 Comments

More News  »

Fixing the issue in assumption of OLS step by step or one by one

Recent news

Hi, I want to raise the issue related to know whether your OLS is ok or not. 

read more
2 days ago

Meaning of 45 degree in economics chart

Recent news

The **45-degree line** in economics and geometry refers to a line where the values on the x-axis and y-axis are equal at every point. It typically has a slope of 1, meaning that for every unit increase along the horizontal axis (x), there is an equal unit increase along the vertical axis (y). Here are a couple of contexts where the 45-degree line is significant:

read more
1 month ago

hyperinflation in hungary

Recent news

The **hyperinflation in Hungary** in the aftermath of World War II (1945–1946) is considered the worst case of hyperinflation in recorded history. The reasons behind this extreme economic event are numerous, involving a combination of war-related devastation, political instability, massive fiscal imbalances, and mismanagement of monetary policy. Here's an in-depth look at the primary causes:

read more
1 month, 1 week ago

what is neutrailty of money

Recent news

**Neutrality of money** is a concept in economics that suggests changes in the **money supply** only affect **nominal variables** (like prices, wages, and exchange rates) and have **no effect on real variables** (like real GDP, employment, or real consumption) in the **long run**.

read more
1 month, 1 week ago

Japan deflationary phenomenon

Recent news

Deflation in Japan, which has persisted over several decades since the early 1990s, is a complex economic phenomenon. It has been influenced by a combination of structural, demographic, monetary, and fiscal factors. Here are the key reasons why deflation occurred and persisted in Japan:

read more
1 month, 1 week ago

What the tips against inflation

Recent news

Hedging against inflation involves taking financial or investment actions designed to protect the purchasing power of money in the face of rising prices. Inflation erodes the value of currency over time, so investors seek assets or strategies that tend to increase in value or generate returns that outpace inflation. Below are several ways to hedge against inflation:

read more
1 month, 1 week ago

Long and short run philip curve

Recent news

The **Phillips Curve** illustrates the relationship between inflation and unemployment, and how this relationship differs in the **short run** and the **long run**. Over time, economists have modified the original Phillips Curve framework to reflect more nuanced understandings of inflation and unemployment dynamics.

read more
1 month, 1 week ago

How the government deal with inflation (monetary and fiscal) policies

Recent news

Dealing with inflation requires a combination of **fiscal and monetary policy** tools. Policymakers adjust these tools depending on the nature of inflation—whether it's **demand-pull** (inflation caused by excessive demand in the economy) or **cost-push** (inflation caused by rising production costs). Below are key approaches to controlling inflation through fiscal and monetary policy.

read more
1 month, 1 week ago

More News »

Generic placeholder image

Collaboratively administrate empowered markets via plug-and-play networks. Dynamically procrastinate B2C users after installed base benefits. Dramatically visualize customer directed convergence without