• Written By Simran Simran
  • Last Modified 18-12-2024

Angle Between Two Lines of Regression: Point of Intersection, Formula

img-icon

Angle Between Two Lines of Regression: Forecasting or prediction is a crucial part of human life. We always try to stay ahead of things that we do, we plan for the future, and for doing this, we always look back at how things took place. They say, “History repeats itself.” Although vague, this is the simplified version of the concept of mathematical regression.

Regression means going back or taking a step back. Regression analysis is a mathematical way of establishing an average relationship between the independent and dependent variables. We can then use this relationship to predict future results or study past data. In this article, let us learn about lines of regression and the angle between them.

Curve of Regression

A scatter diagram is a graph of dots plotted with the values of the dependent and independent variables. If the two variables are related, the points on the scatter diagram will look more or less concentrated around a curve called the curve of regression.

Curve of Regression

What are Lines of Regression?

If this curve is a straight line, it is a case of linear regression, and the straight line is called the line of regression. Regression lines are the lines of best fit, so they try to cover as many points from the scatter diagram as possible. These lines express the average relationship between variables. In linear regression, we find the lines of regression by the concept curve fitting using the principle of least squares.

Lines of Regression

Why are there Two Lines of Regression?

The method of least squares minimises the sum of the squares of the errors from a specified value. Depending upon which variable we select as the dependent variable from x and y, there can be two lines of regression.

Line of Regression of x on y

We use this line to find the value of the dependent variable x for a given value of the independent variable y. We use the least squares method to minimise the squares of the errors of the available values of y from the mean value of y i.e. y¯

Line of Regression of y on x

We use this line to find the value of the dependent variable y for a given value of the independent variable x. We use the method of least squares to minimise the squares of the errors of the available values of x from the mean value of x i.e. x¯

Formula for Lines of Regression

The line of regression of x on y is given by the equation (xx¯)=rσxσy(yy¯).

The line of regression of y on x is given by the equation (yy¯)=rσyσx(xx¯).

Here,

x¯= Mean of the values of x

y¯= Mean of the values of y

σx= Standard deviation of the values of x from x¯

σy= Standard deviation of the values of y from y¯

r= Correlation coefficient

According to these formulas, we can see that both these lines pass through the point (x¯,y¯). Thus, this is the point of intersection of these two lines.

Angle Between the Two Lines of Regression

We have seen that the two lines of regression intersect at the point (x¯,y¯). Therefore, for a correlation that is not perfect, the two lines will be at an angle (θ) to each other.

The line of regression of x on y is given by the equation (xx¯)=rσxσy(yy¯)

Let the slope of this line be m1.

We can rewrite the line equation as (yy¯)=σyrσx(xx¯)

When we compare this with the standard line equation, y=mx+c, we get

m1=σyrσx

Similarly, the line of regression of y on x is given by the equation

(yy¯)=rσyσx(xx¯)

Here, let the slope of this line be m2.

When we compare this with the standard line equation, y=mx+c, we get

m2=rσyσx

For two intersecting lines with slopes m1 and m2, the angle between these two lines, θ, can be found by the formula,

tanθ=|m1m21+m1m2|

tanθ=|σyrσxrσyσx1+σyrσx×rσyσx|

tanθ=|σyσx(1rr)1+σ2σx2|

tanθ=|σyσx(1r2r)×σx2σx2+σy2|

tanθ=|(1r2r)σxσyσx2+σy2|

θ=tan1|(1r2r)σxσyσx2+σy2|

PRACTICE EXAM QUESTIONS AT EMBIBE

Type of Correlation and Angle of Regression

Following are the different types of Correlation and Angle of Regression

1. If we substitute r=0 in the equation of θ we get,

θ=tan1|(1020)σxσyσx2+σy2|

θ=tan1 (not defined)

θ=π2

It means that when r=0, the angle between the lines of regression is θ=π2.

r=0 means the variables x and y have no correlation.

Thus, for uncorrelated variables x and y, the lines of regression are perpendicular to each other.

2. If we substitute r=±1 in the equation of θ we get,

θ=tan1|[1(±1)2±1]σxσyσx2+σy2|

θ=tan1(0)

θ=0 or θ=π

It means that when r=±1, the angle between the lines of regression is θ=0 or θ=π.

r=±1 means the variables have a perfect positive or perfect negative correlation.

Thus, for perfectly correlated (positive or negative) variables x and y, the lines of regression are coincident.

3. The modulus(||) in the formula of θ indicates that it can take two values.

For a single value of r>0,

  • tan1(1r2r)σxσyσx2+σy2 will give the acute angle between the two lines of regression, i.e., 0<θ<π2.
  • tan1(r21r)σxσyσx2+σy2 will give the obtuse angle between the two lines of regression, i.e., π2<θ<π.

4. As the angle between the two lines of regression decreases from π2 to 0, the correlation between the variables increases from 0 to 1.

5. As the angle between the two lines of regression increases from π2 to π, the correlation between the variables increases from 0 to 1.

Solved Examples – Angle Between Two Lines of Regression

Below are a few solved examples that can help in getting a better idea.

Q.1. For two variables, x and y, the correlation coefficient is 0.5. The acute angle between the two lines of regression is tan135. Show that σx=12σy.
Ans: Correlation coefficient, r=0.5

Angle between the two lines of regression, θ=tan135

Substituting these values in the equation of θ,

θ=tan1|(1r2r)σxσyσx2+σy2|

Since, the angle given is acute, we can write

θ=tan1(1r2r)σxσyσx2+σy2

tan135=tan1(10.520.5)σxσyσx2+σy2

35=(10.520.5)σxσyσx2+σy2

35=(10.250.5)σxσyσx2+σy2

35=(0.750.5)σxσyσx2+σy2

35=(32)σxσyσx2+σy2

2(σx2+σy2)=5(σxσy)

2σx25σxσy+2σy2=0

2σx24σxσyσxσy+2σy2=0

2σx(σx2σy)σy(σx2σy)=0

(σx2σy)(2σxσy)=0

(σx2σy)=0 or (2σxσy)=0

(2σxσy)=0

σx=12σy

Q.2. For two variables, x and y, regression coefficients are bxy=0.4 and byx=1.6. If θ is the angle between the two regression lines, then find the value of θ.
Ans: We know that,

tanθ=|(1r2r)σxσyσx2+σy2|

tanθ=|(1r2r)1σx2+σy2σxσy|

tanθ=|(1r2r)1σx2σxσy+σy2σxσy|

tanθ=|(1r2r)1σxσy+σyσx|

tanθ=|1r2rσxσy+rσyσx|

Now,

rσxσy=bxy

rσyσx=byx

r2=bxybyx

tanθ=|1bxybyxbxy+byx|

This is one more formula for the angle between two lines of regression.

tanθ=|10.4×1.60.4+1.6|

tanθ=|10.642.0|

tanθ=|0.362.0|

tanθ=|0.18|

Q.3. If the standard deviation of y is twice the standard deviation of x, find the tangent of the acute angle between the two lines of regression if the correlation coefficient is 0.25.
Ans: Standard deviation of y=2×(Standarddeviationofx)

σy=2σx

Correlation coefficient, r=0.25

We know that for the acute angle between the two lines of regression,

tanθ=(1r2r)σxσyσx2+σy2

tanθ=(10.2520.25)σx(2σx)σx2+(2σx)2

tanθ=(10.06250.25)2σx2σx2+4σx2

tanθ=0.93750.25×25

tanθ=1.5

Q.4. The regression lines of x on y and y on x are 5xy=6 and 2x3y=8, respectively. Find the angle between these lines.
Ans: Line of regression of x on y

5xy=6

5x=y+6

x=y+65

x=0.2y+1.2

bxy=0.2

Line of regression of y on x

2x3y=8

3y=2x8

y=2x83

y=23x+83

byx=23

Angle between the two lines of regression,

θ=tan1|1(0.2)(23)0.2+23|

θ=tan1|30.430.6+23|

θ=tan1|30.40.6+2|

θ=tan1|2.62.6|

θ=tan1|1|

θ=π4

This is the acute angle between the two lines of regression.

Q.5. The two lines of regression are 6x+15y=27 and 6x+3y=15. Calculate the angle between the two lines of regression.
Ans: We do not know which of these lines is the line of regression of x on y and the line of regression of y on x.

Assume that the line of regression of x on y is 6x+15y=27, and the line of regression of y on x is 6x+3y=15.

We now represent these equations in proper form.

Line of regression of x on y

6x+15y=27

6x=15y+27

x=15y+276

x=1.5y+4.5

bxy=1.5

Line of regression of y on x

6x+3y=15

3y=6x+15

y=6x+153

y=2x+5

byx=2

According to this assumption, bxy=1.5 and byx=2 

We know that,

r=bxybyx

r=1.5×2

r=3

This assumption gives a value of r>1, which is not possible. It means our assumption is incorrect.

Thus,

6x+15y=27 – Line of regression of y on x and 6x+3y=15 – Line of regression of x on y

Line of regression of y on x

6x+15y=27

15y=6x+27

y=6x+2715

y=0.4x+1.8

byx=0.4

Line of regression of x on y

6x+3y=15

6x=3y+15

x=3y+156

x=0.5y+2.5

bxy=0.5

Angle between the two lines of regression,

θ=tan1|1bxybyxbxy+byx|

θ=tan1|1(0.5)(0.4)(0.5)+(0.4)|

θ=tan1|10.20.9|

θ=tan1|0.80.9|

θ=tan1|89|

Summary

There are two lines of regression, each trying to minimise the deviations of x and y from their means by the method of least squares. The two lines of regression intersect at the point whose coordinates give the means of both the variables, i.e. (x¯,y¯). The formula for the angle between the two lines of regression is θ=tan1|(1r2r)σxσyσx2+σy2|. When the angle between the two lines of regression is 0 or π then there is a perfect correlation between the two variables. When the angle between the two lines of regression is π2 then there is no correlation between the two variables. Thus, the angle between the two lines of regression is a connecting link between regression and correlation.

FAQs on Angle Between Two Lines of Regression

Students might be having many questions with respect to the Angle Between Two Lines of Regression. Here are a few commonly asked questions and answers.

Q.1. How do you find the angle between two regression lines?
Ans:
You can find the angle between the two lines of regression by using the formula
θ=tan1|(1r2r)σxσyσx2+σy2|
Here,
σx= Standard deviation of the values of x from x¯
σy= Standard deviation of the values of y from y¯
r= Correlation coefficient

Q.2. What are the two lines of regression?
Ans:
The two lines of regressions are a line of regression of x on y and a line of regression of y on x.

  • Line of regression of x on y – It establishes a relationship between x and y. It is used to find the unknown value of the dependent variable x for a known value of the independent variable y.
  • Line of regression of y on x – It establishes a relationship between y and x. It is used to find the unknown value of the dependent variable y for a known value of the independent variable x.

Q.3. Under what condition will the angle between two regression lines become zero?
Ans:
The angle between the two regression lines becomes zero when the two variables are in perfect correlation, either positive or negative.

Q.4. When the correlation coefficient increases from 0 to 1, how does the angle between the regression lines diminish?
Ans:
When the correlation coefficient increases from 0 to 1, the angle between the regression lines diminishes from π2 to 0.

Q.5. Why there are two regression lines? Write the properties of regression lines.
Ans:
There are two lines of regression each trying to minimise the squares of deviations of x and y from their means by the method of least squares.
Line of regression of x on y – It minimises the deviations of squares of values of x from its mean x¯ parallel to the X-axis.
Line of regression of y on x – It minimises the deviations of squares of values of y from its mean y¯ parallel to the Y-axis.

ATTEMPT MOCK TESTS ON EMBIBE

We hope this information about the Angle Between Two Lines of Regression has been helpful. If you have any doubts, comment in the section below, and we will get back to you soon.

Achieve Your Best With 3D Learning, Book Practice, Tests & Doubt Resolutions at Embibe