Written By Simran Simran
Last Modified 18-12-2024

Angle Between Two Lines of Regression: Point of Intersection, Formula

Angle Between Two Lines of Regression: Forecasting or prediction is a crucial part of human life. We always try to stay ahead of things that we do, we plan for the future, and for doing this, we always look back at how things took place. They say, “History repeats itself.” Although vague, this is the simplified version of the concept of mathematical regression.

Regression means going back or taking a step back. Regression analysis is a mathematical way of establishing an average relationship between the independent and dependent variables. We can then use this relationship to predict future results or study past data. In this article, let us learn about lines of regression and the angle between them.

Curve of Regression

A scatter diagram is a graph of dots plotted with the values of the dependent and independent variables. If the two variables are related, the points on the scatter diagram will look more or less concentrated around a curve called the curve of regression.

What are Lines of Regression?

If this curve is a straight line, it is a case of linear regression, and the straight line is called the line of regression. Regression lines are the lines of best fit, so they try to cover as many points from the scatter diagram as possible. These lines express the average relationship between variables. In linear regression, we find the lines of regression by the concept curve fitting using the principle of least squares.

Why are there Two Lines of Regression?

The method of least squares minimises the sum of the squares of the errors from a specified value. Depending upon which variable we select as the dependent variable from $x$ and $y$ , there can be two lines of regression.

Line of Regression of $x$ on $y$

We use this line to find the value of the dependent variable $x$ for a given value of the independent variable $y$ . We use the least squares method to minimise the squares of the errors of the available values of $y$ from the mean value of $y$ i.e. $\bar{y}$

Line of Regression of $y$ on $x$

We use this line to find the value of the dependent variable $y$ for a given value of the independent variable $x$ . We use the method of least squares to minimise the squares of the errors of the available values of $x$ from the mean value of $x$ i.e. $\bar{x}$

Formula for Lines of Regression

The line of regression of $x$ on $y$ is given by the equation $(x - \bar{x}) = \frac{r σ_{x}}{σ_{y}} (y - \bar{y})$ .

The line of regression of $y$ on $x$ is given by the equation $(y - \bar{y}) = \frac{r σ_{y}}{σ_{x}} (x - \bar{x})$ .

Here,

$\bar{x} =$ Mean of the values of $x$

$\bar{y} =$ Mean of the values of $y$

$σ_{x} =$ Standard deviation of the values of $x$ from $\bar{x}$

$σ_{y} =$ Standard deviation of the values of $y$ from $\bar{y}$

$r =$ Correlation coefficient

According to these formulas, we can see that both these lines pass through the point $(\bar{x}, \bar{y})$ . Thus, this is the point of intersection of these two lines.

Angle Between the Two Lines of Regression

We have seen that the two lines of regression intersect at the point $(\bar{x}, \bar{y})$ . Therefore, for a correlation that is not perfect, the two lines will be at an angle $(θ)$ to each other.

The line of regression of $x$ on $y$ is given by the equation $(x - \bar{x}) = \frac{r σ_{x}}{σ_{y}} (y - \bar{y})$

Let the slope of this line be $m_{1}$ .

We can rewrite the line equation as $(y - \bar{y}) = \frac{σ_{y}}{r σ_{x}} (x - \bar{x})$

When we compare this with the standard line equation, $y = m x + c$ , we get

$m_{1} = \frac{σ_{y}}{r σ_{x}}$

Similarly, the line of regression of $y$ on $x$ is given by the equation

$(y - \bar{y}) = \frac{r σ_{y}}{σ_{x}} (x - \bar{x})$

Here, let the slope of this line be $m_{2}$ .

When we compare this with the standard line equation, $y = m x + c$ , we get

$m_{2} = \frac{r σ_{y}}{σ_{x}}$

For two intersecting lines with slopes $m_{1}$ and $m_{2}$ , the angle between these two lines, $θ$ , can be found by the formula,

$\tan θ = | \frac{m_{1} - m_{2}}{1 + m_{1} m_{2}} |$

$\Rightarrow \tan θ = | \frac{\frac{σ_{y}}{r σ_{x}} - \frac{r σ_{y}}{σ_{x}}}{1 + \frac{σ_{y}}{r σ_{x}} \times \frac{r σ_{y}}{σ_{x}}} |$

$\Rightarrow \tan θ = | \frac{\frac{σ y}{σ_{x}} (\frac{1}{r} - r)}{1 + \frac{σ^{2}}{σ x^{2}}} |$

$\Rightarrow \tan θ = | \frac{σ_{y}}{σ_{x}} (\frac{1 - r^{2}}{r}) \times \frac{σ_{x}^{2}}{σ_{x}^{2} + σ_{y}^{2}} |$

$∴ \tan θ = | (\frac{1 - r^{2}}{r}) \frac{σ_{x} σ_{y}}{σ x^{2} + σ_{y}^{2}} |$

$\Rightarrow θ = \tan^{- 1} | (\frac{1 - r^{2}}{r}) \frac{σ_{x} σ_{y}}{σ_{x}^{2} + σ_{y}^{2}} |$

PRACTICE EXAM QUESTIONS AT EMBIBE

Type of Correlation and Angle of Regression

Following are the different types of Correlation and Angle of Regression

1. If we substitute $r = 0$ in the equation of $θ$ we get,

$θ = \tan^{- 1} | (\frac{1 - 0^{2}}{0}) \frac{σ_{x} σ_{y}}{σ_{x}^{2} + σ_{y}^{2}} |$

$\Rightarrow θ = \tan^{- 1}$ (not defined)

$∴ θ = \frac{π}{2}$

It means that when $r = 0$ , the angle between the lines of regression is $θ = \frac{π}{2}$ .

$r = 0$ means the variables $x$ and $y$ have no correlation.

Thus, for uncorrelated variables $x$ and $y$ , the lines of regression are perpendicular to each other.

2. If we substitute $r = \pm 1$ in the equation of $θ$ we get,

$θ = \tan^{- 1} | [\frac{1 - {(\pm 1)}^{2}}{\pm 1}] \frac{σ_{x} σ_{y}}{σ_{x}^{2} + σ_{y}^{2}} |$

$\Rightarrow θ = \tan^{- 1} (0)$

$∴ θ = 0$ or $θ = π$

It means that when $r = \pm 1$ , the angle between the lines of regression is $θ = 0$ or $θ = π$ .

$r = \pm 1$ means the variables have a perfect positive or perfect negative correlation.

Thus, for perfectly correlated (positive or negative) variables $x$ and $y$ , the lines of regression are coincident.

3. The modulus $(| |)$ in the formula of $θ$ indicates that it can take two values.

For a single value of $r > 0$ ,

$\tan^{- 1} (\frac{1 - r^{2}}{r}) \frac{σ_{x} σ_{y}}{σ_{x}^{2} + σ_{y}^{2}}$ will give the acute angle between the two lines of regression, i.e., $0 < θ < \frac{π}{2}$ .
$\tan^{- 1} (\frac{r^{2} - 1}{r}) \frac{σ_{x} σ_{y}}{σ_{x}^{2} + σ_{y}^{2}}$ will give the obtuse angle between the two lines of regression, i.e., $\frac{π}{2} < θ < π$ .

4. As the angle between the two lines of regression decreases from $\frac{π}{2}$ to $0$ , the correlation between the variables increases from $0$ to $1$ .

5. As the angle between the two lines of regression increases from $\frac{π}{2}$ to $π$ , the correlation between the variables increases from $0$ to $- 1$ .

Solved Examples – Angle Between Two Lines of Regression

Below are a few solved examples that can help in getting a better idea.

Q.1. For two variables, $x$ and $y$ , the correlation coefficient is $0.5$ . The acute angle between the two lines of regression is $\tan^{- 1} \frac{3}{5}$ . Show that $σ_{x} = \frac{1}{2} σ_{y}$ .
Ans: Correlation coefficient, $r = 0.5$

Angle between the two lines of regression, $θ = \tan^{- 1} \frac{3}{5}$

Substituting these values in the equation of $θ$ ,

$θ = \tan^{- 1} | (\frac{1 - r^{2}}{r}) \frac{σ_{x} σ_{y}}{σ_{x}^{2} + σ_{y}^{2}} |$

Since, the angle given is acute, we can write

$θ = \tan^{- 1} (\frac{1 - r^{2}}{r}) \frac{σ_{x} σ_{y}}{σ_{x}^{2} + σ_{y}^{2}}$

$∴ \tan^{- 1} \frac{3}{5} = \tan^{- 1} (\frac{1 - {0.5}^{2}}{0.5}) \frac{σ_{x} σ_{y}}{σ_{x}^{2} + σ_{y}^{2}}$

$\Rightarrow \frac{3}{5} = (\frac{1 - {0.5}^{2}}{0.5}) \frac{σ_{x} σ_{y}}{σ_{x}^{2} + σ_{y}^{2}}$

$\Rightarrow \frac{3}{5} = (\frac{1 - 0.25}{0.5}) \frac{σ_{x} σ_{y}}{σ_{x}^{2} + σ_{y}^{2}}$

$\Rightarrow \frac{3}{5} = (\frac{0.75}{0.5}) \frac{σ_{x} σ_{y}}{σ_{x}^{2} + σ_{y}^{2}}$

$\Rightarrow \frac{3}{5} = (\frac{3}{2}) \frac{σ_{x} σ_{y}}{σ_{x}^{2} + σ_{y}^{2}}$

$\Rightarrow 2 (σ_{x}^{2} + σ_{y}^{2}) = 5 (σ_{x} σ_{y})$

$\Rightarrow 2 σ_{x}^{2} - 5 σ_{x} σ_{y} + 2 σ_{y}^{2} = 0$

$\Rightarrow 2 {σ_{x}}^{2} - 4 σ_{x} σ_{y} - σ_{x} σ_{y} + 2 {σ_{y}}^{2} = 0$

$\Rightarrow 2 σ_{x} (σ_{x} - 2 σ_{y}) - σ_{y} (σ_{x} - 2 σ_{y}) = 0$

$\Rightarrow (σ_{x} - 2 σ_{y}) (2 σ_{x} - σ_{y}) = 0$

$\Rightarrow (σ_{x} - 2 σ_{y}) = 0$ or $(2 σ_{x} - σ_{y}) = 0$

$\Rightarrow (2 σ_{x} - σ_{y}) = 0$

$∴ σ_{x} = \frac{1}{2} σ_{y}$

Q.2. For two variables, $x$ and $y$ , regression coefficients are $b_{x y} = 0.4$ and $b_{y x} = 1.6$ . If $θ$ is the angle between the two regression lines, then find the value of $θ$ .
Ans: We know that,

$\tan θ = | (\frac{1 - r^{2}}{r}) \frac{σ_{x} σ_{y}}{σ_{x}^{2} + σ_{y}^{2}} |$

$\Rightarrow \tan θ = | (\frac{1 - r^{2}}{r}) \frac{1}{\frac{σ_{x}^{2} + σ_{y}^{2}}{σ_{x} σ_{y}}} |$

$\Rightarrow \tan θ = | (\frac{1 - r^{2}}{r}) \frac{1}{\frac{σ_{x}^{2}}{σ_{x} σ_{y}} + \frac{σ_{y}^{2}}{σ_{x} σ_{y}}} |$

$\Rightarrow \tan θ = | (\frac{1 - r^{2}}{r}) \frac{1}{\frac{σ_{x}}{σ_{y}} + \frac{σ_{y}}{σ_{x}}} |$

$∴ \tan θ = | \frac{1 - r^{2}}{r \frac{σ_{x}}{σ_{y}} + r \frac{σ_{y}}{σ_{x}}} |$

Now,

$r \frac{σ_{x}}{σ_{y}} = b_{x y}$

$r \frac{σ_{y}}{σ_{x}} = b_{y x}$

$r^{2} = b_{x y} b_{y x}$

$∴ \tan θ = | \frac{1 - b_{x y} b_{y x}}{b_{x y} + b_{y x}} |$

This is one more formula for the angle between two lines of regression.

$∴ \tan θ = | \frac{1 - 0.4 \times 1.6}{0.4 + 1.6} |$

$\Rightarrow \tan θ = | \frac{1 - 0.64}{2.0} |$

$\Rightarrow \tan θ = | \frac{0.36}{2.0} |$

$∴ \tan θ = | 0.18 |$

Q.3. If the standard deviation of $y$ is twice the standard deviation of $x$ , find the tangent of the acute angle between the two lines of regression if the correlation coefficient is $0.25$ .
Ans: Standard deviation of $y = 2 \times (S t a n d a r d d e v i a t i o n o f x)$

$∴ σ_{y} = 2 σ_{x}$

Correlation coefficient, $r = 0.25$

We know that for the acute angle between the two lines of regression,

$\tan θ = (\frac{1 - r^{2}}{r}) \frac{σ_{x} σ_{y}}{σ_{x}^{2} + σ_{y}^{2}}$

$\Rightarrow \tan θ = (\frac{1 - {0.25}^{2}}{0.25}) \frac{σ_{x} (2 σ_{x})}{σ_{x}^{2} + {(2 σ_{x})}^{2}}$

$\Rightarrow \tan θ = (\frac{1 - 0.0625}{0.25}) \frac{2 σ_{x}^{2}}{σ_{x}^{2} + 4 σ_{x}^{2}}$

$\Rightarrow \tan θ = \frac{0.9375}{0.25} \times \frac{2}{5}$

$∴ \tan θ = 1.5$

Q.4. The regression lines of $x$ on $y$ and $y$ on $x$ are $5 x - y = 6$ and $2 x - 3 y = - 8$ , respectively. Find the angle between these lines.
Ans: Line of regression of $x$ on $y$

$5 x - y = 6$

$\Rightarrow 5 x = y + 6$

$\Rightarrow x = \frac{y + 6}{5}$

$\Rightarrow x = 0.2 y + 1.2$

$∴ b_{x y} = 0.2$

Line of regression of $y$ on $x$

$2 x - 3 y = - 8$

$\Rightarrow - 3 y = - 2 x - 8$

$\Rightarrow y = \frac{- 2 x - 8}{- 3}$

$\Rightarrow y = \frac{2}{3} x + \frac{8}{3}$

$∴ b_{y x} = \frac{2}{3}$

Angle between the two lines of regression,

$θ = \tan^{- 1} | \frac{1 - (0.2) (\frac{2}{3})}{0.2 + \frac{2}{3}} |$

$\Rightarrow θ = \tan^{- 1} | \frac{\frac{3 - 0.4}{3}}{\frac{0.6 + 2}{3}} |$

$\Rightarrow θ = \tan^{- 1} | \frac{3 - 0.4}{0.6 + 2} |$

$\Rightarrow θ = \tan^{- 1} | \frac{2.6}{2.6} |$

$\Rightarrow θ = \tan^{- 1} | 1 |$

$∴ θ = \frac{π}{4}$

This is the acute angle between the two lines of regression.

Q.5. The two lines of regression are $6 x + 15 y = 27$ and $6 x + 3 y = 15$ . Calculate the angle between the two lines of regression.
Ans: We do not know which of these lines is the line of regression of $x$ on $y$ and the line of regression of $y$ on $x$ .

Assume that the line of regression of $x$ on $y$ is $6 x + 15 y = 27$ , and the line of regression of $y$ on $x$ is $6 x + 3 y = 15$ .

We now represent these equations in proper form.

Line of regression of $x$ on $y$

$6 x + 15 y = 27$

$\Rightarrow 6 x = - 15 y + 27$

$\Rightarrow x = \frac{- 15 y + 27}{6}$

$\Rightarrow x = - 1.5 y + 4.5$

$∴ b_{x y} = - 1.5$

Line of regression of $y$ on $x$

$6 x + 3 y = 15$

$\Rightarrow 3 y = - 6 x + 15$

$\Rightarrow y = \frac{- 6 x + 15}{3}$

$\Rightarrow y = - 2 x + 5$

$∴ b_{y x} = - 2$

According to this assumption, $b_{x y} = - 1.5$ and $b_{y x} = - 2$

We know that,

$r = \sqrt{b_{x y} b_{y x}}$

$∴ r = \sqrt{- 1.5 \times - 2}$

$∴ r = \sqrt{3}$

This assumption gives a value of $r > 1$ , which is not possible. It means our assumption is incorrect.

Thus,

$6 x + 15 y = 27$ – Line of regression of $y$ on $x$ and $6 x + 3 y = 15$ – Line of regression of $x$ on $y$

Line of regression of $y$ on $x$

$6 x + 15 y = 27$

$∴ 15 y = - 6 x + 27$

$∴ y = \frac{- 6 x + 27}{15}$

$∴ y = - 0.4 x + 1.8$

$∴ b_{y x} = - 0.4$

Line of regression of $x$ on $y$

$6 x + 3 y = 15$

$∴ 6 x = - 3 y + 15$

$∴ x = \frac{- 3 y + 15}{6}$

$∴ x = - 0.5 y + 2.5$

$∴ b_{x y} = - 0.5$

Angle between the two lines of regression,

$θ = \tan^{- 1} | \frac{1 - b_{x y} b_{y x}}{b_{x y} + b_{y x}} |$

$∴ θ = \tan^{- 1} | \frac{1 - (- 0.5) (- 0.4)}{(- 0.5) + (- 0.4)} |$

$∴ θ = \tan^{- 1} | \frac{1 - 0.2}{- 0.9} |$

$∴ θ = \tan^{- 1} | \frac{0.8}{- 0.9} |$

$∴ θ = \tan^{- 1} | \frac{8}{- 9} |$

Summary

There are two lines of regression, each trying to minimise the deviations of $x$ and $y$ from their means by the method of least squares. The two lines of regression intersect at the point whose coordinates give the means of both the variables, i.e. $(\bar{x}, \bar{y})$ . The formula for the angle between the two lines of regression is $θ = \tan^{- 1} | (\frac{1 - r^{2}}{r}) \frac{σ_{x} σ_{y}}{σ_{x}^{2} + σ_{y}^{2}} |$ . When the angle between the two lines of regression is $0$ or $π$ then there is a perfect correlation between the two variables. When the angle between the two lines of regression is $\frac{π}{2}$ then there is no correlation between the two variables. Thus, the angle between the two lines of regression is a connecting link between regression and correlation.

FAQs on Angle Between Two Lines of Regression

Students might be having many questions with respect to the Angle Between Two Lines of Regression. Here are a few commonly asked questions and answers.

Q.1. How do you find the angle between two regression lines?
Ans: You can find the angle between the two lines of regression by using the formula
$θ = \tan^{- 1} | (\frac{1 - r^{2}}{r}) \frac{σ_{x} σ_{y}}{σ_{x}^{2} + σ_{y}^{2}} |$
Here,
$σ_{x} =$ Standard deviation of the values of $x$ from $\bar{x}$
$σ_{y} =$ Standard deviation of the values of $y$ from $\bar{y}$
$r =$ Correlation coefficient

Q.2. What are the two lines of regression?
Ans: The two lines of regressions are a line of regression of $x$ on $y$ and a line of regression of $y$ on $x$ .

Line of regression of $x$ on $y$ – It establishes a relationship between $x$ and $y$ . It is used to find the unknown value of the dependent variable $x$ for a known value of the independent variable $y$ .
Line of regression of $y$ on $x$ – It establishes a relationship between $y$ and $x$ . It is used to find the unknown value of the dependent variable $y$ for a known value of the independent variable $x$ .

Q.3. Under what condition will the angle between two regression lines become zero?
Ans: The angle between the two regression lines becomes zero when the two variables are in perfect correlation, either positive or negative.

Q.4. When the correlation coefficient increases from $0$ to $1$ , how does the angle between the regression lines diminish?
Ans: When the correlation coefficient increases from $0$ to $1$ , the angle between the regression lines diminishes from $\frac{π}{2}$ to $0$ .

Q.5. Why there are two regression lines? Write the properties of regression lines.
Ans: There are two lines of regression each trying to minimise the squares of deviations of $x$ and $y$ from their means by the method of least squares.
Line of regression of $x$ on $y$ – It minimises the deviations of squares of values of $x$ from its mean $\bar{x}$ parallel to the $X$ -axis.
Line of regression of $y$ on $x$ – It minimises the deviations of squares of values of $y$ from its mean $\bar{y}$ parallel to the $Y$ -axis.

ATTEMPT MOCK TESTS ON EMBIBE

We hope this information about the Angle Between Two Lines of Regression has been helpful. If you have any doubts, comment in the section below, and we will get back to you soon.