ABy Admin
May 25'23

Exercise

You are given the following summary statistics:

[[math]] \begin{aligned} \overline{x} &= 3.500 \\ \overline{y} &= 2.840 \\ \sum (x_i - \overline{x})^2 &= 10.820 \\ \sum (x_i - \overline{x} ) (y_i - \overline{y}) &= 2.677 \\ \sum (y_i - \overline{y})^2 &= 1.125. \end{aligned} [[/math]]

Determine the equation of the regression line, using the least squares method.

  • [math]y=1.97 + 0.25x [/math]
  • [math]y =0.78 + 0.59x [/math]
  • [math] y = 0.57 + 0.65 xy 0.39 + 0.70 x [/math]
  • [math]y = 0.39 + 0.70x [/math]
  • The correct answer is not given by (A), (B), (C), or (D).

Copyright 2023. The Society of Actuaries, Schaumburg, Illinois. Reproduced with permission.

ABy Admin
May 26'23

Key: B

The first split is [math]X_1 \lt t_1[/math]. This requires a horizontal line at [math]t_1[/math] on the vertical axis. Graphs B, C, and E have such a line.

The second split is the case where the first split is true. That means all further action is below the line just described. All three graphs do that. The second split is [math]x_2 \lt t_2.[/math] This requires a vertical line at [math]t_2[/math] on the horizontal axis with the line only going up to [math]t_1.[/math] Again, all three graphs have this.

The third split is when the second split is true. That means all further action is to the left of the line just described. That rules out graph C. The only difference between graphs B and E is which part relates to node C and which to node D. The third split indicates that node C is the case when [math]x_1 \lt t_3.[/math] Only graph B has this region marked as C.

Copyright 2023. The Society of Actuaries, Schaumburg, Illinois. Reproduced with permission.

00