guide:75f4dbb8dc: Difference between revisions

Latest revision as of 02:39, 30 June 2023

Loss reserving refers to the process of calculating, at a particular time, an amount which represents liabilities associated with a portfolio of insurance policies. Loss reserves are useful to various stakeholders such as internal management, investors and regulators.

Loss Development

Each portfolio of policies will trigger a number of claims and every such claim has a life of its own from the initial reporting of the claim to its closure/settlement. The insurer will carefully record expenses and payments for claims associated with a portfolio of policies. The actuary can then use historical claims data to establish reserves for various stakeholders. We give a brief overview of some of the key terminology relevant to historical claims development data:

Term	Description
Accident Year	Refers to the year when a claim triggering incident took place.
Case outstanding	Represents an amount, established by a claims professional (typically a claims adjuster), representing the dollar value of the liabilities to the insurer associated with a group of claims.
Reported claims	The case outstanding plus the payments made to the insureds for a group of claims.
Ultimate claims	The cumulative sum of all payments made to insureds for a group of settled claims i.e., the total cost to the insurer to settle a group of claims.
IBNR	The difference between ultimate claims and reported claims.

The Development Triangle

A development triangle is a common way to organize and display historical claims data for the purposes of actuarial analysis. The most common data sets displayed in a development triangle are case outstanding, reported claims, paid claims and reported claim count.

A development triangle organizes claims data as a table or triangular matrix [math]C_{i,j}[/math] with [math]i = 0,\ldots,I[/math] and [math]j = 0,\ldots,I - i [/math]. The rows of the triangle represent accident years and the columns represent valuation dates. For example, in the table below, 43,169,009 represents loss amounts related to claims occurring in accident year 0, valued as of two years.

Reported claims
Accident Year	0	1	2	3	4
0	37,017,487	43,169,009	45,568,919	46,784,558	47,337,318
1	38,954,484	46,045,718	48,882,924	50,219,672
2	41,155,776	49,371,478	52,358,476
3	42,394,069	50,584,112
4	44,755,243

We would like to develop methods to estimate the non-observables [math]C_{i,j}[/math] where [math] j \gt I -i[/math] especially the unknown ultimate number [math]C_{i,J}[/math].

Chain Ladder Technique

The chain ladder technique, also known as the developmental technique, assumes that historical claims development is indicative or predictive of future claims development. Chain ladder methods rely on historical data found in the #development triangle to predict future claims development.

Development Factors

We are primarily interested in the progression of the [math]C_{i,j}[/math] as [math]j[/math] increases. Since [math]C_{i,j}[/math] typically represents cumulative quantities such as cumulative reported or paid claims, we expect [math]C_{i,j}[/math] to be increasing in [math]j[/math].

The development factors are the observed ratios [math]C_{i,j+1}/C_{i,j}[/math]. They represent a measure of growth of claims development associated with a single accident year. If [math]I[/math] is large enough, we should expect the development factors to tend to 1 as all claims associated with an accident year are permanently closed. We calculate the development factors corresponding to the development triangle of reported claims presented above:

Development factors
Accident year	0	1	2	3	To ult
0	1.166	1.056	1.027	1.012	1.00
1	1.182	1.062	1.027
2	1.200	1.061
3	1.193

Selecting Development Factors

The chain ladder technique assumes that historical deveopment factors are predictive of future development factors; consequently, one is expected to use such historical factors to select development factors that can be used to project future claims development for any given accident year. There are different methods used to select development factors. The simplest approach is to use a particular average of the historical development factors. To exemplify this simple approach, we calculate various averages associated with the table of historical development factors above:

Averages
Averaging Method	0	1	2	3	To ult
Simple average last 3 years	1.192	1.06	1.027	1.012	1.000
Volume weighted last 3 years	1.186	1.059	1.027	1.012	1.000
Selected	1.192	1.06	1.027	1.012	1.000

In this particular case, the development factors used for future development is the average development factor for the last 3 observable years. For instance, the selected development factor for development year 0 is 1.192, the average of 1.193, 1.2 and 1.182.

In the table above, the entry corresponding to row selected and column to ult is a tail factor -- a factor used to deduct the ultimate reported claims value from the latest observable reported claims value. Ideally the value of [math]I[/math] in the development triangle is large enough that the tail factor equals 1. In the example above, the tail factor is 1: the insurer expects all claims' development to cease 5 years after the accident year. It's important to note that the tail factor can exceed 1 in so called long tailed lines such as worker's compensation.

Project Ultimate Claims

The key to projecting ultimate claims is to calculate cumulative claim development factors (CDF) which are defined as follows:

[[math]] \begin{equation} \label{cdf} \hat{F_i} = \prod_{j = I- i}^{J-1}\hat{f_j} \end{equation} [[/math]]

with [math]\hat{f}_j[/math] denoting the selected development factors. The projection for ultimate claims for accident year [math]i[/math] equals

[[math]] \begin{equation} \label{puc} C_{i,I-i}\hat{F_i}. \end{equation} [[/math]]

Using formula \ref{puc}, we can calculate the projected ultimate reported claims for the example above (with a selected tail factor of 1):

Estimation of ultimate claims
Accident year	Reported claims	CDF (3 year avg / Mack-Method)	Projected ultimate claims (3 year avg / Mack-Method)
0	47,337,318	1.000 / 1.000	47,337,318 / 47,337,318
1	50,219,672	1.012 / 1.012	50,822,308 / 50,822,308
2	52,358,476	1.039 / 1.039	54,400,457 / 54,400,457
3	50,584,112	1.102 / 1.101	55,743,691 / 55,693,107
4	44,755,243	1.313 / 1.305	58,763,634 / 50,595,802
Total	245,659,807		267,472,394 / 259,253,978

Here is a summary on how to project ultimate claims:

Project Ultimate Claims Using the Chain Ladder Method

Create development triangle
Select development factors using historical development factors (averages, Mack-method, etc.)
Select tail factor
Calculate CDFs using formula \ref{cdf}
Calculate ultimate claims using formula \ref{puc}

The Claim Ratio Method

The claim ratio method, or loss ratio method, is one of the simplest methods to project ultimate unpaid claims. The method projects ultimate claims for an accident year by multiplying a predetermined exposure level by a selected claim ratio representing claims per unit of exposure:

Projection for ultimate claims = Exposure * Selected claim ratio.

The claim ratio method's accuracy depends on the ability to establish a proper exposure level associated with an accident year and then establishing an estimate for the ultimate claim per unit of exposure. The projection for ultimate claims associated with the chain ladder method is derived by multiplying the reported claims for an accident year by the selected CDF whereas the claim ratio method multiplies a selected exposure level associated with an accident year by a selected claim ratio. In other words, the chain ladder technique depends on the initial claim experience of an accident year whereas the claims ratio method depends on an exposure level which is a much more natural measure of size for a portfolio of policies.

A common type of exposure is earned premium. Earned premium is the portion of an insurance premium which is considered "earned" by the insurer, based on the part of the policy period that the insurance has been in effect, and during which the insurer has been exposed to loss. For instance, if a 365-day policy with a full premium payment at the beginning of the term has been in effect for 120 days, 120/365 of the premium is considered earned. Earned premium will not be returned to the insured if the policy is cancelled.

Example

Claim ratio method
Accident year	Reported claims at 12/31/08	Initial selected ultimate claims	On-level earned premium	Trend factor at 07/01/2008	Trended adjusted ultimate claims	Trended adjusted claim ratio
2000	10,000,000	10,012,500	24,000,000	2.954	29,576,925	83.0%
2001	8,000,000	8,220,000	18,000,000	2.580	21,207,600	79.0%
2002	9,400,000	9,591,000	19,000,000	2.253	21,608,523	76.0%
2003	15,600,000	13,845,000	23,000,000	1.968	27,246,960	79.0%
2004	16,500,000	19,700,000	32,000,000	1.719	33,864,300	79.0%
2005	18,500,000	25,700,000	47,000,000	1.501	38,575,700	82.0%
2006	16,500,000	29,850,000	50,000,000	1.311	39,133,350	78.0%
2007	14,000,000	42,800,000	57,000,000	1.145	49,006,000	86.0%
2008	8,700,000	51,150,000	62,000,000	1.000	51,150,000	83.0%

Suppose one wishes to project ultimate claims for accident year 2008 using the claim ratio method applied to the information contained in the table above. The first step is to select the exposure level associated with accident year 2008 which, in this scenario, is given by 62,000,000, the last entry in column labelled on-level earned premium. The second step is to select a claims ratio based on the observed historical claims ratios represented by the last column in the table above. Here we use a simple approach: the claims ratio is set to 79%, the average of the historical claims ratios for accident years 2002 to 2006. We finally obtain the projected ultimate claims for accident year 2008:

Accident year	Reported claims at 12/31/08	On-level earned premium	Selected claim ratio (avg 2002-2006)	Projected ultimate claims
2008	8,700,000	62,000,000	79.0%	48,980,000

A few comments about the table are in order. The exposure type selected here is on-level earned premium which represents earned premium adjusted to current rate levels -- since we're interested in projecting ultimate claims for accident year 2008, the on-level earned premium represents an adjustment to 2008 rate levels. The trend adjusted ultimate claims, second to last column, equals the selected ultimate claims multiplied by a trend factor (fifth column) which adjusts historical claim costs to mid 2008 cost levels. In summary, adjustments are made to both historical earned premium and historical claim costs to obtain claim ratios which represent a more realistic sample for the selection of a claim ratio for accident year 2008.

The Bornhuetter-Ferguson method

Unlike the chain ladder method, the Bornhuetter-Ferguson method doesn't rely exclusively on historical claims development to project ultimate claims. The projected ultimate claims under this method is equal to the current reported claims plus the expected unreported claims:

Ultimate claims = Actual Reported Claims + Expected Unreported Claims

The expected unreported claims is calculated by multiplying the expected claims multiplied by an estimate of the % yet to be reported:

Ultimate claims = Actual Reported Claims + Expected Claims * % Unreported

The estimate for the percentage yet to be reported for accident year [math]i[/math] is based on the developmental method and is set to [math]1 - \hat{F_i}^{-1} [/math]. Mathematically the projected ultimate claims for accident year [math]i[/math] equals

[[math]] \begin{equation} \label{bf-method-main} C_{i,I-i} + (1 - \hat{F_{i}}^{-1})\mu_i = Z_i \hat{F_i}C_{i,I-i} + (1 - Z_i)\mu_i \end{equation} [[/math]]

with [math]\mu_i [/math] the expected ultimate claims for accident year [math]i[/math] and [math]Z_i = \hat{F_i}^{-1} [/math]. Examining \ref{bf-method-main}, we notice that [math]\hat{F_i}C_{i,I-i}[/math] is the projection for ultimate claims given by the developmental technique; consequently, we can view the projection for ultimate claims using the Bornhuetter-Ferguson method as a kind of credibility estimator with credibility weight [math]Z_i[/math].

The Bornhuetter-Ferguson method as MMSE

If we impose specific conditions on the variables [math]C_{i,j} [/math] (the developmental triangle), then the Bornhuetter-Ferguson projection for ultimate claims is a minimum mean square estimator.

Suppose the following holds:

If [math]C_{i,J}[/math] denotes the ultimate claims, then [math]\mu_i = \operatorname{E}[C_{i,J}] [/math] is known for all [math] 1 \leq i \leq I [/math]
[math]C_{i,J} - C_{i,j} [/math] is independent of [math]C_{i,j}[/math] for any [math]1 \leq j \leq I [/math]
[math]F_{i,j} = \mu_i / \operatorname{E}[C_{i,j}] [/math] are known
[math]\operatorname{Var}[C_{i,j}F_{i,j}] = F_{i,j}\operatorname{Var}[C_{i,J}][/math]

Then the the Bornhuetter-Ferguson estimator for [math]C_{i,J} [/math] equals

[[math]] \hat{C}_{i,J} = Z_i F_iC_{i,I-i} + (1 - Z_i)\mu_i. [[/math]]

with the shorthand [math]F_i = F_{i,J}[/math]. It is clear from the assumptions above that [math]\hat{C}_{i,J}[/math] is an unbiased estimator of [math]C_{i,J}[/math]. Furthermore, the estimator minimizes variance among all weighted averages of [math]F_iC_{i,I-i} [/math] and [math]\mu_i[/math]:

Proposition (Minimum variance property of the Bornhuetter-Ferguson estimator)

[[math]] \hat{C}_{i,J} = \underset{X \in S_i}{\operatorname{argmin}} \operatorname{E}[ (C_{i,J} - X)^2 ],\, S_i = \{ ZF_iC_{i,I-i} + (1-Z)\mu_i \}. [[/math]]

Show Proof

We can rewrite the minimization problem as

[[math]] \begin{equation} \label{bf-argmin} \underset{Z}{\operatorname{argmin}} \operatorname{Var}[ Z(C_{i,I-i}F_i - C_{i,J}) + (1-Z)(\mu_i - C_{i,J}) ] \end{equation} [[/math]]

It is first shown that [math]C_{i,I-i}F_i - C_{i,J} [/math] and [math]\mu_i - C_{i,J} [/math] are uncorrelated:

[[math]] \begin{align*} \operatorname{Cov}[C_{i,I-i}F_i - C_{i,J},\mu_i - C_{i,J}] &= F_i\operatorname{Cov}[C_{i,I-i},C_{i,J}] - \operatorname{Var}[C_{i,J}] \\ &= F_i \operatorname{Var}[C_{i,I-i}] - \operatorname{Var}[C_{i,J}] \\ &= 0. \end{align*} [[/math]]

It follows, from the proof of the variance minimizing property of the Mack-method, that the solution to \ref{bf-argmin} is given by setting the weight [math]Z[/math] proportional to the inverse of the variance of [math]C_{i,I-i}F_i - C_{i,J}[/math]:

[[math]] \begin{equation} \label{bf-minvar-sol} Z = \frac{\operatorname{Var}[C_{i,I-i}F_i - C_{i,J}]^{-1}}{\operatorname{Var}[C_{i,I-i}F_i - C_{i,J}]^{-1} + \operatorname{Var}[C_{i,J} ]^{-1}}. \end{equation} [[/math]]

Using the fact that

[[math]] \begin{align*} \operatorname{Var}[C_{i,I-i}F_i - C_{i,J}] &= F_i^2 \operatorname{Var}[C_{i,I-i}] + \operatorname{Var}[C_{i,J}] - 2 F_i \operatorname{Cov}[C_{i,J},C_{i,I-i}] \\ &= \operatorname{Var}[C_{i,J}](F_i - 1) \end{align*} [[/math]]

in equation \ref{bf-minvar-sol} shows that [math]Z[/math] equals [math]F_i^{-1}[/math].

■

Frequency-Severity Models

The underlying premise of frequency-severity models is that claim projections can be improved by projecting ultimate claim count and ultimate cost per claim separately. We define the following triangles:

[math]C_{i,j}[/math] represents the developmental triangle for total aggregate claim costs
[math]N_{i,j}[/math] represents the developmental triangle for aggregate claim count
[math]S_{i,j}[/math] represents the developmental triangle for the average claim cost

Claim Frequency

There are generally two types of claim frequencies, [math]N_{i,j}[/math], available: reported claim counts or settled claim counts. Reported claim frequencies typically have different developmental patterns compared to settled claim counts: claims for an accident tend to be reported quickly while settling claims might take much longer. In general, settled claim counts will have longer tails than reported claims and claim counts will be low for early development years. Because of this phenomenon, it is often preferable to work with incremental claim counts when dealing with settled claim frequencies:

Projecting settled claim frequency

Build the incremental developmental triangle [math]X_{i,j} = N_{i,j+1}-N_{i,j} [/math] for [math]j\gt1[/math] and [math]X_{i,0} = N_{i,0} [/math], where [math]N_{i,j}[/math] is the settled claim count triangle.
Build the developmental triangle [math]N'_{i,j}[/math] for the reported claim counts.
Estimate the proportion of the ultimate settled claim frequency for each accident year that is settled in developmental year [math]j[/math] and denote it by [math]\hat{\gamma_j}[/math].
Use the developmental technique on the triangle [math]N'_{i,j} [/math] to estimate [math]N'_{i,j}[/math] for [math]i+j \gt I [/math].
The projection [math]\hat{X}_{i,j} [/math] equals [math]\hat{\gamma}_{j} \hat{N'}_{i,J}[/math] and the projection [math]\hat{N}_{i,j}[/math] equals [math]\hat{\beta}_j \hat{N'}_{i,J} [/math] where [math]\hat{\beta}_j \ = \sum_{i=1}^j\hat{\gamma}_i[/math].

Claim Severity

The simplest approach to projecting average claim cost is to divide the claims developmental triangle [math]C_{i,j}[/math] by the [math]N_{i,j}[/math] , where [math]N_{i,j}[/math] corresponds to the developmental triangle for the settled claim frequency, to obtain the triangle [math]S_{i,j}[/math] and then use the developmental technique to project forward:

Projecting avg claim cost: basic method

Build the claims developmental triangle [math]C_{i,j}[/math].
Build the settled claim count developmental triangle [math]N_{i,j}[/math].
Build the avg claim cost developmental triangle [math]S_{i,j} = C_{i,j}/N_{i,j}[/math].
Use the developmental technique to project [math]S_{i,j}[/math] for [math]i+j \gt I [/math].

One of the weaknesses of this simple method is that it doesn't take into account that, often times, the avg claim cost is correlated with the developmental year: claims settled much later than the accident year tend to have larger settlement costs. In fact, this is almost a tautology: an insurer is more likely to settle small claims in a shorter amount of time while delaying settlement for potentially large complex settlements. This correlation then motivates the second approach to analyzing avg claim costs:

Projecting avg claim cost: developmental year method

Build the incremental claims developmental triangle [math]X_{i,j} = C_{i,j+1}-C_{i,j}[/math].
Build the incremental settled claim count developmental triangle [math]Y_{i,j} = N_{i,j+1}-N_{i,j}[/math].
Build the avg settlement cost developmental triangle [math]S_{i,j} = X_{i,j}/Y_{i,j}[/math].
The projection [math]\hat{S}_{i,j} [/math], [math]i+j\gt I [/math], is set to the average of [math]S_{i,j}[/math] over the accident year [math]i[/math]: [math]\hat{S}_{i,j} = \frac{1}{I-j}\sum_{i=0}^{I-j}S_{i,j}[/math].

References

"ADVANCED SHORT-TERM ACTUARIAL MATHEMATICS STUDY NOTE: OUTSTANDING CLAIMS RESERVES" (PDF). Society of Actuaries. Retrieved 18 February 2023.

Wikipedia References

Friedland, Jacqueline. "Estimating Unpaid Claims Using Basic Techniques" (PDF). Casualty Actuarial Society. Retrieved 28 August 2019.
Wikipedia contributors. "Chain-ladder method". Wikipedia. Wikipedia. Retrieved 28 August 2019.
Wikipedia contributors. "Bornhuetter–Ferguson method". Wikipedia. Wikipedia. Retrieved 28 August 2019.

@@ Line 84: / Line 84: @@
 ===Selecting Development Factors ===
-The chain ladder technique assumes that historical age-to-age factors are predictive of future age-to-age factors; consequently, one is expected to use such historical factors to select age-to-age factors that can be used to project future claims development for any given accident year. There are different methods used to select age-to-age factors. The simplest approach is to use a particular average of the age-to-age factors. To exemplify this simple approach, we calculate various averages associated with the table of age-to-age factors above:
+The chain ladder technique assumes that historical deveopment factors are predictive of future development factors; consequently, one is expected to use such historical factors to select development factors that can be used to project future claims development for any given accident year. There are different methods used to select development factors. The simplest approach is to use a particular average of the historical development factors. To exemplify this simple approach, we calculate various averages associated with the table of  historical development factors above:
 {| class="table"
@@ Line 264: / Line 264: @@
 with the shorthand <math>F_i = F_{i,J}</math>. It is clear from the assumptions above that <math>\hat{C}_{i,J}</math> is an unbiased estimator of <math>C_{i,J}</math>. Furthermore, the estimator minimizes variance among all weighted averages of <math>F_iC_{i,I-i} </math> and <math>\mu_i</math>:
+<div class="card mb-4"><div class="card-header"> Proposition (Minimum variance property of the Bornhuetter-Ferguson estimator)</div><div class="card-body">
+<p class="card-text">
 <math display="block">
@@ Line 270: / Line 272: @@
 </math>
-<div class="text-right">
+</p>
+<span class="mw-customtoggle-theo.lassoPiecewiseLinear btn btn-primary" >Show Proof</span><div class="mw-collapsible mw-collapsed" id="mw-customcollapsible-theo.lassoPiecewiseLinear"><div class="mw-collapsible-content p-3">
-<proofs page="guide_proofs:75f4dbb8dc" section="bf-minvar" label="Bornhuetter-Ferguson Minimizes Variance" />
+We can rewrite the minimization problem as
-</div>
+<math display="block">
+\begin{equation}
+\label{bf-argmin}
+\underset{Z}{\operatorname{argmin}} \operatorname{Var}[ Z(C_{i,I-i}F_i - C_{i,J}) + (1-Z)(\mu_i - C_{i,J}) ]
+\end{equation}
+</math>
+It is first shown that <math>C_{i,I-i}F_i - C_{i,J} </math> and <math>\mu_i - C_{i,J} </math> are uncorrelated:
+<math display="block">
+\begin{align*}
+\operatorname{Cov}[C_{i,I-i}F_i - C_{i,J},\mu_i - C_{i,J}] &= F_i\operatorname{Cov}[C_{i,I-i},C_{i,J}] - \operatorname{Var}[C_{i,J}] \\
+&=  F_i \operatorname{Var}[C_{i,I-i}] - \operatorname{Var}[C_{i,J}]  \\
+&= 0.
+\end{align*}
+</math>
+It follows, from the proof of the variance minimizing property of the Mack-method, that the solution to \ref{bf-argmin} is given by setting the weight <math>Z</math> proportional to the inverse of the variance of <math>C_{i,I-i}F_i - C_{i,J}</math>:
+<math display="block">
+\begin{equation}
+\label{bf-minvar-sol}
+Z = \frac{\operatorname{Var}[C_{i,I-i}F_i - C_{i,J}]^{-1}}{\operatorname{Var}[C_{i,I-i}F_i - C_{i,J}]^{-1} + \operatorname{Var}[C_{i,J} ]^{-1}}.
+\end{equation}
+</math>
+Using the fact that
+<math display="block">
+\begin{align*}
+\operatorname{Var}[C_{i,I-i}F_i - C_{i,J}] &= F_i^2 \operatorname{Var}[C_{i,I-i}] + \operatorname{Var}[C_{i,J}] - 2 F_i \operatorname{Cov}[C_{i,J},C_{i,I-i}] \\
+&= \operatorname{Var}[C_{i,J}](F_i - 1)
+\end{align*}
+</math>
+in equation \ref{bf-minvar-sol} shows that <math>Z</math> equals <math>F_i^{-1}</math>.
+<div class="text-end"> &#x25A0;</div></div></div></div></div>
 ==Frequency-Severity Models==
@@ Line 307: / Line 351: @@
 #Use the developmental technique to project <math>S_{i,j}</math> for <math>i+j > I </math>.
 </proc>
 One of the weaknesses of this simple method is that it doesn't take into account that, often times, the avg claim cost is correlated with the developmental year: claims settled much later than the accident year tend to have larger settlement costs. In fact, this is almost a tautology: an insurer is more likely to settle small claims in a shorter amount of time while delaying settlement for potentially large complex settlements. This correlation then motivates the second approach to analyzing avg claim costs: