guide:75f4dbb8dc: Difference between revisions

From Stochiki
mNo edit summary
mNo edit summary
 
(2 intermediate revisions by the same user not shown)
Line 84: Line 84:
===Selecting Development Factors ===
===Selecting Development Factors ===


The chain ladder technique assumes that historical age-to-age factors are predictive of future age-to-age factors; consequently, one is expected to use such historical factors to select age-to-age factors that can be used to project future claims development for any given accident year. There are different methods used to select age-to-age factors. The simplest approach is to use a particular average of the age-to-age factors. To exemplify this simple approach, we calculate various averages associated with the table of age-to-age factors above:
The chain ladder technique assumes that historical deveopment factors are predictive of future development factors; consequently, one is expected to use such historical factors to select development factors that can be used to project future claims development for any given accident year. There are different methods used to select development factors. The simplest approach is to use a particular average of the historical development factors. To exemplify this simple approach, we calculate various averages associated with the table of historical development factors above:


{| class="table"
{| class="table"
Line 264: Line 264:
with the shorthand <math>F_i = F_{i,J}</math>. It is clear from the assumptions above that <math>\hat{C}_{i,J}</math> is an unbiased estimator of <math>C_{i,J}</math>. Furthermore, the estimator minimizes variance among all weighted averages of <math>F_iC_{i,I-i} </math> and <math>\mu_i</math>:
with the shorthand <math>F_i = F_{i,J}</math>. It is clear from the assumptions above that <math>\hat{C}_{i,J}</math> is an unbiased estimator of <math>C_{i,J}</math>. Furthermore, the estimator minimizes variance among all weighted averages of <math>F_iC_{i,I-i} </math> and <math>\mu_i</math>:


<div class="card mb-4"><div class="card-header"> Proposition (Minimum variance property of the Bornhuetter-Ferguson estimator)</div><div class="card-body">
<p class="card-text">
<math display="block">
<math display="block">


Line 270: Line 272:
</math>
</math>


<div class="text-right">
</p>
<span class="mw-customtoggle-theo.lassoPiecewiseLinear btn btn-primary" >Show Proof</span><div class="mw-collapsible mw-collapsed" id="mw-customcollapsible-theo.lassoPiecewiseLinear"><div class="mw-collapsible-content p-3">


<proofs page="guide_proofs:75f4dbb8dc" section="bf-minvar" label="Bornhuetter-Ferguson Minimizes Variance" />
We can rewrite the minimization problem as


</div>
<math display="block">
\begin{equation}
\label{bf-argmin}
\underset{Z}{\operatorname{argmin}} \operatorname{Var}[ Z(C_{i,I-i}F_i - C_{i,J}) + (1-Z)(\mu_i - C_{i,J}) ]
\end{equation}
</math>
 
It is first shown that <math>C_{i,I-i}F_i - C_{i,J} </math> and <math>\mu_i - C_{i,J} </math> are uncorrelated:
 
<math display="block">
\begin{align*}
\operatorname{Cov}[C_{i,I-i}F_i - C_{i,J},\mu_i - C_{i,J}] &= F_i\operatorname{Cov}[C_{i,I-i},C_{i,J}] - \operatorname{Var}[C_{i,J}] \\
 
&=  F_i \operatorname{Var}[C_{i,I-i}] - \operatorname{Var}[C_{i,J}]  \\
 
&= 0.
 
\end{align*}
</math>
 
It follows, from the proof of the variance minimizing property of the Mack-method, that the solution to \ref{bf-argmin} is given by setting the weight <math>Z</math> proportional to the inverse of the variance of <math>C_{i,I-i}F_i - C_{i,J}</math>:
 
<math display="block">
\begin{equation}
\label{bf-minvar-sol}
Z = \frac{\operatorname{Var}[C_{i,I-i}F_i - C_{i,J}]^{-1}}{\operatorname{Var}[C_{i,I-i}F_i - C_{i,J}]^{-1} + \operatorname{Var}[C_{i,J} ]^{-1}}.
\end{equation}
</math>
 
Using the fact that 
 
<math display="block">
\begin{align*}
\operatorname{Var}[C_{i,I-i}F_i - C_{i,J}] &= F_i^2 \operatorname{Var}[C_{i,I-i}] + \operatorname{Var}[C_{i,J}] - 2 F_i \operatorname{Cov}[C_{i,J},C_{i,I-i}] \\
&= \operatorname{Var}[C_{i,J}](F_i - 1)
 
\end{align*}
</math>
in equation \ref{bf-minvar-sol} shows that <math>Z</math> equals <math>F_i^{-1}</math>.
 
 
<div class="text-end"> &#x25A0;</div></div></div></div></div>


==Frequency-Severity Models==
==Frequency-Severity Models==
Line 307: Line 351:
#Use the developmental technique to project <math>S_{i,j}</math> for <math>i+j > I </math>.
#Use the developmental technique to project <math>S_{i,j}</math> for <math>i+j > I </math>.
</proc>
</proc>


One of the weaknesses of this simple method is that it doesn't take into account that, often times, the avg claim cost is correlated with the developmental year: claims settled much later than the accident year tend to have larger settlement costs. In fact, this is almost a tautology: an insurer is more likely to settle small claims in a shorter amount of time while delaying settlement for potentially large complex settlements. This correlation then motivates the second approach to analyzing avg claim costs:  
One of the weaknesses of this simple method is that it doesn't take into account that, often times, the avg claim cost is correlated with the developmental year: claims settled much later than the accident year tend to have larger settlement costs. In fact, this is almost a tautology: an insurer is more likely to settle small claims in a shorter amount of time while delaying settlement for potentially large complex settlements. This correlation then motivates the second approach to analyzing avg claim costs:  

Latest revision as of 01:39, 30 June 2023

Loss reserving refers to the process of calculating, at a particular time, an amount which represents liabilities associated with a portfolio of insurance policies. Loss reserves are useful to various stakeholders such as internal management, investors and regulators.

Loss Development

Each portfolio of policies will trigger a number of claims and every such claim has a life of its own from the initial reporting of the claim to its closure/settlement. The insurer will carefully record expenses and payments for claims associated with a portfolio of policies. The actuary can then use historical claims data to establish reserves for various stakeholders. We give a brief overview of some of the key terminology relevant to historical claims development data:

Term Description
Accident Year Refers to the year when a claim triggering incident took place.
Case outstanding Represents an amount, established by a claims professional (typically a claims adjuster), representing the dollar value of the liabilities to the insurer associated with a group of claims.
Reported claims The case outstanding plus the payments made to the insureds for a group of claims.
Ultimate claims The cumulative sum of all payments made to insureds for a group of settled claims i.e., the total cost to the insurer to settle a group of claims.
IBNR The difference between ultimate claims and reported claims.

The Development Triangle

A development triangle is a common way to organize and display historical claims data for the purposes of actuarial analysis. The most common data sets displayed in a development triangle are case outstanding, reported claims, paid claims and reported claim count.

A development triangle organizes claims data as a table or triangular matrix [math]C_{i,j}[/math] with [math]i = 0,\ldots,I[/math] and [math]j = 0,\ldots,I - i [/math]. The rows of the triangle represent accident years and the columns represent valuation dates. For example, in the table below, 43,169,009 represents loss amounts related to claims occurring in accident year 0, valued as of two years.

Reported claims
Accident Year 0 1 2 3 4
0 37,017,487 43,169,009 45,568,919 46,784,558 47,337,318
1 38,954,484 46,045,718 48,882,924 50,219,672
2 41,155,776 49,371,478 52,358,476
3 42,394,069 50,584,112
4 44,755,243

We would like to develop methods to estimate the non-observables [math]C_{i,j}[/math] where [math] j \gt I -i[/math] especially the unknown ultimate number [math]C_{i,J}[/math].

Chain Ladder Technique

The chain ladder technique, also known as the developmental technique, assumes that historical claims development is indicative or predictive of future claims development. Chain ladder methods rely on historical data found in the #development triangle to predict future claims development.

Development Factors

We are primarily interested in the progression of the [math]C_{i,j}[/math] as [math]j[/math] increases. Since [math]C_{i,j}[/math] typically represents cumulative quantities such as cumulative reported or paid claims, we expect [math]C_{i,j}[/math] to be increasing in [math]j[/math].

The development factors are the observed ratios [math]C_{i,j+1}/C_{i,j}[/math]. They represent a measure of growth of claims development associated with a single accident year. If [math]I[/math] is large enough, we should expect the development factors to tend to 1 as all claims associated with an accident year are permanently closed. We calculate the development factors corresponding to the development triangle of reported claims presented above:

Development factors
Accident year 0 1 2 3 To ult
0 1.166 1.056 1.027 1.012 1.00
1 1.182 1.062 1.027
2 1.200 1.061
3 1.193

Selecting Development Factors

The chain ladder technique assumes that historical deveopment factors are predictive of future development factors; consequently, one is expected to use such historical factors to select development factors that can be used to project future claims development for any given accident year. There are different methods used to select development factors. The simplest approach is to use a particular average of the historical development factors. To exemplify this simple approach, we calculate various averages associated with the table of historical development factors above:

Averages
Averaging Method 0 1 2 3 To ult
Simple average last 3 years 1.192 1.06 1.027 1.012 1.000
Volume weighted last 3 years 1.186 1.059 1.027 1.012 1.000
Selected 1.192 1.06 1.027 1.012 1.000

In this particular case, the development factors used for future development is the average development factor for the last 3 observable years. For instance, the selected development factor for development year 0 is 1.192, the average of 1.193, 1.2 and 1.182.

In the table above, the entry corresponding to row selected and column to ult is a tail factor -- a factor used to deduct the ultimate reported claims value from the latest observable reported claims value. Ideally the value of [math]I[/math] in the development triangle is large enough that the tail factor equals 1. In the example above, the tail factor is 1: the insurer expects all claims' development to cease 5 years after the accident year. It's important to note that the tail factor can exceed 1 in so called long tailed lines such as worker's compensation.

Project Ultimate Claims

The key to projecting ultimate claims is to calculate cumulative claim development factors (CDF) which are defined as follows:

[[math]] \begin{equation} \label{cdf} \hat{F_i} = \prod_{j = I- i}^{J-1}\hat{f_j} \end{equation} [[/math]]

with [math]\hat{f}_j[/math] denoting the selected development factors. The projection for ultimate claims for accident year [math]i[/math] equals

[[math]] \begin{equation} \label{puc} C_{i,I-i}\hat{F_i}. \end{equation} [[/math]]

Using formula \ref{puc}, we can calculate the projected ultimate reported claims for the example above (with a selected tail factor of 1):

Estimation of ultimate claims
Accident year Reported claims CDF (3 year avg / Mack-Method) Projected ultimate claims (3 year avg / Mack-Method)
0 47,337,318 1.000 / 1.000 47,337,318 / 47,337,318
1 50,219,672 1.012 / 1.012 50,822,308 / 50,822,308
2 52,358,476 1.039 / 1.039 54,400,457 / 54,400,457
3 50,584,112 1.102 / 1.101 55,743,691 / 55,693,107
4 44,755,243 1.313 / 1.305 58,763,634 / 50,595,802
Total 245,659,807 267,472,394 / 259,253,978

Here is a summary on how to project ultimate claims:

Project Ultimate Claims Using the Chain Ladder Method
  1. Create development triangle
  2. Select development factors using historical development factors (averages, Mack-method, etc.)
  3. Select tail factor
  4. Calculate CDFs using formula \ref{cdf}
  5. Calculate ultimate claims using formula \ref{puc}

The Claim Ratio Method

The claim ratio method, or loss ratio method, is one of the simplest methods to project ultimate unpaid claims. The method projects ultimate claims for an accident year by multiplying a predetermined exposure level by a selected claim ratio representing claims per unit of exposure:

Projection for ultimate claims = Exposure * Selected claim ratio.

The claim ratio method's accuracy depends on the ability to establish a proper exposure level associated with an accident year and then establishing an estimate for the ultimate claim per unit of exposure. The projection for ultimate claims associated with the chain ladder method is derived by multiplying the reported claims for an accident year by the selected CDF whereas the claim ratio method multiplies a selected exposure level associated with an accident year by a selected claim ratio. In other words, the chain ladder technique depends on the initial claim experience of an accident year whereas the claims ratio method depends on an exposure level which is a much more natural measure of size for a portfolio of policies.

A common type of exposure is earned premium. Earned premium is the portion of an insurance premium which is considered "earned" by the insurer, based on the part of the policy period that the insurance has been in effect, and during which the insurer has been exposed to loss. For instance, if a 365-day policy with a full premium payment at the beginning of the term has been in effect for 120 days, 120/365 of the premium is considered earned. Earned premium will not be returned to the insured if the policy is cancelled.

Example

Claim ratio method
Accident year Reported claims at 12/31/08 Initial selected ultimate claims On-level earned premium Trend factor at 07/01/2008 Trended adjusted ultimate claims Trended adjusted claim ratio
2000 10,000,000 10,012,500 24,000,000 2.954 29,576,925 83.0%
2001 8,000,000 8,220,000 18,000,000 2.580 21,207,600 79.0%
2002 9,400,000 9,591,000 19,000,000 2.253 21,608,523 76.0%
2003 15,600,000 13,845,000 23,000,000 1.968 27,246,960 79.0%
2004 16,500,000 19,700,000 32,000,000 1.719 33,864,300 79.0%
2005 18,500,000 25,700,000 47,000,000 1.501 38,575,700 82.0%
2006 16,500,000 29,850,000 50,000,000 1.311 39,133,350 78.0%
2007 14,000,000 42,800,000 57,000,000 1.145 49,006,000 86.0%
2008 8,700,000 51,150,000 62,000,000 1.000 51,150,000 83.0%

Suppose one wishes to project ultimate claims for accident year 2008 using the claim ratio method applied to the information contained in the table above. The first step is to select the exposure level associated with accident year 2008 which, in this scenario, is given by 62,000,000, the last entry in column labelled on-level earned premium. The second step is to select a claims ratio based on the observed historical claims ratios represented by the last column in the table above. Here we use a simple approach: the claims ratio is set to 79%, the average of the historical claims ratios for accident years 2002 to 2006. We finally obtain the projected ultimate claims for accident year 2008:

Accident year Reported claims at 12/31/08 On-level earned premium Selected claim ratio (avg 2002-2006) Projected ultimate claims
2008 8,700,000 62,000,000 79.0% 48,980,000

A few comments about the table are in order. The exposure type selected here is on-level earned premium which represents earned premium adjusted to current rate levels -- since we're interested in projecting ultimate claims for accident year 2008, the on-level earned premium represents an adjustment to 2008 rate levels. The trend adjusted ultimate claims, second to last column, equals the selected ultimate claims multiplied by a trend factor (fifth column) which adjusts historical claim costs to mid 2008 cost levels. In summary, adjustments are made to both historical earned premium and historical claim costs to obtain claim ratios which represent a more realistic sample for the selection of a claim ratio for accident year 2008.

The Bornhuetter-Ferguson method

Unlike the chain ladder method, the Bornhuetter-Ferguson method doesn't rely exclusively on historical claims development to project ultimate claims. The projected ultimate claims under this method is equal to the current reported claims plus the expected unreported claims:

Ultimate claims = Actual Reported Claims + Expected Unreported Claims

The expected unreported claims is calculated by multiplying the expected claims multiplied by an estimate of the % yet to be reported:

Ultimate claims = Actual Reported Claims + Expected Claims * % Unreported

The estimate for the percentage yet to be reported for accident year [math]i[/math] is based on the developmental method and is set to [math]1 - \hat{F_i}^{-1} [/math]. Mathematically the projected ultimate claims for accident year [math]i[/math] equals

[[math]] \begin{equation} \label{bf-method-main} C_{i,I-i} + (1 - \hat{F_{i}}^{-1})\mu_i = Z_i \hat{F_i}C_{i,I-i} + (1 - Z_i)\mu_i \end{equation} [[/math]]

with [math]\mu_i [/math] the expected ultimate claims for accident year [math]i[/math] and [math]Z_i = \hat{F_i}^{-1} [/math]. Examining \ref{bf-method-main}, we notice that [math]\hat{F_i}C_{i,I-i}[/math] is the projection for ultimate claims given by the developmental technique; consequently, we can view the projection for ultimate claims using the Bornhuetter-Ferguson method as a kind of credibility estimator with credibility weight [math]Z_i[/math].

The Bornhuetter-Ferguson method as MMSE

If we impose specific conditions on the variables [math]C_{i,j} [/math] (the developmental triangle), then the Bornhuetter-Ferguson projection for ultimate claims is a minimum mean square estimator.

Suppose the following holds:

  • If [math]C_{i,J}[/math] denotes the ultimate claims, then [math]\mu_i = \operatorname{E}[C_{i,J}] [/math] is known for all [math] 1 \leq i \leq I [/math]
  • [math]C_{i,J} - C_{i,j} [/math] is independent of [math]C_{i,j}[/math] for any [math]1 \leq j \leq I [/math]
  • [math]F_{i,j} = \mu_i / \operatorname{E}[C_{i,j}] [/math] are known
  • [math]\operatorname{Var}[C_{i,j}F_{i,j}] = F_{i,j}\operatorname{Var}[C_{i,J}][/math]


Then the the Bornhuetter-Ferguson estimator for [math]C_{i,J} [/math] equals

[[math]] \hat{C}_{i,J} = Z_i F_iC_{i,I-i} + (1 - Z_i)\mu_i. [[/math]]

with the shorthand [math]F_i = F_{i,J}[/math]. It is clear from the assumptions above that [math]\hat{C}_{i,J}[/math] is an unbiased estimator of [math]C_{i,J}[/math]. Furthermore, the estimator minimizes variance among all weighted averages of [math]F_iC_{i,I-i} [/math] and [math]\mu_i[/math]:

Proposition (Minimum variance property of the Bornhuetter-Ferguson estimator)

[[math]] \hat{C}_{i,J} = \underset{X \in S_i}{\operatorname{argmin}} \operatorname{E}[ (C_{i,J} - X)^2 ],\, S_i = \{ ZF_iC_{i,I-i} + (1-Z)\mu_i \}. [[/math]]

Show Proof

We can rewrite the minimization problem as

[[math]] \begin{equation} \label{bf-argmin} \underset{Z}{\operatorname{argmin}} \operatorname{Var}[ Z(C_{i,I-i}F_i - C_{i,J}) + (1-Z)(\mu_i - C_{i,J}) ] \end{equation} [[/math]]

It is first shown that [math]C_{i,I-i}F_i - C_{i,J} [/math] and [math]\mu_i - C_{i,J} [/math] are uncorrelated:

[[math]] \begin{align*} \operatorname{Cov}[C_{i,I-i}F_i - C_{i,J},\mu_i - C_{i,J}] &= F_i\operatorname{Cov}[C_{i,I-i},C_{i,J}] - \operatorname{Var}[C_{i,J}] \\ &= F_i \operatorname{Var}[C_{i,I-i}] - \operatorname{Var}[C_{i,J}] \\ &= 0. \end{align*} [[/math]]

It follows, from the proof of the variance minimizing property of the Mack-method, that the solution to \ref{bf-argmin} is given by setting the weight [math]Z[/math] proportional to the inverse of the variance of [math]C_{i,I-i}F_i - C_{i,J}[/math]:

[[math]] \begin{equation} \label{bf-minvar-sol} Z = \frac{\operatorname{Var}[C_{i,I-i}F_i - C_{i,J}]^{-1}}{\operatorname{Var}[C_{i,I-i}F_i - C_{i,J}]^{-1} + \operatorname{Var}[C_{i,J} ]^{-1}}. \end{equation} [[/math]]

Using the fact that

[[math]] \begin{align*} \operatorname{Var}[C_{i,I-i}F_i - C_{i,J}] &= F_i^2 \operatorname{Var}[C_{i,I-i}] + \operatorname{Var}[C_{i,J}] - 2 F_i \operatorname{Cov}[C_{i,J},C_{i,I-i}] \\ &= \operatorname{Var}[C_{i,J}](F_i - 1) \end{align*} [[/math]]
in equation \ref{bf-minvar-sol} shows that [math]Z[/math] equals [math]F_i^{-1}[/math].


Frequency-Severity Models

The underlying premise of frequency-severity models is that claim projections can be improved by projecting ultimate claim count and ultimate cost per claim separately. We define the following triangles:

  • [math]C_{i,j}[/math] represents the developmental triangle for total aggregate claim costs
  • [math]N_{i,j}[/math] represents the developmental triangle for aggregate claim count
  • [math]S_{i,j}[/math] represents the developmental triangle for the average claim cost

Claim Frequency

There are generally two types of claim frequencies, [math]N_{i,j}[/math], available: reported claim counts or settled claim counts. Reported claim frequencies typically have different developmental patterns compared to settled claim counts: claims for an accident tend to be reported quickly while settling claims might take much longer. In general, settled claim counts will have longer tails than reported claims and claim counts will be low for early development years. Because of this phenomenon, it is often preferable to work with incremental claim counts when dealing with settled claim frequencies:

Projecting settled claim frequency
  1. Build the incremental developmental triangle [math]X_{i,j} = N_{i,j+1}-N_{i,j} [/math] for [math]j\gt1[/math] and [math]X_{i,0} = N_{i,0} [/math], where [math]N_{i,j}[/math] is the settled claim count triangle.
  2. Build the developmental triangle [math]N'_{i,j}[/math] for the reported claim counts.
  3. Estimate the proportion of the ultimate settled claim frequency for each accident year that is settled in developmental year [math]j[/math] and denote it by [math]\hat{\gamma_j}[/math].
  4. Use the developmental technique on the triangle [math]N'_{i,j} [/math] to estimate [math]N'_{i,j}[/math] for [math]i+j \gt I [/math].
  5. The projection [math]\hat{X}_{i,j} [/math] equals [math]\hat{\gamma}_{j} \hat{N'}_{i,J}[/math] and the projection [math]\hat{N}_{i,j}[/math] equals [math]\hat{\beta}_j \hat{N'}_{i,J} [/math] where [math]\hat{\beta}_j \ = \sum_{i=1}^j\hat{\gamma}_i[/math].

Claim Severity

The simplest approach to projecting average claim cost is to divide the claims developmental triangle [math]C_{i,j}[/math] by the [math]N_{i,j}[/math] , where [math]N_{i,j}[/math] corresponds to the developmental triangle for the settled claim frequency, to obtain the triangle [math]S_{i,j}[/math] and then use the developmental technique to project forward:

Projecting avg claim cost: basic method
  1. Build the claims developmental triangle [math]C_{i,j}[/math].
  2. Build the settled claim count developmental triangle [math]N_{i,j}[/math].
  3. Build the avg claim cost developmental triangle [math]S_{i,j} = C_{i,j}/N_{i,j}[/math].
  4. Use the developmental technique to project [math]S_{i,j}[/math] for [math]i+j \gt I [/math].


One of the weaknesses of this simple method is that it doesn't take into account that, often times, the avg claim cost is correlated with the developmental year: claims settled much later than the accident year tend to have larger settlement costs. In fact, this is almost a tautology: an insurer is more likely to settle small claims in a shorter amount of time while delaying settlement for potentially large complex settlements. This correlation then motivates the second approach to analyzing avg claim costs:

Projecting avg claim cost: developmental year method
  1. Build the incremental claims developmental triangle [math]X_{i,j} = C_{i,j+1}-C_{i,j}[/math].
  2. Build the incremental settled claim count developmental triangle [math]Y_{i,j} = N_{i,j+1}-N_{i,j}[/math].
  3. Build the avg settlement cost developmental triangle [math]S_{i,j} = X_{i,j}/Y_{i,j}[/math].
  4. The projection [math]\hat{S}_{i,j} [/math], [math]i+j\gt I [/math], is set to the average of [math]S_{i,j}[/math] over the accident year [math]i[/math]: [math]\hat{S}_{i,j} = \frac{1}{I-j}\sum_{i=0}^{I-j}S_{i,j}[/math].

References

  • "ADVANCED SHORT-TERM ACTUARIAL MATHEMATICS STUDY NOTE: OUTSTANDING CLAIMS RESERVES" (PDF). Society of Actuaries. Retrieved 18 February 2023.

Wikipedia References