# Gram-Charlier Processes and Applications to Option Pricing.

1. Introduction

Gram-Charlier series are expansions of the form

[mathematical expression not reproducible] (1)

where

[mathematical expression not reproducible] (2)

is the usual normal density and [He.sub.k] is the Hermite polynomial of order k. The expression within square brackets in (1) is an orthogonal polynomial expansion for the ratio f(x)/[phi](x); given an arbitrary function the expansion may or may not converge to the true value of f(x). In this paper we focus on the properties of the Gram-Charlier distributions, obtained by truncating the series after a finite number of terms. What is obtained is a family of distributions parametrized by a, b, [c.sub.0] , ..., [c.sub.N], as is explained in detail below.

This paper has three main goals: (1) define and study the properties of the family of Gram-Charlier distributions; (2) define a Gram-Charlier process and derive its basic properties; (3) apply those to European options. The formulas we give for European option prices and Greeks apply to Gram-Charlier distributions of any order, and we use four- and six-parameter Gram-Charlier distributions in our examples. Two numerical illustrations show how option prices are affected by the skewness and kurtosis of returns. This paper can a reference for those using Gram-Charlier distributions in option pricing but also in statistics.

Most previous applications to option pricing have assumed that [c.sub.1] = [c.sub.2] = 0. We believe this restriction is not necessary in a general theory of Gram-Charlier distributions, as there may well be situations where the extra degrees of freedom given by [c.sub.1] and [c.sub.2] will be useful. We discuss it in detail at the end of Section 2.

It has been observed that option prices have nonconstant implied volatilities, meaning that log returns do not have a normal distribution under the risk-neutral measure. There is a wide literature on modelling log returns to fit observed option prices, the main alternatives to Brownian motion being stochastic volatility models (where the parameter a in Black-Scholes is replaced with a continuous-time stochastic process), GARCH time series, and Levy processes. Gram-Charlier distributions are mathematically simpler than the models just mentioned, while allowing a better fit to data than the normal distribution. Several authors have used Gram-Charlier distributions in option pricing, as a model for log returns, among others [1-10]. The majority of previous authors assumed a density of the form

[mathematical expression not reproducible] (3)

for the normalized log return. (In our notation this is a GC(0, 1; 0, 0, [bar.s]/6, [bar.k]/24) distribution; see Section 2.) The notation emphasizes that in this case the coefficient of [He.sub.3](x) turns out to be Pearson's skewness coefficient s divided by 6, and the coefficient of [He.sub.4](x) the excess kurtosis coefficient [bar.k] divided by 24. The distribution of log returns is then a four-parameter Gram-Charlier distribution (since there are two other parameters, the mean and variance, besides [bar.s] and [bar.k]). This distribution allows nonzero skewness and excess kurtosis, unlike the normal distribution found in Black-Scholes. In (3) the parameters (s, k) are restricted to a specific region (see Figure 3), because outside that region the function in (3) becomes negative for some values of x (see Section 2.1).

Jurczenko et al.  specify the martingale restriction

[S.sub.0] = [e.sup.-rT][E.sup.Q] ([S.sub.T]) (4)

that the four-parameter Gram-Charlier density must satisfy in pricing options (previous authors had not taken it into account). The martingale condition for general Gram-Charlier distributions is described in Section 5. Our Gram-Charlier distribution with parameters a, b, [c.sub.1], ..., [c.sub.N], denoted by GC(a, b; [c.sub.1], ..., [c.sub.N]), has density

[phi] (a, b, x) x [1 + [c.sub.1][He.sub.1](x - a/b) + ... + [c.sub.N][He.sub.N] (x - a/b)]. (5)

As stated above we do not set [c.sub.1] = [c.sub.2] = 0, and N can be any even positive integer. The question whether (5) is nonnegative for all x is an important one. Several authors have disregarded this issue and, in fact, some have come up with parameters that do not yield a true probability density function. One might argue that this is the price to pay for truncating an infinite Gram-Charlier series. However, if the same log return distribution is used to price many options, the natrue probability density function is the only safe choice, because otherwise there might be inconsistencies among option prices. For instance, if the function used as density is negative over the interval ([alpha], [beta]), then a digital option that pays off only when the log return is in that interval will have a negative price. In this paper we consider Gram-Charlier distributions, not expansions, and insist that the densities integrate to one and be nonnegative. Our goal is to define a family of proper probability distributions; nevertheless, our formulas do apply to truncated Gram-Charlier expansions as well.

The paper by Leon et al.  presents an alternative to the general Gram-Charlier distributions we study in this paper. Those authors consider the subclass of Gram-Charlier distributions consisting of densities

[f.sub.X] (x) = [phi] (a, b, x) p (x), (6)

where the polynomial p(x) is the square of another polynomial, p(x) = q[(x).sup.2]. This has the obvious advantage that the nonnegativity restriction on [f.sub.X](*) is automatically satisfied. We discuss that subclass of "squared" Gram-Charlier distributions in the Conclusion.

Almost all previous authors have used Gram-Charlier distributed log returns over a single time period. This has an obvious downside, in that it becomes tricky, if not impossible, to preserve consistency between the prices of options with different maturities. Section 3 shows that a Leovy process with Gram-Charlier increments does not exist; however, it also shows that the sum of independent Gram-Charlier distributed variables also has a Gram-Charlier distribution. This opens the way for multiperiod Gram-Charlier option pricing, using a discrete-time random walk model for which the log return over any period has a Gram-Charlier distribution. The Gram-Charlier distribution of the multiperiod return has a larger number of parameters, though the model is still simpler than almost any (if not all) alternative stochastic volatility models. There is no problem computationally, since we give explicit formulas for options under Gram-Charlier distributions with an arbitrary number of parameters.

The layout of the paper is as follows. In Section 2, we extend the study of Gram-Charlier distributions to all possible polynomials p(x) and derive their properties (moments, cumulants, moment determinacy, properties of the set of valid parameters, tail, and so on). Some of the formulas and properties in Theorem 2 appear to be new. In Section 3 we show that there is no Levy process with Gram-Charlier distributed increments, apart from Brownian motion, and we define a discrete-time process with independent Gram-Charlier increments that is suitable for option pricing. In Section 4 we show that the log Gram-Charlier distribution is not determined by its moments, just like the lognormal. Next, Section 5 gives formulas for European call and put prices when the log-price returns of the underlying have a general Gram-Charlier distribution; the previous literature mostly considered the GC(a, b; 0, 0, [c.sub.3], [c.sub.4]) family and the squared Gram-Charlier distributions (an exception is , where it is assumed that [c.sub.1] = [c.sub.2] = 0). In particular, we derive a change of measure formula that extends the Cameron-Martin formula for the normal distribution; the latter is used in pricing European options in the Black-Scholes model. We also derive formulas for the sensitivities (Greeks) of those option prices with respect to all parameters. A technique for simulating Gram-Charlier distributions is described. Parts (c), (d), (i), (j) (k), and (l) of Theorem 2 and part (b) of Theorem 3 and Theorems 6, 7, 9, and 11 appear to be new. Theorem 3(a) is for the first time formulated for general Gram-Charlier distributions. Theorems 3(a), 4, and 8 are given, more or less explicitly, for the subclass of squared Gram-Charlier distributions in , and some of the Greeks in Theorem 9 had been calculated for the GC(a, b; 0,0, [c.sub.3], [c.sub.4]) distribution by previous authors.

In Section 6 we give two applications that show how option prices depend on skewness and kurtosis of the log returns (this of course cannot be done in the classical Black-Scholes setting, while it appears quite complicated to do so in stochastic volatility or Levy driven models). The first example is equity indexed annuities (EIAs in the sequel) premium options. The pricing of EIAs has been studied by many authors, including Hardy , Gaillardetz and Lin , and Boyle and Tian . The second example is lookback options, which illustrates the use of the simulation method in Section 2.5. Lookback options have been studied by numerous authors, but there is no closed form formula for the price of discretely monitored lookback options within the Black-Scholes model. We refer the reader to Kou . In those examples all parameters are estimated by maximum likelihood, with the range of the parameters restricted to where they correspond to a true probability distribution. Moment estimation is of course possible when [c.sub.1] = [c.sub.2] = 0, but it is not trustworthy, because of the range restriction on the parameters [c.sub.3], [c.sub.4], ...; for instance, the empirical skewness and excess kurtosis have a positive probability of falling outside the feasible region (see Section 6). Theorem 2(f) shows that the first k moments of the Gram-Charlier distribution depend on more than k parameters, which probably rules out moment estimation, unless one a priori fixes two of the parameters (which again is not trustworthy). We apply maximum likelihood to the six-parameter GC(a, b; [c.sub.1], [c.sub.2], [c.sub.3], [c.sub.4]) in Section 6, but we do realize that maximum likelihood estimation for higher order Gram-Charlier distributions poses computational problems, which are an interesting avenue for further research. (A referee pointed out that fixing a and b would make estimation easier; our guess is that if one wishes to simplify maximum likelihood estimation then fixing a and b might be a better idea than fixing [c.sub.1] and [c.sub.2], though we have not looked at this in any depth. A two-step process might be imagined, whereby the data first give information about a and b, and then maximum likelihood is applied using that preliminary information. This would help by constraining the optimization to a smaller region, while agreeing with the intuitive idea that a and b are location and scale parameters, resp.)

Notation 1. The normal density function is denoted as

[mathematical expression not reproducible] (7)

and its distribution function is

[PHI](x) = [[integral].sup.x.sub.[-infinity]] [phi](y) dy. (8)

Two equivalent versions of the Hermite polynomials may be found in the literature: for n = 0, 1, 2...,

[mathematical expression not reproducible] (9)

The first one is common in mathematics and physics, but in probability and statistics there is an obvious advantage in using the second one. (The conversion formula is [He.sub.n](x) = [2.sup.-n/2][H.sub.n](x/ [square root of 2]).) The first few Hermite polynomials are

[mathematical expression not reproducible] (10)

2. Gram-Charlier Distributions

For a fixed N, consider the class of distributions that have a pdf of the form

[mathematical expression not reproducible] (11)

with [c.sub.N] [not equal to] 0. Noting that the leading term of [He.sub.k](x) is [x.sup.k], we conclude that N must necessarily be even, because if N were odd then the polynomial that multiplies [phi](x) would take negative values for some x. For the same reason [c.sub.N] cannot be negative.

Definition 1. Let a [member of] R, b > 0, [c.sub.k] [member of] R, [c.sub.0] = 1, and N [member of] {0, 2, 4, ...}. We write Y ~ GC(a, b; [c.sub.1], ..., [c.sub.N]) (or Y ~ GC(a, b; c)) if the variable (Y - a)/b has probability density function

[mathematical expression not reproducible] (12)

This will be called a Gram-Charlier distribution with parameters a, b, [??], with [??] = ([c.sub.1], ..., [c.sub.N]). The largest N such that [c.sub.N] > 0 is called the order of the Gram-Charlier distribution. The normal distribution with mean a and variance [b..sup.2] is a GC(a, b; (0, ... 0)) (or GC(a, b; -)) with order 0.

The class of Gram-Charlier distributions just defined includes all distributions with density

[1/b] [phi] (y - a/b) p(y), (13)

where p(y) is a polynomial of degree N, since p(y) can be rewritten as a combination of

[He.sub.k] (y - a/b), k = 0, 1, ..., N. (14)

The condition [c.sub.0] = 1 ensures that function (12) integrates to one, since

[mathematical expression not reproducible] (15)

There are no simple conditions that ensure that a polynomial remains nonnegative everywhere, though in some cases precise conditions on c are known; see below. If a vector c leads to a true Gram-Charlier pdf, then we will say that c is valid.

Generating functions are convenient when dealing with orthogonal polynomials. One is

[mathematical expression not reproducible] (16)

Another one is

[mathematical expression not reproducible] (17)

Letting y = 0 leads to

[mathematical expression not reproducible] (18)

This proves the orthogonality of the Hermite polynomials and gives us the value of

[mathematical expression not reproducible] (19)

which is essential in deriving Gram-Charlier series. Since [He.sub.0] = 1 this also implies

[mathematical expression not reproducible] (20)

Another formula is the Laplace transform of [phi](x)[He.sub.n](x), which may be found by integrating by parts n times:

[mathematical expression not reproducible] (21)

Let us calculate the moments of a distribution with density (11). First consider

[mathematical expression not reproducible] (22)

Integrating by parts repeatedly yields

[mathematical expression not reproducible] (23)

Hence, the nth moment of the distribution in (11) is

[mathematical expression not reproducible] (24)

This says in particular that the parameter [c.sub.k] only affects the moments of order k and higher of the GC(a, b; c) distribution (see part (i) of Theorem 2). This is confirmed by the moment generating function, which maybe found from (21):

[mathematical expression not reproducible] (25)

It can be checked that differentiating this expression n times and setting t = 0 give the same expression found for the nth moment of (11).

Theorem 2. Suppose Y ~ GC(a, b; [??]), [??] [member of] [R.sup.N] with b > 0, [c.sub.0] = 1, and [c.sub.N] > 0. The order N of the distribution is necessarily even.

(a)

[mathematical expression not reproducible] (26)

(b)

[mathematical expression not reproducible] (27)

(c) The representation of the GC(a, b; [??]) distribution in terms of the parameters a, b, and [??] is unique.

(d) All Gram-Charlier distributions are determined by their moments.

(e) The set of valid c in [R.sup.N] includes the origin, is not reduced to a single point, and is convex.

(f) The first six moments of the GC(a, b; [??]) distribution are

[mathematical expression not reproducible] (28)

(g) The first six cumulants of the GC(a, b; [??]) distribution are

[mathematical expression not reproducible] (29)

(h) The following hold for any GC(a, b; c) distribution:

[mathematical expression not reproducible] (30)

(i) Suppose X ~ GC(a, b; [[??].sup.X]), Y ~ GC(a, b; [[??].sup.Y]). Then the first K moments of X and are Y the same; that is,

E[X.sup.j] = E[Y.sup.j], j = 1, ..., K, (31)

if and only if [c.sup.X.sub.j] = [c.sup.Y.sub.j], j = 1, ..., K. This implies that if [c.sub.1] up to [c.sub.4] are equal to 0 then the distribution has zero skewness and excess kurtosis; hence, this shows how to construct an infinite number of distributions that share this property with the normal.

(j) Suppose X ~ GC(a, b; [c.sup.X]). Then

[mathematical expression not reproducible] (32)

When [c.sup.X.sub.1] = [c.sup.X.sub.2] = 0 the skewness and excess kurtosis coefficients of X are 6[c.sup.X.sub.3] and 24[c.sup.X.sub.4], respectively, for any N = 0,2,4, ....

(k) If X ~ GC(a, b; [c.sub.1], ..., [c.sub.N]) and q is a constant then Y = qX ~ GC(a, b; [c.sub.1], ..., [c.sub.N]), where [c'.sub.k] = [(sign(q)).sup.k][c.sub.k], k [greater than or equal to] 1. In particular, -X ~ GC(-a, b; -[c.sub.1], ..., [(-1).sup.N-1] [c.sub.N-1], [c.sub.N]).

(l) The law of the square of X ~ GC(0, 1; [c.sub.1], ..., [c.sub.N]) is a combination of chi-square distributions with 1,3, ..., N + 1 degrees of freedom that has density

[mathematical expression not reproducible] (33)

Proof. Parts (a) and (b) were proved above, and (c) follows directly from (b). To prove (d) it is sufficient to note the existence of [Ee.sup.ty] for t in an open neighbourhood of s = 0. For (e), if [mathematical expression not reproducible] then the distribution is the standard normal, and this is a Gram-Charlier distribution. Since [c.sub.N] > 0 and N is even, the polynomial

[mathematical expression not reproducible] (34)

tends to [infinity] when [absolute value of x] tends to [infinity]. Hence, there is [[epsilon].sub.0] > 0 such that

[mathematical expression not reproducible] (35)

(recall that [c.sub.0][He.sub.0](x) = 1 for all Gram-Charlier distributions). The set ofvalidvectorsO thus includes ([epsilon], ..., [epsilon]) [member of] [R.sup.N] for each 0 [less than or equal to] [epsilon] [less than or equal to] [[epsilon].sub.0]. If [c.sup.(j)], j = 1,2, are valid then [pc.sup.(1)] + (1 - p)[c.sup.(2)] is also valid, for any p [member of] (0, 1).

Part (f) is found by expanding the mgf in (b) as a series in s around the origin. Part (g) is found by expanding

[mathematical expression not reproducible] (36)

The formulas in (h) follow (f) and (g).

For part (i), it is sufficient to consider the case a = 0, b = 1 only. Write

[mathematical expression not reproducible] (37)

and calculate the moments of X and Y by successively differentiating the mgf's of X and Y (note that [c.sup.X.sub.0] = [q.sub.X](0) = [c.sup.Y.sub.0] = [q.sub.Y](0) = 1). The first K moments of X and Y are the same if and only if

[mathematical expression not reproducible] (38)

Suppose that [c.sup.X.sub.j] = [c.sup.Y.sub.j], j = 1, ..., K. Then [q.sup.(j).sub.X](0) = [q.sup.(j).sub.Y](0), j = 1, ..., K, and thus the first K moments of X and Y are the same. Conversely, suppose E[X.sup.j] = E[Y.sup.j], j = 1, ..., K. Then the first identity above implies that [c.sup.X.sub.1] = [q'.sub.X](0) = [q'.sub.Y](0) = [c.sup.Y.sub.1], since [psi](0) is not zero. The second identity implies that [c.sup.X.sub.2] = [c.sup.Y.sub.2], and so on, up to [c.sup.X.sub.K] = [c.sup.Y.sub.K].

Turning to (j), the first equivalence

a = EX [??] [c.sup.X.sub.1] = 0 (39)

is an immediate consequence of property (i) with K = 1. To prove the second equivalence, suppose X ~ GC(0, 1; c), and let Y ~ GC(0, 1; -) = N(0, 1). Then

[mathematical expression not reproducible] (40)

Here [q.sub.X](0) = [q.sub.Y](0) = 1 and [psi]'(0) = 0; hence, E[X.sup.2] = E[Y.sup.2] if and only if [q".sub.X](0) = [q".sub.Y](0). The last equality is [c.sup.X.sub.2] = [c.sup.Y.sub.2]. For X - GC(a, b; [c.sup.X]), with a and b being arbitrary, this means that

E (x - a/b) = 1 [??] [c.sup.X.sub.2] = 0. (41)

For (k), observe that if q [not equal to] 0 then the mgf of Y is

[mathematical expression not reproducible] (42)

Finally, turn to (l). Routine calculations show that the density of the square is

[mathematical expression not reproducible] (43)

The Hermite polynomials of odd order are odd functions and so disappear from that expression, while the even order Hermite polynomials are even functions.

When N = 0 the GC(a, b; c) distribution is the normal distribution with mean a and variance [b.sup.2]. However, part (b) of the theorem says that when N > 0 the parameters a and [b.sup.2] are not necessarily the mean and variance of the distribution. Simple calculations lead to the following result.

Theorem 3. (a) If X ~ GC(0, 1; [c.sub.1], ..., [c.sub.N]) then

[mathematical expression not reproducible] (44)

(b) The tails of the GC(a, b; [c.sub.1], ..., [c.sub.N]) distribution are

[mathematical expression not reproducible] (45)

The tails of the Gram-Charlier distributions are thicker than those of the normal distribution but are still "thin" because they are in the limit smaller than any exponential function exp(-[alpha]x).

2.1. The GC(a, b; 0, 0, [c.sub.3], [c.sub.4]) Family. Here the exact region for the ([c.sub.3], [c.sub.4]) that lead to a true probability distribution has been found. This goes back to Barton and Dennis , but a more detailed explanation is given in Jondeau and Rockinger . The region is shown in Figure 3 (use the correspondence [bar.s] = 6[c.sub.3], [bar.k] = 24[c.sub.4] from Theorem 2(h)). An important fact about this region is that it is not rectangular; the possible excess kurtosis values depend on skewness, and conversely.

2.2. GC(a, b; c) Distributions of Order 4 and Higher. The GC(a, b; [c.sub.1], [c.sub.2], [c.sub.3], [c.sub.4]) family has six parameters, rather than four, and thus has more degrees of freedom in fitting data; to the authors' knowledge the general six-parameter Gram-Charlier distribution has been used in financial applications by Leon et al.  only (those authors use the subfamily consisting of polynomials p(x) that are squares of some second-degree polynomial). Schlogl  fits the six- and eight-parameter families GC(a, b; 0, 0, [c.sub.3], [c.sub.4], [c.sub.5], [c.sub.6]) and GC(a, b; 0,0, [c.sub.3], [c.sub.4], [c.sub.5], [c.sub.6], [c.sub.7], [c.sub.8]) to data.

The set of ([c.sub.1], [c.sub.2], [c.sub.3], [c.sub.4]) that yield true probability distributions has not been identified, but it is possible to fit the six parameters a, b, [c.sub.1], [c.sub.2], [c.sub.3], [c.sub.4] by maximum likelihood.

2.3. Why We Do Not Assume That [c.sub.1] = [c.sub.2] = 0. Almost all previous authors have assumed that [c.sub.1] = [c.sub.2] = 0, because they used normalized data:

Y = X - EX/[square root of (Var X)] (46)

(see part (j) of Theorem 2) as explained below. In this section we explain why it is important not to restrict Gram-Charlier distributions or series in that way. The first reason for not doing so is that enlarging the parameter space can only be a good thing. The second one is that in fitting those distributions to data there may be very real advantages in letting [c.sub.1] and [c.sub.2] be different from zero. The only downside of letting [c.sub.1] and [c.sub.2] take nonzero values, and it is of no real importance, is that [c.sub.3] and [c.sub.4] lose their simple relationship with skewness and excess kurtosis (see part (h) of Theorem 2). A third reason is that after an exponential change of measure a Gram-Charlier distribution will rarely have [c.sub.1] = [c.sub.2] = 0; see Section 2.4.

We now show, using an example that can be worked out explicitly, that it is not always best to use normalized data when fitting Gram-Charlier distributions, because choosing another affine transformation of the data may well yield a much better fit.

The "standard" Gram-Charlier expansion for a function f(x) is

[mathematical expression not reproducible] (47)

where

[mathematical expression not reproducible] (48)

A classical result about Hermite series, proved by Crameor , is that sufficient conditions for the Gram-Charlier expansion (47) to converge to

1/2 (f(x-) + f(x+)) (49)

for all x [member of] R are that (i) f(x) has finite variation in every bounded interval, and (ii) satisfies

[mathematical expression not reproducible] (50)

If X has density f(x), then the last condition is [mathematical expression not reproducible]. This condition cannot be improved upon, in the sense that there are cases where the Gram-Charlier series defined above diverges, although [mathematical expression not reproducible] for all [rho] < 1/4 (it will be shown below that one such case is the normal distribution with mean 0 and variance 2).

Let us first calculate

[h.sub.q,k] = E[He.sub.k] (X), X ~ N (0, [q.sup.2]). (51)

Using the generating function (16) we find

[mathematical expression not reproducible] (52)

The Gram-Charlier expansion corresponding to the N(0, [q.sup.2]) density is thus

[mathematical expression not reproducible] (53)

It is possible to determine whether this converges or not when x = 0; the Gram-Charlier series for f(0) is

[mathematical expression not reproducible] (54)

From Stirling's formula,

(2n)!/[2.sup.2n][(n!).sup.2] ~ 1/[square root of [pi]n] (55)

as n tends to infinity, which implies that there are three possibilities: expansion (54) (i) converges absolutely if 0 < [q.sup.2] < 2, (ii) converges simply if [q.sup.2] = 2, and diverges if [q.sup.2] > 2. (The preceding calculations are from Cramer .)

Let us now consider a random variable [X.sub.p], 0 [less than or equal to] p [less than or equal to] 1, with density

[g.sub.p] (x) = p[phi](x) + (1 - p) 1/p [phi](x/p). (56)

In words, [X.sub.p] has a N(0, 1) distribution with probability p and a N(0, [p.sup.2]) distribution with probability 1 - p. Define the normalized variable

[mathematical expression not reproducible] (57)

The coefficients [c.sub.p,k] of the Gram-Charlier expansion for [Y.sub.p] are

[mathematical expression not reproducible] (58)

We have shown that (53) diverges if [q.sup.2] > 2. Here we see that [c.sub.p,k] involves [h.sub.q,k] with q = 1/[[sigma].sub.p] and q = p/[[sigma].sub.p]; in the second case there is no problem, as [(p/[[sigma].sub.p]).sup.2] < 2 for all p [member of] (0, 1]. In the first case,

1/[[sigma].sup.2.sub.p] > 2 [??] p + [p.sup.2] - [p.sup.3] < 1/2. (59)

This condition is satisfied for p smaller than [p.sup.*] [approximately equal to] 0.403, and so the Gram-Charlier expansion for [Y.sub.p] diverges for all p < [p.sup.*]. This is a sad state of affair: a series designed to work for distributions that are "close to the normal" that fails for combinations of two normal densities!

There is an easy solution: use a different scaling for [X.sub.p]. Rather than multiplying [X.sub.p] by 1/[[sigma].sub.p], use a factor r such that the series converges. In this example we know that the series converges if and only if r < [square root of 2]. Say we choose p = .3; this implies

1/Var [X.sub.p] = 1/0.3 + [0.3.sup.2] - [0.3.sup.3] = 2.75482 > 2. (60)

Figure 1 shows how the true density for [X.sub.0.3] is approximated by order 10 and 20 Gram-Charlier truncated series if r = 1/[[sigma].sub.0.3] = 1.66 is chosen; the graph is not surprising, the infinite Gram-Charlier expansion diverges. Figure 2 shows the same except that r is set to 1.4. The latter is smaller than [square root of 2], so the infinite expansion converges. In this example the density to be approximated is symmetrical about 0, so [c.sub.1] is always 0, while

q = 1/[[sigma].sub.0.3] [??] [c.sub.2] = 0; q = 1.4 [??] [c.sub.2] = -0.14426. (61)

2.4. Exponential Change of Measure. The one-dimensional Cameron-Martin formula may be stated as follows: if X ~ N([mu], [[sigma].sup.2]) then for q [member of] R and f [greater than or equal to] 0,

[mathematical expression not reproducible] (62)

The same property may be expressed in terms of a change of measure. If X [??] N([mu], [[sigma].sup.2]) and a change of measure is defined by

P' = [e.sup.qX]/[E.sup.P][e.sup.qX] x P, (63)

then X [??] N([mu] + [[sigma].sup.2]q, [[sigma].sup.2]). The next result is an extension of the Cameron-Martin formula for the normal distribution to the Gram-Charlier distributions. There are different though equivalent formulas in Schlogl  (for the special case [c.sub.1] = [c.sub.2] = 0).

Theorem 4. Suppose that X [??] GC(a, b; [c.sub.1], ..., [c.sub.N] and that P' is defined by (63) for q [member of] R. Then X [??] GC(a + [b.sup.2] q, b; [c'.sub.1], ..., [c'.sub.N]), where [c'.sub.1], ..., [c'.sub.N] are found from

[mathematical expression not reproducible] (64)

or, more precisely,

[mathematical expression not reproducible] (65)

Proof. It is sufficient to calculate the mgf of X under P':

[mathematical expression not reproducible] (66)

Example 5. If X [??] GC(a, b; 0, 0, [c.sub.3], [c.sub.4]) and q = 1 the change of measure (63) leads to

[mathematical expression not reproducible] (67)

Hence, assuming [c.sub.1] = [c.sub.2] = 0 does not imply that [c'.sub.1] and [c.sub.2] are also zero. In other words, the family GC(a, b; 0,0, [c.sub.3], [c.sub.4]) is not closed under a change of measure that occurs naturally in option pricing (see Section 5).

2.5. Simulating Gram-Charlier Distributions. Simulation is required for many kinds of options, and it turns out that the Gram-Charlier distributions are very easy to generate, as we now show; there is no need to invert their distribution functions. When estimating some quantity m = Eg([X.sub.1], ..., [X.sub.s]) by simulation, one generates n independent vectors

([X.sup.(j).sub.1]), ..., [X.sup.(j).sub.s]), j = 1, ..., n, (68)

with the same distribution. Suppose all the X's are independent and have the same Gram-Charlier distribution with density

[f.sub.X] (x) = [phi](a, b, x) p(x), (69)

where the polynomial p(x) is given in (5). Then

[mathematical expression not reproducible] (70)

where under the measure Q the X's have a normal distribution N(a, [b.sup.2]). Hence, estimating Eg([X.sub.1], ..., [X.sub.s]) by simulation can be performed by generating ordinary normal random variables ([X.sup.(j).sub.1], ..., [X.sup.(j).sub.s]), j = 1, ..., n, and then using the estimator

[mathematical expression not reproducible] (71)

This is an application of the likelihood ratio method.

3. Convolution of Gram-Charlier Distributions; Gram-Charlier Processes

The simplest way to find the distribution of the sum of two independent Gram-Charlier distributed variables is to multiply their moment generating functions. Suppose [X.sub.j] ~ GC([a.sub.j], [b.sub.j]; [c.sup.(j).sub.1], ..., [c.sup.(j).sub.N]), j = 1, 2 are independent. Then

[mathematical expression not reproducible] (72)

Expanding the product, this says that

[mathematical expression not reproducible] (73)

It is then possible to know the explicit distribution of

[Z.sub.n] = [X.sub.1] + ... + [X.sub.n], (74)

where the X's are independent and have a Gram-Charlier distribution, constituting a discrete-time Gram-Charlier process. If the X's have the same GC(a, b; [c.sub.1], ..., [c.sub.N]) distribution then {[Z.sub.n], n [greater than or equal to] 0} is a random walk. The derivation of the distribution of [Z.sub.n] can be done recursively, using (73), or it can be done by finding the Taylor expansion of

[mathematical expression not reproducible] (75)

and thus

[mathematical expression not reproducible] (76)

These computations are simple using symbolic mathematics software. An example is given at the end of Section 6.1.

The above raises the question of whether there is a continuous-time process that has Gram-Charlier distributed increments. There is such a process with normal increments (Brownian motion), and it is moreover a Levy process.

Theorem 6. The only Levy process with Gram-Charlier distributed increments is Brownian motion.

Proof. It is sufficient to show that, besides the normal distribution, any Gram-Charlier distribution cannot be infinitely divisible. If X has a Gram-Charlier distribution then its moment generating function is of the form

[mathematical expression not reproducible] (77)

where [pi](x) is a polynomial. Suppose [X.sub.n, 1], ..., [X.sub.n,n] are independent, have the same distribution, and add up to X (in law). Fubini's theorem implies that [mathematical expression not reproducible] is finite for all real t, and thus

[mathematical expression not reproducible] (78)

is an analytic function of z in the whole complex plane. We then have the identity

[M.sub.n] [(z).sup.n] = M(z), z [member of] C, (79)

for every n= 1, 2, 3, .... This means that the function [pi][(z).sup.1/n], which is well defined for all z that is not a zero of [pi], has an analytic continuation over the whole of C for each n [greater than or equal to] 1, which is impossible, for instance, take n larger than the degree of [pi].

More precisely, this says that if we exclude Brownian motion, no increment of any Levy process can have a Gram-Charlier distribution. Any Gram-Charlier process with independent increments must be discrete-time.

4. The Log Gram-Charlier Distribution

The distribution of the exponential of a Gram-Charlier distributed variable will naturally be called "log Gram-Charlier", as we do for the lognormal: if Y ~ GC(a, b; [??]), then L = exp(Y) ~ LogGC(a, b; [??]). The density of L is

[f.sub.L] (z) = [1/x] [f.sub.Y] (log z), z > 0. (80)

This distribution has all moments finite, and they are given by Theorem 2(b). The log Gram-Charlier distribution shares one property with the lognormal, it is "moment indeterminate."

Theorem 7. The log Gram-Charlier distribution is not determined by its moments. More precisely, there is a noncountable number of other distributions that have the same moments as any particular log Gram-Charlier distribution.

Proof. There is a well-known way to construct a family of distributions that have the same moments as the lognormal (Feller , p. 227); the trick works for arbitrary parameters but, for simplicity, let L ~ LogN(0, 1); that is,

[mathematical expression not reproducible] (81)

All the functions

[f.sub.[alpha]] (z) = [f.sub.L] (z)(1 + [alpha] sin (2[pi] log z)), -1 [less than or equal to] [alpha] [less than or equal to] 1, (82)

are nonnegative, integrate to 1, and have the same moments as L (as a result of the symmetry of the standard normal distribution, after an obvious change of variable). Now, suppose L ~ LogGC(0, 1;[??]) (once again the same arguments work for other values of a and b), and consider

[g.sub.[alpha]](z) = [f.sub.L] (z) + [alpha][phi] (z) sin (2[pi] log z). (83)

By the same change of variables used for the lognormal one immediately finds that [g.sub.a] integrates to one and has the same moments as L. The only difference with the case of the lognormal is that [g.sub.a] is not necessarily nonnegative for -1 [less than or equal to] [alpha] [less than or equal to] 1. Two cases may arise. The first one is that the polynomial

[mathematical expression not reproducible] (84)

has no real zero. In that case its infimum is strictly greater than zero, and one can find a nonempty interval I = (-[epsilon], [epsilon]) such that [g.sub.a] is nonnegative for all [alpha] [member of] I. In the second case p(x) has one or more zeros and the previous argument breaks down but can be modified to yield the same conclusion, if the function sin(2[pi]y) is replaced with

s(y) = sin (2[pi]y) [1.sub.{y[not member of]J}], (85)

where J is a collection of intervals that include the zeros of p(x), so defined that s(x) is not identically zero and satisfies

s(y + 2k[pi]) = s (y), s (-y) = -s(y). (86)

(These two conditions are sufficient for

[[integral].sup.[infinity].sub.-[infinity]] [e.sup.ky] [phi](y) s(y) dy = 0 (87)

to hold for k = 1, 2, ....) The details are omitted. Another way to prove that the log Gram-Charlier distribution is moment indeterminate when p(y) has no zero (and thus the density [f.sub.L](z) does not take the value zero for any z > 0) is to use a Krein condition (Stoyanov , p. 941), which says that a continuous distribution on [R.sub.+] with positive density f(x) is not determined by its moments if

[mathematical expression not reproducible] (88)

5. Option Pricing Formulas

The formulas below hold for any vector ([c.sub.1], ..., [c.sub.N]) and thus extend those that have been derived by previous authors for the case where the log return has a GC(a, b; 0, 0, [c.sub.3], [c.sub.4]) distribution or a "squared" Gram-Charlier distribution (Leon et al. ). Schlogl  derives a formula equivalent to (90) but for densities expressed as an infinite Gram-Charlier series with [c.sub.1] = [c.sub.2] = 0.

As previous authors have done, we consider a market with a risky security S and a risk-free security with annual return r and suppose that the log return for period [0, T] (denoted as [X.sub.T]) has a Gram-Charlier distribution under the risk-neutral (or "pricing") measure, which we denote as Q. The physical measure (usually denoted as P) is not specified (nothing says that the log return has or does not have a Gram-Charlier distribution under the physical measure).

The market model may have one or more periods, but since we consider the pricing of ordinary European puts and calls only the distribution of the log return for the whole period [0, T] is required. For other types of options, in particular the applications presented in the next section, it maybe necessary to use the one-period returns separately, as is done in Theorem 11.

The risky security has price [S.sub.0] at time 0, and [mathematical expression not reproducible].

Theorem 8. Suppose that a risky security pays dividends at a constant rate [delta] over [0, T] and that the risk-free rate of interest is r. Suppose also that under the risk-neutral measure Q the log return of the risky security over [0, T] is [X.sub.T] [??] GC(a, b; [c.sub.1], ..., [c.sub.N]), which satisfies the martingale condition

[mathematical expression not reproducible] (89)

Then the time-0 price of a European call option with maturity T and strike price K is

[mathematical expression not reproducible] (90)

where [d.sub.2], [d.sub.1], and {[c.sup.*.sub.k]} are given by (97), (101), and (105). The price of the corresponding European put is

[mathematical expression not reproducible] (91)

If N = 4, then the above simplify to

[mathematical expression not reproducible] (92)

Proof. Absence of arbitrage implies that [E.sup.Q][S.sub.T] = [e.sup.(-r- [delta])T][S.sub.0], or

[mathematical expression not reproducible] (93)

This justifies (89). The price of the call is

[mathematical expression not reproducible] (94)

The second part is easier to deal with

[C.sup.-.sub.0] = [Ke.sup.-rT]Q ([X.sub.T] > log (K/[S.sub.0])). (95)

The probability of the event [X.sub.T] > log(K/[S.sub.0]) can be calculated explicitly by recalling that ([X.sub.T] - a)/b [??] GC(0, 1; [c.sub.1, ..., [c.sub.N]) and using Theorem 3:

[mathematical expression not reproducible] (96)

where

[d.sub.2] = [1/b] (log ([S.sub.0]/K) + a). (97)

To calculate [C.sup.+.sub.0] use the exponential change of measure formula, defining

[mathematical expression not reproducible] (98)

Then

[mathematical expression not reproducible] (99)

Since [X.sub.T] [??] GC(a + [b.sup.2], b; [c'.sub.1], ..., [c'.sub.N]),

[mathematical expression not reproducible] (100)

where

[mathematical expression not reproducible] (101)

Hence,

[mathematical expression not reproducible] (102)

Using

[mathematical expression not reproducible] (103)

(a consequence of (89) and (101)), it is possible to write

[mathematical expression not reproducible] (104)

where

[mathematical expression not reproducible] (105)

This proves (90).

The price of a European put can be found from the put-call parity identity

[P.sub.0] = [C.sub.0] - [S.sub.0] + [Ke.sup.-rT]. (106)

Finally, if N = 4 then the summation in (90) becomes

[mathematical expression not reproducible] (107)

This ends the proof.

The option price formulas are of the form "Black-Scholes plus correction term." Observe, however, that the values of [d.sub.1] and [d.sub.2] are different from what they would be in the Black-Sholes formula. More precisely, in the Black-Scholes model (where [c.sub.k] = 0 for all k [greater than or equal to] 1) we have a = [E.sup.Q][X.sub.T] and [b.sup.2] = [Var.sup.Q][X.sub.T], but this does not happen with Gram-Charlier distributed log returns, first because of the martingale condition (89) and second because of the result in part (j) of Theorem 2.

5.1. Sensitivities ("Greeks"). The previous literature includes formulas for sensitivities of Gram-Charlier option prices, but only in the case of the four-parameter GC(a, b; 0, 0, [c.sub.3], [c.sub.4]) Gram-Charlier distributions; see Jurczenko et al. , Rouah and Vainberg , and Chateau . Below we give sensitivities of the option prices calculated above with respect to all the parameters for a general Gram-Charlier distribution, taking the martingale condition (89) into account. This means that a is a function of [delta], b, {[c.sub.k]}, r and T (we might have written a = a([delta], b, {[c.sub.k]}, r, T)). However, a is not a function of [S.sub.0].

Theorem 9. Let [C.sub.0] be the price of the European call option described in Theorem 8. Then

[mathematical expression not reproducible] (108)

In the formula for [kappa] the symbols [c'.sub.N+1] and [c'.sub.N+2] are equal to zero.

Proof. The following lemma is obtained by elementary calculations.

Lemma 10. For any integrable random variable U and any constant K,

[mathematical expression not reproducible] (109)

If U has a continuous density [f.sub.U], then

[[partial derivative].sup.2]/[partial derivative][s.sup.2] E[(sU - K) .sub.+] = [K.sup.2]/[s.sup.3] [f.sub.U] (K/s). (110)

From [mathematical expression not reproducible] and (100),

[mathematical expression not reproducible] (111)

From the lemma,

[mathematical expression not reproducible] (112)

where [X.sub.T] [??] GC(a, b; [c.sub.1], ..., [c.sub.N]). Now the density of U may be expressed in terms of the density of G = ([X.sub.T] - a)/b [??] GC(0, 1; [c.sub.1], ..., [c.sub.N]):

[mathematical expression not reproducible] (113)

Setting u = K/[S.sub.0] and then using (103), we obtain

[mathematical expression not reproducible] (114)

Next,

[mathematical expression not reproducible] (115)

where G [??] GC(0, 1, c). For fixed G, the function g(r) = h(G, r) is absolutely continuous and thus

[mathematical expression not reproducible] (116)

Now, from (89),

[mathematical expression not reproducible] (117)

does not depend on r, so

[mathematical expression not reproducible] (118)

The same line of reasoning shows that the sensitivity to b is

[mathematical expression not reproducible] (119)

Define

[mathematical expression not reproducible] (120)

In order to shorten the formulas we will write (from (89))

[mathematical expression not reproducible] (121)

and thus find an expression for

[mathematical expression not reproducible] (122)

For arbitrary y [member of] R,

[mathematical expression not reproducible] (123)

To calculate

[mathematical expression not reproducible] (124)

we need (see (105))

[mathematical expression not reproducible] (125)

Then, noting that m'(b) = [c.sup.*.sub.1] = [c'.sub.1]m(b) and ([phi][He.sub.k- 1])' = -[phi][He.sub.k],

[mathematical expression not reproducible] (126)

Setting y = [d.sub.2] and putting the pieces together yield the result. When c = 0 this reduces to [S.sub.0][e.sup.-[delta]T][phi]([d.sub.1]), which is the sensitivity of the Black-Scholes call price to b = [sigma][square root of T].

Finally, turn to the sensitivities with respect to [c.sub.j], j [greater than or equal to] 1. First, let us derive [partial derivative]a/[partial derivative][c.sub.j]; from (89),

[mathematical expression not reproducible] (127)

Next,

[mathematical expression not reproducible] (128)

The first integral reduces to [S.sub.0][DELTA][partial derivative]a/[partial derivative][c.sub.j] (see the derivation of [DELTA] above). The second integral is evaluated by repeated partial integration: if g(x) is continuously differentiable k times and does not grow too fast,

[mathematical expression not reproducible] (129)

Hence the second integral in (128) becomes

[mathematical expression not reproducible] (130)

Putting all this together (using (103)) we get the formula for [mathematical expression not reproducible].

6. Two Applications

6.1. Equity Indexed Annuities. Hardy  describes the main types of equity indexed annuities (EIAs), point-to-point, annual ratchet and high water mark, in which an embedded European call option has to be priced. We consider compound ratchet EIAs without life-of-contract guarantee to explore the dependence of ratchet premium options on skewness and kurtosis of returns.

A single premium P is paid by the policy-holder, and the benefit under the ratchet premium contract is (, p. 248)

[mathematical expression not reproducible] (131)

Here, n is the term of the contract in years, [alpha] is the participation rate (a number between 0 and 1), and [S.sub.t] is the value of the equity index, usually the S&P500 index.

In , p. 249, the formula

[mathematical expression not reproducible] (132)

for [B.sub.n] is proved for the value of the premium option under a compound annual ratchet contract in the Black-Scholes model. The same reasoning will now be applied when index log returns have a Gram-Charlier distribution. To find a formula for the price of this option, we assume that under the risk-neutral measure the one-year log returns are independent Gram-Charlier distributed random variables

[R.sub.t] [??] GC(a, b; c), t = 1, ..., n. (133)

Independence of the variables across time implies that the value of the ratchet premium option is (if the annual risk-free rate of interest is r)

[mathematical expression not reproducible] (134)

As Hardy notes (, p. 249), each factor in the product is the price of a one-year European call option on [S.sub.t], if initial index price and strike are both equal to one.

Theorem 11. The no-arbitrage price of the EIA ratchet premium option described above is

[V.sub.0] = P [([e.sup.-r] + [alpha][C.sub.0]).sup.n], (135)

where [C.sub.0] is the price of a one-year call (see (90)) with [S.sub.0] = 1 and K = 1. The first-order sensitivities are

[mathematical expression not reproducible] (136)

with [xi] replaced with one of [S.sub.0], r , b, [c.sub.j], and [partial derivative][C.sub.0]/[partial derivative][xi] given in Theorem 9. The second-order sensitivity with respect to [S.sub.0] is

[mathematical expression not reproducible] (137)

We now show the effect of skewness and kurtosis on the ratchet premium option values; this is done using the four-parameter distribution GC(a, b; 0, 0, [c.sub.3], [c.sub.4]), for which this is especially simple. We also compute prices using the six-parameter GC(a, b; [c.sub.1], [c.sub.2], [c.sub.3], [c.sub.4]) distribution.

Our application involves an annual ratchet, and so the option [C.sub.0] in Theorem 11 has a maturity of one year; we thus need the distribution of one-year returns. For pricing that is consistent with the (derivatives) market what is needed is the one-year distribution of returns under the risk-neutral measure, which should be obtained from observed derivative prices; see Leon et al.  or Rompolis and Tzavalis  for more details. For illustrative purposes, we estimated the four parameters of that distribution by maximum likelihood ([6, 22]) to obtain the parameters of the GC(a, b; 0, 0, [c.sub.3], [c.sub.4]):

a = 0.0687, b = 0.1685, [c.sub.3] = -0.1150, [c.sub.4] = 0.03598. (138)

This says that the skewness and excess kurtosis of the fitted distribution are

[bar.s] = 6[c.sub.3] = -0.6898, [bar.k] = 24[c.sub.4] = 0.8634. (139)

This point is on the boundary of the feasible ([bar.s], [bar.k]) region; see Figure 3.

We use the parameters a and b while varying the values of [c.sub.3], [c.sub.4] to show how EIA premiums vary with skewness and excess kurtosis. Figure 4 shows the ratchet premium option values (Theorem 11) of an EIA with participation rate [alpha] = 0.6 and single premium P = 100, as a function of skewness [bar.s] and excess kurtosis [bar.k]. The risk-free rate is 3%, the dividend rate is 2%, and the term is 7 years. The martingale condition (89) must hold and so a is replaced with a so that (89) is satisfied. (N.B. The Black-Scholes model corresponds to [c.sub.3] = [c.sub.4] = 0, in which case a' =(r - [delta])T - [b.sup.2]/2; this is -0.004193 with the values of r, [delta], and b we are using and T = 1.)

The effect of varying ([c.sub.3], [c.sub.4]) is quite significant. The highest and lowest ratchet premium options in the graph are 109.30 to 104.24, which is the range [95%, 100%] as a proportion of the Black-Scholes premium option (\$109.26), which has skewness and excess kurtosis equal to zero. (The maximum value of the premium option is reached at ([c.sub.3], [c.sub.4]) = (0.0230, 0.00332567) and the minimum at (-0.0836,0.163); these points correspond to ([bar.s], [bar.k]) equal to (.138, .0798) and (0.023, 0.003326), resp.) The dependence on ([bar.s], [bar.k]) is nearly linear. This is easily explained. The price of a call is

[mathematical expression not reproducible] (140)

The sensitivity of [C.sub.0] to [c.sub.j] (j = 3,4) is thus

[mathematical expression not reproducible] (141)

The first term above is relatively small, and the second one is almost constant, since a' does not change much with [c.sub.j]. This carries over to the sensitivity of the ratchet premium option [V.sub.0] to changes in [c.sub.j] (see the first formula in Theorem 11).

The parameters fitted by maximum likelihood for the GC(a, b; [c.sub.1], [c.sub.2], [c.sub.3], [c.sub.4]) distribution are

a = 0.1174, b = 0.1595, [c.sub.1] = -0.3053675695201066, [c.sub.2] = 0.09542079373489153, [c.sub.3] = -0.12383971126335243, [c.sub.4] = 0.06120331530131559. (142)

The martingale condition implies a' = 0.0451, and the EIA premium option has value 107.90. The skewness and excess kurtosis of the fitted distribution are [bar.s] = -0.5437 and [bar.k] = 0.5092. (N.B. The feasible region for ([bar.s], [bar.k]) in the GC(a, b; [c.sub.1], [c.sub.2], [c.sub.3], [c.sub.4]) case is naturally larger than the one for the GC(a,b;0,0,[c.sub.3], [c.sub.4]); in this case ([bar.s], [bar.k]) are just outside the region in Figure 3.) It was checked numerically that the parameters [c.sub.1] to [c.sub.4] lead to a nonnegative function.

Table 1 shows ratchet premium option values based on Gram-Charlier with four parameters that are lower than the Black-Scholes based ones. Rows A and F are Black-Scholes cases, row B is our estimation of the four-parameter Gram-Charlier (as explained above), row C is the four-parameter Gram-Charlier with the largest negative skewness possible (see Figure 1), row D is similar but with the largest positive skewness, and row E has maximum kurtosis but no skewness. Rows A to E have the same parameter b, the one we estimated from S&P500 for a four-parameter Gram-Charlier, while row F has the same b as the six-parameter Gram-Charlier we estimated. Skewness and kurtosis are shown, followed by the break-even (or "fair") [alpha] ("B-E [alpha]" in the table) and then the ratchet premium option values for two specific choices for [alpha]. Percentage differences with the values in row A (resp., F) are also displayed ("[DELTA]A%" in the table). It is notable that in all cases the effect of moving away from Black-Scholes is more pronounced for the break-even participation rates than for the option values themselves. The scenario with the greatest impact (compared to Black-Scholes) is the one zero skewness and maximal excess kurtosis (row E). Insurers are unlikely to accept a cost of contract ([V.sub.0]) greater than the received initial premium (\$100); they may thus wish to adjust the participation rate [alpha] down to the break-even rate (to wit, see Lin and Tan ). The Black-Scholes break-even participation rate in row A is 41.9%, but it increases from 44.3% up to 49.3% in rows B to E. These are significant increases, ranging from 5.5% to 17.5%. Rows F and G are based on two quite different risk-neutral probability distributions, but the ratchet premium options are surprisingly similar. The parameters we estimated may not be those that a practitioner would use, but what this example says is that skewness and kurtosis have a very real importance in the pricing of equity indexed annuities and that the fair participation rate is, at least here, more sensitive to nonzero skewness and excess kurtosis than are option values. (See also  regarding the separate effects of skewness and excess kurtosis on option values.)

Let us compare this with the approach in . That paper is about a subclass of the Gram-Charlier distributions described in this paper; the authors call their distributions "seminonparametric" and have density (6), where p(x) is the square of the polynomial q(x) If q(x) is of order m, then it is possible to express p(x) = q[(x).sup.2] as

[mathematical expression not reproducible] (143)

However, most nonnegative polynomials cannot be expressed as squares of another polynomial, so the order-m SNP family is a strict subset of the order-2m Gram-Charlier family. The order 2m SNP family has 2m + 2 free parameters (the coefficients of an order-2m polynomial plus the location and dispersion parameters a, b, subject to density integrating to one, which in effect removes one parameter); by comparison, the GC(a, b; [c.sub.1], ..., [c.sub.2m]) distributions have 2m+2 parameters, but they are not "free" because the resulting density must be nonnegative. It is then not a priori clear which of SNP(2m) or GC(a, b; [c.sub.1], ..., [c.sub.2m]) would do best. We looked at this problem with the S&P500 annual prices, comparing SNP(2) and GC(a, b; 0, 0, [c.sub.3], [c.sub.4]), which both have four parameters. The likelihood function is written the same way in both cases; there are just different restrictions on the parameters. The results for GC(a, b; 0, 0, [c.sub.3],[c.sub.4]) are given in (138), and those for the SNP(2) are

a = -0.2445, b = 0.1940, [c.sub.1] = 1.6140, [c.sub.2] = 1.1733, [c.sub.3] = 0.4253, [c.sub.4] = 0.07320. (144)

This translates into the following moments for the SNP(2) fitted distribution:

mean 0.0687 standard deviation 0.1671 skewness - 0.6289 excess kurtosis 2.5502. (145)

The excess kurtosis is strikingly far from the data's 0.8903. The likelihood function evaluated at that estimate was lower than the likelihood evaluated at the parameters of the GC(a, b; 0, 0, [c.sub.3], [c.sub.4]) distribution. This is a puzzling result that does not mean that GC(a, b; 0, 0, [c.sub.3], [c.sub.4]) will always do better than SNP(2); it does however show that restricting the search to squares of polynomials does have consequences. Leon et al.  derive formulas for European option prices, taking the martingale restriction into account, based on expression (143); in this paper we do the same calculation for an arbitrary Gram-Charlier distribution. It is easy to see that the SNP class is not closed under convolution; that is, the distribution of the sum of two independent SNP variables is in general not an SNP distribution; the general formulas we derive are essential in order to add independent Gram-Charlier distributed variables (including those in the SNP subset), and therefore the general formulas are needed to define Gram-Charlier processes. Leon et al.  are correct in pointing out that there is an advantage in using p(x) = q[(x).sup.2] as far as the nonnegativity constraint is concerned, but we believe that it is no more difficult to estimate the parameters of a general Gram-Charlier distribution, including the nonnegativity constraint in the maximization procedure (rather than limiting the parameter space). A full comparison of the numerical and statistical advantages/disadvantages of our approach versus the one in  is an interesting avenue for further research.

6.2. Lookback Options. Consider a European lookback call option with fixed strike, with payoff ([bar.S] - K)+, where [bar.S] is the maximum of the stock price between time 0 and time T. We let the time points used to determine the maximum be equally spaced. There is no closed form formula for the distribution of the maximum price nor for the price of such a discretely monitored lookback option, so simulation is used. We generated normal random variables and used the technique described in Section 2.5. The assumptions are maturity T = 1, monthly monitoring, annual volatility 0.1685 (where this number comes from is explained in the next subsection). The interest rate is set to 0, initial stock price is \$100, and strike price is also \$100.

Figure 5 shows the option prices for values of skewness and excess kurtosis that can be achieved with the GC(a, b; 0, 0, [c.sub.3], [c.sub.4]) family for monthly rates of return. Roughly speaking, prices decrease when excess kurtosis increases, and also as kurtosis decreases, though the dependence on [c.sub.3] and [c.sub.4] becomes less linear as [bar.k] increases. When [c.sub.3] = [c.sub.4] = 0 (Gaussian case) the option price is 11.277. The price is a little higher (\$11.286) when [bar.s] = 0.75 and [bar.k] = 1.0. The lowest price (\$10.163) occurs when [bar.s] = 0 and [bar.k] = 4. Skewness and kurtosis thus have a significant effect on lookback option prices; the effect will of course vary depending on the parameters. (Observe that all those prices are quite far from the Black-Scholes continuous-monitoring lookback price, \$14.17.)

7. Discussion and Conclusion

This paper tries to provide a framework for applying Gram-Charlier series to option pricing. We have shown that option prices may be significantly affected by varying skewness or kurtosis away from their Gaussian values. The practical consequence is that using the Black-Scholes formula (i.e., zero skewness and excess kurtosis) may distort option prices. Gram-Charlier distributions capture skewness and kurtosis and retain a lot of the tractability of the normal distribution. Estimating parameters remains a challenge for Gram-Charlier distributions involving more than four parameters.

Convergence of Gram-Charlier and other expansions in option pricing and other applied problems is an area for further research. The heuristic derivation of the Gram-Charlier/Edgeworth series of a density g(x) in terms of the normal density and its derivatives (see, e.g., , Section 3.2) may give the impression that the inverted series should converge or at least have some asymptotic property; this is not the case in general. Gram-Charlier (or Edgeworth) expansions do not always converge to the true probability distribution, as Section 2.3 indicates; see , Section 17, or  for more details. In the context of the Central Limit Theorem, the Edgeworth series for the density of the normalized sum or random variables have asymptotic properties, when the number of summands n tends to infinity. Details are given in . Multivariate versions of the Gram-Charlier series exist, for instance, [10, 27, 28], apply such generalizations.

The Gram-Charlier type B series are based on the Poisson distribution, rather than the normal. There are several other expansions that can be used. For instance, series of Laguerre polynomials may be used for densities on the positive half-line (a convergent Laguerre series has some numerical success in pricing Asian options; see ). The Jacobi polynomials  are orthogonal over the interval (-1, 1), but any other finite interval may be chosen by making the obvious change of variable, and this leads to series expansions for densities with compact support, based on the beta distribution. An application of log-transformed shifted Jacobi polynomials to arbitrary approximation of probability distributions on the positive half-line may be found in .

http://dx.doi.org/10.1155/2017/8690491

Competing Interests

The authors declare that they have no competing interests.

Acknowledgments

The authors thank the Committee for Knowledge Extension and Research (CKER) of the Society of Actuaries for partially funding this project.

References

 P. A. Abken, D. B. Madan, and S. Ramamurtie, "Estimation of risk-neutral and statistical densities by Hermite polynomial approximation," Working Paper 96-5, Federal Reserve Bank of Atlanta, 1996.

 D. Backus, S. Foresi, K. Li, and L. Wu, "Accounting for biases in Black-Scholes," Working Paper, Stern School of Business, New York University, New York, NY, USA, 1997.

 J.-P. D. Chateau, "Marking-to-model credit and operational risks of loan commitments: a Basel-2 advanced internal ratings-based approach," International Review of Financial Analysis, vol. 18, no. 5, pp. 260-270, 2009.

 J. D. Chateau, "Valuing european put options under skewness and increasing (excess) kurtosis," Journal of Mathematical Finance, vol. 4, no. 3, pp. 160-177, 2014.

 C. J. Corrado, "The hidden martingale restriction in Gram-Charlier option prices," Journal of Futures Markets, vol. 27, no. 6, pp. 517-534, 2007.

 E. Jondeau and M. Rockinger, "Gram-Charlier densities," Journal of Economic Dynamics and Control, vol. 25, no. 10, pp. 1457-1483, 2001.

 E. Jurczenko, B. Maillet, and B. Negrea, "Skewness and kurtosis implied by option prices: a second comment," Discussion Paper of the LSE-FMG 419, 2002.

 E. Jurczenko, B. Maillet, and B. Negrea, "A note on skewness and kurtosis adjusted option pricing models under the Martingale restriction," Quantitative Finance, vol. 4, no. 5, pp. 479-488, 2004.

 J. L. Knight and S. E. Satchell, Return Distributions in Finance, Butterworth Heineman, 2001.

 E. Schlogl, "Option pricing where the underlying assets follow a Gram/Charlier density of arbitrary order," Journal of Economic Dynamics & Control, vol. 37, no. 3, pp. 611-632, 2013.

 A. Leon, J. Mencia, and E. Sentana, "Parametric properties of semi-nonparametric distributions, with applications to option valuation," Journal of Business & Economic Statistics, vol. 27, no. 2, pp. 176-192, 2009.

 M. Hardy, Investment Guarantees: Modeling and Risk Management for Equity-Linked Life Insurance, John Wiley & Sons, New York, NY, USA, 2003.

 P. Gaillardetz and X. S. Lin, "Valuation of equity-linked insurance and annuity products with binomial models," North American Actuarial Journal, vol. 10, no. 4, pp. 117-144, 2006.

 P. Boyle and W. Tian, "The design of equity-indexed annuities," Insurance: Mathematics & Economics, vol. 43, no. 3, pp. 303-315, 2008.

 S. G. Kou, "Discrete barrier and lookback options," in Handbooks in Operations Research and Management Science, vol. 15, chapter 8, pp. 343-373, 2007.

 D. E. Barton and K. E. Dennis, "The conditions under which Gram-Charlier and Edgeworth curves are positive definite and unimodal," Biometrika, vol. 39, no. 3-4, pp. 425-427, 1952.

 H. Cramer, On Some Classes of Series Used in Mathematical Statistics , Skandinaviske Matematiker Congres, Copenhagen, Denmark, 1925.

 W. Feller, An Introduction to Probability Theory and Its Applications, vol. 2, John Wiley & Sons, New York, NY, USA, 1971.

 J. Stoyanov, "Krein condition in probabilistic moment problems," Bernoulli, vol. 6, no. 5, pp. 939-949, 2000.

 F. D. Rouah and G. Vainberg, Option Pricing Models and Volatility Using Excel-VBA, John Wiley & Sons, New York, NY, USA, 2007.

 L. S. Rompolis and E. Tzavalis, "Retrieving risk neutral densities based on risk neutral moments through a Gram-Charlier series expansion," Mathematical and Computer Modelling, vol. 46, no. 1-2, pp. 225-234, 2007.

 E. B. Del Brio and J. Perote, "Gram-Charlier densities: maximum likelihood versus the method of moments," Insurance: Mathematics and Economics, vol. 51, no. 3, pp. 531-537, 2012.

 X. S. Lin and K. S. Tan, "Valuation of equity-indexed annuities under stochastic interest rates," North American Actuarial Journal, vol. 7, no. 4, pp. 72-91, 2003.

 S. R. Das and R. K. Sundaram, "Of smiles and smirks: a term structure perspective," Journal of Financial and Quantitative Analysis, vol. 34, no. 2, pp. 211-239, 1999.

 J. E. Kolossa, Series Approximation Methods in Statistics, Springer, New York, NY, USA, 3rd edition, 2006.

 H. Cramer, Mathematical Methods of Statistics, Princeton University, Princeton, NJ, USA, 1946.

 E. B. Del Brio, T.-M. Niguez, and J. Perote, "Gram-Charlier densities: a multivariate approach," Quantitative Finance, vol. 9, no. 7, pp. 855-868, 2009.

 E. B. Del Brio, T.-M. Niguez, and J. Perote, "Multivariate semi-nonparametric distributions with dynamic conditional correlations," International Journal of Forecasting, vol. 27, no. 2, pp. 347-364, 2011.

 D. Dufresne, "Laguerre series for Asian and other options," Mathematical Finance, vol. 10, no. 4, pp. 407-428, 2000.

 N. N. Lebedev, Special Functions and Their Applications, Dover, New York, NY, USA, 1972.

 D. Dufresne, "Fitting combinations of exponentials to probability distributions," Applied Stochastic Models in Business and Industry, vol. 23, no. 1, pp. 23-48, 2007.

Jean-Pierre Chateau (1) and Daniel Dufresne (2)

Correspondence should be addressed to Daniel Dufresne; dufresne@unimelb.edu.au

Received 17 August 2016; Accepted 1 November 2016; Published 8 February 2017

Caption: Figure 1: Gram-Charlier approximations to the density [g.sub.0.3] (x) (solid line) when q = 1/[[sigma].sub.0.3] = 1.66, resulting in [c.sub.1] = [c.sub.2] = 0. The curve with long dashes is the order 10 approximation, and the one with short dashes is the order 20 approximation.

Caption: Figure 2: Gram-Charlier approximations to the density [g.sub.0.3] (x) (solid line) when q = 1.4. The curve with long dashes is the order 10 approximation, and the one with short dashes is the order 20 approximation.

Caption: Figure 3: Feasible region for the skewness and excess kurtosis of the GC(a, b; 0, 0, [c.sub.3], [c.sub.4]) distributions and maximum likelihood estimate.

Caption: Figure 4: Ratchet premium option prices as a function of skewness and excess kurtosis, with GC(a, b; 0, 0, [c.sub.3], [c.sub.4]) distributions.

Caption: Figure 5: Lookback option prices as a function of skewness and excess kurtosis.
```Table 1: Comparison of ratchet option values computed with the
Black-Scholes formula and Gram-Charlier distributions. Rows
A to F are based on the four-parameter GC(a, b; 0, 0, [c.sub.3],
[c.sub.4]), while row G is based on a six-parameter
GC(a, b; [c.sub.1], [c.sub.2], [c.sub.3], [c.sub.4]).

a'             b     [c.sub.1]   [c.sub.2]   [c.sub.3]   [c.sub.4]

A -0.0042   0.1685       0           0           0           0
B -0.0035   0.1685       0           0        -0.1150     0.0360
C -0.0034   0.1685       0           0        -0.1749     0.1021
D -0.0051   0.1685       0           0         0.1749     0.1021
E -0.0043   0.1685       0           0           0        0.1667

a'             b     [c.sub.1]   [c.sub.2]   [c.sub.3]   [c.sub.4]

F 0.0027    0.1595       0           0           0          .0
G 0.0451    0.1595    -.3054      .09542     -0.12384     0.06120

[V.sub.0]
[DELTA]   ([alpha] =
a'           [bar.s]   [bar.k]   B-E [alpha]    (A)%       0.6)

A -0.0042       0         0         0.419        0        109.26
B -0.0035    -0.6898   0.8634       0.443       5.5       107.60
C -0.0034    -1.0493   2.4508       0.478       13.9      105.42
D -0.0051     1.0493   2.4508       0.446       6.3       107.39
E -0.0043       0      4.0000       0.493       17.5      104.59

[V.sub.0]
[DELTA]   ([alpha] =
a'           [bar.s]   [bar.k]   B-E [alpha]    (A)%       0.6)

F 0.0027        0         0         0.441        0        107.69
G 0.0451     -0.5437   0.5091       0.438       -0.7      107.90

[V.sub.0]
[DELTA]   ([alpha] =   [DELTA]
a'            (A)%      0.419)       (A)%

A -0.0042      0        100.00        0
B -0.0035     -1.5       98.92       -1.1
C -0.0034     -3.5       97.50       -2.5
D -0.0051     -1.7       98.78       -1.2
E -0.0043     -4.3       96.96       -3.0

[V.sub.0]
[DELTA]   ([alpha] =   [DELTA]
a'            (A)%      0.419)       (A)%

F 0.0027       0        100.00        0
G 0.0451      0.2       100.14       0.14
```
Title Annotation: Printer friendly Cite/link Email Feedback Research Article Chateau, Jean-Pierre; Dufresne, Daniel Journal of Probability and Statistics Report 1USA Jan 1, 2017 11712 A Conditional Fourier-Feynman Transform and Conditional Convolution Product with Change of Scales on a Function Space II. Stochastic Models for the Infectivity Function in an Infinite Population of Susceptible Individuals. Distribution (Probability theory) Pricing Probability distributions