Printer Friendly
The Free Library
14,559,952 articles and books
Member login
User name  
Password 
 
Join us Forgot password?

2 Bayesian prediction, entropy, and option pricing.


Abstract: This paper studies the performance of the Foster-Whiteman (1999) procedure for using a Bayesian predictive distribution for the future price of an asset to compute the price of a European option European Option

An option that can only be exercised at the end of its life.

Notes:
In other words, you must ride the rollercoaster until the maturity date, and only then can you cash in.
 on that asset. A technical contribution of the paper is the description of a sequential importance sampling procedure for implementing an informative prior that reflects and rewards past option-pricing success. The risk-neutralization of the predictive distribution is accomplished by Stutzer's (1996) constrained KLIC-minimizing change of measure. The procedure is used in weekly pricing of July and November options on soybeans on the Chicago Board of Trade Chicago Board of Trade (CBOT)

The second largest futures exchange in the US, and a pioneer in the development of financial futures and options.
 from 1993-1997, and produces option prices that mimic market prices much more closely than those of the Black model or those produced by risk-neutralizing a nonparametric predictive.

Keywords:

BAYESIAN PREDICTION; ENTROPY; OPTION PRICING; COMMODITIES.

1. Introduction

Options pricing techniques have been an important part of finance for some time. Most approaches specify a particular stochastic process stochastic process

In probability theory, a family of random variables indexed to some other set and having the property that for each finite subset of the index set, the collection of random variables indexed to it has a joint probability distribution.
 to represent the price dynamics of the underlying asset and then derive an explicit pricing model. While this may be acceptable for standard financial assets Financial assets

Claims on real assets.
, it may be problematic for commodities. Many commodities have significant seasonalities and require a far more elaborate time-series specification of the price dynamics of the underlying asset. Hence, it becomes difficult at best to derive explicit pricing formulae. Further, with the additional complexity of a rich time-series specification, estimation risk becomes a genuine concern. Finally, not all predictive information need be drawn from the historical time-series, so allowing for an explicit incorporation of nonsample beliefs could be important.

In this paper we suggest an alternative approach. We use numerical Bayes techniques to build a predictive density for the price of the underlying asset (for the example, in this paper we need to predict the soybean soybean, soya bean, or soy pea, leguminous plant (Glycine max, G. soja, or Soja max) of the family Leguminosae (pulse family), native to tropical and warm temperate regions of Asia, where it has been  cash and futures prices Futures price

The price at which parties to a futures contract agree to transact upon the settlement date.
 at the option's expiration). Bayesian techniques allow for two very important additions. First, they enable us to integrate out any estimation risk. Second, they allow us to incorporate properly any non-sample information that we may have through an informative prior. Indeed, our informative prior is built sequentially by rewarding past option-pricing successes. Once the predictive density has been computed, we use a procedure proposed by Stutzer (1996) to translate this density to its risk-neutral form. With the risk-neutral density, pricing European options is very straightforward.

To illustrate this technique, we consider prices of options on soybean futures traded on The Chicago Board of Trade. We start with a simple vector autoregressive specification for the spot and futures prices. We enrich this predictive model to include weather data as well as futures market futures market, a commodity exchange where contracts for the future delivery of grain, livestock, and precious metals are bought and sold. Speculation in futures serves to protect both the developers and the users of the commodities from unfavorable and unpredictable  trading activity as evidenced by trading volume Trading volume

The number of shares transacted every day. As there is a seller for every buyer, one can think of the trading volume as half of the number of shares transacted. That is, if A sells 100 shares to B, the volume is 100 shares.
 and open interest. We compare this procedure with traditional approaches as well as with a non-parametric procedure advocated by Stutzer (1996).

The paper is organized as follows. The next section describes the construction of a predictive distribution for the futures price at the time of expiration of the contract. Section 3 uses a procedure due to Stutzer (1996) to change the probabilities implicit in Adj. 1. implicit in - in the nature of something though not readily apparent; "shortcomings inherent in our approach"; "an underlying meaning"
underlying, inherent
 the numerical predictive distribution in such a way that they can be thought of as 'risk-neutral' probabilities. Using this transformed distribution, options can be priced by computing a partial expectation and discounting at the risk-free rate Risk-free rate

The rate earned on a riskless asset.
. Section 4 describes a study of the performance of the procedure in week-by-week pricing of soybean options from 1993-1997. Section 5 concludes.

2. Building the Predictive

The options to be priced are on soybean futures contracts, so the predictive distribution that will be needed is for the soybean futures price on the day the option contract expires, [F.sub.E], where F denotes the futures price and the E subscript (1) In word processing and scientific notation, a digit or symbol that appears below the line; for example, H2O, the symbol for water. Contrast with superscript.

(2) In programming, a method for referencing data in a table.
 refers to the future's expiration day. To predict the futures price we use a multivariate The use of multiple variables in a forecasting model.  model describing the evolution of the (log of the) spot and futures prices, which are related via the cost of carry relation:

1n([F.su.t]) = 1n([S.sub.t]) + [b.sub.t], (1)

where S denotes the spot price and b is the basis. The basis represents the percentage cost of carrying the spot commodity forward in time to the future's expiration date Expiration Date

The day on which an options or futures contract is no longer valid and, therefore, ceases to exist.

Notes:
The expiration date for all listed stock options in the U.S.
 (which is not the same as the options expiration date).

The model we utilize is a kind of vector autoregression Vector autoregression (VAR) is an econometric model used to capture the evolution and the interdependencies between multiple time series, generalizing the univariate AR models.  (VAR) for the spot and futures prices. This structure is convenient for our study, but it will become clear below that in principle we could use any model that can be simulated to produce a predictive distribution. Let [y.sub.t] denote the 4 x 1 vector containing the futures price, the basis, and open interest ([O.sub.t]) and volume ([V.sub.t]) in the futures market:

[y.sub.t] = (1n([F.sub.t]), [b.sub.t] 1n([O.sub.t]), 1n([V.sub.t]))' (2)

The trading volume and open interest variables and equations are included to pick up any volume / volatility relations. (1) Because reported trading volume gives the summed absolute value of trade sizes (it ignores whether the trade is buyer or seller initiated) we also include open interest. Open interest can be thought of as a signed summation summation n. the final argument of an attorney at the close of a trial in which he/she attempts to convince the judge and/or jury of the virtues of the client's case. (See: closing argument)  of past trading volumes.

In addition to the variables in y, we take as exogenous Exogenous

Describes facts outside the control of the firm. Converse of endogenous.
 a set of [sigma] variables [d.sub.t] which includes seasonal dummy variables and variables describing the weather, including rainfall and temperature data at various locations around the soybeangrowing region of the Midwestern United States United States, officially United States of America, republic (2005 est. pop. 295,734,000), 3,539,227 sq mi (9,166,598 sq km), North America. The United States is the world's third largest country in population and the fourth largest country in area. . The model is a near-VAR because it does not describe the evolution of these variables. The near-VAR can be written

[y.sub.t] = C + [Dd.sub.1] + A(L)[y.sub.t-1] + [v.sub.t] [v.sub.t] ~ iid N(O,[SIGMA]), (3)

where C and D are vectors of constants and A(L) is a vector of [lambda]-degree polynomials in the lag operator In time series analysis, the lag operator or backshift operator operates on an element of a time series to produce the previous element. For example, given some time series

. Henceforth, we shall refer to the parameters in C, D, and A(L) as [mu], and write [theta Theta

A measure of the rate of decline in the value of an option due to the passage of time. Theta can also be referred to as the time decay on the value of an option. If everything is held constant, then the option will lose value as time moves closer to the maturity of the option.
] = ([mu][SIGMA]).

Our approach to prediction is Bayesian. Thus we treat the unknown parameters [theta] as random, and we condition our analysis on the observed data. We first need to describe how observed data modify our subjective views about the unknown parameters through the posterior distribution, and then how the posterior is used to build a predictive distribution for future values of the spot and futures prices. Given the simple structure of our model, this is quite standard, and readers familiar with such derivations may wish to skip to the description of the informative prior case.

2.1 Flat Prior Posterior

Our description of the posterior distribution under a flat prior follows Foster and Whiteman (1999) closely. Let Y denote the Txn matrix with t-th row given by [y'.sub.t] , and let X denote the T x(2+n2) matrix with t-th row given by (1, [d.sub.t]', [y.sub.t-1]'). Using the independence of the [v.sub.t]'s and noting that the Jacobian of the transformation from v to y is unity, the sampling density of Y conditional on [lambda], initial values, is

[MATHEMATICAL EXPRESSION A group of characters or symbols representing a quantity or an operation. See arithmetic expression.  NOT REPRODUCIBLE IN ASCII ASCII or American Standard Code for Information Interchange, a set of codes used to represent letters, numbers, a few symbols, and control characters. Originally designed for teletype operations, it has found wide application in computers. ] (4)

or

P(Y|[theta]) [varies] [[absolute value of [summation]].sup.-(T-[lambda]-1)/2] exp exp
abbr.
1. exponent

2. exponential
[-1/2 tr (Y - XB)'(Y - XB)[[summation].sup.-1]] (5)

where tr denotes the trace operator. That is, the VAR can be seen to be a version of the standard multivariate regression model:

Y = XB + V, (6)

where the (1+[delta]+n[lambda])x n matrix B contains the VAR coefficients, and the rows of V are iid N(0, [summation]).

To begin, we adopt an 'uninformative' prior. There are many interpretations that can be given to the term 'uninformative'; we use the standard 'flat' prior:

P(B, [summation]) [varies [[absolute value of [summation].sup.(n+1)/2] (7)

(see Zellner 1971). The posterior distribution of the parameters is the product of the likelihood and the prior, or

P(B, [summation]|Y, X) [varies] [[absolute value of [summation].sup.-(T-[lambda]+n)/2] exp[-1/2 tr (Y -XB)'(Y-XB)[[summation].sup.-1]]. (8)

Analysis of this expression is simplified by rewriting using the least squares estimate of B, [??], and the sum of squares matrix,

S = (Y-X[??])'(Y-X[??]), (9) yielding

P(B, [summation]|Y, X) [varies] [[absolute value of [summation].sup.-(T-[lambda]+n)/2]exp{-1/2 tr [S+ (B -[??]) 'X'X(B - [??])][[summation].sup.-1]}. (10)

This expression can be factored to reveal that

P(B,[summation]|Y,X) [varies] p(B|[summation],Y,X)P([summation]|Y,X), (11)

where

P(B|[summation],Y,X,) [varies][[absolute value of [summation].sup.-k/2] exp{-1/2 tr(B - [??])'X'X(B - [??])[[summation].sup.-1]}. (12)

is the normal distribution, (k = l+[delta]+n[lambda]), and

P([summation]|Y,X) [varies] [[absolute value of [summation].sup.-v/2]exp{-1/2 trS[[summation].sup.-1]} (13)

is the 'inverse-Wishart' distribution (v = T-[lambda]-k+n+1).

2.2 Predictive Distribution

Suppose the prediction horizon is h, so that E = T+h. Given [theta] and data on [y.sub.t] and [x.sub.t] for t = 1,2, ..., T denoted by [Y.sub.T] and [X.sub.T], standard Gaussian regression arguments yield the sampling distribution for [y.sub.T+h]:

P([y.sub.T+h]|[theta], [Y.sub.T],[X.sub.T]). (14)

Then the joint distribution of the future values and the unknown parameters is

P([y.sub.T+h],[theta]|[Y.sub.T][X.sub.T]) = P([y.sub.T+h]|[theta], [Y.sub.T], [X.sub.T]) P([theta]|[Y.sub.T],[X.sub.T]). (15)

The predictive distribution is obtained by integrating out the uncertain parameters,

P([y.sub.T+h],[theta]|[Y.sub.T][X.sub.T]) = [integral]P([y.sub.T+h]| [theta], [Y.sub.T], [X.sub.T]) P([theta]|[Y.sub.T],[X.sub.T])d[theta]. (16)

Analytical treatment of this distribution is difficult under the best of circumstances; were an informative prior embedded Inserted into. See embedded system.  in the posterior distribution, such analysis is generally not possible. But numerical analysis numerical analysis

Branch of applied mathematics that studies methods for solving complicated equations using arithmetic operations, often so complex that they require a computer, to approximate the processes of analysis (i.e., calculus).
 is straightforward. To generate a random sample from the predictive distribution, one simply generates a random sample from the posterior distribution, {[[theta].sub.1],}, [iota] = 1, ..., N, and for each element of this sample, one generates a sample from P([y.sub.T+h]|[[theta].sub.[iota]],[Y.sub.T], [X.sub.T]). This is straightforward. In particular, to sample from the posterior distribution, simply sample from the appropriate inverse-Wishart, use this drawing of [summation], [[summation].sub.[iota]], to condition the normal, and draw a [B.sub.[iota]]. For each drawing of [B.sub.[iota]], and [[summation].sub.[iota]], a drawing from the predictive is calculated as follows: first make h drawings from the N(0,[[summation].sub.[iota]]) to generate realizations of shocks [v.sub.T+1], [v.sub.T+2], ..., [v.sub.T+h]. Then starting from the last [lambda], sample data points, perform a dynamic simulation Dynamic Simulation is similar to a physics engine, the technology used in many powerful computer graphics software programs, like 3ds Max, Maya, Lightwave, and many others to simulate physical characteristics.  of the VAR using the previously drawn [B.sub.[iota]], and the newly drawn shocks. The only complication here stems from the fact that the future values of the d variables are not known at date T. We make the process fully operational by replacing these future values for the weather variables in d by historical averages using data through 1992.

2.3 Informative Prior

In our application, we also experimented with various informative priors in place of the 'flat' prior expression (7). In particular, in place of (7) we used

P(B, [summation]) [varies] [[absolute value of [summation].sup.-(n+1)/2]f(B) (17)

where f(B) is a density. The density we used embodies the notion that values of B that price options 'well' are more likely; this density will be described once the option pricing procedure is introduced in the next section. To generate a sample under the informative prior, we turned our flat prior posterior distribution into an 'importance sampler' (see Geweke 1989). That is, we sample from the flat prior posterior distribution, and assign each such drawing a 'weight' of

f([B.sub.i])/[[summation].sup.N.sub.i=1]f([B.sub.i]).

Thus, for example, the posterior mean under the flat prior posterior would be calculated by

[N.sup.-1][[summation].sup.N.sub.i=1][B.sub.i],

while the posterior mean under the informative prior would be calculated by

[[summation].sup.N.sub.i=1][f([B.sub.i])/[[summation].sup.N.sub.j=1]f ([B.sub.J])][B.sub.i].

Under mild conditions (see Geweke 1989), estimates computed in this way converge almost surely to the population values. (See DeJong & Whiteman 1994 for an application of this procedure.)

3. The Risk-Neutral Density and Option Prices

Once we have computed the predictive density we need to risk-adjust the probabilities to form the risk-neutral or pricing density. To do this we use a procedure advocated by Stutzer (1996). This procedure uses the maximum entropy principle of information theory to transform the predictive density to its risk-neutral form. This section describes his basic approach.

Using the Monte Carlo Monte Carlo (môNtā` kärlō`), town (1982 pop. 13,150), principality of Monaco, on the Mediterranean Sea and the French Riviera.  sample from the predictive density for [F.sub.E] we compute a futures return factor, [R.sup.i](E-T), for each draw, i=1,2, ..., N:

[F.sup.i.sub.E] = [F.sub.T][R.sup.i](E-T), (18)

where T denotes the end of the sample (the 'current' date) and E is the options expiration date. In the flat-prior case, each of these drawings is assigned an equal weight of 1/N, whereas in the informative prior case the weights will differ according to according to
prep.
1. As stated or indicated by; on the authority of: according to historians.

2. In keeping with: according to instructions.

3.
 (17) and the ensuing en·sue  
intr.v. en·sued, en·su·ing, en·sues
1. To follow as a consequence or result. See Synonyms at follow.

2. To take place subsequently.
 discussion. We now need to transform the Monte Carlo probabilities for each draw, [??](i) so that the resulting estimated risk-adjusted density, [[??].sup.*](i) prices the futures contract Futures Contract

An exchange traded agreement to buy or sell a particular type and grade of commodity for delivery at an agreed upon place and time in the future. Futures contracts are transferable between parties.
 properly. That is, we require the true risk-adjusted density to satisfy the following:

[summation over (i)][[pi].sup.*](i)[R.sup.i](E-T) = 1. (19)

Equivalently, under the probabilities [[pi].sup.*], the expected value Expected value

The weighted average of a probability distribution. Also known as the mean value.
 of the futures price at E is the current futures price. Thus {[[pi].sup.*](i), i = 1, ..., N} is the 'equivalent martingale martingale

a leather strap running from the girth to the reins or the noseband for the purpose of restricting the movements of the horse's head. There are many designs. The common ones are the standing martingale, which is attached to the noseband, and the running martingale, which
 measure' associated with arbitrage-free price system. (See Huang & Litzenberger 1988, chapter 8.).

Of course, there are many choices of the N-component vector [[pi].sup.*] satisfying expression (19). Following Stutzer (1996), we use an estimate, [[??].sup.*], satisfying expression (19) that is chosen to minimize the Kullback-Leibler Information Criterion There are a number of statistics that can act as an information criterion. They include:
  • Akaike's information criterion
  • the Bayesian information criterion, also known as the Schwarz information criterion
  • Hannan-Quinn information criterion
 (KLIC) divergence between the risk-adjusted probabilities and those from the predictive density formed with our numerical Bayes procedure. (2) This optimization is of the form:

[MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII] (20)

When the prior is flat and the weights [[??](i)= [N.sup.-1] are equal for all N Monte Carlo draws, the constrained optimization in expression (20) is identical to a constrained maximization of Shannon entropy,

-[summation over ([for all]i)][[pi].sup.*](i)ln([[pi].sup.*](i)).

In the general case, using the Lagrange multiplier method gives the Gibbs canonical distribution:

[[??].sup.*](i) = [[??](i)exp([[lambda].sup.*][R.sup.i](E - T))/[summation over ([for all]i][??](i)exp([[lambda].sup.*][R.sup.i](E - T)), i = 1,2, ..., N, (21)

whose Lagrange multiplier, [[gamma].sup.*], is found by solving:

[MATHEMATICAL EXPRESSION NOT REPRODUCIBLE IN ASCII] (22)

Finally, to price a European option on a future contract, use the risk-neutral density to compute the discounted expected value at the option's expiration. For a call option with a strike price of Xwe have:

C(T, E) = [summation over(i)]max[[F.sub.T][R.sup.i](E - T) - X, 0]/[r.sup.(E - T)][[??].sup.*](i), (23)

where [r.sup.(E-T)] is the risk-free discount factor from the current time, T, to the option expiration date E..

4. Data and Implementation (3)

For our analysis we use the soybean futures contract traded on the Chicago Board of Trade (CBOT See Chicago Board of Trade.

CBOT

See Chicago Board of Trade (CBOT).
) from 1993-1997. Each futures contract entitles the long to receive 5,000 bushels of soybeans. The deliverable grade is No. 2 Yellow at par and there can be delivery substitutions at differentials established by the CBOT (e.g. #1 soybeans at par + $0.06 per bushel bushel: see English units of measurement. .). During the sample period, delivery sites were terminal elevators Terminal Elevator

An agricultural elevator that is considered to be the largest accumulator of the actual.

Notes:
Generally, terminal elevators are located at points where the movement of agricultural products to processing plants is convenient.
 in the Chicago region. (Beginning in 2000, delivery may be made at any point along a 403-mile stretch of the Illinois and Mississippi rivers from Chicago to St. Louis, with location differentials per bushel established by the CBOT.) Prices for the futures are quoted in cents and quarter cents per bushel, with a tick size of $0.0025 ($12.50 per contract). The daily price limit is $0.30 per bushel above or below the prior day's price, with no price limit in the spot month. The contract year starts trading in September and each year there are futures that expire in September, November, January, March, May, July, and August. During the sample period, the last delivery day was the last business day of the delivery month; for 2000 and beyond, this has been changed to the 2nd business day following the last trading day Last Trading Day

The final day that a futures or options contract may trade or be closed out before delivery of the underlying asset must occur.

Notes:
If the buying and selling parties do not arrange an alternate agreement, the physical commodity must be delivered from
.

Our histories of the soybean futures and cash prices extend back to January 1959. Figure 1 shows a plot of two key contract months, July and November. July is the last contract before the new crop is harvested, while November is the first futures contract where the details of the new crop are known. For these reasons we concentrate our efforts on pricing these two contract months. Note that there are a number of occasions where the two series diverge diverge - If a series of approximations to some value get progressively further from it then the series is said to diverge.

The reduction of some term under some evaluation strategy diverges if it does not reach a normal form after a finite number of reductions.
, indicating, in part, traders' beliefs about the quality of the incoming crop. Each series represents a joining of the underlying contract years. In producing this graph we have a simple rollover A graphic element in an application or on a Web page that changes its color or shape when the pointer is moved (rolled) over it. See JavaScript rollover. See also n-key rollover.  policy; use the price from the next year when a contract expires. In the predictive we use a dummy variable and adjust explicitly for the change in the contract year.

[FIGURE 1 OMITTED]

We also examine the cash price and the simple difference between the cash and futures prices (the basis). This is done in figure 2. The cash price series follows the same general pattern as the two futures. The bases have a saw-tooth pattern that is due to the rollover and subsequent decay to expiration as the futures and spot converge. Secondly, the November basis is more variable than the July, supportive of the view that the November basis is more driven by the uncertainty in the growth of the new crop.

[FIGURE 2 OMITTED]

The soybean options on futures contract also trades on the CBOT. The options are American style and upon exercise the option holder (for a call) receives a futures contract for the appropriate month. From the CBOT we get the volume and open interest for each contract and strike price each day. We also get the closing (settlement) put and call prices for each contract and strike price. Because there are many strike prices offered and most trading occurs in the at-the-money contracts, we concentrate on nine strike prices; the at-the-money strike price, the four nearest in-the-money (call) strike prices, and the four nearest out-of-the-money (call) strike prices. For these options the early exercise value of the option is quite small, so our use of European option pricing technique is a less onerous assumption. The expiration day of the futures option Futures option

An option on a futures contract. Related: Options on physicals.


futures option

A put or call option on a futures contract.
 is the last Friday with at least seven business days remaining in the month before the futures contract expires.

In addition to market data we have data on weather in upper Midwest The Upper Midwest is a region of the United States with no universally agreed-upon boundary, but it almost always lies within the US Census Bureau's definition of the Midwest and includes the states of Minnesota and Wisconsin, as well as at least the Upper Peninsula of Michigan.  of the United States. Knowing that Illinois, Indiana, and Iowa are all significant soybean growing states we use the weather in Springfield Illinois, Indianapolis Indiana, and Des Moines Des Moines, city, United States
Des Moines (dĭ moin`), city (1990 pop. 193,187), state capital and seat of Polk co., S central Iowa, at the junction of the Des Moines and Raccoon rivers; inc.
 Iowa to help predict future soybean cash and futures prices.

Figure 3 shows the minimum and maximum daily temperatures averaged across the three centers. As one would expect there is a significant annual seasonality. Note that temperature changes are likely to have a greater impact on soybean prices at certain times of the year (post planting and prior to harvest, for example).

[FIGURE 3 OMITTED]

Another fundamental weather factor is precipitation. In figure 4 we give the maximum number of days since precipitation across the Des Moines, Indianapolis, and Springfield sites. The larger this value, the more drought-like are conditions. This series shows considerable variation and we note that in some years there can be a significant period without rain in at least one of the cities.

[FIGURE 4 OMITTED]

The last information source we use is for US Treasury rates. We use the current 3-, and 6-month bill prices to compute the risk-free rate. For options expiring in less than 20 weeks, we discounted using the 3 month rate; for options 20 weeks or more from expiration, we used the six month rate.

4.1 Preliminary Specification and Estimation

In our implementation, we selected Friday data and estimated a weekly VAR using the most recent 1000 observations through November 6, 1992. We have 4 equations, in the VAR corresponding to the variables given in expression (2); the log futures price, log spot price, log of open interest, and log of volume each Friday. In addition to a constant and three lags of the (log of) futures, basis, open interest, and volume, we included four weather variables and a set of deterministic 1. (probability) deterministic - Describes a system whose time evolution can be predicted exactly.

Contrast probabilistic.
2. (algorithm) deterministic - Describes an algorithm in which the correct next step depends only on the current state.
 variables including eleven monthly dummies and a contract rollover dummy. The weather variables were the average minimum and maximum temperatures over the three sites, and the minimum and maximum number of days since the last measurable precipitation across the sites. Each of the four equations of the near VAR had 29 right-hand-side variables; with 12 distinct variance-covariance parameters, the system comprises a total of 128 parameters. (4)

4.2 Informative Prior

In addition to the flat-prior procedure described above, we implemented an informative prior procedure that embodied a type of 'learning' over the contract year. In our analysis, we valued calls and puts for July and November contracts for 1993-1997. For each contract, we priced the option on 26 consecutive Fridays beginning 27 weeks prior to expiration. In each case, with the passage of each week, we added that week's data to the data set, and updated the posterior distribution. Updating the posterior distribution associated with the flat prior thus incorporates a very crude sort of learning--the additional data is used as it becomes available, but in precisely the same way (i.e. via a flat prior) as the initial sample. A more natural type of learning would respect the success of the option pricing procedure itself. Indeed, our informative prior was built iteratively: in each contract year, we would begin 27 weeks prior to expiration with the basic near VAR specification and the data, but no other prior information. After pricing options in that first week with the flat prior specification, we would also determine the KLIC-closest predictive distribution satisfying the no-arbitrage condition that also priced three options correctly: the at-the-money, and just-in and just-out of the money options. The reweighting of sample values of the parameters implicit in this calculation was used as a prior distribution for the subsequent week's calculation.

Implementing this prior requires special care because of its non-conjugate nature. To see why, consider the situation at week t, somewhere in the midst Adv. 1. in the midst - the middle or central part or point; "in the midst of the forest"; "could he walk out in the midst of his piece?"
midmost
 of the 26-week prediction period for a particular contract. Suppose that the current sample from the posterior is {[[theta].sub.i]} for i = 1, ..., N. Suppose further that the posterior distribution can be represented by a set of probability density probability density
n. Statistics In both senses also called probability distribution.
1. A function whose integral over a given interval gives the probability that the values of a random variable will fall within the interval.
 values [[omega].sub.i,t] = f([[theta].sub.i],t). After constructing the sample from the predictive density and risk-neutralizing (reweighting) via the Stutzer procedure, we also computed a different reweighting that priced the three options correctly. This reweighting can be represented by [??]*(i), which is calculated from [[omega].sub.i,t] in the same manner as [??]*(i) is calculated from [??](i), except that in addition to the no-arbitrage constraint, three additional constraints are imposed to ensure that the three nearest-the-money options are priced correctly in week t. Entering week t+1, the [??]*(i) distribution plays the role of the prior for that week.

The difficulty with this procedure is that the mapping from [[theta].sub.i] to [??]*(i) is unknown. Thus it is not possible to sample from the posterior distribution given by the product of the likelihood for week t+1 and the prior just derived the week before. Fortunately, this difficulty can be finessed because it is possible to construct an importance sample for which it is necessary only to know the weights [??]* (i) themselves rather than the mapping from [[theta].sub.i] to [??]* (i). This in turn requires that the values of the drawings {[theta].sub.i]} not change from week to week. To carry this out, we used the flat prior posterior from the first week of our pricing exercise (for each of the 5 years) as an importance sampler sampler, sample piece of needlework or embroidery, of silk, cotton, or worsted, for the preservation of some pattern or as an example of the ability of a child or a beginner. In museums and private collections there are samplers dating from as early as 1643. . That is, in week 1 of each contract year, we drew a very large number (N=50,000) of [[theta].sub.i]'s. (5) These drawings were then used in each of the subsequent 25 weeks. When used in week t, for example, the drawing [[theta].sub.i] was associated with a 'weight' not, of course, equal to 1/N, but rather equal to the product of the [??]*(i) calculated at the end of week t-1 and an additional weight proportional to the likelihood of [[theta].sub.i] using data up to and including week t. As the weeks passed, the prior weights and likelihood values changed, and the priors and posteriors updated sequentially (6,7).

4.3 Alternative Predictive

As one standard for comparison, we also produced a nonparametric predictive like that used by Stutzer (1996). In particular, at each date t we determined the (nonparametric) empirical distribution of (E-T)-period growth rates Growth Rates

The compounded annualized rate of growth of a company's revenues, earnings, dividends, or other figures.

Notes:
Remember, historically high growth rates don't always mean a high rate of growth looking into the future.
 of futures prices using the most recent 1000 weekly observations. This empirical distribution was used with the then-current futures price to produce a predictive distribution of futures prices at expiration which was then risk-neutralized using the procedure outlined in section 3. The resulting distribution was used to price options using a formula analogous to (23).

5. Pricing Results

We valued calls and puts for July and November contracts for 1993-1997. As noted above, for each contract, we priced the option on 26 consecutive Fridays beginning 27 weeks prior to expiration. In each case, with the passage of each week, we added that week's data to the data set, and updated the posterior distribution. (8) Thus each Friday we computed the price of the option on the ensuing Friday, using the information that would have been available to the actual options traders in real time. Each week we priced every option at every strike price that traded at least once between 1992 and 1998. We used four pricing methods: the Black model (which uses the familiar log normal predictive), the Stutzer model (the risk-neutralized nonparametric predictive), and our flat- and informative-prior numerical Bayes procedures involving the VAR of Section II. In each case, we also computed a 'constrained' option price. For the Black model, this involved using the current at-the-money call price to determine an implied volatility Implied volatility

The expected volatility in a stock's return derived from its option price, maturity date, exercise price, and riskless rate of return, using an option pricing model such as Black-Scholes.
 for use in pricing other options. For the other three procedures, when risk-neutralizing we imposed one additional constraint, that the procedure correctly prices the current at-the-money call. (In the constrained cases, in pricing the option at date t using other information dated t-1 and earlier, we therefore also used the price of one option at date t (the at-the-money call.)

Results are presented for the at-the-money and eight adjacent strike prices in figure 5. The figures graph mean absolute pricing errors for the calls, in cents per bushel, relative to market prices. We include an overall average for all years (1993-97) as well as year-by-year breakdowns for both the July and November option on futures contracts. Errors are generally smaller for the November contract than the July contract, perhaps reflecting greater uncertainty about the evolution of prices as inventories of the old crop dwindle dwin·dle  
v. dwin·dled, dwin·dling, dwin·dles

v.intr.
To become gradually less until little remains.

v.tr.
To cause to dwindle. See Synonyms at decrease.
 during the course of the July contract. In terms of matching market prices, among the unconstrained procedures for pricing the July option, the informative-prior Bayesian approach dominates, followed by the flat prior, the Stutzer model, and the Black model. For the November option, the flat-prior Bayesian procedure is dominated by the Black model, the Stutzer model, and the informative-prior Bayesian procedure. Among the latter three, each one is more accurate than the others for at least one strike price shown. Pricing errors tend to be greater when the time value of the option is at its largest (when the option is nearer to the money).

[FIGURE 5 OMITTED]

An inspection of the graphs in figure 5 shows that adding the constraints improves the pricing accuracy considerably. (That is, using contemporaneous con·tem·po·ra·ne·ous  
adj.
Originating, existing, or happening during the same period of time: the contemporaneous reigns of two monarchs. See Synonyms at contemporary.
 market price information in addition yields option prices for the unconstrained strikes that are closer to market prices for options at those strikes.) Both the Stutzer model and the Bayesian approaches price the options at least as well as the Black model. Given the complexity of the VAR, parameter uncertainty appears to be well managed. The Stutzer model seems to do particularly well when one pricing constraint is added.

Note that the use of the informative prior can improve dramatically the pricing error for the numerical Bayes procedures. This suggests that information in the change of measures from prior weeks that is not part of the sample futures and cash data is helpful in understanding the structure of the option prices. It illustrates the potential importance of using such informative prior calculations, yet the incremental Additional or increased growth, bulk, quantity, number, or value; enlarged.

Incremental cost is additional or increased cost of an item or service apart from its actual cost.
 computation cost is relatively minor.

Options traders often calibrate To adjust or bring into balance. Scanners, CRTs and similar peripherals may require periodic adjustment. Unlike digital devices, the electronic components within these analog devices may change from their original specification. See color calibration and tweak.  models for pricing using implied volatilities. This gives us another method of computing pricing error. We use the Black model to compute the volatility (standard deviation In statistics, the average amount a number varies from the average number in a series of numbers.

(statistics) standard deviation - (SD) A measure of the range of values in a set of numbers.
) of futures price returns implied by the observed market option price and contrast this with the volatilities implied (using the Black model) of each model. This calculation therefore summarizes the same information as in figure 5, but measured in 'volatility units' instead of cents per bushel. The results are reported in figure 6, where we give the overall averages for the July and November contracts. We do not present the year-by-year breakdowns as the results are consistent with those of figure 5. Computing errors in this fashion makes more noticeable the improvement of the informative prior for the Bayesian techniques, in the true one-week-ahead (unconstrained) calculations.

[FIGURE 6 OMITTED]

Figures 7 and 8 provide mean absolute pricing errors weighted by the volume and option interest from the option market each week, respectively. The intent of these figures is to see whether the largest observed errors occur for options that are actively traded. Overall averages for the both the July and November contracts are presented. Not surprisingly, the at- or near-the-money option each week tends to be the most actively traded, and has the largest open interest. Hence, we see that the weighted errors reflect a concentration of weight and absolute pricing error (as depicted in figure 5) for at- or near-the-money options. The relative ranking of the various models does not change noticeably, however the improvement of the Bayesian model with an informative prior becomes more noticeable--the 'success at historical option pricing' prior helps for the most actively traded options.

[FIGURES 7-8 OMITTED]

A final caveat to our implementation is that the options on soybean futures referred to are American-style and we have provided European options prices. Our investigations suggest that for the options that we consider, the additional value of the American early-exercise feature is very modest.

6. Conclusions

In this paper we studied the real-time performance of a procedure for pricing derivative securities Derivative security

A financial security such as an option or future whose value is derived in part from the value and characteristics of another security, the underlying asset.
 when the underlying asset has rich time-series properties. Many commodities are examples of such assets. We use some simple examples to show that relative to the standard Black (1976) model, as well as a non-parametric procedure advocated by Stutzer (1996), a procedure that makes use of numerical Bayes techniques to develop an underlying predictive density holds significant promise. The results are strongest when we adopt a prior that reflects and rewards historical option pricing success--a prior that can only be implemented numerically using an importance sampling procedure. That these techniques work well for complicated time series models (in our case, the model had 128 parameters) with an informative prior distribution is particularly encouraging.

The authors thank Logical Information Machines for their extensive data support. Discussions with Tom Smith and Michael Stutzer substantially improved the paper. Earlier versions have been presented in seminars at APFA APFA American Professional Football Association (now National Football League)
APFA Association of Professional Flight Attendants
APFA Abortion Providers' Federation of Australasia
APFA American Pipe Fittings Association, Inc.
, Iowa State University Academics
ISU is best known for its degree programs in science, engineering, and agriculture. ISU is also home of the world's first electronic digital computing device, the Atanasoff–Berry Computer.
, the Melbourne Business School Melbourne Business School (MBS) is the largest business school in Asia Pacific and a leader in management education and executive development. Over the last 50 years, we have built an outstanding reputation for program excellence and a high quality learning experience. , The University of Melbourne
  • AsiaWeek is now discontinued.
Comments:

In 2006, Times Higher Education Supplement ranked the University of Melbourne 22nd in the world. Because of the drop in ranking, University of Melbourne is currently behind four Asian universities - Beijing University,
, The University of Pennsylvania (body, education) University of Pennsylvania - The home of ENIAC and Machiavelli.

http://upenn.edu/.

Address: Philadelphia, PA, USA.
, Purdue University Purdue University (pərdy`, -d`), main campus at West Lafayette, Ind. , The University of Queensland The University of Queensland (UQ) is the longest-established university in the state of Queensland, Australia, a member of Australia's Group of Eight, and the Sandstone Universities. It is also a founding member of the international Universitas 21 organisation. , and the University of Technology, Sydney; comments received at these sessions are greatly appreciated.

(Date of receipt of final transcript: November 7, 2006. Accepted by Garry Twite twite  
n.
A small songbird (Carduelis flavirostris) of northern Great Britain and Scandinavia that resembles the linnet.



[Imitative of its call.]
, Area Editor.)

References

Barone-Adesi, G. & Whaley, R.E. 1987, 'Efficient analytic approximation of American option American Option

An option that can be exercised anytime during its life. The majority of exchange-traded options are American.

Notes:
Since investors have the freedom to exercise their American options at any point during the life of the contract, they are more valuable
 values', The Journal of Finance, vol. 42, June, pp. 301-20.

Black, F. 1976, 'The pricing of commodity contracts', Journal of Financial Economics, vol. 3 March, pp. 167-79.

Chicago Board of Trade, 1994, Contract Specifications, Chicago, Illinois.

DeJong, D.N. & Whiteman, C.H. 1994, 'Modeling stock prices without knowing how to induce stationarity', Econometric Theory Econometric Theory is an economic journal specialising in econometrics. Its editor is Peter Phillips. According to research in 2003 it is the seventh most important economic journal. Source
  1. Kalaitzidakis, P. T. P. Mamueas and T. Stengos (2003).
, vol. 10, pp. 701-19.

DeJong, D.N., Ingram, B.F. & Whiteman, C.H. 2000, 'A bayesian approach to dynamic macroeconomics', Journal of Econometrics econometrics, technique of economic analysis that expresses economic theory in terms of mathematical relationships and then tests it empirically through statistical research. , vol. 15, May-June, pp. 311-20.

Foster, F.D. & Viswanathan, S. 1993 'The effect of public information and competition on trading volume and price volatility', The Review of Financial Studies, vol. 6, pp. 23-56.

Foster, F.D. & Whiteman, C.H. 1999 'An application of Bayesian option pricing to the soybean market', American Journal of Agricultural Economics Agricultural economics originally applied the principles of economics to the production of crops and livestock - a discipline known as agronomics. Agronomics was a branch of economics that specifically dealt with land usage. , vol. 81, August, pp. 722-27.

Gerber, H. & Shiu, E. 1994 'Option pricing by Esscher transforms', Transactions of the Society of Actuaries Mission Statement
The Society of Actuaries is a professional organization for actuaries based in North America. Its headquarters are located in Schaumburg, Illinois.
, vol. 46, pp. 99-140.

Geweke, J. 1989, 'Bayesian inference in econometric models using Monte Carlo integration', Econometrica, vol. 57, pp. 1317-39.

Huang, C.F. & Litzenberger, R. 1988, Foundations for Financial Economics, Elsevier, New York New York, state, United States
New York, Middle Atlantic state of the United States. It is bordered by Vermont, Massachusetts, Connecticut, and the Atlantic Ocean (E), New Jersey and Pennsylvania (S), Lakes Erie and Ontario and the Canadian province of
. Hobson, A. 1971, Concepts in Statistical Mechanics statistical mechanics, quantitative study of systems consisting of a large number of interacting elements, such as the atoms or molecules of a solid, liquid, or gas, or the individual quanta of light (see photon) making up electromagnetic radiation. , Gordon and Breach, New York.

Maasoumi, E. 1993, 'A compendium com·pen·di·um  
n. pl. com·pen·di·ums or com·pen·di·a
1. A short, complete summary; an abstract.

2. A list or collection of various items.
 to information theory in economics and econometrics', Econometric e·con·o·met·rics  
n. (used with a sing. verb)
Application of mathematical and statistical techniques to economics in the study of problems, the analysis of data, and the development and testing of theories and models.
 Reviews, vol. 12, pp. 137-81.

Stutzer, M. 1995, 'A bayesian approach to diagnosis of asset pricing models', Journal of Econometrics, vol. 68, no. 2, August, pp. 367-97.

Stutzer, M. 1996, 'A simple nonparametric approach to derivative security valuation', The Journal of Finance, vol. 51, no. 4, December, pp. 1633-52.

Zellner, A. 1971, An Introduction to Bayesian Inference Bayesian inference is statistical inference in which evidence or observations are used to update or to newly infer the probability that a hypothesis may be true. The name "Bayesian" comes from the frequent use of Bayes' theorem in the inference process.  in Econometrics, Krieger Publishing Company, Malabar, Florida
For other uses of Malabar, see Malabar (disambiguation).


Malabar is a town in Brevard County, Florida, USA. The population was 2,622 at the 2000 census. As of 2005, the population estimated by the U.S.
, (reprint reprint An individually bound copy of an article in a journal or science communication  edition, 1987).

(1.) See for example, Foster and Viswanathan (1993) for an explicit derivation derivation, in grammar: see inflection.  of a volume / volatility relation in the context of a microstructure mi·cro·struc·ture  
n.
The structure of an organism or object as revealed through microscopic examination.


microstructure
Noun

a structure on a microscopic scale, such as that of a metal or a cell
 model.

(2.) See Hobson (1971), Maasoumi (1993), and Stutzer (1995) for a more detailed review of the KLIC. For its use in deriving well-known parametric option pricing models option pricing model

A mathematical formula for determining the price at which an option should trade. The model expresses the value of an option as a function of the value of the underlying asset, length of time until maturity, exercise price, yields on
, see Gerber and Shiu (1994).

(3.) Logical Information Machines (LIM) provided all data other than option prices through their XMIM system. The authors are very grateful to LIM for providing this significant data support. The Chicago Board of Trade provided all option price histories.

(4.) Some of the details of the specification (the number of lags, the set of deterministic variables used, etc.) were chosen after a preliminary specification search involving a smaller two variable system that omitted volume and open interest data and pseudo-real-time pricing experiments on the 1993 data. This effort was primarily aimed at developing a prior distribution for the model parameters that would improve option pricing. None of the prior distributions so developed survived the transition to the larger four variable system. We believe the only important vestige vestige /ves·tige/ (ves´tij) the remnant of a structure that functioned in a previous stage of species or individual development.vestig´ial

ves·tige
n.
 of the initial specification search is that the logarithms of both the spot and futures data enter the VAR directly. This is because an initial system involving the change in the spot rate together with the basis, a natural implementation of a cointegrated VAR specification, was a poor performer in pricing options. Thus unlike Foster and Whiteman (1999), in this paper we do not first difference, and we do not impose the basis relationship.

(5.) We used 50,000 antithetic an·ti·thet·i·cal   also an·ti·thet·ic
adj.
1. Of, relating to, or marked by antithesis.

2. Being in diametrical opposition. See Synonyms at opposite.
 replications. The flat-prior posterior for B is symmetric, so the sample size is inexpensively doubled by reflecting odd-numbered drawings about the mean to obtain even-numbered ones. This antithetic acceleration has the added advantage of making the Monte Carlo estimate of the posterior mean exact. Monte Carlo errors in the figures are about one-fifth of a penny. Calculation of one week's worth of option prices takes about 30 minutes on an 800 MHz (MegaHertZ) One million cycles per second. It is used to measure the transmission speed of electronic devices, including channels, buses and the computer's internal clock. A one-megahertz clock (1 MHz) means some number of bits (16, 32, 64, etc.  Pentium IV See Pentium 4.  personal computer. All calculations were done in RATS.

(6.) Experimentation with other types of more conventional informative (i.e. 'shrinkage') priors in the smaller two-variable system did not produce alternative prior specifications that produced superior option prices.

(7.) It is important in this sort of analysis that the importance density not be too unlike the posterior of interest. Here, the two are quite similar: they differ only in that the posterior at week 26 of our experiment is based on 1026 observations and the entropic transformation associated with correctly pricing past options, whereas the importance density uses the initial 1000 observations and does not reflect any entropic transformation. By the standards of the importance sampling literature, both effects were small in our implementation. See DeJong, Ingrain in·grain  
tr.v. in·grained, in·grain·ing, in·grains
1. To fix deeply or indelibly, as in the mind:
, and Whiteman (2000) for an example in which a substantially less well-tailored importance density was used successfully.

(8.) The natural way to do this for the flat prior case is to re-estimate the VAR each period to obtain the moments to be used in the Normal-Wishart posterior. In practice, we actually did this in parallel with the informative prior calculation described in the previous section, using the same 50,000 drawings of the parameters, but weighting by the flat prior alone.

F. Douglas Foster [dagger]

Charles H. Whiteman [section]

[dagger] Professor of Finance, University of New South Wales The University of New South Wales, also known as UNSW or colloquially as New South, is a university situated in Kensington, a suburb in Sydney, New South Wales, Australia. , Sydney, NSW NSW New South Wales

Noun 1. NSW - the agency that provides units to conduct unconventional and counter-guerilla warfare
Naval Special Warfare
 2052. Email: fd.foster@unsw.edu.au

[section] Senior Associate Dean; Director, Economic Research Institute; and Stanley M. Howe Chair in Leadership, Tippie College of Business The Tippie College of Business at The University of Iowa, established as the College of Commerce in 1921, is one of the oldest business schools in the United States. The College was the first academic division at the University of Iowa to be named for an alumnus, Henry B. , The University of Iowa Not to be confused with Iowa State University.
The first faculty offered instruction at the University in March 1855 to students in the Old Mechanics Building, situated where Seashore Hall is now. In September 1855, the student body numbered 124, of which, 41 were women.
.
COPYRIGHT 2006 Australian Graduate School Of Management
No portion of this article can be reproduced without the express written permission from the copyright holder.
Copyright 2006, Gale Group. All rights reserved. Gale Group is a Thomson Corporation Company.

 Reader Opinion

Title:

Comment:



 

Article Details
Printer friendly Cite/link Email Feedback
Title Annotation:price of soybeans from 1993-1997 at Board of Trade of the City of Chicago
Author:Whiteman, Charles H.
Publication:Australian Journal of Management
Geographic Code:1U3IL
Date:Dec 1, 2006
Words:6363
Previous Article:1 Should executive stock options be abandoned?
Next Article:3 Investigating the performance of alternative default-risk models: option-based versus accounting-based approaches.
Topics:



Related Articles
Options plays.(understanding puts and calls)(Brief Article)
Bayesian cross hedging: an example from the soybean market.
Bayesian inference of nanoparticle-broadened x-ray line profiles.

Terms of use | Copyright © 2009 Farlex, Inc. | Feedback | For webmasters | Submit articles