Printer Friendly
The Free Library
19,573,952 articles and books
Member login
User name  
Password 
 
Join us Forgot password?

A single-equation study of U.S. petroleum consumption: the role of model specification.


I. Introduction

The price responsiveness of U.S. petroleum consumption began to attract a great deal of attention following the unexpected and substantial oil price increases of 1973-74. There have been a number of large, multi-equation econometric e·con·o·met·rics  
n. (used with a sing. verb)
Application of mathematical and statistical techniques to economics in the study of problems, the analysis of data, and the development and testing of theories and models.
 studies of U.S. energy demand since then which have focused primarily on estimating short run and long run price and income elasticities of individual energy resources (coal, oil, natural gas & electricity) for various consumer sectors (residential, industrial, commercial).(1) Following these early multi-equation studies there have been several single-equation studies of aggregate U.S. petroleum consumption |4; 3; 9; 5; 6~. These single-equation studies have found that U.S. demand for petroleum products can be quite price inelastic inelastic

Of or relating to the demand for a good or service when quantity purchased varies little in response to price changes in the good or service.
 in the short run (estimates range from -0.04 to -0.08) but exhibits long lags of anywhere from 6 to 10 years that yield long run price elasticities Price elasticities

The percentage change in quantity divided by a percentage change in the price. Answers the question: How much will the demand for my product decrease if I raise prices by 10%?
 ranging from -0.25 to -0.56. Income or GNP GNP

See: Gross National Product
 elasticities are usually found to be much higher (0.69 to 1.13) with the complete response often assumed to occur in the current period.

Single-equation studies are often justified as efficient shortcuts See Win Shortcuts. , or reduced forms In social science and statistics, particularlly econometrics, a reduced form equation is a method of dealing with endogeneity. A reduced form equation is defined by James Stock & Mark Watson (2007) in the following way:  for identifying the central behavioral behavioral

pertaining to behavior.


behavioral disorders
see vice.

behavioral seizure
see psychomotor seizure.
 aspects of a particular market. The main issue addressed in this paper is the extent to which the existing empirical results from single-equation studies of aggregate U.S. petroleum consumption have been influenced by the researchers' choice of dynamic model specification. This question is relevant because the econometric methodology usually followed in such studies is to start with a relatively simple specification, such as a partial adjustment model or some kind of a distributed lag structure on price alone. This initial specification choice is rarely justified, except possibly to cite its previous success in similar work. Given that models which produce insignificant or perverse per·verse  
adj.
1. Directed away from what is right or good; perverted.

2. Obstinately persisting in an error or fault; wrongly self-willed or stubborn.

3.
a.
 coefficient coefficient /co·ef·fi·cient/ (ko?ah-fish´int)
1. an expression of the change or effect produced by variation in certain factors, or of the ratio between two different quantities.

2.
 estimates are not usually publishable, the reported specification is almost certain to provide "reasonable," statistically significant estimates with the correct signs.

The primary diagnostic test of the chosen specification is the Durbin-Watson test statistic statistic,
n a value or number that describes a series of quantitative observations or measures; a value calculated from a sample.


statistic

a numerical value calculated from a number of observations in order to summarize them.
, or perhaps Durbin's "h" statistic in the presence of lagged dependent variables. When this test gives evidence of serial correlation serial correlation

The relationship that one event has to a series of past events. In technical analysis, serial correlation is used to test whether various chart formations are useful in projecting a security's future price movements.
 in the residuals, the usual response is to "correct" for this by estimating a first or second-order autoregressive Autoregressive

Using past data to predict future data.

Notes:
Essentially it's forecasting, similar to the weather... Sometimes even the weatherman can be caught in an unexpected downpour.
 model (AR(1) or AR(2)). Should this easily-implemented correction not solve the problem, there is a tendency to chalk it up to some kind of unknown specification error or deficiency in the data, leaving any further investigation as a suggestion for future research.

Hendry |12, 223~ argues that this type of specification search (from simple to general) characterizes much of applied work, and is "a reasonably certain path to concluding with a mis-specified relationship" if it is not accompanied by rigorous diagnostic testing Diagnostic testing
Testing performed to determine if someone is affected with a particular disease.

Mentioned in: Von Willebrand Disease
.(2) Additionally, he believes that a simple-to-general modeling approach often involves "excessive presimplification with inadequate testing" |12, 222~ or what Leamer has called a "prejudiced search for an acceptable model" |13, 126~.

Mis-specification of the dynamic adjustment process for U.S. petroleum consumption could lead to biased elasticity estimates with corresponding inaccurate forecasts. A mis-specified model might appear to suffer structural shifts or "changes in regimes" when in reality the true underlying structural economic relationship has remained unchanged while one or more of the exogenous variables Exogenous variable

A variable whose value is determined outside the model in which it is used. Related: Endogenous variable
 have somehow changed their behavior |12, 219~. Similarly, a mis-specified model may provide biased evidence of parameter (1) Any value passed to a program by the user or by another program in order to customize the program for a particular purpose. A parameter may be anything; for example, a file name, a coordinate, a range of values, a money amount or a code of some kind.  symmetry symmetry, generally speaking, a balance or correspondence between various parts of an object; the term symmetry is used both in the arts and in the sciences. , leading to further inaccuracies in forecasting. Structural change and/or parameter symmetry have been the focus of many single-equation studies of U.S. petroleum consumption |3; 9; 5; 6~.

Given the importance of oil to the U.S. economy and the high degree of uncertainty associated with the future path of world oil prices, it seems appropriate to stop and closely investigate the issue of possible model mis-specification in the U.S. demand for petroleum products. Following Hendry's suggestions |12~, we first pursue a general-to-simple dynamic specification search, sequentially testing various restrictions on a deliberately overparameterized general model. The aim is to obtain a data-based simplification of a general model that provides a parsimonious par·si·mo·ni·ous  
adj.
Excessively sparing or frugal.



parsi·mo
 representation of the underlying data generation process. Among other topics |7; 15~, Hendry's modeling approach has been used to analyze aggregate OECD OECD: see Organization for Economic Cooperation and Development.  energy demand |1~, yielding price elasticity estimates comparable to existing studies but a long run income elasticity twice as large as the consensus estimate of unity.

Next, rather than to just report "one more set" of elasticity estimates, we use the same data set to estimate the parameters of several typical alternative single-equation models, including the widely-used partial adjustment model, a simple static model, and the popular polynomial polynomial, mathematical expression which is a finite sum, each term being a constant times a product of one or more variables raised to powers. With only one variable the general form of a polynomial is a0xn+a  distributed lag (PDL See page description language.

1. PDL - Page Description Language.
2. PDL - Program Design Language.
3. PDL - Push Down List.
4. PDL - Dave Lebling, one of the co-authors of Zork.
) on price. These alternative estimates over the same sample period provide some indication of how our results differ from those of other researchers who might have chosen a different specification by following a simple-to-general approach.

Besides revealing the impact of model specification on our elasticities, all but one of these alternative models (the PDL) can be seen as nested or restricted versions of a more general dynamic specification, and therefore can be tested for their acceptability using standard statistical tests. In each case, we find that the restrictions implied by the alternative models are not supported by the data, suggesting that their selection and use to analyze U.S. petroleum consumption is inappropriate. Furthermore, a comparison of forecast errors (ex post and historical) shows substantial variation in forecasting performance across the various models. Hendry's general-to-simple modeling approach is seen to provide a data-acceptable restricted model that outperforms the alternatives and is free of the prior subjective prejudices of the researcher for a particular specification.

II. Data

The data set used for the following estimations contained annual observations over the period 1947-89. U.S. petroleum consumption (in thousand barrels per day Barrels per day (abbreviated BPD, bbl/d, bpd, bd or b/d) is a measurement used to describe the amount of crude oil (measured in barrels) produced or consumed by an entity in one day. ) was measured by "petroleum products supplied" as reported in Table 50 of Annual Energy Review 1989 by the Energy Information Administration of the U.S. Department of Energy (DOE/EIA). The price of oil (in current dollars per barrel) was measured by refiner re·fine  
v. re·fined, re·fin·ing, re·fines

v.tr.
1. To reduce to a pure state; purify.

2. To remove by purifying.

3.
 acquisition cost (composite) as calculated by DOE/EIA for 1968-89, published in Table 68 of Annual Energy Review 1989. Oil prices prior to 1968 were generously supplied by Dermot Gately, calculated as the domestic average wellhead well·head  
n.
1. The source of a well or stream.

2. A principal source; a fountainhead.

3. The structure built over a well.


wellhead
Noun

1.
 price plus a 10 percent markup (text) markup - In computerised document preparation, a method of adding information to the text indicating the logical components of a document, or instructions for layout of the text on the page or other information which can be interpreted by some automatic system.  for transportation costs. All prices were converted to 1982 dollars using the implicit GNP deflator Deflator

A statistical factor used to convert current dollar purchasing power into inflation-adjusted purchasing power. Enables the comparison of prices while accounting for inflation in two different time periods.
 (1982 = 100) published in Survey of Current Business by the U.S. Department of Commerce. U.S. GNP (in billions of 1982 dollars) was also obtained from the Survey of Current Business.

III. General-to-Simple Modeling Approach

An application of Hendry's general-to-simple modeling approach to the study of U.S. petroleum consumption would involve first estimating an unrestricted autoregressive distributed lag model (denoted ADL (|m.sub.0~, |m.sub.1~, |m.sub.2~)) of the form:

|q.sub.t~ = |Alpha~ + |summation summation n. the final argument of an attorney at the close of a trial in which he/she attempts to convince the judge and/or jury of the virtues of the client's case. (See: closing argument)  of~ ||Delta~.sub.j~|q.sub.t-j-1~ where j=0 to |m.sub.0~ + |summation of~ ||Beta~.sub.j~|p.sub.t-j~ where j=0 to |m.sub.1~ + |summation of~ ||Gamma~.sub.j~|y.sub.t-j~ where j=0 to |m.sub.2~ + |u.sub.t~ t = 1,2,...,T (1)

where the three lag lengths (|m.sub.0~, |m.sub.1~, |m.sub.2~) are initially set at the same maximum value (in this case, 4). All variables are measured in natural logarithms Natural logarithm

Logarithm to the base e (approximately 2.7183).
, with |q.sub.t~ denoting U.S. petroleum consumption in year t, |p.sub.t~ denoting the real price of oil, and |y.sub.t~ denoting real GNP Noun 1. real GNP - a version of the GNP that has been adjusted for the effects of inflation
real gross national product

GNP, gross national product - former measure of the United States economy; the total market value of goods and services produced by all
. As others have noted |1~, theory often does not provide us with any strict guidelines guidelines,
n.pl a set of standards, criteria, or specifications to be used or followed in the performance of certain tasks.
 on the length of these three lags. However, a maximum of 4 lags on each variable will allow for a large number of different possible lag distributions.(3)

This unrestricted general model is then progressively simplified by sequential testing of individual parameter restrictions suggested by the data. Potential restrictions are usually identified by looking at the magnitudes of individual parameter estimates, as well as their standard errors. Parameters whose estimates involve large standard errors can usually be set to zero. Often the estimated coefficient for one lagged value of a variable will be close in magnitude to the estimate for a neighboring neigh·bor  
n.
1. One who lives near or next to another.

2. A person, place, or thing adjacent to or located near another.

3. A fellow human.

4. Used as a form of familiar address.

v.
 lag, suggesting the use of a single common parameter. Should the estimates be of similar magnitude yet opposite in sign, then perhaps differencing may be acceptable. Restrictions are tested one at a time, and if accepted, all subsequent testing will include those restrictions in the model.

The criteria for accepting any particular restriction are: (i) it must lower the estimated standard error of the regression regression, in psychology: see defense mechanism.
regression

In statistics, a process for determining a line or curve that best represents the general trend of a data set.
 (SE), (ii) it cannot induce in·duce
v.
1. To bring about or stimulate the occurrence of something, such as labor.

2. To initiate or increase the production of an enzyme or other protein at the level of genetic transcription.

3.
 non-randomness in the residuals (as measured by the Lagrange multiplier multiplier

In economics, a numerical coefficient showing the effect of a change in one economic variable on another. One macroeconomic multiplier, the autonomous expenditures multiplier, relates the impact of a change in total national investment on the nation's total
 (LM) test for serial correlation proposed by Godfrey |11~); and (iii) it cannot cause "predictive failure" outside the estimation estimation

In mathematics, use of a function or formula to derive a solution or make a prediction. Unlike approximation, it has precise connotations. In statistics, for example, it connotes the careful selection and testing of a function called an estimator.
 period (as measured by the Chow TABULAR tab·u·lar
adj.
1. Having a plane surface; flat.

2. Organized as a table or list.

3. Calculated by means of a table.



tabular

resembling a table.
 DATA OMITTED test for predictive stability or parameter constancy con·stan·cy  
n.
1. Steadfastness, as in purpose or affection; faithfulness.

2. The condition or quality of being constant; changelessness.

Noun 1.
 |12, 222~). In order to implement this third test, we hold back the last four observations (1986-89), basing all subsequent results on the estimation period 1961-85.(4)

Results from the OLS OLS Ordinary Least Squares
OLS Online Library System
OLS Ottawa Linux Symposium
OLS Operation Lifeline Sudan
OLS Operational Linescan System
OLS Online Service
OLS Organizational Leadership and Supervision
OLS On Line Support
OLS Online System
 estimation of the unrestricted general model of equation (1) are presented in Table I, along with the relevant diagnostic test statistics. LM(s) is Godfrey's LM test for auto correlation of order s against a null hypothesis null hypothesis,
n theoretical assumption that a given therapy will have results not statistically different from another treatment.

null hypothesis,
n
 of serial independence, which is asymptotically distributed as ||Chi~.sup.2~(s), and F(4, 25 - k) is the Chow test The Chow test is an econometric test of whether the coefficients in two linear regressions on different data are equal. The Chow test is most commonly used in time series analysis to test for the presence of a structural break.  for predictive stability, where k is the number of estimated coefficients in the model (k = 15 in the unrestricted model). Sums of the estimated coefficients for each variable (q, p, y) are also shown for easy calculation of the long run elasticities. In particular, the short run elasticities for p and y are given by the estimated coefficients on the zeroth (jargon) zeroth - First.

Since zero is the lowest value of an unsigned binary integer, which is one of the most fundamental types in programming and hardware design, it is often natural to count from zero rather than one, especially when the integer is actually an index, as
 lag term (|Mathematical Expression A group of characters or symbols representing a quantity or an operation. See arithmetic expression.  Omitted~ and |Mathematical Expression Omitted~), while the long run elasticities are calculated as:

|Mathematical Expression Omitted~

and

|Mathematical Expression Omitted~.

These unrestricted estimates serve at least two purposes: (i) they provide a statistical benchmark for evaluating the acceptability of restricted variants of the general model of equation (1); (ii) they give us unrestricted elasticity estimates that will serve as a guide against imposing invalid Null; void; without force or effect; lacking in authority.

For example, a will that has not been properly witnessed is invalid and unenforceable.


INVALID. In a physical sense, it is that which is wanting force; in a figurative sense, it signifies that which has no effect.
 restrictions in the paring-down process of model selection. The estimated price elasticities are comparable to existing single-equation estimates for the short run, yet are considerably smaller for the long run, reflecting a somewhat shorter lag in the full price response. Both the short run and long run GNP elasticity estimates are well within the range of existing results.

The two diagnostic tests for serial correlation and post-sample parameter constancy reveal TABULAR DATA OMITTED a stable model that probably suffers from "over-fitting" or over-parameterization. Progressive reduction of the LM test statistic in the restricted models which follow confirm this suspicion.

We now proceed to simplify the general model by imposing data-instigated restrictions on the set of coefficients. Table II provides a sequential list of acceptable parameter restrictions and the corresponding values of SE, LM (4) and F(4, 25 - k). The final restricted model is (with t-statistics in parentheses See parenthesis.

parentheses - See left parenthesis, right parenthesis.
):

|Mathematical Expression Omitted~

T = 25, SE = 0.0089098, Adj. |R.sup.2~ = 0.999252, LM(4) = 7.217, F(4, 18) = 2.775.

This restricted ADL model displays no evidence of serial correlation in its residuals, is stable over the post-sample period, and involves a reduction in SE of 20 percent over the general model.

The unscrambled coefficients from the restricted ADL model in (2) are given in Table III. From these, the short run and long run elasticities are easily seen to be:

|Mathematical Expression Omitted~ |Mathematical Expression Omitted~

|Mathematical Expression Omitted~ |Mathematical Expression Omitted~

which are very close to those found in the unrestricted model, suggesting that the restrictions inherent in the final model are valid.

As a point of interest, the restricted model was re-estimated over the longer sample period of 1961-89. The summary statistics over this longer sample period were T = 29, SE = 0.0102473, adjusted |R.sup.2~ = 0.998872 and LM(4) = 4.823. The resulting unscrambled coefficients are also shown in Table III, yielding the following implied elasticities:

|Mathematical Expression Omitted~ |Mathematical Expression Omitted~

|Mathematical Expression Omitted~ |Mathematical Expression Omitted~

TABULAR DATA OMITTED

which show a slightly more responsive price effect in the long run but otherwise are essentially the same as those obtained over 1961-85.

In conclusion, the general-to-simple model selection procedure has produced a rather parsimonious (only 7 parameters to be estimated) representation of the data generation process that appears to adequately capture the dynamics of U.S. petroleum consumption. Rather than to continue to compare these estimates to those obtained in other studies using different specifications and different data bases, in the next section we pursue the simple-to-general modeling approach over the same sample period. This will isolate isolate /iso·late/ (i´sah-lat)
1. to separate from others.

2. a group of individuals prevented by geographic, genetic, ecologic, social, or artificial barriers from interbreeding with others of their kind.
 the role that model specification, apart from data differences, plays in estimating price and GNP elasticities for U.S. petroleum consumption.

IV. Simple-to-General Modeling Approach

Given the same set of annual observations on U.S. petroleum consumption, the real price of oil and real GNP described in the data section above, most applied researchers seeking a parsimonious, single-equation econometric model Econometric models are used by economists to find standard relationships among aspects of the macroeconomy and use those relationships to predict the effects of certain events (like government policies) on inflation, unemployment, growth, etc.  would not have followed the general-to-simple selection process of the previous section. Instead, they would have chosen some familiar specification they have been trained to use or find easy to implement. Some would begin to determine an "optimal" PDL structure for the price term, assuming away any lags on the GNP response |4; 5; 6; 9; 17~. Others might appeal to other researchers' success with a popular specification, such as the partial adjustment model |3~. Still others might want to start out by estimating a simple static model, with the strong expectation of having to upgrade it to at least an AR(1) model.

PDL Model

We begin by estimating a simple model of U.S. petroleum consumption with a PDL (n, r) structure on price but an instantaneous in·stan·ta·ne·ous  
adj.
1. Occurring or completed without perceptible delay: Relief was instantaneous.

2.
 response to GNP:

|q.sub.t~ = |Alpha~ + |Gamma~|y.sub.t~ + |summation of~ ||Beta~.sub.i~|p.sub.t-1~ where i=0 to n + |u.sub.t~ (3)

where |Mathematical Expression Omitted~

and n = lag length and r = order of the polynomial distributed lag structure on price are two values to be determined. The instantaneous GNP elasticity is given by |Gamma~ and the price elasticity is given by ||Beta~.sub.0~ for the short run and |summation~||Beta~.sub.i~ for the long run.
Table IV. AR(2) PDL(12, 1) Model Results, 1961-1985

Parameter                        Estimate           t-statistic

|Alpha~                            1.141               (6.25)
|Gamma~                            1.029              (19.20)
||Beta~.sub.0~                    -0.024               (3.48)
||Beta~.sub.1~                    -0.027               (4.62)
||Beta~.sub.2~                    -0.029               (6.25)
||Beta~.sub.3~                    -0.031               (8.50)
||Beta~.sub.4~                    -0.033              (10.87)
||Beta~.sub.5~                    -0.035              (11.24)
||Beta~.sub.6~                    -0.037              (10.24)
||Beta~.sub.7~                    -0.040               (8.56)
||Beta~.sub.8~                    -0.042               (7.22)
||Beta~.sub.9~                    -0.044               (6.25)
||Beta~.sub.10~                   -0.046               (5.53)
||Beta~.sub.11~                   -0.048               (4.99)
||Beta~.sub.12~                   -0.051               (4.58)
|summation~||Beta~.sub.i~         -0.487                 --
||Theta~.sub.0~                   -0.024               (3.48)
||Theta~.sub.1~                   -0.051               (4.58)
||Rho~.sub.1~                      1.195                 --
||Rho~.sub.2~                     -0.622                 --

T = 25
SE = 0.0155485
Adj. |R.sup.2~ = 0.964346
DW = 2.140 Q*(1) = 0.18 Q*(4) = 3.24
F(4, 21) = 1.609


As before, we held back the last 4 observations (1986-89) for post-sample testing. This left a maximum possible estimation sample of 1947-85 (T = 39), or a maximum unconstrained lag on price of 17 years. The optimal lag length (n |is less than or equal to~ 17) was determined by finding the number of lags which gave the highest value of adjusted |R.sup.2~ |14, 357~. In the present case, an unconstrained lag of 12 years gave the highest value of adjusted |R.sup.2~ = 0.982517, with an inconclusive INCONCLUSIVE. What does not put an end to a thing. Inconclusive presumptions are those which may be overcome by opposing proof; for example, the law presumes that he who possesses personal property is the owner of it, but evidence is allowed to contradict this presumption, and show who is  value for the DW statistic of 1.182. Next, the optimal order of the polynomial lag structure (r |is less than or equal to~ n - 1) was determined by starting with an 11th order polynomial (with no endpoint restrictions) and progressively reducing the order by one until we were unable to reject the hypothesis that the last term in the polynomial was zero |14, 357~. This procedure was followed using the previously determined optimal 12 year lag length over the longest possible sample period of 1959-85 and resulted in an optimal first-degree polynomial (r = 1).

Unfortunately, this PDL (12, 1) model showed clear evidence of serial correlation in its residuals: the DW statistic was only 0.472 and the Ljung-Box (Q*(s)) portmanteau test In statistics, a portmanteau test tests whether any of a group of autocorrelations of a time series are different from zero. The term portmanteau test refers both to the Ljung-Box test and to the (now obsolete) Box-Pierce test.  statistics for serial independence (distributed as ||Chi~.sup.2~(s)) were significant from the very first lag (s = 1).(5) Subsequent re-estimation using a Cochrane-Orcutt iterative it·er·a·tive  
adj.
1. Characterized by or involving repetition, recurrence, reiteration, or repetitiousness.

2. Grammar Frequentative.

Noun 1.
 AR(2) procedure seemed to correct this problem, yielding insignificant Q* statistics out to the fourth lag and a respectable DW statistic of 2.140.

All parameter estimates were of reasonable magnitudes, had the "correct" or expected signs and were statistically significant at the 1 percent level. The short run price elasticity was very small (-0.024), but with a lengthy 12 year lag was able to rise to -0.487 in the long run. The instantaneous GNP elasticity was once again close to unity (1.029).(6)

By assuming no change in the autoregressive parameters ||Rho~.sub.1~ and ||Rho~.sub.2~ over 1986-89, a post-sample Chow test of parameter constancy was also performed.(7) The test statistic of 1.609 is distributed as F(4, 21) and is not significant at the 5 percent level, showing that the model does not appear to suffer from "predictive failure" as defined by Hendry. Thus, from the perspective of the simple-to-general approach, we seem to have found an acceptable model.

Partial Adjustment Model

However, not everyone wanting to analyze U.S. petroleum consumption with a single-equation model would have chosen to fit a simple PDL structure on price. The partial adjustment model is another very popular dynamic specification that we could have used. A simple partial adjustment model would take the form:

|q.sub.t~ = ||Phi~.sub.0~ + ||Phi~.sub.1~|p.sub.t~ + ||Phi~.sub.2~|y.sub.t~ + ||Phi~.sub.3~|q.sub.t-1~ + |u.sub.t~ (4)

where 0 |is less than~ ||Phi~.sub.3~ |is less than~ 1. In this well-known specification, the short run price elasticity is ||Phi~.sub.1~, the short run GNP elasticity is ||Phi~.sub.2~ and the long run elasticities are ||Phi~.sub.1~/(1 - ||Phi~.sub.3~) and ||Phi~.sub.2~/(1 - ||Phi~.sub.3~), respectively. Unlike the PDL model, changes in both price and GNP cause lagged responses in consumption.

OLS estimation of (4) over 1961-85 yielded the following results:

|Mathematical Expression Omitted~

T = 25, SE = 0.0247197, Adj. |R.sup.2~ = 0.983455, LM(1) = 4.029, F(4, 21) = 1.469.

While the parameter estimates seem reasonable and are all statistically significant, the residuals show some evidence of first order autocorrelation Autocorrelation

The correlation of a variable with itself over successive time intervals. Sometimes called serial correlation.
 since Godfrey's LM test statistic exceeds the critical value of 3.84 for a ||Chi~.sup.2~(1) variate at the 5 percent level.

"Correcting" for first-order autocorrelation using the Cochrane-Orcutt iterative procedure produced a value for ||Rho~.sub.1~ = 0.896. With this transformation the LM test statistic is now insignificant, so the problem seems to be solved. The AR(1) partial adjustment model is:

|Mathematical Expression Omitted~

T = 25, SE = 0.0212301, Adj. |R.sup.2~ = 0.686825, LM(1) = 2.366, F(4, 21) = 1.355

where the post-sample F-test of parameter constancy assumes no change in ||Rho~.sub.1~ over 1986-89.

The final parameter estimates seem acceptable, although the speed of adjustment parameter (0.590) suggests a much faster adjustment process than that of the OLS model. The short run price elasticity is -0.114, somewhat larger than other estimates, with a comparatively low long run price elasticity of -0.278. On the other hand, the short run GNP elasticity is low at 0.416 although the long run estimate is very close to the consensus value of unity at 1.015. Finally, the parameters appear to be constant through the post-sample period, giving the model predictive stability. Once again, we seem to have obtained an acceptable model by an entirely different approach.

Static Model

The last simple-to-general modeling alternative we follow begins with OLS estimation of a purely static specification, yielding the following results:

|Mathematical Expression Omitted~

T = 25, SE = 0.0884227, Adj. |R.sup.2~ = 0.788308, DW = 0.142, F(4, 22) = 3.781.

Although the parameter estimates seem reasonable and are significant, the extremely low DW statistic suggests serially correlated cor·re·late  
v. cor·re·lat·ed, cor·re·lat·ing, cor·re·lates

v.tr.
1. To put or bring into causal, complementary, parallel, or reciprocal relation.

2.
 residuals if not outright mis-specification, and the significant F-statistic for post-sample stability indicates predictive failure.

In an attempt to correct this problem we again used the Cochrane-Orcutt iterative procedure to determine the value of the first order autoregressive parameter ||Rho~.sub.1~. However, this yielded a value for ||Rho~.sub.1~ in excess of one (||Rho~.sub.1~ = 1.112), which violates the stationarity condition that ||Rho~.sub.1~ lie inside the unit circle. A first differences model was then estimated as an alternative:

|Mathematical Expression Omitted~

T = 25, SE = 0.0295252, Adj. |R.sup.2~ = 0.510185, DW = 0.454, F(4, 22) = 0.158

which represents some improvement over the static model (7) but still appears to suffer from serial correlation. Adjusting this first differences model for first-order autocorrelation gives ||Rho~.sub.1~ = 0.800 and the following results:

|Mathematical Expression Omitted~

T = 25, SE = 0.0341131, Adj. |R.sup.2~ = 0.704120, DW = 2.186, F(4, 22) = 0.628

where the post-sample test again assumes a constant value of ||Rho~.sub.1~ over the 1986-89 period. By all appearances, this AR(1) first differences model is thus also an acceptable specification.

V. Model Comparisons

At this point, we have selected one restricted ADL model (equation (2)) using the general-to-simple approach and three very different dynamic specifications ((i) AR(2) PDL(12, 1) model; (ii) AR(1) partial adjustment model (equation (6)); and (iii) AR(1) first differences TABULAR DATA OMITTED TABULAR DATA OMITTED model (equation (9)) by following different paths in a simple-to-general approach. All four chosen models appear to be free of serial correlation and exhibit post-sample stability. All four models yielded statistically significant parameter estimates with the correct signs and reasonable magnitudes. However, the implied elasticities and lengths of lag responses differed substantially across the four models. Which model should we use?

As an initial comparison we looked at the ex post forecasting properties of the four chosen models. Table V reports five commonly used measures of forecast error for ex post forecasts over the post-sample period 1986-89: (1) root mean squared error In statistics, the mean squared error or MSE of an estimator is the expected value of the square of the "error." The error is the amount by which the estimator differs from the quantity to be estimated.  (RMSE RMSE Root Mean Square Error
RMSE Root Mean Squared Error
); (2) root mean squared percent error (RMSPE); (3) mean absolute error (MAE (1) (Metropolitan Area Exchange) Originally known as Metropolitan Area Ethernets, MAEs are junction points on the Internet where data is exchanged between carriers. See IXP and NAP. ); (4) mean absolute percent error (MAPE MAPE Mean Absolute Percentage Error
MAPE Minnesota Association of Professional Employees
MAPE Multinational Advisory Police Element (UN - Albania) 
); and Theil's inequality inequality, in mathematics, statement that a mathematical expression is less than or greater than some other expression; an inequality is not as specific as an equation, but it does contain information about the expressions involved.  coefficient (U).(8) Over this brief period of only 4 observations, the mean forecast errors were only about 0.2 to 0.3 percent for all but the partial adjustment model, which produced forecast errors of over 9 percent as well as a substantially higher value for Theil's U. On this basis we might prefer any one of the first three models, with either the PDL or first differences model having a slight edge over the restricted ADL model.

To further discriminate dis·crim·i·nate  
v. dis·crim·i·nat·ed, dis·crim·i·nat·ing, dis·crim·i·nates

v.intr.
1.
a.
 between these four models, we performed historical simulations over the estimation sample period of 1961-85 and report the same five error measures as before in Table VI. Over this longer period the partial adjustment model still has the poorest performance, with mean errors almost identical to those over 1986-89. However, the other three models' performances are no longer so similar. The restricted ADL model now yields mean errors roughly half the size of those for the PDL model and one-third the size of those for the first differences model, as well as the lowest value for Theil's U. The restricted ADL model now clearly outperforms all other specifications considered here.

As a final comparison, it is possible to recognize the partial adjustment and first differences models as special cases of the general ADL(4, 4, 4) model of equation (1) and test whether or not the restrictions implicit in Adj. 1. implicit in - in the nature of something though not readily apparent; "shortcomings inherent in our approach"; "an underlying meaning"
underlying, inherent
 each specification are supported by the data. The long 12 year lag on price in the PDL model makes it impossible to test in this way, since it is not a nested case of the general model. Nevertheless, for the same number of observations (25) the chosen PDL model has a SE of 0.0155485, which is almost 40 percent higher than the SE of the unrestricted general model of 0.0111443, suggesting that it would not be a data-acceptable simplification.

The AR (1) partial adjustment model of equation (6) is observationally equivalent to a nested case of the general ADL(4, 4, 4) model, in particular the ADL(2, 1, 1) model:

|q.sub.t~ = |Alpha~ + ||Delta~.sub.1~|q.sub.t-1~ + ||Delta~.sub.2~|q.sub.t-2~ + ||Beta~.sub.0~|p.sub.t~ + ||Beta~.sub.1~|p.sub.t-1~ + ||Gamma~.sub.0~|y.sub.t~ + ||Gamma~.sub.1~|y.sub.t-1~ + |u.sub.t~. (10)

This equivalence can be shown by taking quasi-first differences of the basic partial adjustment model of equation (4):

|q.sub.t~ = ||Phi~.sub.0~(1-||Rho~.sub.1~) + (||Phi~.sub.3~ + ||Rho~.sub.1~)|q.sub.t-1~ - ||Phi~.sub.3~||Rho~.sub.1~|q.sub.t-2~ + ||Phi~.sub.1~|p.sub.t~ - ||Phi~.sub.1~||Rho~.sub.1~|p.sub.t-1~ + ||Phi~.sub.2~|y.sub.t~ - ||Phi~.sub.2~||Rho~.sub.1~|y.sub.t-1~ + |u.sub.t~ (11)

which has 5 parameters (||Phi~.sub.0~, ||Phi~.sub.1~, ||Phi~.sub.2~, ||Phi~.sub.3~, ||Rho~.sub.1~) to be estimated against 7 parameters in equation (10). This implies that the AR(1) partial adjustment model imposes 2 nonlinear A system in which the output is not a uniform relationship to the input.

nonlinear - (Scientific computation) A property of a system whose output is not proportional to its input.
 restrictions on the ADL(2, 1, 1) model of equation (10). To test these 2 nonlinear restrictions, we use a likelihood ratio test statistic (LR), which is asymptotically distributed as ||Chi~.sup.2~(2). The calculated value of LR = 17.06 exceeds the critical value of 9.21 at the 1 percent level, so the restrictions implied by the AR(1) partial adjustment model are rejected.

Of course, this test is immaterial Not essential or necessary; not important or pertinent; not decisive; of no substantial consequence; without weight; of no material significance.


immaterial adj.
 if the underlying restrictions implied by the ADL(2, 1, 1) model of equation (10) are not first found to be data-acceptable. This set of 8 linear restrictions (||Beta~.sub.2~ = ||Beta~.sub.3~ = ||Beta~.sub.4~ = ||Gamma~.sub.2~ = ||Gamma~.sub.3~ = ||Gamma~.sub.4~ = ||Delta~.sub.3~ = ||Delta~.sub.4~ = 0) is easily tested with a standard F-test, being distributed as F(8, 10). The calculated value is 3.57, which exceeds the critical value of 3.07 at the 5 percent level. Furthermore, the ADL(2, 1, 1) model has a SE = 0.0163026, which is almost 50 percent greater than that for the general ADL(4, 4, 4) model, so equation (10) is not itself a data-acceptable simplification. We conclude that the AR(1) partial adjustment model is a data-rejectable specification for this sample.

Similarly, the AR(1) first differences model of equation (9) can be seen as observationally equivalent to an ADL(2, 2, 2) model. This nested case would have 9 separate parameters while the AR(1) first differences model has only 4, implying 5 nonlinear restrictions that can again be tested by a LR test statistic (assuming the ADL(2, 2, 2) model is itself data-acceptable). The calculated value of LR is 14.02, being distributed as ||Chi~.sup.2~(5), which exceeds the critical value of 11.1 at the 5 percent level, so these 5 restrictions can be rejected. However, the nested ADL(2, 2, 2) is not an acceptable simplification, as its SE of 0.0164392 is again roughly 50 percent above the SE for the general model, and an F-test of the 6 linear restrictions implied by an ADL(2, 2, 2) model (||Beta~.sub.3~ = ||Beta~.sub.4~ = ||Gamma~.sub.3~ = ||Gamma~.sub.4~ = ||Delta~.sub.3~ = ||Delta~.sub.4~ = 0) gives a statistic of F(6, 10) = 4.14, surpassing the critical value of 3.22 at the 5 percent level. Therefore, the AR(1) first differences model must also be data-rejectable, comprising a set of invalid parameter A DOS error message that means DOS does not understand the command line. It indicates that a switch is used incorrectly. If you're typing path names, be sure to use a backslash (\), not a forward slash (/). The forward slash is used to enter parameters (see DOS switch and DOS abc's).  restrictions.(9)

VI. Conclusions

When choosing an econometric model specification for a single-equation study of aggregate U.S. petroleum consumption, the researcher seeks a parsimonious, easily estimated model that will provide unbiased price and income elasticity estimates and yield accurate forecasts. In contrast to existing studies, we have used Hendry's general-to-simple specification search technique and annual data (1961-89) to obtain a restricted, data-acceptable simplification of a general ADL model. This restricted ADL model yielded GNP and short run price elasticities near the consensus estimates, but a long run price elasticity (-0.17) that is substantially smaller than existing estimates.

Comparisons with three other seemingly seem·ing  
adj.
Apparent; ostensible.

n.
Outward appearance; semblance.



seeming·ly adv.
 acceptable alternative models that were chosen via the simple-to-general modeling approach showed that popular model specifications often involve untested parameter restrictions that cannot be accepted. In addition, such models may also have poorer forecasting performance, with the widely used partial adjustment model found to have the largest mean forecast errors (nearly 10 percent) of all those considered here.

Therefore, the untested acceptance of a popular dynamic specification which yields "reasonable" and significant estimates can lead to the selection of a model which is not consistent with the data (i.e., is data-rejectable). Parameter estimates from such data-rejectable models may give very misleading indications of the dynamic nature of the behavioral relationships being modeled. In the present case, the long run price elasticity appears to have been over-estimated by such models, while the speed of the adjustment process may have been under-estimated. Energy policies which assumed a lingering lin·ger  
v. lin·gered, lin·ger·ing, lin·gers

v.intr.
1. To be slow in leaving, especially out of reluctance; tarry. See Synonyms at stay1.

2.
, substantial consumption response to an oil price change could be confounded by an actual response that was much smaller and largely complete in just a few years.

Finally, forecasts of future U.S. petroleum consumption levels are often required for policy planning purposes. While the accuracy of any petroleum consumption forecast will depend heavily upon the assumptions made about future world oil prices and GNP levels, using a forecasting model that is not data-acceptable can lead to sizable siz·a·ble also size·a·ble  
adj.
Of considerable size; fairly large.



siza·ble·ness n.
 errors. Selection of a data-acceptable model that does not suffer from predictive failure over existing out-of-sample data points can help to minimize forecasting errors for policy makers and other analysts. Based on the results presented here, the general-to-simple approach appears to offer a satisfying new methodology for generating superior forecast models of petroleum consumption and other energy use patterns.

Appendix

TABULAR DATA OMITTED

1. An excellent survey of these results may be found in Bohi |2, 159~.

2. For a comparison of Hendry's econometric methodology with the traditional, simple-to-general ("North American North American

named after North America.


North American blastomycosis
see North American blastomycosis.

North American cattle tick
see boophilusannulatus.
") approach, see Gilbert |10~.

3. A formal F-test of |H.sub.O~ : ADL(4, 4, 4) vs |H.sub.A~ : ADL(5, 5, 5) yielded a test statistic of 0.701, which is distributed as F(3, 7), falling well short of the critical value of 4.35 at the 5 percent level.

4. With four lags, this implies using observations reaching back to 1957. Obviously, with data going as far back as 1947, we could have begun the estimation period in 1951. We started in 1961 so that these results would be comparable to those in the next section using alternative specifications requiring longer lags, in particular the AR(2) PDL model.

5. We can use the familiar DW and Ljung-Box portmanteau test statistics here rather than Godfrey's LM test since the PDL model has no lagged dependent variables. Otherwise, we report the LM test statistic, since the Ljung-Box statistics have been shown to be inappropriate for models with lagged dependent variables |8~.

6. For comparison, Gately and Rappoport |9~ estimated an AR(2) PDL(10, 3) model from annual data over 1949-85 and found a much larger (although still small) short run price elasticity of -0.068, a smaller long run price elasticity of -0.364, and a smaller GNP elasticity of 0.689. However, our elasticities are very similar to those reported by Walls |17~ that estimated a nearly identical AR(2) PDL(10, 1) model from annual data over 1946-87.

7. Complete re-estimation over 1961-89 yielded almost identical values for these two parameters (||Rho~.sub.1~ = 1.182 and ||Rho~.sub.2~ = -0.597), so this necessary assumption seemed reasonable.

8. An explanation of each of these forecast error measures may be found in Pindyck and Rubinfeld |16, 338-40~.

9. For the sake of curiosity, the OLS partial adjustment model and the OLS first differences model were also found to be data-rejectable.

References

1. Beenstock, Michael and Patrick Willcocks, "Energy Consumption and Economic Activity in Industrialized in·dus·tri·al·ize  
v. in·dus·tri·al·ized, in·dus·tri·al·iz·ing, in·dus·tri·al·iz·es

v.tr.
1. To develop industry in (a country or society, for example).

2.
 Countries: The Dynamic Aggregate Time Series Relationship." Energy Economics, October 1981, 225-32.

2. Bohi, Douglas R. Analyzing Demand Behavior: A Study of Energy Elasticities. Baltimore Baltimore, city (1990 pop. 736,014), N central Md., surrounded by but politically independent of Baltimore co., on the Patapsco River estuary, an arm of Chesapeake Bay; inc. 1745. : John Hopkins Hopkins, city (1990 pop. 16,534), Hennepin co., SE Minn., a suburb of Minneapolis; inc. as West Minneapolis 1893, name changed 1928. The city manufactures machinery, computer and electronic parts, steel products, air pollution equipment, ophthalmic lenses, tools,  University Press, 1981.

3. Bopp, Anthony E., "Tests for Structural Change in U.S. Oil Consumption, 1967-82." Energy Economics, October 1984, 223-30.

4. Brown, Scott B., "An Aggregate Petroleum Consumption Model." Energy Economics, January 1983, 27-30.

5. Brown, Stephen P. A. and Keith R. Phillips, "Oil Demand and Prices in the 1990s." Federal Reserve Bank of Dallas The Federal Reserve Bank of Dallas covers the Eleventh Federal Reserve District, which includes Texas, northern Louisiana and southern New Mexico. It has branch offices in El Paso, Houston, and San Antonio.  Economic Review, January 1989, 1-8.

6. ----- and -----, "U.S. Oil Demand and Conservation." Contemporary Policy Issues, January 1991, 67-72.

7. Cuthbertson, Keith and Paul Richards Paul Richards can refer to different people:
  • Paul Richards (baseball) (1908-1986), baseball player, manager, scout and executive
  • Paul Richards (actor) (1924-1974), American actor who starred in the ABC-TV 1963-64 series Breaking Point
  • Paul W.
, "An Econometric Study of the Demand for First and Second Class Inland Letter Services." Review of Economics and Statistics, November 1990, 640-48.

8. Dezhbakhsh, Hashem, "The Inappropriate Use of Serial Correlation Tests in Dynamic Linear Models." Review of Economics and Statistics, February 1990, 126-32.

9. Gately, Dermot and Peter Rappoport, "The Adjustment of U.S. Oil Demand to the Price Increases of the 1970s." The Energy Journal, April 1988, 93-107.

10. Gilbert, Christopher L., "Professor Hendry's Econometric Methodology." Oxford Bulletin of Economics and Statistics, August 1986, 283-307.

11. Godfrey, Leslie G., "Testing for Higher Order Serial Correlation in Regression Equations Regression equation

An equation that describes the average relationship between a dependent variable and a set of explanatory variables.
 When the Regressors Include Lagged Dependent Variables." Econometrica, November 1978, 1303-10.

12. Hendry, David F. "Predictive Failure and Econometric Modelling in Macroeconomics macroeconomics

Study of the entire economy in terms of the total amount of goods and services produced, total income earned, level of employment of productive resources, and general behaviour of prices.
: The Transactions Demand Transactions demand is the demand for financial assets, e.g., securities, money or foreign currency. It is used for purposes of business transactions and personal consumption.

The need to accommodate a firm's expected cash transactions.
 for Money," in London Business School Around 800 degree students, from 70 countries, graduate from the school each year. Over 80 percent of students, and over 70 percent of faculty, come from outside the UK. A further 6,000 executives attend the school executive education programmes each year.  Conference on Economic Modelling, edited by P. Ormerod. London: Heinemann, 1979, 217-42.

13. Leamer, Edward E., "False Models and Post-Data Model Construction." Journal of the American Statistical Association Established in 1888 and published quarterly in March, June, September, and December, the Journal of the American Statistical Association (JASA) has long been considered the premier journal of statistical science. , March 1974, 122-31.

14. Maddala, G. S. Introduction to Econometrics econometrics, technique of economic analysis that expresses economic theory in terms of mathematical relationships and then tests it empirically through statistical research. . New York New York, state, United States
New York, Middle Atlantic state of the United States. It is bordered by Vermont, Massachusetts, Connecticut, and the Atlantic Ocean (E), New Jersey and Pennsylvania (S), Lakes Erie and Ontario and the Canadian province of
: Macmillan, 1988.

15. Mizon, Grayham E. and David F. Hendry, "An Empirical Application and Monte Carlo Monte Carlo (môNtā` kärlō`), town (1982 pop. 13,150), principality of Monaco, on the Mediterranean Sea and the French Riviera.  Analysis of Tests of Dynamic Specification." Review of Economic Studies, 47(1980), 21-45.

16. Pindyck, Robert S Robert, Henry Martyn 1837-1923.

American army engineer and parliamentary authority. He designed the defenses for Washington, D.C., during the Civil War and later wrote Robert's Rules of Order (1876).

Noun 1.
. and Daniel L. Rubinfeld. Econometric Models and Economic Forecasts. New York: McGraw-Hill, 1991.

17. Walls, Margaret A., "Dynamic Firm Behavior and Regional Deadweight Losses Deadweight Loss

The costs to society created by an inefficiency in the market.

Notes:
Mainly used in economics, the term "deadweight loss" can be applied to any deficiency due to an inefficient allocation of resources.
 from a U.S. Oil Import Fee." Southern Economic Journal, October 1990, 772-88.
COPYRIGHT 1993 Southern Economic Association
No portion of this article can be reproduced without the express written permission from the copyright holder.
Copyright 1993, Gale Group. All rights reserved. Gale Group is a Thomson Corporation Company.

 Reader Opinion

Title:

Comment:



 

Article Details
Printer friendly Cite/link Email Feedback
Author:Jones, Clifton T.
Publication:Southern Economic Journal
Date:Apr 1, 1993
Words:5864
Previous Article:Effects of quotas under variable returns to scale: the large country case.
Next Article:Habit formation as a resolution to the equity premium puzzle: what is in the data, what is not.
Topics:



Related Articles
Monetary innovations, capital taxation, and real wage movements: some new evidence.
Philippine electric demand and equivalence scales.
Testing an alternative habit persistence model.
On the estimation of short- and long-run elasticities in U.S. petroleum consumption: comment.
On the estimation of short- and long-run elasticities in U.S. petroleum consumption: reply.
The demand for nominal and real money balances in a large macroeconomic system.
Broadcast advertising and U.S. demand for alcoholic beverages.
Beer taxation and alcohol-related traffic fatalities.
Estimating the Effects of Earnings Uncertainty on Families' Saving and Insurance Decisions.
The Demand For Money: A Structural Econometric Investigation.

Terms of use | Copyright © 2012 Farlex, Inc. | Feedback | For webmasters | Submit articles