An Exponential Family of Lorenz Curves.Jose-Maria Sarabia [*] Enrique Castillo [+] Daniel J. Slottje [++] A new method for building parametric-functional families of Lorenz curves The Lorenz curve is a graphical representation of the cumulative distribution function of a probability distribution; it is a graph showing the proportion of the distribution assumed by the bottom y% of the values. , generated from an initial Lorenz curve (which satisfies some regularity conditions), is presented. The method is applied to the exponential family In probability and statistics, an exponential family is any class of probability distributions having a certain form. This special form is chosen for mathematical convenience, on account of some useful algebraic properties; as well as for generality, as exponential families are in since they use the exponential 1. (mathematics) exponential - A function which raises some given constant (the "base") to the power of its argument. I.e. f x = b^x If no base is specified, e, the base of natural logarthims, is assumed. 2. Lorenz curves as their generating curves. Several properties of these families are analyzed an·a·lyze tr.v. an·a·lyzed, an·a·lyz·ing, an·a·lyz·es 1. To examine methodically by separating into parts and studying their interrelations. 2. Chemistry To make a chemical analysis of. 3. , including the population function, inequality inequality, in mathematics, statement that a mathematical expression is less than or greater than some other expression; an inequality is not as specific as an equation, but it does contain information about the expressions involved. measures, and Lorenz orderings. Finally, an application is presented for data from various countries. The family is shown to perform well in fitting the data across countries. The results are very robust across data sources. 1. Introduction The purpose of this paper is to introduce a parametric See parametric modeling, parametric symbol and PTC. family of Lorenz curves that are obtained by a general method. In a recent paper, Sarabia, Castillo, and Slottje (1999) (SCS SCS, n strain/counterstrain, an approach of applying pressure to certain tender points in the muscles or joints to decrease or remove the pain sensed at the point of palpation. ) introduced a method that allowed for the building of hierachies of Lorenz curves when some regularity conditions are satisfied. They introduced the Pareto family, which was found to be a flexible form and which fits actual income distribution data well. This paper introduces another family, the exponential family, which also has interesting characteristics. The exponential family involves more complex estimation estimation In mathematics, use of a function or formula to derive a solution or make a prediction. Unlike approximation, it has precise connotations. In statistics, for example, it connotes the careful selection and testing of a function called an estimator. with a form that is somewhat less flexible but in return gives a robust performance in fitting actual data across countries, as we will show here. The researcher or policy maker is provided another effective tool in the ongoing effort to quantify Quantify - A performance analysis tool from Pure Software. , analyze, and understand economic inequality
Economic inequality refers to disparities in the distribution of economic assets and income. . The strategy used here is to apply a Lorenz curve hierarchy that contains (as special cases) Lorenz curves derived from this general method. In section 2 we introduce the notation notation: see arithmetic and musical notation. How a system of numbers, phrases, words or quantities is written or expressed. Positional notation is the location and value of digits in a numbering system, such as the decimal or binary system. and some necessary background information. The general method is presented in section 3, which starts from an initial Lorenz curve [L.sub.0](p) (which is called the generating curve) and builds a family with an increasing number of parameters. These in turn can be interpreted in terms of elasticities of [L.sub.0](p). Also in section 3 we introduce the exponential family of Lorenz curves and discuss some of its properties as population functions and inequality measures and for undertaking Lorenz orderings. In section 4 we present a method for estimating Lorenz curves and apply it to the two families specified previously. Since the goodness of fit Goodness of fit means how well a statistical model fits a set of observations. Measures of goodness of fit typically summarize the discrepancy between observed values and the values expected under the model in question. Such measures can be used in statistical hypothesis testing, e. is one important criterion in the evaluation of these (and any) models, we use a method due to Gastwirth (1972) and actually incorporate his procedure into the estimation process, as will b e clear in section 4. An example of an application of our new methodology is presented in section 5. Finally, in section 6 we conclude the paper. 2. Notation and Previous Results In this section we use the Lorenz curve as defined by Gastwirth (1971). That is, DEFINITION 1. Given a distribution function F(x) with support in the subset A group of commands or functions that do not include all the capabilities of the original specification. Software or hardware components designed for the subset will also work with the original. of the positive real numbers and with finite finite - compact expectation [micro], we define a Lorenz curve as [L.sub.F](p) = [[micro].sup.-1] [[[integral].sup.p].sub.0] [F.sup.-1](x) dx, 0 [less than or equal to] p [less than or equal to] 1, (1) where [F.sup.-1](x) = sup{y: F(y) [less than or equal to] x}. A characterization A rather long and fancy word for analyzing a system or process and measuring its "characteristics." For example, a Web characterization would yield the number of current sites on the Web, types of sites, annual growth, etc. of the Lorenz curve that is attributed to Gaffney and Anstis by Pakes (1981) is given by the following theorem theorem, in mathematics and logic, statement in words or symbols that can be established by means of deductive logic; it differs from an axiom in that a proof is required for its acceptance. : THEOREM 1. Assume that L(p) is defined and continuous in the interval [0,1] with second derivative L"(p). The function L(p) is a Lorenz curve i.f.f. L(0) = 0, L(1) = 1, L'([0.sup.+]) [greater than or equal to] 0 for p [epsilon] (0, 1) [L.sup.n](p) [greater than or equal to] 0. (2) Lorenz curves allow establishing a ranking in a set of distributions functions. If two distribution functions have associated Lorenz curves that do not intersect In a relational database, to match two files and produce a third file with records that are common in both. For example, intersecting an American file and a programmer file would yield American programmers. , then they can be ordered without ambiguity Ambiguity Delphic oracle ultimate authority in ancient Greece; often speaks in ambiguous terms. [Gk. Hist.: Leach, 305] Iseult’s vow pledge to husband has double meaning. [Arth. in terms of welfare functions that are symmetric No difference in opposing modes. It typically refers to speed. For example, in symmetric operations, it takes the same time to compress and encrypt data as it does to decompress and decrypt it. Contrast with asymmetric. (mathematics) symmetric - 1. , increasing, and quasiconcave (Atkinson 1970; Dasgupta, Sen, and Sarret 1973; Shorrocks 1983). A distribution function [F.sub.x](x) is said to have less inequality in the Lorenz sense than a distribution function [G.sub.Y](y) if their Lorenz curves [L.sub.F](p) and [L.sub.G](p) satisfy the condition [L.sub.F](p) [greater than or equal to] [L.sub.G](p) for all p, where the sign [greater than] applies for at least one p [epsilon] (0, 1)'. In this case we write X [[less than or equal to].sub.L] Y. From the definition of the Lorenz curve (Eqn. 1), it is evident that the Lorenz partial order is invariant (programming) invariant - A rule, such as the ordering of an ordered list or heap, that applies throughout the life of a data structure or procedure. Each change to the data structure must maintain the correctness of the invariant. with respect to scale transformations, that is, X [[less than or equal to].sub.L] Y i.f.f. [lambda]X [[less than or equal to].sub.L] vY for all [lambda], v [grea ter than] 0. THEOREM 2. Let L(p) be a Lorenz curve and consider the transformation [L.sub.[alpha]](p) = [P.sup.[alpha]]L(p), [alpha] [greater than or equal to] 0. (3) Then, if [alpha] [greater than or equal to] 1, [L.sub.[alpha]](p) is a Lorenz curve, too. In addition, if 0 [less than or equal to] [alpha] [less than] 1 and [L.sup.m](p) [greater than or equal to] 0, [L.sub.[alpha]](p) is also a Lorenz curve. THEOREM 3. If L(p) is a Lorenz curve, [L.sub.[gamma]](p) = L[(p).sup.[gamma]], [gamma] [greater than or equal to] 1 (4) is a Lorenz curve. Since [L.sub.[gamma]](p) is an increasing convex Convex Curved, as in the shape of the outside of a circle. Usually referring to the price/required yield relationship for option-free bonds. transform of L(p) and [L.sub.[gamma]](0) = 0 and [L.sub.[gamma]](1) = 1, [L.sub.[gamma]](p) is a Lorenz curve as well. We now present several examples to demonstrate the usefulness of these theorems This is a list of theorems, by Wikipedia page. See also
One well-known form of the Lorenz curve is that attributable to Rasche et al. (1980). Other forms are due to Kakwani and Podder (1973) and Kakwani (1980). Rasche et al. (1980) showed that Kakwani's Lorenz curve does not satisfy all the requirements for a Lorenz curve. Using our Theorem 1, we find a modified Lorenz curve: L(p; a, [beta]) = p - ap[(1 - p).sup.[beta]], 0 [less than or equal to] a [less than or equal to] 1; 0 [less than] [beta] [less than or equal to] 1. (5) Then, using Theorems 2 and 3, we generate a new family of Lorenz curves: [L.sub.a,[alpha],[beta],[gamma]] (p) = [p.sup.[alpha]+[gamma]][[1 - a[(1 - p).sup.[beta]]].sup.[gamma]]; 0 [less than or equal to] a [less than or equal to] 1, [alpha] [greater than or equal to] 0, 0 [less than] [beta] [less than or equal to] 1, [gamma] [greater than or equal to] 1. (6) 3. Hierarchical A structure made up of different levels like a company organization chart. The higher levels have control or precedence over the lower levels. Hierarchical structures are a one-to-many relationship; each item having one or more items below it. Families of Lorenz Curves The previous theorems suggest a method for obtaining hierarchical families of Lorenz curves. Towards this aim, we start with an initial generating Lorenz curve [L.sub.0](p) and consider the following parametric hierarchy: [L.sub.1](p; [alpha]) = [p.sup.[alpha]][L.sub.0](p), ([alpha] [greater than or equal to] 1) or [0 [less than or equal to] [alpha], 1, [[L.sup.m].sub.0](p) [greater than or equal to] 0] (7) [L.sub.2](p; [gamma]) = [L.sub.0][(p).sup.[gamma]], [gamma] [greater than or equal to] 1 (8) [L.sub.3](p; [alpha], [gamma]) = [p.sup.[alpha]][L.sub.0][(p).sup.[gamma]], ([alpha], [gamma] [greater than or equal to] 1) or [0 [less than or equal to] [alpha] [less than] 1, [gamma] [greater than or equal to] 1, [[L.sup.m].sub.0](p) [greater than or equal to] 0]. (9) Families 7 and 8 were obtained using Theorems 2 and 3 and Family 9 arises by combining both results. Note that Families 7 and 8 are ordered with respect to their parameters [alpha] and [gamma]. It is clear that (a) [L.sub.1] is ordered with respect to [alpha] since if [[alpha].sub.1] [greater than or equal to] [[alpha].sub.2] [greater than] 0, then [L.sub.1](p, [[alpha].sub.1]) [less than or equal to] [L.sub.1](p, [[alpha].sub.2]). (b) [L.sub.2] is ordered with respect to [gamma] since if [[gamma]..1] [greater than or equal to] [[gamma].sub.2] [greater than] 0, then [L.sub.2](p, [[gamma].sub.1]) [less than or equal to] [L.sub.2](p, [[gamma].sub.2]). (c) If [L.sub.0](p) = [L.sub.0](p; k) is ordered with respect to parameter (1) Any value passed to a program by the user or by another program in order to customize the program for a particular purpose. A parameter may be anything; for example, a file name, a coordinate, a range of values, a money amount or a code of some kind. k, that is, if [k.sub.1] [less than or equal to] [k.sub.2], we have [L.sub.0](p; [k.sub.1]) [less than or equal to] [L.sub.0](p; [k.sub.2]). (10) Then (i) If [[alpha].sub.1] [greater than or equal to] [[alpha].sub.2], then [p.sup.[[alpha].sub.1]][L.sub.0](p; [k.sub.1]) [less than or equal to] [p.sup.[a.sub.1]][L.sub.0](p; [k.sub.2]) [less than or equal to] [p.sup.[a.sub.2]][L.sub.0](p; [k.sub.2]; (11) that is, we have new ordering with respect to [alpha]. (ii) If [[gamma].sub.1] [greater than] [[gamma].sub.2], then [[L.sup.[[gamma].sub.1]].sub.0](p; [k.sub.1]) [less than or equal to] [[L.sup.[[gamma].sub.1]].sub.0](p; [k.sub.2]) [less than or equal to] [[L.sup.[[gamma].sub.2]].sub.0](p; [k.sub.2]); (12) that is, we have new ordering with respect to y. (d) Combining the previous results, we can also obtain a new ordering for family [L.sub.3]. The new parameters that are sequentially incorporated in the hierarchy can be interpreted in terms of the curve elasticities. For example, [epsilon]([L.sub.3]; p) = [alpha] + [gamma][epsilon]([L.sub.0]; p), (13) where [epsilon](L; p) represents the elasticity of L. The Exponential Lorenz Curve Family The family we discuss is the exponential Lorenz curve family. This family is generated from the initial Exponential Lorenz curve. [L.sub.0](p; k) = [C.sub.k]([e.sup.kp] - 1), 0 [less than or equal to] p [less than or equal to] 1, (14) with [[c.sup.-1].sub.k] = [e.sup.k] - 1, which satisfies Theorem 2. This curve is called the exponential Lorenz curve since it is generated from the suitably normalized exponential function exponential function In mathematics, a function in which a constant base is raised to a variable power. Exponential functions are used to model changes in population size, in the spread of diseases, and in the growth of investments. g(p; k) = exp exp abbr. 1. exponent 2. exponential (kp), k [greater than] 0, and yields the Lorenz curve [L.sub.0](p, k) = [g(p; k) - g(0; k)]/g(1; k) - g(0; k)]. This curve has been recently proposed by Chotikapanich (1993) and gives excellent fitting results with grouped data. The model [L.sub.0](p, k) includes as a particular case the egalitarian e·gal·i·tar·i·an adj. Affirming, promoting, or characterized by belief in equal political, economic, social, and civil rights for all people. model L(p) = p. This is a limiting case for k going to zero, that is, [L.sub.0](p; k) = p. The model [L.sub.0](p; k) can also be interpreted as a linear convex combination A convex combination is a linear combination of data points (which can be vectors, scalars, or more generally points in an affine space) where all coefficients are non-negative and sum up to 1. of an infinite set (mathematics) infinite set - A set with an infinite number of elements. There are several possible definitions, e.g. (i) ("Dedekind infinite") A set X is infinite if there exists a bijection (one-to-one mapping) between X and some proper subset of X. of potential Lorenz curves, [p.sup.i], j = 1, 2, with weights decreasing with i, that is, [L.sub.0](p; k) = [e.sup.kp] -1/[e.sup.k] -1 = [[[sigma].sup.[infinity infinity, in mathematics, that which is not finite. A sequence of numbers, a1, a2, a3, … , is said to "approach infinity" if the numbers eventually become arbitrarily large, i.e. ]].sub.i=1] [w.sub.i][p.sup.i] (15) where [w.sub.i] = [k.sup.i]/i!([e.sup.k] - 1)' [w.sub.i] [greater than or equal to] 0, [[[sigma].sup.[infinity]].sub.i=1] [w.sub.i] = 1. (16) In some cases the fit is even better than that associated with some biparametric families. Using previous results again, we can consider the hierarchy of exponential Lorenz curves: [L.sub.1](p; k, [alpha]) = [c.sub.k][p.sup.[alpha]]([e.sup.kp] - 1); k [greater than] 0, [alpha] [greater than or equal to] 0 (17) [L.sub.2](p; k, [gamma]) = [c.sub.k,[gamma]][([e.sup.kp] - 1).sup.[gamma]]; k [greater than] 0, [gamma] [greater than or equal to] 1 (18) [L.sub.3](p; k, [alpha], [gamma]) = [c.sub.k,[gamma]][p.sup.[alpha]][([e.sup.kp] - 1).sup.[gamma]]; k [greater than] 0, [alpha] [greater than or equal to] 0, [gamma] [greater than or equal to] 1, (19) where [c.sub.k,[gamma]] - [([e.sup.k] - 1).sup.-[gamma]]. Population Functions The quantile functions In theory of probability, a quantile function of a probability distribution is the inverse of its cumulative distribution function. Simple example For example, the quantile function for Exponential(λ) is [X.sub.0](p; k, [micro]) = [micro]k[C.sub.k][e.sup.kp] (20) [X.sub.1](p; k, [alpha][micro]) = [micro][C.sub.k][[alpha][P.sup.[alpha]-1]([e.sup.kp] - 1) + [kp.sup.[alpha]][e.sup.kp]] (21) [X.sub.2](p; k, [gamma][micro]) = [micro][gamma]k[C.sub.k,[gamma]][e.sup.kp][([e.sup.kp] - 1).sup.[gamma]-1] (22) [X.sub.3](p; k, [alpha], [gamma], [micro]) = [micro][c.sub.k,[gamma]][[alpha][p.sup.[alpha]-1][([e.sup.kp] - 1).sup.[gamma]] + k[gamma][p.sup.[alpha]][e.sup.kp][([e.sup.kp] - 1).sup.[gamma]-1]]. (23) In some particular cases we can obtain closed-form expressions In mathematics, an equation or system of equations is said to have a closed-form solution if, and only if, at least one solution can be expressed analytically in terms of a bounded number of certain "well-known" functions. for the distribution functions, as with [L.sub.0]. Again we can prove that the distribution function for Equation 20 becomes [F.sub.0](x; k, [micro]) = 0 if x [less than or equal to] [micro]u(k), [F.sub.0](x; k, [micro]) = 1 if x [greater than or equal to] [micro]v(k) and [F.sub.0](x; k, [micro]) = 1/k log[x/[micro]u(k)] if [micro]u(k) [less than or equal to] x [less than or equal to] [micro]v(k), (24) where, u(k) = k/([e.sup.k] - 1) and v(k) = k[e.sup.k]/([e.sup.k] - 1). For the remaining families we also can obtain results. For example, for [L.sub.2] with [gamma] = 2, we obtain [F.sub.2](x; k, 2, [micro]) = 1/k log[1/2(1 + [square root]1 + 4x/c)] if 0 [less than or equal to] x [less than or equal to] 2v(k)[micro], k [greater than] 0, and [alpha] [greater than or equal to] 0 c = 2k[micro]/[([e.sup.k] - 1).sup.2] and [F.sub.2](x; k, 2, [micro]) = 0 if x [less than or equal to] 0 and [F.sub.2](x; k, 2, [micro]) = 1 if x [greater than or equal to] 2v(k)[micro]. (25) We present some inequality measures that correspond to these Lorenz curves in the Appendix. We now discuss estimation of these models. 4. Estimation In inequality studies, several types of data are normally utilized: grouped data and micro data. Micro data can consist of a set of individual observations or a set of points on the empirical Lorenz curve, for example, income deciles. The estimation method that is presented here can be used for any of the three types of data. For estimating the parameters of Families 17 to 19, least squares is the most direct method to be applied. In all cases, we need to minimize a nonlinear A system in which the output is not a uniform relationship to the input. nonlinear - (Scientific computation) A property of a system whose output is not proportional to its input. function of the parameters. This method presents some well-known problems, such as the need for proving the existence of an absolute minimum and the need from initial values of the estimates for the iterative it·er·a·tive adj. 1. Characterized by or involving repetition, recurrence, reiteration, or repetitiousness. 2. Grammar Frequentative. Noun 1. process to converge con·verge v. con·verged, con·verg·ing, con·verg·es v.intr. 1. a. To tend toward or approach an intersecting point: lines that converge. b. . We discuss these problems and propose solutions now. The Proposed Method The merits of parametric methods, as opposed to nonparametric methods, for the construction of indices and inequality measures for income probability distributions Many probability distributions are so important in theory or applications that they have been given specific names. Discrete distributions With finite support
["CONSTRAINTS: A Language for Expressing Almost-Hierarchical Descriptions", G.J. Sussman et al, Artif Intell 14(1):1-39 (Aug 1980)]. that should be satisfied by the Gini index of any parametric family of Lorenz curves. Consequently, any estimation method for the exponential family should lead to parameter values whose Gini indices satisfy Gastwirth's bounds. The usual estimation method consists of minimizing with respect to [theta Theta A measure of the rate of decline in the value of an option due to the passage of time. Theta can also be referred to as the time decay on the value of an option. If everything is held constant, then the option will lose value as time moves closer to the maturity of the option. ] the sum of squares: [[[sigma].sup.n].sub.i=1] [[[q.sub.i] - L([p.sub.i]; [theta])].sup.2]; [theta] [epsilon] [theta], (26) where [theta] is the set of feasible parameters. Unfortunately, an estimation method based on Equation 26 does not guarantee a Gini satisfying the Gastwirth bounds. An empirical study on this problem has been done by Schader and Schmid (1994), who arrived at conclusions similar to those in Slottje (1990). One possible solution to this problem consists of incorporating the Gastwirth bounds directly into the programming problem as one more constraint Constraint A restriction on the natural degrees of freedom of a system. If n and m are the numbers of the natural and actual degrees of freedom, the difference n - m is the number of constraints. . Thus, we propose to minimize the function Equation 26 subject to GL [less than or equal to] 2 [[[integral].sup.1].sub.0] [p - L(p; [theta])] dp [less than or equal to] GU (27) where GL and GU are the Gastwirth bounds associated with the set of data ([p.sub.i],[q.sub.i]), i - 1,...,n, that is GL = 1 - [[[sigma].sup.k+1].sub.j=1] ([p.sub.j] - [p.sub.j-1])([q.sub.j] + [q.sub.j-1]), GU = GL + [m.sup.-1] [[[sigma].sup.k+1].sub.j=1] [([p.sub.j] - [p.sub.j-1]).sup.2]([a.sub.j] - [m.sub.j])([m.sub.j] - [a.sub.j-1])[([a.sub.j] - [a.sub.j-1]).sup.-1] where [p.sub.0] = [q.sub.0] = 0, [p.sub.k+1] = [q.sub.k+1] = 1[[a.sub.j-1][a.sub.j]], are the limits of the income intervals, [m.sub.j] is the mean income of the interval, and m is the overall mean. Constraint (Eqn. 27) can be incorporated with other alternative estimation methods, as, for example, that proposed in Castillo, Hadi, and Sarabia (1995, 1998). 5. Some Examples To illustrate the method proposed here, we apply it to income distribution data on national samples of income recipients across countries. The data are from Shorrocks (1983). The data correspond to figures for cumulated income shares for 19 countries derived from Jain (1975). The 19 countries selected for analysis were chosen because they cover samples with relatively high, middle, and low income groups with varying degrees of inequality. Using our approach, the Gastwirth lower bounds associated with the different countries are shown in Table 1. As can be seen, the lower bound varies significantly across the countries scrutinized in our study. These should be viewed in light of the overall estimates. In Tables 2 to 3, we give the parameter estimates and the mean square error (MSE MSE Mouse (computer) MSE Materials Science & Engineering MSE Mean Squared Error MSE Mean Square Error MSE Master of Science in Engineering MSE Manufacturing Systems Engineering MSE Mechanically Stabilized Earth ), the mean absolute error (MAE (1) (Metropolitan Area Exchange) Originally known as Metropolitan Area Ethernets, MAEs are junction points on the Internet where data is exchanged between carriers. See IXP and NAP. ), the maximum absolute error (MAXABS) and the Gini index for each country, where MSE = [[[sigma].sup.n].sub.i=1] [[q.sub.i], - L[([P.sub.i], k, [alpha], [gamma])].sup.2]/n (28) is the mean squared error In statistics, the mean squared error or MSE of an estimator is the expected value of the square of the "error." The error is the amount by which the estimator differs from the quantity to be estimated. and MAE = [[[sigma].sup.n].sub.i=1] [absolute val. of [q.sub.i] - L([P.sub.i], k, [alpha], [gamma])]/n (29) is the mean absolute error. The maximum absolute error is MAXABS = [max.sub.i=1....,n] [absolute val. of [q.sub.i] - L([p.sub.i]; k, [alpha], [gamma])].(30) As can be seen in Tables 2 and 3, across all countries, the first model ([L.sub.1]) gives lower MAE, MSE, and MAXABS. The order of magnitude A change in quantity or volume as measured by the decimal point. For example, from tens to hundreds is one order of magnitude. Tens to thousands is two orders of magnitude; tens to millions is three orders of magnitude, etc. of the coefficients, however, is virtually the same. The differences in the measures of goodness of fit are not different until the fourth or fifth decimal place decimal place n. The position of a digit to the right of a decimal point, usually identified by successive ascending ordinal numbers with the digit immediately to the right of the decimal point being first: . In sum, [L.sub.1] appears to be a slightly better fitting model than [L.sub.2]. The Gini coefficients The Gini coefficient is a measure of statistical dispersion most prominently used as a measure of inequality of income distribution or inequality of wealth distribution. It is defined as a ratio with values between 0 and 1: the numerator is the area between the Lorenz curve of the for the [L.sub.1] and [L.sub.2] models are essentially the same in both cases, but the Ginis are slightly higher in [L.sub.2]. In fact, it appears that [L.sub.1] is giving better precision of the model's description of inequality, yet [L.sub.2] yields Gini measures that are more sensitive to inequality. Thus, [L.sub.2] and [L.sub.1] appear to flip-flop across countries with respect to their relative Ginis vis-a-vis their goodness-of-fit measures. 6. Conclusions and Recommendations In this paper we have introduced a new family of Lorenz curves that are generated from the exponential family. Several parameters are incorporated sequentially, keeping the Lorenz character of the resulting families of curves. Several properties of this family are analyzed, and a general estimation method has been proposed that guarantees the existence of unique estimates. The exponential models appear to be very good approximations to actual income distribution data. The results are robust to different data sets for different countries from various parts of the world. Perhaps the most attractive feature of the proffered estimation method is that it is robust. The only cost of this method is some loss of flexibility. (*.) Department of Economics, University of Cantabria The University of Cantabria (Spanish Universidad de Cantabria, UC) is a university located in Santander and Torrelavega in Cantabria, Spain. It was founded in 1972 and is organized in 12 Faculties. , Avda. de los Castros s/n, 39005-Santander, Spain (+.) Department of Applied Mathematics and Computational Sciences | Computational science (or scientific computing) is the field of study concerned with constructing mathematical models and numerical solution techniques and using computers to analyze and solve scientific, social scientific and engineering problems. , University of Cantabria Avda. de los Castros s/n, 39005-Santander, Spain (++.) Department of Economics, Southern Methodist University Southern Methodist University, at Dallas, Tex.; United Methodist; coeducational; chartered 1911. The school's facilities include laboratories for electron microscopy and stable isotopes, a museum of paleontology, and a graduate research center. , Dallas, TX 75275, USA; E-mail dslottje@mail.smu. edu; corresponding author. Paper presented at the American Statistical Association The American Statistical Association (ASA) is a scientific and educational society in the United States with the stated mission to promote excellence in the application of statistical science across the wealth of human endeavor. meetings, August 9 to 13, 1998, Dallas, Texas “Dallas” redirects here. For other uses, see Dallas (disambiguation). The City of Dallas (pronounced [ˈdæl.əs] or [ˈdæl. . The authors are grateful to the Direccion General de Investigacion Cientifica y Tecnica (DGICYT) (project PB96-1261) for partial support of this work. Comments by two anonymous referees have greatly improved the paper. The usual caveat holds. Received June 1998; accepted March 2000. References Abramowitz, Moses, and I. A. Stegum. 1970. Handbook of mathematical functions In mathematics, several functions or groups of functions are important enough to deserve their own names. This is a listing of pointers to those articles which explain these functions in more detail. . New York New York, state, United States New York, Middle Atlantic state of the United States. It is bordered by Vermont, Massachusetts, Connecticut, and the Atlantic Ocean (E), New Jersey and Pennsylvania (S), Lakes Erie and Ontario and the Canadian province of : Dover Publications. Atkinson, Anthony B Anthony B is the stage name of Keith Blair (born March 31, 1976), a Jamaican musician. Biography Early life Blair grew up in rural Clarks Town in the northwestern parish of Trelawny. . 1970. On the measurement of inequality. Journal of Economic Theory 2:244-63. Castillo, Enrique, Ali S. Hadi, and Jose-Maria Sarabia. 1995. A method for estimating Lorenz curves. Actas del XXII Congreso Nacional de Estadistica e Investigacion Operativa: 131-2. Castillo, Enrique, Ali S. Hadi, and Jose-Maria Sarabia. 1998. A method for estimating Lorenz curves. Communications in Statistics, Theory and Methods 27:2037-63. Chotikapanich, Duankamong. 1993. A comparison of alternative functional forms for the Lorenz curve. Economics Letters Economics Letters is a scholarly peer-reviewed journal of economics that publishes concise communications (letters) that provide a means of rapid and efficient dissemination of new results, models and methods in all fields of economic research. Published by Elsevier. 41:129-38. Dasgupta, P., Amartya K. Sen, and David Starret. 1973. Notes on the measurement of inequality. Journal of Economic Theory 6:180-7. Gastwirth, Joseph L. 1971. A general definition of the Lorenz curve. Econometrica 39:1037-9. Gastwirth, Joseph L. 1972. The estimation of the Lorenz curve and Gini index. Review of Economics and Statistics 54: 306-16. Jain, s. 1975. Size distribution of income. Washington, DC: World Bank Publications. Kakwani, Nanak C. 1980. On a class of poverty measures. Econometrica 48:437-46. Kakwani, Nanak C., and Nripesh Podder. 1973. On estimation of Lorenz curves from grouped observations. International Economic Review 14:278-92. Pakes, Ariel G. 1981. On income distributions and their Lorenz curves. Technical Report, Department of Mathematics, University of Western Australia Western Australia, state (1991 pop. 1,409,965), 975,920 sq mi (2,527,633 sq km), Australia, comprising the entire western part of the continent. It is bounded on the N, W, and S by the Indian Ocean. Perth is the capital. , Nedlands, WA. Rasche, R. H., J. Gaffney, A. Koo, and N. Obst. 1980. Functional forms for estimating the Lorenz curve. Econometrica 48:1061-2. Sarabia, Jose-Maria, Enrique Castillo, and Dan J. Slottje. 1999. An ordered family of Lorenz curves. Journal of Econometrics econometrics, technique of economic analysis that expresses economic theory in terms of mathematical relationships and then tests it empirically through statistical research. 91:43-60. Schader, M., and F. Schmid. 1994. Fitting parametric Lorenz curves to grouped income distributions: A critical note. Empirical Economics 19:361-70. Shorrocks, Anthony F. 1983. Ranking income distributions. Economica 50:3-17. Slottje. Dan J. 1990. Using grouped data for constructing inequality indices: Parametric vs. non-parametric methods. Economics Letters 32:193-7.
Gastwirth Lower Bounds for Different Countries
Country Lower Bound to G
Brazil 0.62105
Columbia 0.54710
Denmark 0.36215
Finland 0.46585
India 0.44925
Indonesia 0.43575
Japan 0.30660
Kenya 0.60635
Malaysia 0.50345
Netherlands 0.44210
New Zealand 0.36580
Norway 0.35740
Panama 0.44085
Sri Lanka 0.40395
Sweden 0.38205
Tanzania 0.52615
Tunisia 0.49645
United Kingdom 0.35790
Uruguay 0.49135
Goodness-of-Fit Measures and Gini
Indices Corresponding to Model [L.sub.1]
Country [kappa] MAE MSE MAXABS Gini Index
Brazil 6.11303 0.0672026 0.00670074 0.184059 0.677267
Columbia 4.40909 0.0446958 0.00353165 0.136717 0.571024
Denmark 2.36837 0.0103202 0.00014356 0.0223345 0.36215
Finland 3.27372 0.0184473 0.000568854 0.0520662 0.467785
India 3.23994 0.0499807 0.00423375 0.149348 0.46423
Indonesia 3.15391 0.0633097 0.00652695 0.184605 0.455043
Japan 1.96496 0.0146695 0.000358936 0.0371705 0.308185
Kenya 6.00083 0.0766118 0.00835894 0.202144 0.671678
Malaysia 3.7724 0.0343096 0.00203517 0.102055 0.51691
Netherlands 3.0776 0.023943 0.000997398 0.0704869 0.446732
New Zealand 0.39684 0.0132102 0.000240519 0.0297536 0.3658
Norway 0.33157 0.0150283 0.000283704 0.042860 0.3574
Panama 3.06741 0.0247178 0.00106703 0.0728711 0.445611
Sri Lanka 2.73551 0.0172091 0.000491571 0.0462995 0.407594
Sweden 2.5308 0.0146085 0.000328096 0.03686 0.382693
Tanzania 4.32115 0.0635603 0.00610748 0.173073 0.564087
Tunisia 3.72563 0.0276664 0.00104612 0.0608432 0.512564
United Kingdom 2.34177 0.0215614 0.000809624 0.0627512 0.35872
Uruguay 3.57844 0.0159616 0.000408415 0.0394658 0.498539
Goodness-of-Fit Measures and Gini
Indices Corresponding to Model [L.sub.1]
Country [kappa] [gamma] MAE MSE MAXABS Gini Index
Brazil 6.11300 1.00019 0.0672139 0.00670082 0.184018 0.677324
Columbia 4.41021 1.00000 0.0447035 0.00353165 0.136675 0.571111
Denmark 1.96676 1.12001 0.0107581 0.00016163 0.025593 0.363753
Finland 3.26489 1.00201 0.0184466 0.00056943 0.052112 0.467757
India 3.24050 1.00000 0.0499908 0.00423375 0.149326 0.464290
Indonesia 3.15433 1.00000 0.0633176 0.00652695 0.184589 0.455088
Japan 0.08593 1.89580 0.0342035 0.00136918 0.054208 0.323708
Kenya 6.00158 1.00004 0.0766221 0.00835896 0.202107 0.671729
Malaysia 3.77466 1.00004 0.0343362 0.00203522 0.101962 0.517135
Netherlands 3.07786 1.00003 0.0239475 0.00099742 0.070474 0.446772
New Zealand 1.47461 1.30012 0.0137192 0.00030297 0.035858 0.365800
Norway 1.44674 1.28799 0.0141477 0.00029580 0.035039 0.357400
Panama 3.06767 1.00001 0.0247216 0.00106704 0.072860 0.445644
Sri Lanka 2.73138 1.00148 0.0172602 0.0049237 0.046265 0.407800
Sweden 2.52290 1.00157 0.0145834 0.00032861 0.036961 0.382464
Tanzania 4.32084 1.00000 0.0635580 0.00610748 0.173085 0.564062
Tunisia 3.72630 1.00016 0.0276857 0.00104624 0.060793 0.512692
United Kingdom 2.34042 1.00086 0.0215970 0.00081023 0.062703 0.358960
Uruguay 3.57863 1.00009 0.0159672 0.00040845 0.039445 0.498597
Appendix The Gini index of the exponential hierarchy can be expressed in terms of the confluent hypergeometric function In mathematics, there are two types of functions known as confluent hypergeometric functions. One is the family of solutions to a differential equation known as Kummer's equation; these are called Kummer's confluent hypergeometric function, or simply Kummer's function , whose integral representation is given by (b [greater than] a): [gamma](b - a)[gamma](a)/[gamma](b) M(a, b, z) = [[[integral].sup.1].sub.0] [e.sup.u][t.sup.a-1][(1 - t).sup.b-a-1] dt. (A.1) [gamma](b - a)[gamma](a)/[gamma](b) M(a, b, z) = [[[integral].sup.1].sub.0] [e.sup.u][t.sup.a-1][(1 - t).sup.b-a-t]dt. (A.1) The most important properties of the confluent con·flu·ent adj. 1. Flowing together; blended into one. 2. Merging or running together so as to form a mass, as sores in a rash. hypergeometric can be found in Abramowitz and Stegum (1970, p. 503). We have the following theorem. THEOREM Al. The Gini indices of the exponential hierarchy are given by [G.sub.0](K) = k([e.sup.k] + 1) - 2([e.sup.k] - 1)/k([e.sup.k] - 1) (A.2) [G.sub.1](k, [alpha]) = 1 - 2 [k.sup.c]/[alpha] + 1 [M([alpha] + 1, [alpha] + 2, k) - ] (A.3) [G.sub.2](k, [gamma]) = 1 - 2[c.sub.k,[gamma]] [[[sigma].sup.[infinity]].sub.i=0] [gamma](i - [gamma])[[e.sup.k([gamma]-i)] - 1]/[gamma](i + l)[gamma](-[gamma])k([gamma] - i) (A.4) [G.sub.3](k, [alpha], [gamma]) = 1 - 2[c.sub.k,[gamma]] [[[sigma].sup.[infinity]].sub.i=0] [gamma](i - [gamma])/[gamma](i + l)[gamma](-[gamma]) M[[alpha] + 1, [alpha] + 2, k([gamma] - i)], (A.5) where B( ) and [gamma]( ) are the well-known beta and gamma functions In mathematics, the Gamma function (represented by the capitalized Greek letter Γ) is an extension of the factorial function to real and complex numbers. For a complex number z with positive real part it is defined by PROOF, The index [G.sub.0](k) is given by Chotikapanich (1993). For [G.sub.1](k, [alpha]), we have [G.sub.1](k, [alpha]) = 1 - 2[c.sub.k] [[[integral].sup.1].sub.0] ([p.sup.a][e.sup.kp] - [p.sup.a]) dp = 1 - 2[c.sub.k] [[gamma](1)[gamma]([alpha] + 1)/[gamma]([alpha] + 2)M([alpha] + 1, [alpha] + 2, k) - 1/[alpha] + 1] = 1 - 2[k.sup.c]/[alpha] + 1][M([alpha] + 1, [alpha] + 2, k) - 1], and for the index [G.sub.2], we can write [L.sub.2](p; k, [gamma]) = [([c.sub.k,[gamma]][e.sup.kp[gamma]]).sup.[gamma]] = [c.sub.k,[gamma]][e.sup.kp[gamma]][(1 - [e.sup.-kp]).sup.[gamma]] [[[sigma].sup.[infinity]].sub.i=0] [gamma](i - [gamma])/[gamma](i + 1)[gamma](-[gamma])[e.sup.-kpi], and integrating, term by term, we obtain the index [G.sub.2]. Finally, the index [G.sub.3] can be obtained in a similar form. QED QED abbr. Latin quod erat demonstrandum (which was to be demonstrated) QED which was to be shown or proved [Latin quod erat demonstrandum] Noun 1. . |
|
||||||||||||||||||

Printer friendly
Cite/link
Email
Feedback
Reader Opinion