Methodology for the estimation of annual population stocks by citizenship group, age and sex in the EU and EFTA countries.The paper addresses selected computational Having to do with calculations. Something that is "highly computational" requires a large number of calculations. issues related to the challenge of dealing with poor statistics on international migration. Partial results of the ongoing Adj. 1. ongoing  currently happening; "an ongoing economic crisis"ongoing current  occurring in or belonging to the present time; "current events"; "the current topic"; "current negotiations"; "current psychoanalytic theories"; "the ship's current position" Eurostatfunded project on "Modelling of statistical data on migration and migrant mi·grant n. 1. One that moves from one region to another by chance, instinct, or plan. 2. An itinerant worker who travels from one area to another in search of work. adj. Migratory. population" (MIMOSA Mimosa, in astronomy Mimosa or Beta Crucis (bā`tə kr`sĭs), bright star in the constellation Crux (Southern Cross); 1992 position R.A. ) are presented. The focus is on the data on population stocks by broad group of citizenship, sex and age. After a brief overview of the main problems with data on population by citizenship for 31 European European emanating from or pertaining to Europe. European bat lyssavirus see lyssavirus. European beech tree fagussylvaticus. European blastomycosis see cryptococcosis. countries (27 European Union European Union (EU), name given since the ratification (Nov., 1993) of the Treaty of European Union, or Maastricht Treaty, to the European Community countries, Iceland Iceland, Icel. Ísland, officially Republic of Iceland, republic (2005 est. pop. 297,000), 39,698 sq mi (102,819 sq km), the westernmost state of Europe, occupying an island in the Atlantic Ocean just S of the Arctic Circle, c. , Liechtenstein Liechtenstein (lĭkh`tənshtīn'), officially Principality of Liechtenstein, principality (2005 est. pop. 33,700), 62 sq mi (160 sq km), W central Europe. , Norway Norway, Nor. Norge, officially Kingdom of Norway, constitutional monarchy (2005 est. pop. 4,593,000), 125,181 sq mi (324,219 sq km), N Europe, occupying the western part of the Scandinavian peninsula. and Switzerland Switzerland (swĭt`sərlənd), Fr. Suisse, Ger. Schweiz, Ital. Svizzera, officially Swiss Confederation, federal republic (2005 est. pop. 7,489,000), 15,941 sq mi (41,287 sq km), central Europe. ), a range of computational methods is proposed including cohortwise interpolation interpolation In mathematics, estimation of a value between two known data points. A simple example is calculating the mean (see mean, median, and mode) of two population counts made 10 years apart to estimate the population in the fifth year. , cohortcomponent projections, cohortwise weights propagation The transmission (spreading) of signals from one place to another. and proportional proportional values expressed as a proportion of the total number of values in a series. proportional dwarf the patient is a miniature without disproportionate reductions or enlargements of body parts. fitting methods, as well as other, auxiliary auxiliary In grammar, a verb that is subordinate to the main lexical verb in a clause. Auxiliaries can convey distinctions of tense, aspect, mood, person, and number. methods. The algorithm algorithm (ăl`gərĭth'əm) or algorism (–rĭz'əm) [for AlKhowarizmi], a clearly defined procedure for obtaining the solution to a general type of problem, often numerical. for choosing the best method for estimating missing data on population stock by broad citizenship (nationals, foreigners Foreigners alienage the condition of being an alien. androlepsy Law. the seizure of foreign subjects to enforce a claim for justice or other right against their nation. gypsyologist, gipsyologist Rare. EU27 citizens, foreignersnon EU27 citizens), fiveyear age group (up to 85+) and sex on 1st January January: see month. 20022006 is proposed and illustrated by examples of its application for selected countries. Povzetek: Opisane so razlicne metode za ocenjevanje demografskih podatkov. Keywords: population estimates, stocks by citizenship, Europe Europe (yr`əp), 6th largest continent, c.4,000,000 sq mi (10,360,000 sq km) including adjacent islands (1992 est. pop. 512,000,000). , missing data, cohortwise methods, fitting methods 1 Introduction The deficiencies of statistical information on migrationrelated variables, such as population flows or stocks, are wellknown wellknown adj. 1. Widely known; familiar or famous: a wellknown performer. 2. Fully known: wellknown facts. and widely discussed in the literature [1, 7]. The aim of the paper is to contribute to the works on dealing with these shortcomings A shortcoming is a character flaw. Shortcomings may also be:
n. A stepbystep problemsolving procedure, especially an established, recursive computational procedure for solving a problem in a finite number of steps. procedure of selecting the best one, for the estimation estimation In mathematics, use of a function or formula to derive a solution or make a prediction. Unlike approximation, it has precise connotations. In statistics, for example, it connotes the careful selection and testing of a function called an estimator. of population stocks as of 1st January in a breakdown by sex, age group and broad citizenship category, for the countries for which information is unavailable or incomplete. The study was carried out within a Eurostatfunded project on "Modelling of statistical data on migration and migrant population" (MIMOSA). It covers 31 European countries, of which 27 belong to the European Union (as per 1st January 2007), and further fourto the EFTA EFTA: see European Free Trade Association. (Iceland, Liechtenstein, Norway and Switzerland). The period of interest is 20022006. The citizenship groups under study are: nationals, European Union (EU27) foreigners and nonEU27 foreigners, while the age groups are fiveyear, with the last, openended category being 85 years or more. Generally, the proposed estimation methods aim to combine data from different sources (population census, vital statistics, data on acquisition of citizenship, specific surveys, etc.). In principle, the data that are already available are not modified (for example, in order to harmonise definitions, or for any other reason), unless in the case of inconsistencies between the sources. In the latter cases, the demographic data, provided to Eurostat by national statistical institutes (NSIs), are given priority. Apart from the Introduction, the paper is structured in four sections. Section 2 contains summary information on the availability and quality of the 20022006 data on population stocks for 31 countries under study. In Section 3, the proposed methodology for estimating population stocks by sex, age and citizenship groups is discussed. This section presents such tools as estimation of data in single years of age from fiveyear agegroups, cohortwise interpolation of population stocks, cohortcomponent projections, cohortwise propagation of weights, proportional fitting, as well as other, auxiliary methods. Subsequently, Section 4 contains recommendations concerning the procedure of selecting appropriate estimation methods for each of the countries under study, presented in the form of a decision algorithm and accompanied by several illustrative il·lus·tra·tive adj. Acting or serving as an illustration. il·lustra·tive·ly adv. Adj. 1. examples for selected countries. The discussion is summarised in Section 5. The study is based on the data available in the Eurostat databases, supplemented by additional information obtained from national statistical institutes, whenever required and feasible. Throughout the paper, the abbreviation abbreviation, in writing, arbitrary shortening of a word, usually by cutting off letters from the end, as in U.S. and Gen. (General). Contraction serves the same purpose but is understood strictly to be the shortening of a word by cutting out letters in the middle, 'NSI' is used to denote de·note tr.v. de·not·ed, de·not·ing, de·notes 1. To mark; indicate: a frown that denoted increasing impatience. 2. the national statistical institute of the respective country, 'JMQ' stands for the Joint Questionnaire on International Migration Statistics (hereafter In the future. The term hereafter is always used to indicate a future time—to the exclusion of both the past and present—in legal documents, statutes, and other similar papers. : Joint Migration Questionnaire) of Eurostat, UN Statistical Division, UN Economic Commission for Europe Noun 1. Economic Commission for Europe  the commission of the Economic and Social Council of the United Nations that is concerned with economic development in Europe , the Council of Europe Council of Europe, international organization founded in 1949 to promote greater unity within Europe and to safeguard its political and cultural heritage by promoting human rights and democracy. The council is headquartered in Strasbourg, France. and the International Labour Office. 'LFS' depicts the Labour Force Survey. 2 Availability of the 20022006 data on population stocks for 31 European countries Annual statistics on usually resident population by citizenship, sex and age are collected by Eurostat from the NSIs via the Joint Questionnaire on International Migration, together with migration flow data. Population statistics for 37 European countries, collected through the JMQ JMQ Java Message Queue (iPlanet ECommerce Solutions) JMQ Jorvik Message Queues (IP based message queuing system) are checked and subsequently loaded into Eurostat's online database Noun 1. online database  (computer science) a database that can be accessed by computers computer database, electronic database, electronic information service , NewCronos. The data are located under the Population and Social conditions theme, in the International Migration and Asylum asylum (əsī`ləm), extension of hospitality and protection to a fugitive and the place where such protection is offered. The use of temples and churches for this purpose in ancient and medieval times was known as sanctuary. domain (MIGR), tables migr_st_popctz (population by sex and citizenship) and migr_st_popage (population by age group, citizenship and sex). The data for 20002006 come from the following tables in the 20002006 JMQs: * Table 7a (for 20002003, 8a): Usually resident population by citizenship and age, both sexes; * Table 7b (for 20002003, 8b): Usually resident population by citizenship and age, males; * Table 7c (for 20002003, 8c): Usually resident population by citizenship and age, females. A detailed analysis of statistics on population stocks by citizenship provided by the 31 countries covered the JMQs for the reference period 20022006. Selected results of the analysis of the data availability Refers to the degree to which data can be instantly accessed. The term is mostly associated with service levels that are set up either by the internal IT organization or that may be guaranteed by a third party datacenter or storage provider. for particular countries are summarised in Table 1, providing an overview of the situation for all 31 countries. The information on the lack of data, marked as 'not available' in Table 1, was based on the information provided in the JMQ or on information obtained during the THESIM project (1). Other missing data were marked as 'not provided to Eurostat'. In addition to missing data, a number of other problems were detected, for example the presence of provisional Temporary; not permanent. Tentative, contingent, preliminary. A provisional civil service appointment is a temporary position that fills a vacancy until a test can be properly administered and statutory requirements can be fulfilled to make a permanent appointment. data, some citizenship categories only, broad age groups, or a different reference date than 1st January. Data on total population stock on 1st January, not disaggregated Broken up into parts. by citizenship, are also collected by Eurostat within the framework of the Annual Demographic Statistics Among the kinds of data that national leaders need are the demographic statistics of their population. Records of births, deaths, marriages, immigration and emigration and a regular census of population provide information that is key to making sound decisions about national policy. data collection. These data, disaggregated by sex and age, are located under the Population and social conditions theme, in the Demography demography (dĭmŏg`rəfē), science of human population. Demography represents a fundamental approach to the understanding of human society. (DEMO) domain of the database, table demo_pjan. The results of the review of the availability of these data for the years 20022006 revealed that the data on total population stock by sex and fiveyear age group (up to 85+) are available for all 31 countries, with the following exceptions: there is no 2006 data by age for Belgium and Italy, while for Romania the highest age group in 2004 data is 80+. In addition to the annual data, Eurostat also collects and disseminates statistics on population by citizenship, sex and age obtained by the countries during population censuses. Like other statistics, the census data are located under the Population and social conditions theme, in the Census (CENS CENS Censor CENS Center for Embedded Networked Sensing (UCLA NSF) CENS Censorship CENS Centre d'Etudes Nucleaires de Seclay ) domain of the database, table cens_nsctz. Unlike annual population figures, the census data on population by citizenship, sex and age are available for almost all 31 countries, with the notable exceptions of the United Kingdom, Germany and Malta. A supplementary source of data on population stock by citizenship is the Labour Force Survey. However, the availability of data from the LFS LFS Linux from Scratch LFS Labour Force Survey (UK) LFS Live for Speed (computer racing simulation) LFS London Film School LFS LogStructured File System (Unix, BSD) in the Eurostat database is very limited and the reliability of data is probably not high, due to the nature of the data source. By definition, the LFS statistics are estimates and thus bear certain errors, which tan be relatively high for disaggregated categories (e.g., for population broken down by age, sex and citizenship groups). However, some use of the LFS data could be considered as an alternative to the proposed methods in the countries where data on total nationals and total foreigners are missing. In the Eurostat database, the LFS tables are located under the Population and social conditions theme, the Labour market (LABOUR) domain, in the table with population data containing the nationality nationality, in political theory, the quality of belonging to a nation, in the sense of a group united by various strong ties. Among the usual ties are membership in the same general community, common customs, culture, tradition, history, and language. dimension (population by sex, age groups, nationality and labour status, table lfsa_pganws). However, the table does not contain data on the level of individual countries of citizenship and only data on total population and on nationals could be useful for this project. Estimates of the 20022006 stock of the EU27 citizens cannot be prepared using the LFS tables in the Eurostat database. These considerations need to be taken into account when proposing computation Computation is a general term for any type of information processing that can be represented mathematically. This includes phenomena ranging from simple calculations to human thinking. methods for the current study. 3 Proposed methods of estimating population stocks by citizenship, sex and age The current section presents a theoretical background of the methods proposed for the calculations of the missing elements in population stocks by age, sex and citizenship group. After a brief summary of the notation notation: see arithmetic and musical notation. How a system of numbers, phrases, words or quantities is written or expressed. Positional notation is the location and value of digits in a numbering system, such as the decimal or binary system. , the following methods are discussed: interpolation of fiveyear into oneyear age groups, regarded as a preparatory pre·par·a·to·ry adj. 1. Serving to make ready or prepare; introductory. See Synonyms at preliminary. 2. Relating to or engaged in study or training that serves as preparation for advanced education: method (Section 3.2), followed by cohortwise interpolation of population stocks (3.3), cohortcomponent projections, traditionally used in demography (3.4) and cohortwise weights propagation (3.5). Further, Section 3.6 describes selected proportional fitting methods, which category encompasses three approaches, depending on the availability of information: the proportional adjustment, direct proportional fitting and iterative it·er·a·tive adj. 1. Characterized by or involving repetition, recurrence, reiteration, or repetitiousness. 2. Grammar Frequentative. Noun 1. proportional fitting. Section 3 concludes by presenting some auxiliary methods for dealing with the Unknown categories, and for the estimation of missing elements of age distributions (3.7). 3.1 Notation and basic concepts Throughout the paper, the notation used for population variables follows a common convention presented below. In all cases, the superscript Any letter, digit or symbol that appears above the line. For example, 10 to the 9th power is written with the 9 in superscript (10^{9}). Contrast with subscript. n indicates one of the three broad groups of citizenship: nationals, EU27 foreigners or nonEU27 foreigners, abbreviated as N, EU and nEU, respectively, thus reflecting the composition of the European Union as of 1st January 2007. The nonEU27 group includes also the stateless persons Noun 1. stateless person  a person forced to flee from home or country displaced person, DP refugee  an exile who flees for safety Civilian who has been denationalized or whose country of origin cannot be determined or who cannot establish a . An abbreviation FOR is used for all foreign population (EU27 and nonEU27 together). For the transparency (1) The quality of being able to see through a material. The terms transparency and translucency are often used synonymously; however, transparent would technically mean "seeing through clear glass," while translucent would mean "seeing through frosted glass." See alpha blending. of presentation, the country index is skipped, as all calculations proposed in the paper are always countryspecific. The variables in question are as follows: Stock variables: [P.sup.n](x, t)Population in broad citizenship group n, in the age of x years on 1st January, year t. [P.sup.n](x, c)  Population in broad citizenship group n, in the age of x years at the census date c. Event variables: [B.sup.n](t)Births during calendar year t in citizenship group n; [D.sup.n](x, t)Deaths of persons aged x years, belonging to citizenship group n, during calendar year t; [l.sup.n](x, t)Registered immigration immigration, entrance of a person (an alien) into a new country for the purpose of establishing permanent residence. Motives for immigration, like those for migration generally, are often economic, although religious or political factors may be very important. of persons in citizenship group n, aged x years, during calendar year t, regardless of the country of origin; [E.sup.n](x, t)Registered emigration emigration: see immigration; migration. of persons in citizenship group n, aged x years, during calendar year t, regardless of the country of destination; [R.sup.n](x, t)Outcome of the regularisation Noun 1. regularisation  the condition of having been made regular (or more regular) regularization condition, status  a state at a particular time; "a condition (or state) of disrepair"; "the current status of the arms negotiations" 2. of the status of formerly irregular HEIR, IRREGULAR. In Louisiana, irregular heirs are those who are neither testamentary nor legal, and who have been established by law to take the succession. See Civ. Code of Lo. art. 874. residents (cf. [4]) aged x, in year t, by definition referring only to foreigners, n [member of] {EU, nEU}, thus with [R.sup.N](x, t) [equivalent to] 0; [S.sup.n](x, t)Statistical adjustment (official correction) of the size of population aged x, in year t, due to the reasons other than regularisations; [A.sup.n](x, t)Acquisitions of citizenship by the population aged x, in year t, by definition referring only to foreigners, n [member of] {EU, nEU}, with [A.sup.N](x, t) [equivalent to] 0. Unless noted otherwise, age is reported in years reached during a given calendar year, and thus the events in question (deaths, migrations, citizenship changes, etc.) correspond to parallelograms with vertical sides on the Lexis Lexis® An online legal information service that provides the full text of opinions and statutes in electronic format. Subscribers use their personal computers to search the Lexis database for relevant cases. They may download or print the legal information they retrieve. diagram diagram /di·a·gram/ (di´ahgram) a graphic representation, in simplest form, of an object or concept, made up of lines and lacking pictorial elements. . An illustration of the relevant concepts on a Lexis plane is shown in Figure 2, in Section 3.4. Whenever necessary, the index denoting sex is added as an additional subscript (1) In word processing and scientific notation, a digit or symbol that appears below the line; for example, H_{2}O, the symbol for water. Contrast with superscript. (2) In programming, a method for referencing data in a table. g [member of] {m, f} for males and females, respectively, e.g. [P.sup.n.sub.f] (x, t) refers to female population stock, and [D.sub.m.sup.n](x, t) to deaths among males. In order to distinguish fiveyear age groups, an additional lefthand side lefthand side n → izquierda lefthand side left n → linke Seite f lefthand side n → lato or subscript '5' is added. For example, [sub.5][P.sub.m.sup.n](x, t) refers to male population belonging to broad citizenship group n which was in the age of [x, x+5) years on 1st January of year t. The same principle applies to almost all event variables (D, I, E, R, S and A), with a clear exception of B. In some instances, for the clarity of presentation, the summation summation n. the final argument of an attorney at the close of a trial in which he/she attempts to convince the judge and/or jury of the virtues of the client's case. (See: closing argument) of a particular variable over a given index is indicated by an asterisk (1) See Asterisk PBX. (2) In programming, the asterisk or "star" symbol (*) means multiplication. For example, 10 * 7 means 10 multiplied by 7. The * is also a key on computer keypads for entering expressions using multiplication. in a respective place, e.g. [A.sup.nEU](*, t) = [[summation].sub.x] [A.sup.nEU] (x, t) refers to all acquisitions of citizenship by nonEU27 foreigners in year t, regardless of age. Similarly, [I.sup.*] (x, t) = [[summation].sub.n][I.sup.n](x, t) denotes all immigrants aged x, in year t, irrespective of irrespective of prep. Without consideration of; regardless of. irrespective of preposition despite their citizenship, and [D.sup.*](*, t) = [[summation].sub.n] [[summation].sub.x] [D.sup.n](x, t) refers to all deaths registered in year t, without respect to nationality or age. It has to be noted that in several cases the summation over n involves only two components, e.g. n [member of] {EU, nEU} for [R.sup.n](x, t) and [A.sup.n](x, t). 3.2 Interpolation of fiveyear age groups into oneyear groups Among the preparatory steps for the estimation of missing data, the most frequent problem concerns disaggregation dis·ag·gre·ga·tion n. 1. A breaking up into component parts. 2. An inability to coordinate various sensations and a failure to observe their mutual relations. of fiveyear age groups of population (or events) into single years. This has to be performed in order to enable cohortwise interpolations, cohortcomponent projections with yearly steps, or cohortwise weights propagation, as described in Sections 3.3, 3.4 and 3.5. If auxiliary information is available from a different source (e.g. from a census, from the previous or next year, etc.), the population size or the number of events can be disaggregated using a 'Prorating' method [11, p. 561], whereby the relative distribution from the auxiliary source is imposed on the data in question. The obtained distribution might need to be further adjusted to marginal totals, by means of proportional fitting procedures, described in Section 3.6. If the data on population stocks by sex, broad citizenship group and fiveyear age group [sub.5][P.sup.n](x, t) are available and the stocks by sex and oneyear age group [P.sup.*](x, t) are also known, then, assuming no other information about the distribution by single years, we can estimate the missing distributions for particular citizenship groups proportionally pro·por·tion·al adj. 1. Forming a relationship with other parts or quantities; being in proportion. 2. Properly related in size, degree, or other measurable characteristics; corresponding: , that is as: [P.sup.n](x+i, t) = [sub.5][P.sup.n](x, t) x [P.sup.*](x+i, t)// [sup.5] [P.sup.*](x, t). This is an example of the application of the direct proportional fitting described in Section 3.6.2. If none of the above information is available, the proposed methodology is to use the wellknown interpolative fourterm thirddifference solution of Karup and King [11, p. 565]. For each fiveyear group, the disaggregation into fifths is done via applying multiplicative mul·ti·pli·ca·tive adj. 1. Tending to multiply or capable of multiplying or increasing. 2. Having to do with multiplication. mul coefficients to the global value of this group and the neighbouring ones. Different multipliers are used for the first group, the middle groups and the last group, as set forth in Table 2. For example, if we want to split a middle fiveyear group with population [N.sub.i] into five singleyear groups [n.sub.1], [n.sub.2], [n.sub.3], [n.sub.4], [n.sub.5], then: [n.sub.1] = 0.064 [N.sub.i1] + 0.152 [N.sub.i]  0.016 [N.sub.i+1], [n.sub.2] = 0.008 [N.sub.i1] + 0.224 [N.sub.i]  0.032 [N.sub.i+1], etc. When KarupKing multipliers are used, the condition [N.sub.i] = [n.sub.i] + [n.sub.2] + [n.sub.3] + [n.sub.4] + [n.sub.5] is automatically fulfilled ful·fill also ful·fil tr.v. ful·filled, ful·fill·ing, ful·fills also ful·fils 1. To bring into actuality; effect: fulfilled their promises. 2. . As an alternative to the KarupKing interpolation, the sixterm fifthdifference interpolative formulae of Sprague or Beers can be applied, which however use information from more surrounding sur·round tr.v. sur·round·ed, sur·round·ing, sur·rounds 1. To extend on all sides of simultaneously; encircle. 2. To enclose or confine on all sides so as to bar escape or outside communication. n. groups. Methodological details can be found in Shryock et al. [11]. In our case, the KarupKing interpolation is recommended for the sake of simplicity. For variables depicting nonvital events, like migration or citizenship acquisitions, the estimates for particular cohorts can be obtained from two neighbouring periodage estimates yielded by the KarupKing formula, split equally by half. For the first cohort cohort /co·hort/ (ko´hort) 1. in epidemiology, a group of individuals sharing a common characteristic and observed over time in the group. 2. , we can assume that a half of the relevant periodage events concern the cohort in question, while for the last, openended cohort, we can add up the periodage estimate for the openended group and a half of the events concerning the age group immediately preceding the last one. The underlying rationale rationale (rash´nal´), n the fundamental reasons used as the basis for a decision or action. is an assumption that nonvital events are equally spread over the periodage squares of the Lexis diagram. In any case, the estimates for the eldest ELDEST. He or she who has the greatest age. 2. The laws of primogeniture are not in force in the United States; the eldest child of a family cannot, therefore, claim any right in consequence of being the eldest. cohorts would be close to zero for all practical migrationrelated applications. Regardless of the method, if the disaggregation is performed on data broken down by sex or citizenship, the final estimate might need to be obtained by proportional fitting methods (described in Section 3.6), in order to ensure the summation to available marginal totals. 3.3 Cohortwise interpolation of population stocks Given the information on the age structures of the population for two nonadjacent moments of time, a simple idea to obtain the missing figures for inbetween moments would be to apply interpolation techniques. In this case, we propose cohortwise interpolation for all cohorts apart from the youngest and oldest one, which are discussed separately. Overall, this method requires much less information on input than the cohortcomponent projections presented in the next section, but it requires information about population both before and after the moment for which the estimates are to be done. The interpolative approach is recommended for the cases where (a) the span between two points with available data is not wide (say, twothree years), and (b) no information on the distribution of vital and migratory migratory /mi·gra·to·ry/ (mi´grahtor?e) 1. roving or wandering. 2. of, pertaining to, or characterized by migration; undergoing periodic migration. migratory emanating from or pertaining to migration. events by citizenship is available. In practical applications, as the ones described in Section 4, it often happens that the data are available for year t from the census conducted at time c (t [less than or equal to] c < t+1), and for 1st January of the year t+k, not immediately following the census. Such situations can be put in a general framework illustrated on a Lexis diagram in Figure 1, where [alpha] denotes the fraction of a year remaining after the census until 31st December. Figure 1 and the methodology proposed below cover also the situations when data come from other sources than the census, and the situations when the reference date of the data for year t is 1st January. In the latter case it suffices to set [alpha] = 1. For the cohorts already existent ex·is·tent adj. 1. Having life or being; existing. See Synonyms at real^{1}. 2. Occurring or present at the moment; current. n. One that exists. Adj. 1. at the census date c, the interpolation can follow various patterns, but an arithmetic and geometric pattern of growth [3, 10] will be the most frequent choices. As noted by Rowland [10, p. 50], "under arithmetic growth, successive population totals differ from one another by a constant amount [, while] under geometric growth they differ by a constant ratio". For shortperiod interpolations, both approaches should yield similar results, although this is an empirical issue, and there is no convincing argument in favour of either of them. Hence, a selection of appropriate methods should rely on casespecific judgements. [FIGURE 1 OMITTED] It has to be noted that the cohort aged x completed years on 1st January t+k was split at the census date between two age groups: the younger one (aged x completed years) and the older (aged x+1), as shown in Figure 1. Therefore, the interpolative estimate of [P.sup.n](x, t+i) depends on [P.sup.n](x, c), [P.sup.n](x + 1, c) and [P.sup.n](x, t+k). Given the above, the formula for an interpolative estimate of population sizes belonging to a particular age group x+i and citizenship group n, assuming the linear pattern of change, is as follows: [P.sup.n](x+i, t+i) = (ki) / (k1+[alpha]) x [[alpha] x [P.sup.n](x, c) + (1[alpha]) x x [P.sup.n](x+1, c)] + (i1+[alpha]) / (k1+[alpha]) x [P.sup.n](x+k, t+k), (1a) while for the geometric change: [P.sup.n] (x+i, t+i) = [{[[[alpha] x [P.sup.n](x, c) + (1[alpha]) x [P.sup.n](x+1, c)].sup.ki]. x [P.sup.n][(x+k, t+k).sup.i1+[alpha]]}.sup.1/(k1+[alpha]). (1b) For the youngest and oldest cohorts, for which interpolation as proposed above is not possible, a simplified solution is proposed. In such cases, we suggest to take the average shares (proportions) of the sizes of the respective age groups in the total population, calculated from the data available for neighbouring periods, weighted by the distance between the available data points and estimation point. In order to ensure consistency of the results and summation of the agespecific estimates to the marginal totals by sex or citizenship group, whenever available, the estimates have to be adjusted by the means of proportional fitting, presented in Section 3.6. The framework presented above can be easily generalised Adj. 1. generalised  not biologically differentiated or adapted to a specific function or environment; "the hedgehog is a primitive and generalized mammal" generalized biological science, biology  the science that studies living organisms to a much less frequent situation with interpolation between two censusesin such case, a fraction [beta] of a year between the 1st January of the year of the second census and the second census date, c', should be additionally accounted for. However, the estimates obtained in such cases would be only very approximate, due to a usually large time span between the censuses. It should be noted that an identical solution as shown above in (1a), or in (1b) can be used for extrapolating cohort sizes beyond the available data points, in whichever direction. In either case, it would suffice suf·fice v. suf·ficed, suf·fic·ing, suf·fic·es v.intr. 1. To meet present needs or requirements; be sufficient: These rations will suffice until next week. to put an appropriate integer integer: see number; number theory i [less than or equal to] 0 for the backward extrapolation (mathematics, algorithm) extrapolation  A mathematical procedure which estimates values of a function for certain desired inputs given values for known inputs. If the desired input is outside the range of the known values this is called extrapolation, if it is inside then (in particular, following the example from Figure 1, set i = 0 to obtain values for the beginning of the census year), or i > k for the forward extrapolation. The methods discussed above resemble to some extent the ones presented in the Human Mortality Database Methods Protocol [15], with the exception of the oldest age groups, where the quoted study suggests more sophisticated extinct cohort and survivor ratios approaches. Direct application of the methods proposed by Wilmoth et al. [15] would be, however, difficult. This is not because of computational reasons, but rather due to the lack of yearly estimates of deaths, births and migratory events broken down by citizenship groups, which has been listed at the beginning of the current section as a precondition pre·con·di·tion n. A condition that must exist or be established before something can occur or be considered; a prerequisite. tr.v. for selecting cohortwise interpolation method. 3.4 Cohortcomponent projections As concerns projections, let us denote by [X.sup.n](x, t) a sum of all event variables not related to the natural change of population stocks (i.e. all but births and deaths), thus: [X.sup.n](x, t) = [I.sup.n](x, t)  [E.sup.n](x, t) + [S.sup.n](x, t) + [[summation].sub.k[member of]]{EU, nEU}] [A.sup.k](x, t), for n = N; (2a) [X.sup.n](x, t) = [I.sup.n](x, t)  [E.sup.n](x, t) + [S.sup.n](x, t) + [R.sup.n](x, t)  [A.sup.n](x, t), for n [not equal to] N. (2b) Given (2a) and (2b), the population accounting equations for each broad citizenship group are: [P.sup.n](0, t+ 1) = [B.sup.n](t)  [D.sup.n](0, t) + [X.sup.n](0, t); (3a) [P.sup.n](x, t+1) = [P.sup.n](x1, t)  [D.sup.n](x, t) + [X.sup.n](x, t), for x [member of] {1, 2, ..., [x.sub.max]1 }; (3b) [P.sup.n]([x.sub.max], t+1) = [[P.sup.n]([x.sub.max]1, t) + [P.sup.n]([x.sub.max], t)]  [D.sup.n]([x.sub.max], t) + + [X.sup.n]([x.sub.max], t). (3C) In (3c), [x.sub.max] stands for the highest (openended) age group for which information is available. Note also that deaths and other event variables in age group [x.sub.max] refer to the trapezoid trapezoid, closed plane figure bounded by four line segments, or sides, two of which are parallel and two of which are nonparallel. The parallel sides of a trapezoid are called bases and the nonparallel sides legs; in an isosceles trapezoid the legs are of equal on the Lexis diagram rather than to a parallelogram parallelogram, closed plane figure bounded by four line segments, or sides, with opposite pairs of sides parallel and equal in length. The rhombus, rectangle, and square are special types of parallelograms. , while for age group 0to a right triangle, as shown in Figure 2. [FIGURE 2 OMITTED] Under the assumptions presented above, the projection is made following the equations (3a), (3b) and (3c) for consecutive years, on the basis of information available for singleyear age groups, decomposed de·com·pose v. de·com·posed, de·com·pos·ing, de·com·pos·es v.tr. 1. To separate into components or basic elements. 2. To cause to rot. v.intr. 1. from the fiveyear groups, if needed. Note that the default citizenship of a newborn newborn /new·born/ (noo´born?) 1. recently born. 2. newborn infant. new·born adj. Very recently born. n. A neonate. child can differ between the countries, either following the ius soli principle, whereby a child acquires the citizenship of the country of birth, or ius sanguinis, according to according to prep. 1. As stated or indicated by; on the authority of: according to historians. 2. In keeping with: according to instructions. 3. which a child inherits the citizenship of its parent(s), or finally a mixture of those two, for example differentiating between the generations of migrants, taking into account the length of stay in the country, etc. The general rules are as follows: a) lus sanguinis If the child gets citizenship of any of the parents, then [B.sup.n](t) in equation (3a) may be assumed to be roughly proportional to [P.sup.n](t). If the child acquires citizenship of the mother and we have no separate estimate of fertility fertility: see infertility. fertility Ability of an individual or couple to reproduce through normal sexual activity. About 80% of healthy, fertile women are able to conceive within one year if they have intercourse regularly without contraception. for nationals and foreigners, then [B.sup.n](t) may be assumed to be roughly proportional to [P.sup.n.sub.f](t). If the estimates of fertility by broad citizenship and age of mother exist then a better estimate may be obtained using the formula: [B.sup.n](t) = [B.sup.*](t) [[summation].sub.x] [f.sup.n](x) [P.sub.f.sup.n] (x, t) / [[summation].sub.k,x] [f.sup.k](x) [P.sub.f.sup.k](x, t), (4) where [f.sup.n](x) denotes agespecific fertility rates Noun 1. fertility rate  the ratio of live births in an area to the population of that area; expressed per 1000 population per year birth rate, birthrate, fertility, natality for women in age group x, belonging to the group of citizenship n. If the estimates of fertility are available by broad citizenship group, but not by the age of mother, the formula (4) would have to be modified, so as the summation over age reflects only the female population aged 1549 years. b) Ius soli If the child automatically acquires the citizenship of a given country, then the balance equation for the youngest age group, (3a), becomes, depending on the citizenship in question: [P.sup.N] (0, t+1) = [B.sup.*](t)  [D.sup.N] (0, t) + [X.sup.N](0, t), for n = N; (3a') [P.sup.n] (0, t+1) = [X.sup.n](0, t)  [D.sup.n](0, t), for n [not equal to] N. (3a") In mixed cases, it is recommended to project one part of births according to formulas for ius soli and another part according to the ius sanguinis principle. Note also that losses of citizenship are not accounted for, as they in most instances concern persons in reality either already living abroad, or emigrating (and counted in E). For acquisitions of citizenship, we assume that nonnationals fall in the category of nationals upon naturalization naturalization, official act by which a person is made a national of a country other than his or her native one. In some countries naturalized persons do not necessarily become citizens but may merely acquire a new nationality. , in order to count the same people only once, regardless of the number of citizenships they have. If the breakdown by citizenship group of all variables referring to vital and migratory events can be assumed proportional to the citizenship structure of the population at the beginning of each year, then the projection methodology can be often de facto [Latin, In fact.] In fact, in deed, actually. This phrase is used to characterize an officer, a government, a past action, or a state of affairs that must be accepted for all practical purposes, but is illegal or illegitimate. simplified to proportional adjustment / decomposition decomposition /de·com·po·si·tion/ (dekom?pahzish´un) the separation of compound bodies into their constituent principles. de·com·po·si·tion n. 1. , whereby the citizenship distribution of the considered cohort in the previous year would directly apply to all cohorts except the first and the last one in each year. In particular, this situation applies if the following four conditions hold: 1. Total population by age, [sub.5] [P.sup.*](x, t), is known for successive years, but the citizenship structure is missing; 2. We may assume that the distribution of deaths and migration flows by broad citizenship is the same as the citizenship composition of the population; 3. Acquisitions of citizenship may be ignored; 4. There was no regularization reg·u·lar·ize tr.v. reg·u·lar·ized, reg·u·lar·iz·ing, reg·u·lar·iz·es To make regular; cause to conform. reg , or it may be ignored. In such cases, the projection equation (3b) combined with proportional fitting is equivalent to proportional decomposition of [sub.5] [P.sup.*](x, t) by citizenship group described in Section 3.6.1. The estimations can be performed using the formula: [sub.5][P.sup.n] (x, t) = [sub.5] [P.sup.*](x, t) x [sub.5] [P.sup.n](x1, t1) / [sub.5] [P.sup.*](x1, tl). (5) The first and the last cohort may be disaggregated using the citizenship composition of the first and last age group in the previous year. In such cases, the following formulas apply: [sub.5][P.sup.n] (0, t) = [sub.5][P.sup.*] (0, t) x [sub.5][P.sup.n] (0, t1) / [sub.5][P.sup.*] (0, t1), or: (6a) [sub.5][P.sup.n] ([x.sub.max], t) = [sub.5][P.sup.*] ([x.sub.max], t) x [sub.5][P.sup.n] ([x.sub.max], t1) / [sub.5][P.sup.*] ([x.sub.max], t1). (6b) 3.5 Cohortwise weights propagation In some cases, too much information on the agesexcitizenship distribution of the components of population change is missing, which renders projections too dubious with respect to the number of assumptions that need to be made. In practice, in such instances the only reliable information comes from the population census and from annual population stocks available in the DEMO domain of the NewCronos database. Hence, the proposed procedure is as follows. For the census population, apply the structure by citizenship, taken from each fiveyear age group, to the respective singleyear age groups (i.e. from age group 04 to single ages 0, 1 ..., 4; from 59 to 5, 6, ..., 9 etc.). Let [w.sup.n](x, c) = [P.sup.n](x, c) / P(x, c) denote the agespecific shares ('weights') of citizenship group n in the census. Further, set [alpha] as a fraction of the calendar year before the census date. It is implicitly assumed that the census population in singleyear age groups can be divided between 'older' and 'younger' cohorts using the [alpha] and (1[alpha]) partition A reserved part of disk or memory that is set aside for some purpose. On a PC, new hard disks must be partitioned before they can be formatted for the operating system, and the Fdisk utility is used for this task. . For the census date, use the following formula to calculate the share of citizenship group n in the cohort that was aged x years on 1st January of the census year: [w.sup.n](x+[alpha], c) = [(1  [alpha]) x [P.sup.n](x, c) + [alpha] x [P.sup.n](x+1, c)] // [(1  [alpha]) x [P.sup.*](x, c) + [alpha] x [P.sup.*](x+1, c)], for x < [x.sub.max]; (7a) [w.sup.n]([x.sub.max] + [alpha], c) = [P.sup.n]([x.sub.max, c) / [P.sup.*]([x.sub.max], c). (7b) For the 1st January of the census year assume that the weights [w.sup.n](x, t) = [w.sup.n]([x+[alpha], c). For the 1st January of the year following the census year (t > c), assume in turn: [w.sup.n](x, t) = [w.sup.n](x1+[alpha], c), for 0 < x <[x.sub.max]; (8a) [w.sup.n]([x.sub.max], t) = [[P.sup.n]([x.sub.max]1, c) (1[alpha]) + [P.sup.n]([x.sub.max], c)] // [[P.sup.*]([x.sub.max]1, c) (1[alpha]) + [P.sup.*]([x.sub.max], c)]. (8b) For the youngest age group assume [w.sup.n](x, t) = [w.sup.n](0, c), or alternatively that the shares are the same as the shares of citizenship group n in the births during the census year, so as: [w.sup.n](0, t) = [B.sup.n](t1) / [B.sup.*](t1). For consecutive years calculate: [w.sup.n](x, t)= [w.sup.n](x1, t1), for x = 1, ..., [x.sub.max]1; (9a) [w.sup.n]([x.sub.max], t) = [[P.sup.n]([x.sub.max]1, t1) + [P.sup.n]([x.sub.max], t1)] // [[P.sup.*] ([x.sub.max]1, t1) + [P.sup.*]([x.sub.max], t1)]; (9b) [w.sup.n](0, t) = [w.sup.n](0, t1), (9c) or, as an alternative to (9c): [w.sup.n] (0, t) = [B.sup.n] (t1) /[B.sup.*](t1). Subsequently, calculate populations for all years using the above shares and total populations (available e.g. from DEMO), as: [P.sup.n](x, t) = [P.sup.*](x, t) x [w.sup.n](x, t). Finally, aggregate singleyear age groups into fiveyear ones. 3.6 Proportional fitting methods In the proportional fitting methods, the general task is to estimate [P.sub.g.sup.n](x, t), i.e. the elements of a threedimensional cube cube, in geometry, regular solid bounded by six equal squares. All adjacent faces of a cube are perpendicular to each other; any one face of a cube may be its base. The dimensions of a cube are the lengths of the three edges which meet at any vertex. (with the dimensions being sex, age and citizenship). The choice of a particular method depends on which marginal information See: marginal data. (cube's edges or faces) is known and if an initial estimate of the cube elements are available. Below, the examples of frequent situations are presented. The formulas have been given for population by single years of age but the analogical an·a·log·i·cal adj. Of, expressing, composed of, or based on an analogy: the analogical use of a metaphor. an formulas apply to population by fiveyear age group. For more information on multiproportional techniques sec for example the studies of Willekens [12, 13], Willekens et al. [14], Rees [8] and Norman [6]. Note that proportional fitting methods presented below are known under various names in the scientific literature. In addition to a potential application as the main estimation method, proportional fitting may be used, in almost all the countries for which estimations are needed, as the final stage of the estimation procedure, in order to adjust the initial estimates to known aggregates or marginal totals. The initial estimate might be obtained for example using interpolation or projection, or assumed to be the same as at some different time (e.g. the same as at the census date). Such an initial estimate has to be subsequently adjusted for example to the known total population size by age and sex. 3.6.1 Proportional adjustment / decomposition Among the proportional fitting methods, the simplest one tan be applied to situations, when a population can be directly disaggregated by a variable (sex, age or citizenship), according to the pattern observed in an auxiliary source. In general, the idea is the same as in the Prorating prorating (prōrā´ting), n a clause in a contract with participating dental professionals wherein they agree to accept a percentage reduction in their billings to offset the amount by which the total cost of method [11, p. 561] mentioned in Section 3.2. For example, if the aggregates [P.sub.g.sup.*] (x, t) and an initial estimate of the citizenship structure [P'.sub.g.sup.n](x, t) are known, then the final estimate [P".sub.g.sup.n](x, t) may be obtained as: [P".sub.g.sup.n](x, t) = [P'.sub.g.sup.n] (x, t) x [P.sub.g.sup.*](x, t)/[P'.sub.g.sup.*](x, t). (10) In particular, if one wants to estimate the breakdown by citizenship using the citizenship structure taken from the census, [P.sub.g.sup.n](x, c), then (10) becomes: [P.sub.g.sup.n](x, t) = = [P.sub.g.sup.*] (x, t) x [P.sub.g.sup.n](x, c) / [P.sub.g.sup.*](x, c). 3.6.2 Direct proportional fitting The estimation problem becomes slightly more complicated, if one wants to estimate [P.sub.g.sup.n] (x, t), but does not have any initial estimate of it. One possible situation is that at least some fragments of the data cube For the data mining concept, see, see . In computer programming contexts, a data cube is a three (or higher) dimensional array of values, commonly used to describe a time series of image data. (faces and/or edges) are available and provide coherent information The coherent information is an entropy measure used in quantum information theory. It is a property of a quantum state ρ and a quantum channel infrequently in·fre·quent adj. 1. Not occurring regularly; occasional or rare: an infrequent guest. 2. , because there usually are some initial estimates of the population structures, for example from the census. Willekens et al. [14, p. 97] noted that general formulae of a form akin to (11) for a one faceone edge problem, as well as similar closedform solutions for the cases with three edges or two faces are the solutions of the entropymaximisation problems in research tasks aimed at reconstructing the elements of a threedimensional arrays, given the available marginal totals. 3.6.3 Iterative proportional fitting In a general case, a closedform solution (11) may not exist due to possible incoherence incoherence Not understandable; disordered; without logical connection. See Schizophrenia. between various data at hand. Such problems call for a multistep iterative proportional fitting (IPF (Itanium Processor Family) See Itanium. ) method, whereby the solutions are sought stepwise, through iterative adjustments of their successive approximations successive approximation n. A method for estimating the value of an unknown quantity by repeated comparison to a sequence of known quantities. to marginal totals available from the faces or edges of the data cube. In particular, this method can be used for adjusting the preliminary joint distributions to the known marginal distributions In probability theory, given two jointly distributed random variables X and Y, the marginal distribution of X is simply the probability distribution of X ignoring information about Y . For example, let the initial estimate of the citizenship structure [P'.sub.g.sup.n](x, t) be known, as well as the sexage face and the citizenship edge of the data cube, respectively [P.sub.g.sup.*](x, t) and [P.sub.[??].sup.n] (*, t). By the IPF algorithm, the initial estimates are iteratively corrected by proportional adjustment. An additional superscript (k) in [P.sub.g.sup.(k)n] (x, t) denotes the iteration One repetition of a sequence of instructions or events. For example, in a program loop, one iteration is once through the instructions in the loop. See iterative development. (programming) iteration  Repetition of a sequence of instructions. step (for k [greater than or equal to] 1). The starting value k = 1 defines also the initial estimate of the joint sexagecitizenship distribution, [P.sub.g.sup.(1)n](x, t) = [P'.sub.g.sup.n](x, t). Subsequent steps are computed as follows: [P.sub.g.sup.(2k)n](x, t) = [P.sub.g.sup.(2k1)n](x, t) x [P.sub.g.sup.*](x, t) / [P.sub.g.sup.(2k1)*](x, t); (12a) [P.sub.g.sup.(2k+1)n](x, t) = [P.sub.g.sup.(2k)n](x, t) x [P.sub.[??].sup.n]](*, t) / [P.sub.[??].sup.(2k)n] (*, t). (12b) The procedure defined by (12a) and (12b) is repeated iteratively till some convergence criterion is achieved. For example, the estimates yielded by consecutive steps should differ by no more than by an arbitrarilyselected small number [epsilon]. More details of the method have been discussed by Willekens [13, pp. 6971], Willekens et al. [14], Rees [8] and Norman [6]. Although the IPF method is purely mechanical, its main advantage is that it does not require any additional information (such as data on vital events or migration) or excessive labour resources, and the obtained results (in terms of joint distributions by all variables under study) are automatically coherent with marginal distributions of particular variables. Moreover, under some general assumptions, the IPF estimates can be interpreted from a statistical viewpoint as joint probability joint probability n. The probability that two or more specific outcomes will occur in an event. Noun 1. joint probability  the probability of two events occurring together distributions obtained using the maximum likelihood or entropy entropy (ĕn`trəpē), quantity specifying the amount of disorder or randomness in a system bearing energy or information. Originally defined in thermodynamics in terms of heat and temperature, entropy indicates the degree to which a given maximisation methods [2, pp. 8397; after: 13, p. 70]. 3.7 Auxiliary methods Among the auxiliary methods proposed in the current study, the foremost one is the decomposition of the Unknown category wherever it appears (i.e., with respect to age, citizenship, or even sex, as in the case of Greece for 2005). The universal solution proposed in such cases is a proportional disaggregation: population belonging to the Unknown category is broken down proportionally to the existing, well defined categories (citizenship groups, age groups, etc.) and the resulting parts are attached to these categories. For example, if total population P consists of n welldefined groups [P.sub.1], ..., [P.sub.n], and the Unknown category, [P.sub.unk], such that P = [[summation]sub.i] [P.sub.i] + [P.sub.unk], where i = 1, ..., n, then the following corrections apply: [P'.sub.j] = [P.sub.j] + [P.sub.unk] x [P.sub.j] / [[summation].sub.i] [P.sub.i] = [P.sub.j] (1 + [P.sub.unk] / [[summation].sub.i] [P.sub.i]), for all j, with i = 1, ..., n. (13) If some elements of age structures are missing (e.g. tails of respective age distributions, or a breakdown into fiveyear groups given the availability of broader ones), we may either use a structure from a different year or fit a mathematical function A rule for creating a set of new values from an existing set; for example, the function f(x) = 2x creates a set of even numbers (if x is a whole number). to available data. For example, we can assume that foreign population stocks are a doubleexponential function of age, as originally proposed for the intensity of migration flows by Rogers and Castro [5, 9]. The number of foreign population aged x, [phi](x), would then be given by the following equation: [phi](x) = C + [a.sub.1] x exp exp abbr. 1. exponent 2. exponential ([[alpha].sub.1] x x) + [a.sub.2] x exp{[[alpha].sub.2] x (x  [[mu].sub.2]) +  exp[[[lambda].sub.2] x (x  [[mu].sub.2])]}. (14) The parameters c, [a.sub.1], [[alpha].sub.1], [a.sub.2], [[alpha].sub.2], [[lambda].sub.2] and [[mu].sub.2] can be estimated separately for each sex, for example using the ordinary least squares method least squares method Statistical method for finding a line or curve—the line of best fit—that best represents a correspondence between two measured quantities (e.g., height and weight of a group of college students). (OLS OLS Ordinary Least Squares OLS Online Library System OLS Ottawa Linux Symposium OLS Operation Lifeline Sudan OLS Operational Linescan System OLS Online Service OLS Organizational Leadership and Supervision OLS On Line Support OLS Online System ) on the basis of the data for the available age groups (for example, below 65 years of age). Technically, the calculations can be done in a spreadsheet spreadsheet Computer software that allows the user to enter columns and rows of numbers in a ledgerlike format. Any cell of the ledger may contain either data or a formula that describes the value that should be inserted therein based on the values in other cells. (e.g. MS Excel A fullfeatured spreadsheet for Windows and the Macintosh from Microsoft. It can link many spreadsheets for consolidation and provides a wide variety of business graphics and charts for creating presentation materials. ) using a solverlike tool, controlling for sensitivity of the algorithm to the choice of initial input values. Based on the obtained parameter (1) Any value passed to a program by the user or by another program in order to customize the program for a particular purpose. A parameter may be anything; for example, a file name, a coordinate, a range of values, a money amount or a code of some kind. estimates, formula (14) yields approximations of [phi](x) for the remaining age groups. The last, openended group (85+) can be obtained by subtraction subtraction, fundamental operation of arithmetic; the inverse of addition. If a and b are real numbers (see number), then the number a−b is that number (called the difference) which when added to b (the subtractor) equals of all other figures from the total. To avoid negative numbers in the 85+ category, appropriate constraints CONSTRAINTS  A language for solving constraints using value inference. ["CONSTRAINTS: A Language for Expressing AlmostHierarchical Descriptions", G.J. Sussman et al, Artif Intell 14(1):139 (Aug 1980)]. should be set during the estimation procedure. In either case, when adjustment to broader age groups is needed in order to ensure summation to respective totals (e.g. for functional age groups), it can be done via proportional fitting presented in Section 3.6. 4 Estimating population stock for EU27 and EFTA countries The current section briefly summarises the algorithm for the selection of an appropriate method of computations for a given country (Section 4.1), followed by a brief illustration of the proposed approach employed for the 31 countries under study, and a selection of the results (4.2). 4.1 Procedure for selecting an estimation method In the light of the overview of data availability presented in Section 2 and the methodological discussion presented in Section 3, it is suggested to inspect the following general options of data availability, in order to apply the relevant data estimation procedures: Option 1. All the required data are available in the Eurostat database Whenever all data are available in the Eurostat database, the following fivestep procedure is recommended: 1. Organize the data in a database; 2. Verify (1) To prove the correctness of data. (2) In data entry operations, to compare the keystrokes of a second operator with the data entered by the first operator to ensure that the data were typed in accurately. See validate. the data (perform data validation In computer science, data validation is the process of ensuring that a program operates on clean, correct and useful data. It uses routines, often called validation rules, that check for correctness or meaningfulness of data that are input to the system. and internal consistency In statistics and research, internal consistency is a measure based on the correlations between different items on the same test (or the same subscale on a larger test). It measures whether several items that propose to measure the same general construct produce similar scores. checks); 3. Deal with the Unknown categories (if applicable); 4. Calculate the required aggregates; 5. Check the results. This option includes cases when there is a need for combining data from various parts of the Eurostat database (e.g. in DEMO and in JMQ), and the cases where there is an 'Unknown' category, which has to be disaggregated proportionally among the well specified categories, as described in Section 3.7 on 'Auxilliary methods'. Option 2. Some of the data missing in the Eurostat database can be obtained from the respective NSI See Network Solutions. NSI  Network Solutions, Inc. or from other sources In this case, two situations are possible: Option 2a. All the missing data may be obtained without contacting the NSI If all the missing information is publicly available, for example from the NSI webpage, it should be downloaded and combined with the Eurostat data. Such an overall dataset should be then subject to a procedure described under Option 1, points 2. through 5. Option 2b. Some (or all) missing data are (or are suspected to be) available either from the NSI or from other sources If some missing information is downloadable from sources like the NSI webpage, it should be collected and merged with the Eurostat data. Nonetheless, there are cases when data are not publicly available but it can be suspected that either some, or even all the missing information is in the possession of the NSI. In such case, the undertaken actions should be as follows: 1. Contact the NSI in order to obtain the missing data. If successful, proceed as in Option 2a; 2. For the data that are still unavailable, but can be estimated, proceed as in Option 3; 3. For the data that are still not available and cannot be estimated, look at Option 4. Option 2 includes cases when data from various national sources has to be combined, for example aggregated data obtained using the component method and data on citizenship composition from the register of foreigners. Option 3. Some data are not available anywhere but can be estimated Even if some auxiliary data can be collected from whichever source, in many instances the available information can be still incomplete. In a vast majority of such cases, the missing information can be still estimated, either following the methodological guidelines guidelines, n.pl a set of standards, criteria, or specifications to be used or followed in the performance of certain tasks. and techniques outlined in Section 3, or by means of more straightforward and easytoapply solutions. The order of undertaken actions is then as follows: 1. Organize the available data (from Eurostat and other sources); 2. Verify the data (perform data validation and internal consistency checks); 3. Deal with the Unknown categories (if applicable); 4. Calculate the required aggregates for the available data; 5. For each year for which data are missing select the best method to estimate missing data; 6. Collect supplementary data needed for the estimations; 7. Estimate the missing data; 8. Check the results. For the estimations (item 7), various methods can be used, depending on the range and type of the missing information and data availability. In general, fsfive broad groups of methods can be distinguished here, following the outline presented in Section 3: a. Proportional fitting methods (Section 3.6); b. Cohortwise weights propagation (Section 3.5); c. Cohortwise interpolation of population stocks (Section 3.3); d. Cohortcomponent projections (Section 3.4); e. Other solutions, not listed above, or combined approaches. The methodology of interpolating the fiveyear into oneyear age groups, presented in Section 3.2, as well as some methods described in Section 3.7 should be treated as auxiliary to all the remaining ones rather than constituting separate estimation methods per se. Option 4. Data are not available, and no or only very rough estimates can be produced In principle, this should be a very infrequent in·fre·quent adj. 1. Not occurring regularly; occasional or rare: an infrequent guest. 2. option. If no information is available that would enable estimation under Option 3, none or only very rough approximations can be performed, such as for example the 5050 division of all foreigners into the EU27 and nonEU27 categories. Under Option 3, in all the cases where several methods could be alternatively applied, preference is given to the more straightforward ones, and definitely to the ones having less judgemental elements, thus less potential sources of error. This approach conforms to the Occam's razor (philosophy) Occam's Razor  The English philosopher, William of Occam (13001349) propounded Occam's Razor: Entia non sunt multiplicanda praeter necessitatem. (Latin for "Entities should not be multiplied more than necessary"). principle, stating that "entities are not to be multiplied mul·ti·ply^{ 1} v. mul·ti·plied, mul·ti·ply·ing, mul·ti·plies v.tr. 1. To increase the amount, number, or degree of. 2. Mathematics To perform multiplication on. beyond necessity" (2), which in this case means that the proposed models should hot be more sophisticated than necessary, due to various possible sources of error. For example, given complete data on stocks by age and citizenship group for two moments of time (e.g. nom successive population censuses), if the data on flows (I and E), natural change (B and D) and citizenship acquisitions (A) are not available by citizenship and/or age, and require estimation, then the intermediate values are recommended to be calculated using cohortwise interpolation rather than projection. In the former case, the only source of possible error is the composition of population as such, whereas in the latter, judgemental assumptions on the relevant distributions of all components of the balance equation are likely to result in higher uncertainty of the ultimate results, which within the deterministic 1. (probability) deterministic  Describes a system whose time evolution can be predicted exactly. Contrast probabilistic. 2. (algorithm) deterministic  Describes an algorithm in which the correct next step depends only on the current state. framework of the project is impossible to assess. Figure 3 presents a decision tree summarising the procedure for selecting the estimation methodology, taking into account all the above options. 4.2 Application of the methodology, examples, selected results The decision tree presented in Figure 3 has been used to select the best estimation method for each of the 31 EU and EFTA countries, accounting for the availability of data in the Eurostat database (either online or in the JMQs), in the NSI databases, and at other sources. It turned out that complete data needed to estimate population by broad group of citizenship, sex and age on 1st January 20022006 were available in the JMQs for nine countries: Austria, the Czech Republic Czech Republic, Czech Česká Republika (2005 est. pop. 10,241,000), republic, 29,677 sq mi (78,864 sq km), central Europe. It is bordered by Slovakia on the east, Austria on the south, Germany on the west, and Poland on the north. , Denmark, Finland, Hungary (3), Norway, Slovenia and Sweden. For additional four countries it was possible either to collect all the missing data from the NSI websites (Belgium and Iceland), or to get them by contacting directly the NSI (Lichtenstein and Switzerland). [FIGURE 3 OMITTED] For the remaining 18 countries some estimations were necessary. The method that proved to useful in the largest number of cases was some sort of proportional fitting (one of the three versions presented in Section 3.6). It was used as the main method for estimating population by broad citizenship in Cyprus, France, Germany, Greece, Italy, Latvia, Luxembourg, Malta, Slovakia, Spain and the UK. In all cases the total population was assumed to be as reported by the NSI in their demographic statistics, while the citizenship structure was taken from varied sources, for example the JMQ data for the same year, data taken from the NSI website (Italy), the census data (Cyprus, France), the data for another year (Romania, Spain), the LFS data (Cyprus, France) or the data from the register of foreigners (Germany) (see also examples below). The cohortwise interpolation method was used for Ireland, Lithuania and Portugal. For Bulgaria, Estonia and Poland, where only data from the census were available, the cohortwise weight propagation was applied. For Cyprus, Lithuania, Luxembourg and Poland it was originally planned to use a projection method, however it was decided that it would require too many assumptions that would be difficult to justify, and that the final result would not be reliable enough to justify the additional effort required when using this method. Estimations done for Romania do not fit any of the above groups. They involved simple combination of data coming from various sources. Below, more details about the estimation procedures are provided for selected countries. In doing so, we have tried to give an example for each estimation method. The resulting numbers in terms of the estimated citizenship structures of the populations of 18 European countries (all being EU Member States) on 1st January 2006, are presented in Table 3. Table 3: Estimated population by broad group of citizenship in 18 EU countries, as of 1st January 2006. As concerns particular examples: in Germany, data on foreigners come from two different sources. The component method (Bevolkerungsfortschreibung), based on the last traditional German census of 25th May 1987, is used by the NSI to produce annual figures on total population, total nationals and total foreigners, as well as nationals and foreigners by sex and age. The other source is the Central Register on Foreigners which contains data on foreigners by citizenship, sex and age. The total numbers of foreigners and their sex and age structures differ between both sources. In order to obtain a single set of estimates, the total number of German citizens, the total number of foreigners, as well as the age structures of Germans and foreigners were taken, following the NSI procedures, from the Bevolkerungsfortschreibung data. The distribution of foreigners into EU27 and nonEU27 foreigners was done in proportion to their shares in respective age groups according to the data from the Central Register of Foreigners. Thus, all in all, the proportional decomposition method In constraint satisfaction, a decomposition method translates a constraint satisfaction problem into another constraint satisfaction problem that is binary and acyclic. Decomposition methods work by grouping variables into sets, and solving a subproblem for each set. was used. In Latvia, no joint distribution of population by citizenship and age was available for 1st January 2002, only the structures by age and by citizenship separately. However, the full joint distribution was available for 2003. The iterative proportional fitting method was selected to deal with this case. The joint distribution by citizenship group and age on 1st January 2003 was taken as the starting point Noun 1. starting point  earliest limiting point terminus a quo commencement, getgo, offset, outset, showtime, starting time, beginning, start, kickoff, first  the time at which something is supposed to begin; "they got an early start"; "she knew from the for estimating the 2002 structure of population, which was then iteratively adjusted to the known marginal totals. Lithuania is an example for the application of a cohortwise interpolation method. In this country, the joint distribution of population by sex, age and citizenship was available for the Census date (6th April 2001), as well as for 1st January 2005. The cohortwise interpolation, as described in Section 3.3, was used to obtain the initial estimates of males and females on 1st January 2002, 2003 and 2004. In the next step those initial estimates where proportionally adjusted to the known numbers of males and females by age, taken from the Eurostat demographic database. In Bulgaria, annual data on population by citizenship were not available. The only information on citizenship structure came from the census of 1st March 2001. There are also annual data on population by age and sex prepared by the NSI using the component method, available from the Eurostat database. The estimates of annual 20022006 population by citizenship, sex and age were prepared using the cohortwise weight propagation method. The census data were used as the starting point for calculating the initial shares (weights) of citizenship groups in each age cohort. These shares were iteratively propagated forward as described in Section 3.5 and the resulting weights were combined with the available data on population by sex and age to calculate the required joint distribution by citizenship, sex and age. 5 Conclusion As it can be seen from the countryspecific overview of problems with data on population stocks by age, sex and citizenship, there is no universal solution for estimating the missing pieces of information in the European countries under study. Nevertheless, depending on the availability of data at hand, either in the Eurostat / JMQ, or in the respective national statistical institutes, several estimation procedures can be proposed and applied, as mentioned in Sections 3 and 4. The methods and algorithm we proposed for this purpose do not, however, consider the issue of the harmonisation Noun 1. harmonisation  a piece of harmonized music harmonization musical harmony, harmony  the structure of music with respect to the composition and progression of chords of the data and definitions, as mentioned in Section 1. More work would be needed in order to recalculate re·cal·cu·late tr.v. re·cal·cu·lat·ed, re·cal·cu·lat·ing, re·cal·cu·lates To calculate again, especially in order to eliminate errors or to incorporate additional factors or data. the population stocks into a common definition (cf. [1, 7]), and make them consistent with the (also reestimated) statistics on migration flows. These very important research tasks are still to be performed in the subsequent tasks of the MIMOSA research project, of which the current study forms a part. Acknowledgement The paper was prepared within the framework of the research project on "Modelling of statistical data on migration and migrant population" (MIMOSA), commissioned by Eurostat (contract no. 2006/S 100106607/EN; Lot 2) to the Netherlands Interdisciplinary in·ter·dis·ci·pli·nar·y adj. Of, relating to, or involving two or more academic disciplines that are usually considered distinct. interdisciplinary Adjective Demographic Institute (NIDI NIDI Nederlands Interdisciplinair Demografisch Instituut (Dutch) NIDI Nickel Development Institute NIDI National Internet Diagnostics Infrastructure ) and conducted jointly by NIDI, Central European Forum for Migration and Population Research (CEFMR), Groupe d'etude de demographie appliquee, Universite Catholique de Louvain (GeDAP UCL UCL University College London UCL Université Catholique de Louvain UCL UEFA Champions League UCL Upper Confidence Limit UCL University of Central Lancashire UCL Upper Control Limit UCL Unfair Competition Law UCL Ulnar Collateral Ligament ) and Southampton Statistical Sciences Research Institute of the University of Southampton In the most recent RAE assessment (2001), it has the only engineering faculty in the country to receive the highest rating (5*) across all disciplines.^{[3]} According to The Times Higher Education Supplement (S3RI). We are grateful to our colleagues for their comments and discussions. Special credits go to Peter Ekamper from NIDI for his assistance in the computational part of the current study. Received: March 9, 2008 References [1] Bilsborrow, R., Hugo, G., Oberai, A.S. and Zlotnik, H. (1997). International migration statistics. Guidelines for improving data collection systems. International Labour Organisation, Geneva Geneva, canton and city, Switzerland Geneva (jənē`və), Fr. Genève, canton (1990 pop. 373,019), 109 sq mi (282 sq km), SW Switzerland, surrounding the southwest tip of the Lake of Geneva. . [2] Bishop, Y.M.M., Fienberg, E.F. and Holland, P.W. (1975). Discrete multivariate analysis multivariate analysis, n a statistical approach used to evaluate multiple variables. multivariate analysis, n a set of techniques used when variation in several variables has to be studied simultaneously. . MIT MIT  Massachusetts Institute of Technology Press, Cambridge, MA. [3] Calot, G. and Sardon, J.P. (2003). Methodology for the calculation of Eurostat's demographic indicators. Eurostat Working Papers working papers pl.n. Legal documents certifying the right to employment of a minor or alien. Noun 1. working papers and Studies, Population and Social Conditions 3/2003/F/no 26. Eurostat, Luxembourg. [4] Cangiano, A. (2008). Foreign Migrants in Southern European Countries: Evaluation of Recent Data. In: J. Raymer, F. Willekens (eds.), International Migration in Europe: Data, Models and Estimates. John Wiley John Wiley may refer to:
[5] Castro, L.J. and Rogers, A. (1983). Patterns of Family Migration: Two Methodological Approaches. Environment and Planning The Environment and Planning journals are four influential academic journals. They are described as as 'interdisciplinary', though they have a highly spatial focus, meaning that they are often of most interest to human geographers. A, 15 (2), pp. 237254. [6] Norman, P. (1999). Putting Iterative Proportional Fitting on the Researcher's Desk. Working Paper 99/03. School of Geography, University of Leeds Organisation Faculties The various schools, institutes and centres of the University are arranged into nine faculties, each with a dean, prodeans and central functions:
[7] Poulain, M., Perrin, N. and Singleton sin·gle·ton n. An offspring born alone. singleton Medtalk One baby. Cf Triplet, Twin. , A. (eds.) (2006), THESIM: Towards Harmonised Adj. 1. harmonised  involving or characterized by harmony consonant, harmonical, harmonized, harmonic harmonious  musically pleasing European Statistics on International Migration, Presses Universitaires de Louvain, LouvainlaNeuve. [8] Rees, P. (1994). Estimating and projecting the populations of urban communities. Environment and Planning A, 26 (11), pp. 16711697. [9] Rogers, A. and Castro, L.J. (1981). Model Migration Schedules. IIASA IIASA International Institute for Applied Systems Analysis (Austria) IIASA Institute of Islamic and Arabic Sciences in America (Fairfax, VA) Report RR8130. International Institute for Applied Systems Analysis, Laxenburg. [10] Rowland, D.T. (2006). Demographic methods and concepts. Oxford University Press, Oxford. [11] Shryock, H.S h.s., n Latin phrase for “at bedtime”; used in writing prescriptions. ., Siegel, J.S. and Associates (1993). Interpolation: Selected General Methods. In: D.J. Bogue, E.E. Arriaga, D.L. Anderton and G.W. Rumsey (eds.), Readings in Population Research Methodology. Vol. 1: Basic Tools. Social Development Center / UN Population Fund, Chicago, pp. 548572. [12] Willekens, F. (1977). The Recovery of Detailed Migration Patterns from Aggregate Data: An Entropy Maximizing Approach. IIASA Report RR8130. International Institute for Applied Systems Analysis, Laxenburg. [13] Willekens, F. (1982). Multidimensional mul·ti·di·men·sion·al adj. Of, relating to, or having several dimensions. multi·di·men Population Analysis with Incomplete Data. In: K.C. Land and A. Rogers (eds.), Multidimensional Mathematical Demography. Academic Press, New York New York, state, United States New York, Middle Atlantic state of the United States. It is bordered by Vermont, Massachusetts, Connecticut, and the Atlantic Ocean (E), New Jersey and Pennsylvania (S), Lakes Erie and Ontario and the Canadian province of , pp. 43111. [14] Willekens F.J., Por A. and Raquillet R. (1981). Entropy, multiproportional, and quadratic quadratic, mathematical expression of the second degree in one or more unknowns (see polynomial). The general quadratic in one unknown has the form ax^{2}+bx+c, where a, b, and c are constants and x is the variable. techniques for inferring patterns of migration from aggregate data. In: A. Rogers (ed.), Advances in Multiregional Demography. IIASA Report RR816. International Institute for Applied Systems Analysis, Laxenburg, pp. 83124. 15] Wilmoth, J.R., Andreev, K., Jdanov, D., Glei, D.A., with the assistance of C. Boe, M. Bubenheim, D. Philipov, V. Shkolnikov and P. Vachon (2005). Methods Protocol for the Human Mortality Database, version 4. Department of Demography, University of California, Berkeley The University of California, Berkeley is a public research university located in Berkeley, California, United States. Commonly referred to as UC Berkeley, Berkeley and Cal . Available from: http://www.mortality.org/Public/Docs/MethodsProtocol.pdf (accessed on 17th May 2007). (1) Research project THESIM: Towards Harmonised European Statistics on International Migration, funded by the European Commission European Commission, branch of the governing body of the European Union (EU) invested with executive and some legislative powers. Located in Brussels, Belgium, it was founded in 1967 when the three treaty organizations comprising what was then the European Community through the Sixth Framework Programme The Sixth Framework Programme (abbreviated FP6) was the Framework Programme for Research and Technological Development from 2002 till 2006 set up by the European Union (EU) in order to fund and promote European research and technological development. and executed by a research consortium led by Groupe d'etude de demographie appliquee (GeDAP), Universite Catholique de Louvain. (2) After: 'Ockham's razor', in: Encyclopoedia Britannica Online, http://www.britannica.com/eb/article9056716, accessed on 21st May 2007. (3) For Hungary, data on total population and on the number of Hungarian citizens were hot always provided in the JMQ and therefore hot available in the migration part of the Eurostat database. However, data on total population were available in the demographic part of the Eurostat database and the number of Hungarian citizens could be calculated directly as a difference between total population and total foreigners, the latter taken from the JMQ. Jakub Bijak and Dorota Kupiszewska Central European Forum for Migration and Population Research (CEFMR) ul. Twarda 51/55, 00818 Warsaw Warsaw (wôr`sô), Pol. Warszawa, city (1993 est. pop. 1,655,700), capital of Poland and of Mazowieckie prov., central Poland, on both banks of the Vistula River. , Poland Email: j.bijak@cefmr.pan.pl, d.kupisz@cefmr.pan.pl www.cefmr.pan.pl Table 1: Availability of data on population stock by citizenship, sex and age in the JMQ, 31 countries, as of 1st January 20022006. Country 2002 2003 2004 Austria + + + Belgium + x  Bulgaria    Cyprus dref   Czech Republic + + + Denmark + + + Estonia na na na Finland + + + France    Germany for, broad age, [+ or ] age, i for, i [+ or ] agesex, i Greece   i Hungary for for + Ireland p, [+ or ] ctz, p, +ctz, p, [+ or ] broad age, broad age, ctz, broad dref dref age, dref Italy dref   Latvia age + + Lithuania  ctz ctz Luxembourg   tot, nat Malta    Netherlands + + + Poland dref   Portugal p, for, age p, for  Romania dref  + Slovakia   for Slovenia + + + Spain +  p Sweden + + + United Kingdom  [+ or ] ctz, [+ or ] ctz, dref dref, a70 Iceland + +  Lichtenstein    Norway + + + Switzerland + + + Country 2005 2006 Austria +  Belgium   Bulgaria   Cyprus   Czech Republic + + Denmark + + Estonia na  Finland + + France   Germany i P, i Greece for, [+ or ] sex for Hungary + + Ireland p, [+ or ] ctz, p, +ctz, broad age, broad age, d ref dref Italy age age, w Latvia + + Lithuania + + Luxembourg [+ or ] ctz, [+ or ] ctz, [+ or ] age, [+ or ] age, [+ or ] sex [+ or ] sex Malta   Netherlands + + Poland ctz  Portugal   Romania + + Slovakia i i Slovenia + + Spain + + Sweden + + United Kingdom [+ or ] ctz,  broad age, dref Iceland   Lichtenstein   Norway + + Switzerland + + + data provided to Eurostat; data not provided to Eurostat; age no disaggregation by age; ctz no disaggregation by citizenship; [+ or ] age age disaggregation only for a few citizenship categories; [+ or ] agesex disaggregation by age not provided for Males and Females; [+ or ] ctz data provided for a few citizenship categories; [+ or ] sex disaggregation by sex provided for a few citizenship categories only; a70 age provided only until 70 years, with the openended group 70+; broad age data disaggregated by broad age groups; dref reference date different than 1st January; for data provided for foreigners only; i data inconsistency problems; na data not available, nat data provided for nationals; p provisional data; x problems detected in the data sent by the NSI, tot data provided for Total. Table 2: Coefficients for the KarupKing interpolation formula. First group, [N.sub.0] [N.sub.0] [N.sub.1] [N.sub.2] First fifth +0.344 0.208 +0.064 Second fifth +0.248 0.056 +0.008 Third fifth +0.176 +0.048 0.024 Fourth fifth +0.128 +0.104 0.032 Last fifth +0.104 +0.112 0.016 Middle groups, [N.sub.i] [N.sub.i1] [N.sub.i] [N.sub.i+1] First fifth +0.064 +0.152 0.016 Second fifth +0.008 +0.224 0.032 Third fifth 0.024 +0.248 0.024 Fourth fifth 0.032 +0.224 +0.008 Last fifth 0.016 +0.152 +0.064 Last group, [N.sub.K] [N.sub.K2] [N.sub.K1] [N.sub.K] First fifth 0.016 +0.112 +0.104 Second fifth 0.032 +0.104 +0.128 Third fifth 0.024 +0.048 +0.176 Fourth fifth +0.008 0.056 +0.248 Last fifth +0.064 0.208 +0.344 Source: [11], Table C1, p. 569. Table 3: Estimated population by broad group of citizenship in 18 EU countries, as of 1st January 2006. Country Total Nationals EU27 NonEU27 foreigners foreigners Bulgaria 7 718 750 7 693 214 3 855 21 681 Cyprus 766 414 678 114 52 217 36 084 Estonia 1 344 684 1 082 605 3 961 258 118 France 61 166 822 58 208 155 1 148 691 1 809 976 Germany 82 437 995 75 148 846 2 448 113 4 841 036 Greece 11 125 179 10 165 903 180 282 778 994 Ireland 4 209 019 3 779 755 295 165 134 099 Italy 58 751 711 56 081 197 538 853 2 131 661 Latvia 2 294 590 1 837 832 5 527 451 231 Lithuania 3 403 284 3 370 422 1 962 30 900 Luxembourg 469 086 280 938 171 876 16 273 Malta 404 962 392 850 7 022 5 090 Poland 38 157 055 38 115 920 18 660 22 476 Portugal 10 569 592 10 293 686 80 039 195 867 Romania 21 610 213 21 584 220 6 058 19 935 Slovakia 5 389 180 5 368 255 12 289 8 636 Spain 43 758 250 39 755 741 1 326 128 2 676 381 United Kingdom 60 393 100 56 990 704 1 365 190 2 036 807 Source: Own calculations based on the Eurostat and NSI data. 

Reader Opinion