Printer Friendly
The Free Library
5,675,427 articles and books
Member login
User name  
Password 
 
Join us Forgot password?

Identifying hierarchical choice structures: a comparison of methods.


ABSTRACT

Consumers make product choice decisions based on a personal evaluation of the product's attributes. The hierarchical A structure made up of different levels like a company organization chart. The higher levels have control or precedence over the lower levels. Hierarchical structures are a one-to-many relationship; each item having one or more items below it.  theory posits that a choice is first made on the most important attribute dimension, followed by the next most important, etc, until a unique product is chosen. This paper compares three methods for identifying the hierarchical structure See hierarchical.  based on product switching matrices. To use multidimensional scaling Multidimensional scaling (MDS) is a set of related statistical techniques often used in data visualisation for exploring similarities or dissimilarities in data. MDS is a special case of ordination.  and hierarchical cluster analysis Cluster analysis

A statistical technique that identifies clusters of stocks whose returns are highly correlated within each cluster and relatively uncorrelated across clusters. Cluster analysis has identified groupings such as growth, cyclical, stable, and energy stocks.
 the paper develops a dissimilarity measure from the switching matrix. The third method uses entropy entropy (ĕn`trəpē), quantity specifying the amount of disorder or randomness in a system bearing energy or information. Originally defined in thermodynamics in terms of heat and temperature, entropy indicates the degree to which a given  to unfold unfold - inline  the hierarchy directly from the switching matrix. A numerical numerical

expressed in numbers, i.e. Arabic numerals of 0 to 9 inclusive.


numerical nomenclature
a numerical code is used to indicate the words, or other alphabetical signals, intended.
 example compares the relative effectiveness of the three methods.

Keywords: Market Research, Market Partitioning To divide a resource or application into smaller pieces. See partition, application partitioning and PDQ. , Entropy, Cluster Analysis, Multidimensional Scaling

1. INTRODUCTION

The problem of market partitioning is to find a way to group products into subgroups that directly compete with each other. Products can be considered to compete when potential purchasers perceive them as satisfying the same requirements. An example of the market for table spreads might include all products that are spread on bread, including cheeses, butter, margarine margarine, manufactured substitute for butter. It consists of a blend of vegetable oils or meat fats (or a combination of both) mixed with milk and salt. It was developed in the late 1860s by the French chemist Hippolyte Mège-Mouries in a contest sponsored by  and margarine substitutes. Each of these product classifications may be seen as a partition A reserved part of disk or memory that is set aside for some purpose. On a PC, new hard disks must be partitioned before they can be formatted for the operating system, and the Fdisk utility is used for this task.  of the market for table spreads. Products within a partition, such as margarine, would be expected to compete more closely with other margarine products than with products in other partitions, such as cheeses. Within the margarine subset A group of commands or functions that do not include all the capabilities of the original specification. Software or hardware components designed for the subset will also work with the original. , products can be further partitioned par·ti·tion  
n.
1.
a. The act or process of dividing something into parts.

b. The state of being so divided.

2.
a.
 into competing subsets by other attribute dimensions such as brand, form, saltiness salt·y  
adj. salt·i·er, salt·i·est
1. Of, containing, or seasoned with salt.

2. Suggestive of the sea or sailing life.

3. Witty; pungent; earthy: salty humor.
, consistency, oil type, etc.

Product design, advertising, promotion and pricing are examples of strategic decisions that are based on how a market is partitioned. In the margarine market, the total calories in a serving might determine the competing subset for some potential purchasers. Reducing the number of calories from the low 90's to the high 70's might allow the product to be considered by a large number of new consumers looking for Looking for

In the context of general equities, this describing a buy interest in which a dealer is asked to offer stock, often involving a capital commitment. Antithesis of in touch with.
 a diet alternative. Similarly, if a firm knows which product variants are most closely competing with its own variant variant /var·i·ant/ (var´e-ant)
1. something that differs in some characteristic from the class to which it belongs.

2. exhibiting such variation.


var·i·ant
adj.
, it will be able to more effectively choose advertising themes and particular product attributes to emphasize in advertisements.

Product attributes may be subjective or objective. Brand name itself is an important attribute dimension that has a variety of connotations to customers or potential customers. General Motors sells Chevrolet, Pontiac and Buick models that differ little except by brand, yet potential purchasers do not view them as closely competing products. The subjective associations that go with brand name may well result in their being perceived to be in different market partitions. Products only belong in a competitive subset if potential customers see them as competitive. It is therefore insufficient to assemble a list of easily measurable attributes for a set of products and then group product variants on the basis of attribute commonality com·mon·al·i·ty  
n. pl. com·mon·al·i·ties
1.
a. The possession, along with another or others, of a certain attribute or set of attributes: a political movement's commonality of purpose.
. Because consumers consider both objective and subjective attributes, it is not sufficient to group products by commonality of attribute dimensions. Product utility for any given attribute for one consumer may differ from the utility derived by that attribute for a different consumer. Furthermore, the same consumer may value the attributes of a given product variant differently on another purchase occasion.

The purpose of market partitioning is to define market in terms of a hierarchical structure of attributes. The starting point Noun 1. starting point - earliest limiting point
terminus a quo

commencement, get-go, offset, outset, showtime, starting time, beginning, start, kickoff, first - the time at which something is supposed to begin; "they got an early start"; "she knew from the
 is to define all product alternatives as a vector of attributes, using all relevant objective and subjective attribute dimensions. For example, margarines could be defined by the attributes: brand, consistency, type of oil, form, size, and degree of "dietness." One product might be the vector (Mazola, solid, corn oil corn oil
n.
A pale yellow liquid obtained from the embryos of corn grains, used especially as a cooking and salad oil and in the manufacture of margarines.

Noun 1.
, bar, 16 oz., non-diet.)

Attribute dimensions exhibiting decreasing degrees of customer loyalty determine the attribute hierarchy. The attribute dimension exhibiting the greatest degree of consumer loyalty occupies the top level of the hierarchy. Product variants in top-level partitions are most closely competing, in the sense that there is strong resistance to switching from one partition to another. For the margarine example, if oil type were at the top of the hierarchy, the Mazola product described above might not have to compete on the basis of oil type. Consumers would more readily substitute different brands, sizes, or even "dietness" than they would different oil types. The second level in the hierarchy is the attribute dimension that exhibits the next highest loyalty within the primary partitions The first division of a hard disk drive. The primary partition is often the only one on the disk, and it occupies the entire disk volume. If there are multiple partitions, the primary partition is the one that holds the operating system and has to be made "active" in order to do so. See partition. . This analysis continues for lower level partitions.

To illustrate the attribute hierarchical model In a hierarchical data model, data are organized into a tree-like structure. The structure allows repeating information using parent/child relationships: each parent can have many children but each child only has one parent.  of a market, consider a market with five product variants defined on three attribute dimensions: brand, color and size. Assume that there are three brands (a, b, c), two colors (x, y) and four sizes (r, s, t, u). Further assume that the five product variants are as follows:

An attribute hierarchy with the order: size, color, and brand partitions the market into four competing subsets by size. Consumers in that market switch relatively little among sizes. Because of the particular colors available within sizes, there is no opportunity for consumers to switch among different colors unless they are prepared to change to a different package size. At the lowest level in the hierarchy, consumers have the least resistance to switching. If one brand is not available, another will do almost as well or as well. The attribute hierarchy may be shown as a graph.

2. SOURCES OF DATA

In order to empirically identify how a set of product variants is partitioned, data reflecting the relationship among the variants is needed. We have already argued that objective similarity Similarity is some degree of symmetry in either analogy and resemblance between two or more concepts or objects. The notion of similarity rests either on exact or approximate repetitions of patterns in the compared items.  may not correlate well with perceived substitutability. A common way of trying to measure perceived relationships among attributes is to ask a structured set of questions of a sample of consumers. Aside from the high cost of obtaining such data for a large sample of individuals, one weakness of this approach is that respondents In the context of marketing research, a representative sample drawn from a larger population of people from whom information is collected and used to develop or confirm marketing strategy.  are placed in a hypothetical Hypothetical is an adjective, meaning of or pertaining to a hypothesis. See:
  • Hypothesis
  • Hypothetical
  • Hypothetical (album)
 situation and reported relationship data may well be at odds with relationships revealed by actual purchase behavior. The extent to which attributes are substitutable is more reliably measured by recording the attributes actually chosen to meet a particular need. For example, if a person is observed to eat at a coffee shop, then a fast-food restaurant, followed by a steak house steak house or steak·house
n.
A restaurant that specializes in beefsteak dishes.
, it would be fallacious to conclude that the three types of restaurant are closely competitive or close substitutes. Ideally then, context should be controlled for when collecting data on substitutability patterns. The methodologies compared in this paper are relevant for frequently purchased consumer products. They are not suitable for expensive, durable goods durable goods

Goods, such as appliances and automobiles, that have a useful life over a number of periods. Firms that produce durable goods are often subject to wide fluctuations in sales and profits. Also called consumer durables.
 since the purchaser's utility function probably changes from one purchase occasion to the next. Diary panel data, though imperfect imperfect: see tense. , are readily obtainable and usually record use context.

2.1 Sample Data

Continuing with the example in the section 1, assume that a diary panel recording 100 sequential purchases of a product produced the switching matrix in Table 2. For example, for 17 transaction pairs beginning with the purchase of variant A, 10 transactions were repeat purchases of A, while 1 shows a switch to B, 1 to D and 5 to E.

3. DEVELOPING A DISSIMILARITY MEASURE

The first two methods used to identify the hierarchical structure of a market are multidimensional scaling and hierarchical clustering. Both methods begin with a matrix of "distances" between product pairs. A similarity index is first calculated for each pair of products by finding the ratio of the proportion of consumers switching from i to j to the expected proportion switching from i to j. The expected proportion switching from i to j assumes independence of choice on two successive purchase occasions.

[MATHEMATICAL EXPRESSION A group of characters or symbols representing a quantity or an operation. See arithmetic expression.  NOT REPRODUCIBLE re·pro·duce  
v. re·pro·duced, re·pro·duc·ing, re·pro·duc·es

v.tr.
1. To produce a counterpart, image, or copy of.

2. Biology To generate (offspring) by sexual or asexual means.
 IN ASCII ASCII or American Standard Code for Information Interchange, a set of codes used to represent letters, numbers, a few symbols, and control characters. Originally designed for teletype operations, it has found wide application in computers. ]

where: [S.sub.i,j] = the similarity index,

[n.sub.i,j] = the number of switches from product i to product j,

[n.sub.i] = the number of purchases of item i on the first choice occasion,

[n.sub.j] = the number of purchases of item j on the second choice occasion,

n = the total number of purchases.

The distance measure between products i and j is given by:

[D.sub.i,j] = max([S.sub.i,j]) - [S.sub.i,j]

Because the distance measures may not be symmetrical symmetrical

equally on both sides.


symmetrical multifocal encephalopathy
inherited disease in two forms: Limousin form appears at about a month old with blindness, forelimb hypermetria, hyperesthesia, nystagmus, aggression, weight
, i.e., [X.sub.1,2] [not equal to] [X.sub.2,1], an average distance measure is found by the following:

[D'.sub.i,j] = [D.sub.i,j] + [D.sub.j,i]/2

This gives us the following dissimilarity matrix for our example:
TABLE 3: DISSIMILARITY MATRIX

        A         B         C         D         E

A     1.526     4.102     4.467     4.363     3.607
B     4.102     0.000     4.458     4.184     4.385
C     4.467     4.458     1.681     3.160     4.206
D     4.363     4.184     3.160     1.808     4.541
E     3.607     4.385     4.206     4.541     2.109


These dissimilarities are ratio scaled, with larger numbers reflecting greater dissimilarities between products based on switching behavior.

4. ATTRIBUTE HIERARCHY METHODS

Methods for identifying an attribute hierarchy may be classified as either empirical or based on a model of market behavior. In the latter case, exemplified by the Hendry system, the market is assumed to be in equilibrium and expected switching frequencies computed for a candidate hierarchical structure. If the empirical data are consistent with a particular proposed hierarchy, the problem has been solved. If not, another structure is proposed until the structure of best fit is obtained.

The present study compares three empirical approaches to uncovering the attribute hierarchy. Multidimensional scaling and hierarchical cluster analysis are two traditional multivariate The use of multiple variables in a forecasting model.  statistical methods. The third method is based on the information theoretic concept of entropy.

4.1 Multidimensional Scaling

Multidimensional scaling creates a perceptual per·cep·tu·al
adj.
Of, based on, or involving perception.
 map, which is a spatial description of a respondent's perception about a product, service or other object of interest. With multidimensional scaling, items that are perceived to be similar will fall close together in multidimensional mul·ti·di·men·sion·al  
adj.
Of, relating to, or having several dimensions.



multi·di·men
 space and items that are perceived to be dissimilar will be further apart. In the present context, the purpose is to identify two or more dimensions that show the positioning of each product in relation to the others. Based on how the products are positioned on each of the derived dimensions, we may be able to give meaningful labels to the dimensions. For example, if the products were to appear as two distinct groups {C, D} and {A, B, E} along the horizontal axis, we might deduce de·duce  
tr.v. de·duced, de·duc·ing, de·duc·es
1. To reach (a conclusion) by reasoning.

2. To infer from a general principle; reason deductively:
 that the product set was partitioned by the color attribute, knowing that the first group has color x and the second color y. The question is whether, given switching proximities between all product pairs, we can uncover the underlying hierarchical structure of attribute dimensions for the set of products.

In this study, the ALSCAL multidimensional scaling module in the SPSS A statistical package from SPSS, Inc., Chicago (www.spss.com) that runs on PCs, most mainframes and minis and is used extensively in marketing research. It provides over 50 statistical processes, including regression analysis, correlation and analysis of variance.  program was applied to the proximity matrix. The algorithm algorithm (ăl`gərĭth'əm) or algorism (–rĭz'əm) [for Al-Khowarizmi], a clearly defined procedure for obtaining the solution to a general type of problem, often numerical.  seeks the spatial pattern that best reproduces the original order of the data. Referring to the proximity matrix, table 3, products C and D have the smallest dissimilarity score and should appear closest together in multidimensional space. By contrast, D and E are most dissimilar and should be farthest apart.

The objective is to use the fewest dimensions to provide a good fit between the original data and the distances in the derived multidimensional space. Kruskars stress index is computed to indicate the goodness of fit Goodness of fit means how well a statistical model fits a set of observations. Measures of goodness of fit typically summarize the discrepancy between observed values and the values expected under the model in question. Such measures can be used in statistical hypothesis testing, e. , with 0 indicating a perfect fit and 1 indicating the worst fit. In effect, the badness-of-fit is measured as the square root of the differences between the optimally transformed data and the squared distances, as a proportion of the total sum of squares. Similarly, R-SQ measures the proportion of variance of the scaled data (disparities) in the partition, which is accounted for by their corresponding distances.

Analysis of the proximity matrix in the ALSCAL multidimensional scaling routine of SPSS produced a Kruskal's stress index of 0.1971 and an R-SQ of 0.9380 for the matrix. The relatively high stress index indicates that the derived distances in 2-dimensional space provide a poor fit to the original proximities. Given the proximity matrix with ten distances, this is the maximal max·i·mal
adj.
1. Of, relating to, or consisting of a maximum.

2. Being the greatest or highest possible.
 dimensionality obtainable by ALSCAL. The resulting configuration is shown in figure 1.

The grouping on dimension 1 in the plot in figure 2 indicates that products C and D are similar to one another and dissimilar from products A, E and B. Referring to the product descriptions in table 1, color is the only attribute products C and D have in common. Products A, E and B all have a different color. This suggests that dimension 1 might be interpreted as a color dimension. The grouping on dimension 2 is less clear. Projecting the points onto the vertical axis reveals three groups: {C, E}, {A, D} and on its own, {B}. Again referring to table 1, neither the first nor second subgroup sub·group  
n.
1. A distinct group within a group; a subdivision of a group.

2. A subordinate group.

3. Mathematics A group that is a subset of a group.

tr.v.
 has any attribute in common. Thus, it is not possible to give a meaningful interpretation to dimension 2. It is therefore evident that at best, the color attribute is the only one to be associated with the multidimensional configuration.

[FIGURE 2 OMITTED]

4.2 Hierarchical Cluster Analysis

Hierarchical cluster analysis is a traditional statistical method used for grouping objects into clusters in such a way that a particular object within one cluster is more similar to objects within that cluster than to objects in other clusters. In this way we can see which products are most similar to other products. Objects are grouped based on their similarities on predetermined pre·de·ter·mine  
v. pre·de·ter·mined, pre·de·ter·min·ing, pre·de·ter·mines

v.tr.
1. To determine, decide, or establish in advance:
 attributes.

The agglomerative ag·glom·er·ate  
tr. & intr.v. ag·glom·er·at·ed, ag·glom·er·at·ing, ag·glom·er·ates
To form or collect into a rounded mass.

adj.
Gathered into a rounded mass.

n.
1.
 hierarchical cluster method starts with a separate cluster for each product and combines the closest products into a cluster. Then the next closest product is either added to the first cluster or starts a separate cluster. At each successive step, a product is added to an existing cluster or forms another separate cluster. Clusters can be combined but can never be split apart.

When a cluster consists of more than one point, a measure of closeness between clusters must be chosen. The nearest neighbor See point sampling.  or single linkage linkage

In mechanical engineering, a system of solid, usually metallic, links (bars) connected to two or more other links by pin joints (hinges), sliding joints, or ball-and-socket joints to form a closed chain or a series of closed chains.
 method is one common method. The closest point in one cluster is measured to the closest point in a second cluster to get the Euclidean distance In mathematics, the Euclidean distance or Euclidean metric is the "ordinary" distance between two points that one would measure with a ruler, which can be proven by repeated application of the Pythagorean theorem. . This is shown in figure 3, below. This method only uses one point in each cluster to define the distance between clusters. The average linkage between clusters uses information about all pairs of distances. The distance between each point in one cluster to each point in the second cluster is calculated and averaged to get the between cluster distance. This method is preferred over the single point method and is the one used for this analysis.

[FIGURE 3 OMITTED]

In our problem, we wish to find a product attribute hierarchy based on actual customer behavior instead of on a predetermined list of product attributes. In addition, attributes can be non-metric such as color, taste, smell, etc. In order to accommodate these differences, we base our proximity matrix on product switching data from consumer panel data (see table 2.) These data show the frequency of transitions from one product to another (including the same product) made by consumers. As discussed in the previous section, the first variant starts as an individual cluster. Each new variant is then added to an existing cluster or forms a new cluster. As each new variant is added, clusters are created and grow in size. As necessary, clusters can be combined to form one larger cluster. The process continues until there is one large cluster. The results are shown in an agglomeration ag·glom·er·a·tion  
n.
1. The act or process of gathering into a mass.

2. A confused or jumbled mass:
 schedule and on a dendrogram A dendrogram is a tree diagram frequently used to illustrate the arrangement of the clusters produced by a clustering algorithm (see cluster analysis). Dendrograms are often used in computational biology to illustrate the clustering of genes. . Using the above proximity matrix in the SPSS hierarchical cluster analysis module produced the following results:
FIGURE 4: AGGLOMERATION SCHEDULE

Agglomeration Schedule

              Cluster Combined

Stage     Cluster 1     Cluster 2     Coefficients

1             3             4            3.160
2             1             5            3.607
3             1             2            4.244
4             1             3            4.370

            Stage Cluster Firs
                Appears

Stage     Cluster 1     Cluster 2      Next Stage

1             0             0              4
2             0             0              3
3             2             0              4
4             3             1              0


The agglomeration schedule depicted de·pict  
tr.v. de·pict·ed, de·pict·ing, de·picts
1. To represent in a picture or sculpture.

2. To represent in words; describe. See Synonyms at represent.
 in figure 4 shows how the clusters were formed and the values of the average distance between clusters. When the coefficient coefficient /co·ef·fi·cient/ (ko?ah-fish´int)
1. an expression of the change or effect produced by variation in certain factors, or of the ratio between two different quantities.

2.
 is small, fairly similar clusters are being merged. Large coefficients indicate that dissimilar clusters are being merged. In our example, merging points 3 and 4 resulted in a small coefficient so the new cluster is fairly homogeneous The same. Contrast with heterogeneous.

homogeneous - (Or "homogenous") Of uniform nature, similar in kind.

1. In the context of distributed systems, middleware makes heterogeneous systems appear as a homogeneous entity. For example see: interoperable network.
. The result is similar for points 1 and 5. However, when point 2 is merged with the cluster containing points 1 and 5, the coefficient rises, indicating that the newly formed cluster is less homogeneous. This would be a reasonable point to stop merging points and clusters.

The dendrogram in figure 5 is a pictorial representation of the agglomeration schedule. It is scaled so that the coefficients represent values from 0 to 25. As can be seen, clusters {3, 4} and {1, 5} were formed relatively early in the process. Point 2 was added near the end and the last cluster to form a combination of all clusters. A reasonable place to stop merging points or clusters would be after the first two clusters were formed leaving us with clusters made up of {3, 4} and {1, 5} with point {2} as a separate cluster. In our example points 1 through 5 represent products A through E. We are left with the three clusters {C, D}, {A, E} and {B}. Figure 6 depicts the product clusters with their associated attributes. Cluster {C, D} has no switching on brand. Also, cluster {A, E} has no switching on brand. Therefore, brand is selected as being the highest attribute. Since cluster {C, D} also has no switching on color, it is selected as the second highest attribute. Size would be the lowest attribute.

[FIGURE 5 OMITTED]

The attribute hierarchy is pictured in figure 7.

[FIGURE 7 OMITTED]

4.3 Entropy-based Partitioning

Shannon's definition of the entropy H{[p.sub.i]} of a stochastic By guesswork; by chance; using or containing random values.

stochastic - probabilistic
 system with n states, having probability [p.sub.i] is given by:

H{[p.sub.i]} = -k [n.summation summation n. the final argument of an attorney at the close of a trial in which he/she attempts to convince the judge and/or jury of the virtues of the client's case. (See: closing argument)  over (i=1)] [p.sub.i] log [p.sub.i]

where k is an arbitrary constant (Math.) a quantity of function that is introduced into the solution of a problem, and to which any value or form may at will be given, so that the solution may be made to meet special requirements. . Choosing k = 1, the function has its maximum value of log n when the n states are equally likely. The entropy theory holds that all systems will be at the maximum entropy consistent with the constraints CONSTRAINTS - A language for solving constraints using value inference.

["CONSTRAINTS: A Language for Expressing Almost-Hierarchical Descriptions", G.J. Sussman et al, Artif Intell 14(1):1-39 (Aug 1980)].
 on the system. One of the strengths of this measure is that it can be used to study dependencies in sequences of outcomes. First, second and third order dependencies may be analyzed an·a·lyze  
tr.v. an·a·lyzed, an·a·lyz·ing, an·a·lyz·es
1. To examine methodically by separating into parts and studying their interrelations.

2. Chemistry To make a chemical analysis of.

3.
 quite readily. In the present context, a first order process is assumed, where the last item purchased depends only on the one purchased the time before last. By dividing switching frequencies in a row by the total row frequency, row-conditional relative frequencies may be computed. These will be treated as conditional probabilities conditional probability

the probability that event A occurs, given that event B has occurred. Written P(AB).
. From these conditional probabilities and market share probabilities, a measure of switching entropy can be computed.

[H.sub.s] = [summation over (i)][p.sub.i] [summation over (j)] [p.sub.j\i] log [p.sub.j\i]

where [p.sub.i] = state i's share of the market and [p.sub.j|i] = the conditional probability of choosing j next time given that i was chosen last time

The assumption underlying the attribute hierarchy is that consumers view product variants in a competitive subgroup as relatively close substitutes. A high level attribute exhibits relatively low switching, and therefore switching entropy, across states (for example, from one size to another.) Similarly, low level attributes exhibit relatively high switching entropy.

Level 1

* The first step is to compute To perform mathematical operations or general computer processing. For an explanation of "The 3 C's," or how the computer processes data, see computer.  attribute switching matrices, which are as follows:
FIGURE 8: ATTRIBUTE SWITCHING MATRICES

                    BRAND

                      TO

               A      B      C

FROM     A     39      2      4
         B      3     11      3
         C      4      1     33

                 COLOR

                  TO

               X      Y

FROM     X     46      9
         Y     13     32

                        SIZE

                         TO

               R      S      T      U

FROM     R     25      2      6      5
         S      4     11      1      1
         T      6      0      9      2
         U      6      1      2     19


* The next step is to compute the switching entropy for each attribute dimension. The switching entropy value for Brand is 0.490, for Size it is 0.688 and for Color it is 0.744. For example, the calculations for Color are as follows:
FIGURE 9: ENTROPY CALCULATION EXAMPLE

Example: Color

                             TO

                     Color x     Color y     [P.sub.i]

FROM     Color x      0.836       0.164        0.55
         Color y      0.289       0.711        0.45

[H.sub.s] = 0.516

Adjusted [H.sub.s] = [H.sub.s]/log(2): 0.744

Notes: (1.) All logs are natural logs.

(2.) The adjustment is for the number of states


* At the third step, the attribute dimension with the lowest entropy is selected as the highest level attribute in the hierarchy. In the example, Brand is at the top level.

Level 2

* Since this is a brand primary market, only transaction pairs with common brand names are considered in computing computing - computer  brand-conditional switching matrices for Color and Size within each brand.

* A weighted average switching entropy is computed using as weights the market shares of the brands. The conditional switching entropy values for the two second-level candidates is 0.363 for Color and 0.712 for Size, thus indicating a Brand--Color--Size hierarchy.

[FIGURE 10 OMITTED]

This attribute hierarchy indicates that consumers show relatively little loyalty to size variants, so that A and E are strongly competitive, as are D and C. Consumers are less likely to switch to B from the other variants, so is less competitive and more immune to marketing efforts to attract consumers of B. Note that for this example, the entropy-based method gave the same attribute hierarchy structure as the hierarchical cluster analysis shown in figure 4.

5. SUMMARY AND CONCLUSIONS

This paper has compared three empirical methods Empirical method is generally taken to mean the collection of data on which to base a theory or derive a conclusion in science. It is part of the scientific method, but is often mistakenly assumed to be synonymous with the experimental method.  for identifying competing sub-groups of products from the analysis of product switching matrices. All methods require the pre-specification of product variants as vectors of their attributes. Hierarchical cluster analysis is normally applied to data recording attribute values on metric scales, while multidimensional scaling can proceed from either metric or non-metric similarity or dissimilarity data. For metric data, it is simple to construct a distance measure (often Euclidean) to compute a semi-matrix of distances to serve as the input to these algorithms. However, many product attributes of interest to marketers are non-metric and it is less clear how to construct distances among product alternatives. In this paper we demonstrated how to construct a distance measure between pairs of product variants from a non-symmetrical transition matrix showing frequency of purchases of one variant following purchases of each other variant.

Multidimensional scaling was able to reveal little about the attribute structure. Although the configuration plot in two dimensions suggested that color might be important in explaining the "closeness" of the products, the second dimension had no evident interpretation. With the provision of so little attribute-related information, the conclusion is that multidimensional scaling is not a viable approach to uncover the competitive attribute structure of a market.

The benefit of using cluster analysis to uncover competitive partitions is that the procedures are well recognized. There are several difficulties in using cluster analysis. First, the type of cluster analysis must be selected, along with the appropriate measure of between-cluster distances. Second, the degree of agglomeration in defining partitions is judgmental judg·men·tal  
adj.
1. Of, relating to, or dependent on judgment: a judgmental error.

2. Inclined to make judgments, especially moral or personal ones:
. Third, the hierarchy of attributes is deduced by examining the attribute composition of clusters at each level of agglomeration. Thus, different investigators could find various attribute structures based on their selection of method, judgment of when a new cluster should be combined to an existing cluster and how to assign attributes to clusters. However, for the experimental data analyzed in this study, hierarchical cluster analysis was able to identify a meaningful attribute structure.

The proximity matrix used in the entropy method is the product variant switching matrix. The main premise of the hierarchical structure is that the lower the observed switching across attribute states, the stronger the bond among variants within a partition. As each level is determined, the degree of switching within higher level partitions is evaluated for the remaining attributes. A significant advantage of the process is that it is entirely automated au·to·mate  
v. au·to·mat·ed, au·to·mat·ing, au·to·mates

v.tr.
1. To convert to automatic operation: automate a factory.

2.
. The method takes full advantage of the known attribute composition of each variant in the product set. There are no subjective decisions to make so the analysis will always be consistent.

The study has shown that two methods are capable of using the same data to identify the same competitive structure of a simple market. Before judging them equal in this quest, the two methodologies need to be applied to real data, where there are often many attributes of interest and hundreds of product variants. To be able to identify the hierarchical structure in such cases requires large-scale diary panel data that are often readily available. Such an investigation is the subject of future research.

REFERENCES

Carter, J and Silverman, F, "An Empirical Approach to Market Partitioning: Application to the Cigarette Market," Journal of Targeting, Measurement and Analysis for Marketing, 12 (2004): 366-378

Cooper, D. and Emory, C., Business Research Methods. Irwin, 1995

Gardner, M.., "Advertising Effects on Attributes Recalled and Criteria Used for Brand Evaluations," Journal of Consumer Research, 10 (1983): 310-318

Garner, W. and McGill, W., "The Relationship Between Information and Variance Analysis," Pschometrika, 21 (1956): 219-228

Green, P., Analyzing Multivariate Data, Dryden Press, 1978

Kalwani, M. and Morrison, D., "A Parsimonious par·si·mo·ni·ous  
adj.
Excessively sparing or frugal.



parsi·mo
 Description of the Hendry System," Management Science, 23 (1977): 467-477

Lehmann, D., "Judged Similarity and Brand-Switching Data as Similarity Measures," Journal of Marketing Research, 9, (1972): 331-334

Kumar, A. and Sahi, C., "Confirmatory Analysis of Aggregate Hierarchical Market Structures> Inferences from Brand Switching Behavior," Journal of Marketing Research, 26 (1989): 444-453

Rao, V. and Sabavala, D., "Inference (logic) inference - The logical process by which new facts are derived from known facts by the application of inference rules.

See also symbolic inference, type inference.
 of Hierarchical Choice Processes from Panel Data," Journal of Consumer Research, 8 (1981): 85-96

Shannon, C. E., "A Mathematical Theory of Communication The article entitled "A Mathematical Theory of Communication", published in 1948 by mathematician Claude E. Shannon, was one of the founding works of the field of information theory. ," Bell System Technical Journal 27 (1948) 379-423

John C. Carter, Pace University, Lubin School of Business The Joseph I. Lubin School of Business is the business school of Pace University. It was named after Joseph I. Lubin, an alumnus and benefactor of the school. The school was established in 1906 as the Pace School of Accountancy to prepare men and women for the CPA exam. , New York New York, state, United States
New York, Middle Atlantic state of the United States. It is bordered by Vermont, Massachusetts, Connecticut, and the Atlantic Ocean (E), New Jersey and Pennsylvania (S), Lakes Erie and Ontario and the Canadian province of
, New York, USA

Fred N. Silverman, Pace University, Lubin School of Business, New York, New York, USA
TABLE 1: PRODUCT DESCRIPTIONS

Variant     Brand     Color     Size

A             a         x        r
B             c         x        t
C             a         y        u
D             b         y        s
E             c         x        r

TABLE 2: SWITCHING MATRIX

                                       TO

                A           B           C           D           E
            (a, x, r)   (b, y, s)   (c, x, t)   (c, x, r)   (a, y, u)

A
(a, x, r)      10           1           0           1           5

B
(b, y, s)       2          11           1           2           1

C
(c, x, t)       1           0           9           5           2

D
(c, x, r)       1           1           6          13           0

E
(a, y, u)       5           1           2           1          19

TABLE 3: DISSIMILARITY MATRIX

        A         B         C         D         E

A     1.526     4.102     4.467     4.363     3.607
B     4.102     0.000     4.458     4.184     4.385
C     4.467     4.458     1.681     3.160     4.206
D     4.363     4.184     3.160     1.808     4.541
E     3.607     4.385     4.206     4.541     2.109

FIGURE 1: STIMULUS COORDINATES FROM SPSS

                               Dimension
Stimulus     Stimulus
Number       Name            1           2

1            A            -1.3730     -0.0055
2            B            -0.2092     -1.5489
3            C             1.1927      0.6716
4            D             1.3064     -0.2148
5            E            -0.9168      1.0976

FIGURE 4: AGGLOMERATION SCHEDULE

Agglomeration Schedule

                      Cluster Combined

Stage     Cluster 1     Cluster 2     Coefficients

1             3             4            3.160
2             1             5            3.607
3             1             2            4.244
4             1             3            4.370

                     Stage Cluster Firs
                          Appears

Stage     Cluster 1     Cluster 2      Next Stage

1             0             0              4
2             0             0              3
3             2             0              4
4             3             1              0

FIGURE 6: CLUSTER-ATTRIBUTE RELATIONSHIP

      Brand     Color     Size

C       c         x        t
D       c         x        r
A       a         x        r
E       a         y        u
B       b         y        s
COPYRIGHT 2005 International Academy of Business and Economics
No portion of this article can be reproduced without the express written permission from the copyright holder.
Copyright 2005, Gale Group. All rights reserved. Gale Group is a Thomson Corporation Company.

 Reader Opinion

Title:

Comment:



 

Article Details
Printer friendly Cite/link Email Feedback
Author:Silverman, Fred N.
Publication:Journal of Academy of Business and Economics
Geographic Code:1USA
Date:Jan 1, 2005
Words:4644
Previous Article:Brand as a reliability reference point: a test of prospect theory in the used car market.
Next Article:Organizational practices facilitating patent commercialization.
Topics:



Related Articles
Implementing a computerized job-matching program with a hierarchical occupational structure.
Effects of navigation tool information on hypertext navigation behavior: a configural analysis of page-transition data.
Assessing complex problem-solving skills and knowledge assembly using web-based hypermedia design.
IMS DATABASES IMPROVE EFFICIENCIES OF DATA QUERIES.
Assessing learning from hypertext: an individual differences perspective.
The ubiquitous hierarchy: an army to overcome the threat of a mob.
COMPANIES RECEIVE INSIGHT INTO THEIR WEBSITE'S PERFORMANCE.
Effects of navigation tools and computer confidence on performance and attitudes in a hypermedia learning environment.
Developing a more conceptual understanding of matrices & systems of linear equations through concept mapping and vee diagrams.

Terms of use | Copyright © 2009 Farlex, Inc. | Feedback | For webmasters | Submit articles