CLUSTER ANALYSIS 2014 Edition - Statistical Associates

Transcription

CLUSTER ANALYSISCopyright @c 2014 by G. David Garson and Statistical Associates Publishing2014 EditionPage 1

CLUSTER ANALYSIS2014 Edition@c 2014 by G. David Garson and Statistical Associates Publishing. All rights reservedworldwide in all media. No permission is granted to any user to copy or post this work inany format or any media.ISBN: ISBN: 978-1-62638-030-1The author and publisher of this eBook and accompanying materials make norepresentation or warranties with respect to the accuracy, applicability, fitness, orcompleteness of the contents of this eBook or accompanying materials. The author andpublisher disclaim any warranties (express or implied), merchantability, or fitness for anyparticular purpose. The author and publisher shall in no event be held liable to any party forany direct, indirect, punitive, special, incidental or other consequential damages arisingdirectly or indirectly from any use of this material, which is provided “as is”, and withoutwarranties. Further, the author and publisher do not warrant the performance,effectiveness or applicability of any sites listed or linked to in this eBook or accompanyingmaterials. All links are for information purposes only and are not warranted for content,accuracy or any other implied or explicit purpose. This eBook and accompanying materials is copyrighted by G. David Garson and Statistical Associates Publishing. No part of this maybe copied, or changed in any format, sold, or used in any way under any circumstancesother than reading by the downloading individual.Contact:G. David Garson, PresidentStatistical Publishing Associates274 Glenn DriveAsheboro, NC 27205 USAEmail: gdavidgarson@gmail.comWeb: www.statisticalassociates.comCopyright @c 2014 by G. David Garson and Statistical Associates PublishingPage 2

CLUSTER ANALYSIS2014 EditionTable of ContentsOverview . 10Data examples in this volume . 10Key Concepts and Terms. 12Terminology . 12Distances (proximities) . 12Cluster formation. 12Cluster validity . 12Types of cluster analysis. 14Types of cluster analysis by software package . 14Disjoint clustering . 15Hierarchical clustering . 15Overlapping clustering. 16Fuzzy clustering . 16Hierarchical cluster analysis in SPSS . 16SPSS Input for hierarchical clustering . 16Example . 16The main “Hierarchical Cluster Analysis” dialog . 17Statistics button . 18Plots button . 19Methods button. 20SPSS output for hierarchical cluster analysis . 21Proximity table. 21Cluster membership table . 22Agglomeration Schedule . 22Dendogram . 24Icicle plots . Error! Bookmark not defined.Summary measures . Error! Bookmark not defined.Hierarchical cluster analysis in SAS . Error! Bookmark not defined.SAS input for hierarchical cluster analysis\ . Error! Bookmark not defined.Example . Error! Bookmark not defined.Data setup. Error! Bookmark not defined.SAS syntax . Error! Bookmark not defined.SAS output for hierarchical cluster analysis . Error! Bookmark not defined.Simple statistics table . Error! Bookmark not defined.Eigenvalues of the covariance matrix table . Error! Bookmark not defined.Root mean square coefficients . Error! Bookmark not defined.Copyright @c 2014 by G. David Garson and Statistical Associates PublishingPage 3

CLUSTER ANALYSIS2014 EditionCluster history table . Error! Bookmark not defined.Dendogram . Error! Bookmark not defined.Icicle Plots . Error! Bookmark not defined.Cluster membership table . Error! Bookmark not defined.Saving data to file . Error! Bookmark not defined.Hierarchical cluster analysis in Stata . Error! Bookmark not defined.Stata input for hierarchical cluster analysis . Error! Bookmark not defined.Stata output for hierarchical cluster analysis . Error! Bookmark not defined.Agglomeration coefficients . Error! Bookmark not defined.Dendogram . Error! Bookmark not defined.Saving cluster membership values . Error! Bookmark not defined.Cluster membership table . Error! Bookmark not defined.K-means cluster analysis . Error! Bookmark not defined.Overview . Error! Bookmark not defined.Example . Error! Bookmark not defined.K-means cluster analysis in SPSS. Error! Bookmark not defined.SPSS input . Error! Bookmark not defined.Main K-means dialog . Error! Bookmark not defined.The Iterate button . Error! Bookmark not defined.The Save button . Error! Bookmark not defined.The Options button . Error! Bookmark not defined.SPSS Output for K-Means cluster analysis . Error! Bookmark not defined.The Anova table . Error! Bookmark not defined.Number of cases in each cluster. Error! Bookmark not defined.Getting different clusters . Error! Bookmark not defined.Cluster membership table . Error! Bookmark not defined.K-Means cluster analysis in SAS . Error! Bookmark not defined.Overview . Error! Bookmark not defined.Example . Error! Bookmark not defined.SAS input for k-means cluster analysis. Error! Bookmark not defined.SAS output for k-means cluster analysis . Error! Bookmark not defined.The “Statistics for Variables” table . Error! Bookmark not defined.Criteria for determining k . Error! Bookmark not defined.The “Cluster Summary” table . Error! Bookmark not defined.Cluster membership and distance values. Error! Bookmark not defined.Crosstabulation tables . Error! Bookmark not defined.Cluster separation plots. Error! Bookmark not defined.K-Means cluster analysis in Stata. Error! Bookmark not defined.Copyright @c 2014 by G. David Garson and Statistical Associates PublishingPage 4

CLUSTER ANALYSIS2014 EditionExample . Error! Bookmark not defined.Stata input for k-means cluster analysis . Error! Bookmark not defined.The main kmeans clustering command. Error! Bookmark not defined.Obtaining descriptive statistics. Error! Bookmark not defined.Obtaining distance information . Error! Bookmark not defined.Obtaining cluster separation plots . Error! Bookmark not defined.Comparing kmeans and kmedian solutions . Error! Bookmark not defined.Stata output for k-means cluster analysis. Error! Bookmark not defined.Cluster membership assignments . Error! Bookmark not defined.Descriptive statistics . Error! Bookmark not defined.Distance coefficients. Error! Bookmark not defined.Cluster separation plots. Error! Bookmark not defined.Comparing kmeans and kmedians solutions . Error! Bookmark not defined.Two-step cluster analysis in SPSS . Error! Bookmark not defined.Overview . Error! Bookmark not defined.Cluster feature tree (CF tree) . Error! Bookmark not defined.Proximity . Error! Bookmark not defined.Example . Error! Bookmark not defined.SPSS input for two-step clustering . Error! Bookmark not defined.The main two-step clustering dialog . Error! Bookmark not defined.Options button dialog. Error! Bookmark not defined.Output button dialog . Error! Bookmark not defined.SPSS output for two-step clustering . Error! Bookmark not defined.Autoclustering table . Error! Bookmark not defined.Cluster distribution table . Error! Bookmark not defined.Centroids (cluster profiles) table . Error! Bookmark not defined.Model summary. Error! Bookmark not defined.The “Cluster Quality” graph. Error! Bookmark not defined.The “Cluster Sizes” pie chart . Error! Bookmark not defined.The “Predictor Importance” chart . Error! Bookmark not defined.The “Clusters” table . Error! Bookmark not defined.The “Cell Distribution” chart . Error! Bookmark not defined.The “Cluster Comparison” chart. Error! Bookmark not defined.Nearest neighbor analysis in SPSS . Error! Bookmark not defined.Overview . Error! Bookmark not defined.Target variables . Error! Bookmark not defined.Selecting k . Error! Bookmark not defined.Feature variables . Error! Bookmark not defined.Copyright @c 2014 by G. David Garson and Statistical Associates PublishingPage 5

CLUSTER ANALYSIS2014 EditionFocal cases . Error! Bookmark not defined.Case labels . Error! Bookmark not defined.Partitions and cross-validation . Error! Bookmark not defined.Example . Error! Bookmark not defined.SPSS input . Error! Bookmark not defined.The user interface . Error! Bookmark not defined.The “Variables” tab. Error! Bookmark not defined.The “Neighbors” tab . Error! Bookmark not defined.The “Features” tab. Error! Bookmark not defined.The “Partitions” tab . Error! Bookmark not defined.The “Save” tab . Error! Bookmark not defined.The “Output” tab . Error! Bookmark not defined.The “Options” tab . Error! Bookmark not defined.SPSS output . Error! Bookmark not defined.Overview . Error! Bookmark not defined.The “Case Processing Summary “ table . Error! Bookmark not defined.The “Predictor Space” plot . Error! Bookmark not defined.The “Peers Chart” . Error! Bookmark not defined.The “k Nearest Neighbors and Distances” table . Error! Bookmark not defined.“k and Predictor Selection” plots . Error! Bookmark not defined.“Quadrant Map” maps . Error! Bookmark not defined.The “Error Summary” table . Error! Bookmark not defined.SAS PROC ACECLUS: Pre-processing for elliptical clusters . Error! Bookmark not defined.Overview . Error! Bookmark not defined.Example . Error! Bookmark not defined.SAS input . Error! Bookmark not defined.Overview . Error! Bookmark not defined.Set-up. Error! Bookmark not defined.Plot of original data . Error! Bookmark not defined.Using PROC ACECLUS to transform the data . Error! Bookmark not defined.Plot of transformed data . Error! Bookmark not defined.K-means clustering of transformed data . Error! Bookmark not defined.K-means clustering of original data . Error! Bookmark not defined.SAS output . Error! Bookmark not defined.Plot of untransformed data . Error! Bookmark not defined.Data transformation with PROC ACECLUS . Error! Bookmark not defined.Plot of transformed data . Error! Bookmark not defined.Copyright @c 2014 by G. David Garson and Statistical Associates PublishingPage 6

CLUSTER ANALYSIS2014 EditionK-means (PROC FASTCLUS) results with original vs. transformed data Error! Bookmark notdefined.SAS PROC VARCLUS : Oblique principal components cluster analysis . Error! Bookmark notdefined.Overview . Error! Bookmark not defined.The PROC VARCLUS default method . Error! Bookmark not defined.PROC VARCLUS variations . Error! Bookmark n

Jun 08, 2014 · Hierarchical cluster analysis in Stata . Error! Bookmark not defined. Stata input for hierarchical cluster analysis . Error! Bookmark not defined. Stata output for hierarchical clu