prepare and submit a paper on the classification of a statistical ward. The objective is to choose the least number that will fully stand for the major dimensions in the data.
Your assignment is to prepare and submit a paper on the classification of a statistical ward. The objective is to choose the least number that will fully stand for the major dimensions in the data. Variables used are demographic and socioeconomics, which cover six dimensions including household composition, demographic structure, socioeconomic, housing, industry sector, and employment. The variable is selected in a procedure with several steps. The first step considers variables from the key statistics table. Step two merges the variables to create composite variables. The third step removes variables strongly correlated through examining the correlation matrix. this is necessary and done to avoid too much influence of the census data on the result. The last step excludes variables previously considered as badly behaved and with a high proportion of zeros. he Advisory Board was consulted and proposed conduction of principal component analysis to aid variable selection. The objective of the sorting variables was to select the least likely number of variables, which adequately stand for the major measurements of the Census data in 2001.
During the census, five main domains were identified whose intention was to represent fully the main domains within the classification. The five identified domains are demographic makeup, domestic composition, socio-economic as well as employment. Preliminary data setting included output area level key statistics table variables, which represents the most important variables from the census published data. The initial set of data was later reduced to represent the census data in the main dimensions with a minimum number of variables following a detailed assessment of each variable. The process eliminated any variable adding nothing to the classification and in some cases, a composite variable was used to reduce variables. Variables representing very small sectors of the population were removed. Migration indicators were omitted because the data was absent. .