Learning Goal: I’m working on a statistics practice test / quiz and need an explanation and answer to help me learn.
JEANS |
STOREID |
FASHION |
LEISURE |
STRETCH |
ORIGINAL |
Examine the distribution of the variables.
- Are there any unusual data values?
- Are there missing values that should be replaced?
- Assign the variable STOREID the model role ID and the variable SALESTOT the model role Rejected. Make sure that the remaining variables have the Input model role and the Interval measurement level. Why should the variable SALESTOT be rejected?
- Add an Input Data Source node to the diagram workspace and select the DUNGAREE data table as the data source.
- Add a Cluster node to the diagram workspace and connect it to the Input Data node.
- Select the Cluster node. Leave the default setting as Internal Standardization ð Standardization. What would happen if inputs were not standardized?
- Run the diagram from the Cluster node and examine the results.
Does the number of clusters created seem reasonable? - Specify a maximum of six clusters and rerun the Cluster node. How does the number and quality of clusters compare to that obtained in part h?
- Use the Segment Profile node to summarize the nature of the clusters.Discuss.