This document describes a professor's proposal to analyze factors that influence diamond prices. It involves:
1) Coding diamond characteristic variables like carat, color, clarity on numeric scales.
2) Grouping variable values through frequency analysis to make distributions more homogeneous.
3) Analyzing scatter plots between each characteristic and price to identify clusters. All but clarity showed clusters.
4) Performing cluster analysis to identify 5 optimal clusters. Regression models were developed for each cluster to predict price based on characteristics.
5) Using the regression models to predict prices for 3 sample diamonds purchased from different wholesalers, finding the quoted prices were higher than predicted in each case.