I have a morphometric dataset of 430 fish, with 27 various "shape" measurements, for example: total length, head length, fin distances, etc.
My goal is multi-fold: (BIG GOAL: cluster by "shape")
Hypotheses:
H0a: No groups can be found in the dataset
H0b: No groups can be imposed on the dataset
H1: Pre-specified well defined groups do “not” a priori exist; though the characteristics of the data can be used to assign entities into artificial groups by morphometric "shape" characteristics (independent of and/or secondary to size) (i.e. Cluster analysis)
H2: Pre-specified well defined groups “do” exist; pre-specified a prioi “expert knowledge” (validated stock origins based on DNA chemistry) well-defined groups exist. Dependent relationships between one set of discriminating variables (morpho-variables, date) and a single grouping variable (DNA stock origin) can be made to define the relationship between dependent and independent variables.
Where I need your help. I would like to run non-hierarchical cluster analysis. If I were to run the analysis on the morphometric variables I have, the clustering would likely be driven purely by size. When I say "size" I mean length.
In multivariate morphometrics, if PCA is run and all the variables load eveningly in the same direction on PC1, typically that PC1 component can be thought of as a "size or length" component and that component is typically "cleaved" from the analysis (not my language, but lit). And the rest of the analysis can be run on the other PC scores.
I'm not totally familar with what "cleave" means. But I guess my overall questions comes down to:
Can I run a PCA on the dataset, get PC1-PCn indidividual unit PC scores, and run cluster analysis on those new PC scores >PC1, i.e. PC2, PC3,... and identify "shape" groups that are relatively independent of size.
I do know I have a number of assumptions I have to meet, or stay within reason with, for distribution, outliers, etc..
If you need more clarity on my objectives, I can provide more information.
Thank you!!
Image below is example of fish with morphometric measurements: