I am analyzing a dataset with 5 factors(Y1,Y2,Y3,Y4,Y5).
ID Y1 Y2 Y3 Y4 Y5
1 5 1 2 9 40
2 6 1 17 9 49
3 5 1 6 10 25
4 5 1 14 6 69
5 7 1 19 15 66
6 5 1 6 7 24
. . . . . .
. . . . . .
300 6 1 2 12 28
The mean and standard deviation of each factor(y_i) is as follows
avg1 avg2 avg3 avg4 avg5
5.39 1.02 11.8 9.61 42.1
sd1 sd2 sd3 sd4 sd5
1.22 0.145 10.1 3.61 14.5
I have two new observations
ID Y1 Y2 Y3 Y4 X5
* 6 1 18 7 36
** 3 5 1 3 37
What statistical method should I use if my goal is to determine which out of these two observations are closer or most similar to the sample. Thanks.