My question is very much related to this one:
How to apply standardization/normalization to train- and testset if prediction is the goal?
However, my testing data is not a single observation that I want to predict on but rather it is a set of new observations. My training set is about 300 observations and my testing set about 30. So using method 2 from the above link would not be good as it would include bias (sample size of 30 is not large enough to give a good representation of the data distribution).
In that case, it seems to me that method 1 is preferable over method 3? Or am I missing something.