The sample standard deviation and standard error of the mean are biased estimators for their corresponding population population parameters. As explained here for the sample standard deviation this is due to the fact that the expectation of a square root does not equal the square root of an expectation, as can be shown by Jensens equality.
Nevertheless, we are inclined to give an interpretation to standard error estimates, in the sence that they give the mean distance of all sample estimators (means) to the population parameter (mean). However, this seems only to be legitimate if the bias is commonly small or vanishes in large samples under quite general conditions (e.g., distributions). In the proof references above, this seems to be the case for normal data. Is this generally so?
Furthermore: the sample standard error is commonly used as estimator of population standard error in a t-statistic. $$t=\frac{\hat{\theta}-\theta}{s.e.({\hat{\theta}})}$$. Is it true that we would actually need an unbiased estimator of the standard error in this formula or is the use of a biased estimator allowed. What is the role of the sample size again (for normal / non-normal data)?