I was wondering if you can help me clarifying some concepts (if it is possible providing references to papers or books) that I will write in the form of ideas rather than questions. Consider this situation:
we draw a stratified random sampling method, first by randomizing the selection of regions within a country and then randomizing the selection of households within the regions. But, we collect data at the individual level.
Idea 1: If I don't have any information about the population, I can only assume that if the selection was drawn at random (each observation had the same probability of being selected) and if the sample size is big enough, It will be "representative" of a certain population parameter(s).
Idea 2 If I have information about the population, I can test statistically whether or not the sample is "representative" of a certain population parameter.
Idea 3 If the sample resulted not representative, do not necessarily means that the selection was not at random. Rather, that it is not representative of a certain population parameter and it will produce a biased estimator(s). For instance, if I randomly select $n$ individuals and collect information about two parameters $X_{n}$ and $Y_{n}$, i.e, age and gender. With information about the population $N$, $X_{N}$ and $Y_{N}$, we could compute some weights $W_{X}=\frac{X_{N}*n}{X_{n}*N}$ and $W_{Y}=\frac{Y_{N}*n}{Y_{n}*N}$ that will give us a rough indicator of the representativeness of our sample. It is possible that none, one or both estimators $\hat{X}=X_{n}/n$ and $\hat{Y}=Y_{n}/n$ resulted in bias. I can imagine two scenarios to explain the source of the bias, one is that the sampler purposely decided to bias the sample, which will make the selection process deterministic instead of random. If it is not the case, and the selection was done by change, I will consider that the sample was not big enough and that the estimator(s) could be overrepresented or underrepresented. This does not mean that the selection process had no uncertainty (it was drawn by choice rather than chance), nor independence between observations or that they didn't come from the same uniform distribution. But, rather that we need more information in order to get closer to the distribution of the population. I guess what I want to propose is that random sampling can still produce bias estimator(s) if the sample size is small.
Idea 4 The method of weights that can only make sense "after sampling", when I know that the sample is not representative according to the information that I have about the population. In other words, it is not a weakness of the stratified sampling method in itself, but rather a method that can be used in order to correct for bias in the sample.