It seems to me that some authors use different definitions on what is random and what is not? This confuses me so my question is which of these things are random variables.
- Data: does data refer to the actual measurements $x_1,\dots,x_n$ (realizations) or to the random variables $X_1,\dots,X_n$ that generated these values? For me both viewpoints are valid (present data and future data). However I saw the term data defined specifically for the non-random $x_1,\dots,x_n$.
- Population: is the population just a big quantity full of random variables $X_1,\dots,X_n$ where one takes out a sample $X_1,\dots,X_k$ for $k<n$ or is the population a giant set full of realizations from $X_1,\dots,X_n$?