Background:
I am currently surveying a number of articles related to subject of urban sprawl. One of the things that I have come across multiple times in the literature (a thing which seems to bother me), is a somewhat strange relationship between the data that have been used to create the dependent and the independent variables.
The relationship:
Imagine that:
The dependent variable (y) = the size of the central city population.
A control variable (x) = the size of the metropolitan population.
... and that the size of the metropolitan population is a sum of central city population and the suburban population (i.e. x = y + (x - y)).
Question:
Does the above-stated relationship not induce some kind of bias (given that the obtained covariance is partially based on the size of y, which ultimately makes x endogenous to y)?
Disclaimer:
Sorry if the question is too simple - I could not find a post with a similar question, nor any textbook examples.