Sorry if this question has been asked before; I'd love to read any discussion around this. There's got to be a better way to summarize this question as well.
I've got covariates $X$ and response $Y$, and suppose I know that when $X$ is high (or low), so is the variance in $Y$, though perhaps not the expectation. Is there a standard approach to modeling this?
Thinking of examples where this could pop up:
Maybe Y is a stock price, and X is the number of articles written about the stock that day - we don't know if the news is good or bad, but we know the stock probably did something interesting that day if so many articles were written about it. This is assuming you're not interested in forecasting into the future, I guess.
Maybe $X = Z^2$, $Z \sim N(0, \sigma)$, and $Y$ has a partial correlation with $Z$, but you don't have access to $Z$ directly.
You have some advanced archers, and you're interested in the technique they use given their builds/limb lengths/injuries, but for some reason you gave them varying levels of caffeine beforehand, ranging from 0 to 10 cups of coffee (which you recorded dutifully). So their technique is still unbiased, just more jittery, and the noise in your measurements is greater for more caffeinated archers.
These are contrived examples, but this is just for fun right now.
One kind of exploratory approach that I can think of goes something like this:
fit a linear regression on all the other covariates, leaving out your "variance predictor" $X_v$
plot the squared residuals against your variance predictor to decide on some functional form of the relationship between the two, eg $V(Y)$ proportional to $exp(X_v)$
Use $\frac{1}{\exp(X_v)}$ as weights and run a weighted linear regression
This makes some sense to my little bird brain, but I'm sure there's a better choice than an exponential link function, or even WLS.