How can I approximate the median with a linear function?

Question

From this thread I learned why the median is a nonlinear function. In the context in which I'm working I need to use a linear function.

Googling "approximate the median"+"linear" didn't reveal anything usable, but somehow it seems intuitive to me that some reasonable approximation should be possible with a linear function.

EDIT: I was asked in one of the comments to provide a fuller explanation of the context behind the question. The context is that I'm taking in a bunch of estimates from people about some quantity, and then generating an aggregated prediction them. One of the most obvious things to do is to linearly combine the estimates according to some predetermined weights.

The framework is provided by Davis-Stober et al (2014), who write:

Consider a set of N-many decision makers (DMs), where each DM makes a judgment about the unknown value of a criterion. [...] We take this criterion value to be a random variable, $Y$, with mean $\mu_{y}$ and variance $\sigma^{2}$. [...] A crowd prediction, denoted $C$, is defined as the random variable formed by linearly combining the DMs according to predetermined weights $w_{i}$, $C = \sum_{i=1}^{N} w_{i}X_{i}$, with the restriction that all $w_{i}$ are non-negative and, to ensure uniqueness, $\sum_{i=1}^{N} w_{i} = 1$. The weights $w_{i}$ are not random variables, but rather fixed choices of how to combine crowd member judgments. [...]

Let $\mu_{x}$ by the $N \times 1$ vector of the DM's mean predictions. Let $\Sigma_{xx}$ be the covariance matrix of the $X_{i}$, $i \in \{1, > 2, ..., N\}$ random variables. Let $\sigma_{xy}$ denote the $N \times > 1$ vector of covariances of $Y$ with each $X_{i}$, $i \in > \{1,2,...,N\}$. It is straightforward to show that $E[(C-Y)^2]$ is equal to the following

$E[(C-Y)^{2}]=(u{'}_{x}w - u_{y})^{2} + w{'}\Sigma_{xx}w - 2w{'}\sigma_{xy} + \sigma^{2}_{y}$,

where $w$ is the $N \times 1$ vector of weights, $w_{i}, i \in \{1,2, > ..., N\}$, defining $C$.

So I want to 'break down' $E[(C-Y)^{2}]$ in the way described in the aforementioned paper, but in the context in which I'm working it's more common to use the median instead of the mean. It's been asserted to me that I could get around this problem by using a linear aggregate to approximate the median.

Davis-Stober, C. P., Budescu, D. V., Dana, J., & Broomell, S. B. (2014). When is a crowd wise?. Decision, 1(2), 79. Chicago

Can you expand on your context? Essentially, for any data set there is some set of weights which make the weighted mean equal to the median. However the weights depend on the data (i.e. the median is nonlinear). In many contexts a sequence of linear problems can be solved, if that is suitable (e.g. [Iteratively Reweighted Least Squares](https://en.wikipedia.org/wiki/Iteratively_reweighted_least_squares)). — GeoMatt22, Aug 30 '16 at 01:37
Linear function of what exactly? If it's the original observations, without regard to their size/order I think the unweighted mean is as good as you can do. If you can do it with regard to order (i.e. if you can apply it to the order statistics), then there is a linear function already. If you can do it with regard to size, there are various approaches. Is iteration allowed? (i.e. can I derive weights based on a sequence of weighted approximations?) — Glen_b, Aug 30 '16 at 03:05
@GeoMatt22 Thanks for the tip. I've edited the OP to provide some additional context. — user1205901 - Reinstate Monica, Aug 30 '16 at 04:06
It seems to me you could create a related vector of "crowd member judgments" by including various nonlinear functions of their original judgments. For instance, a set of judgments $(x_1,\ldots,x_n)$ could be augmented to $(x_1,\ldots,x_n;x_1^2,\ldots,x_n^2)$, which would be treated as a set of $2n$ "judgments". This would open up useful possibilities, especially if you also adopt some assumptions about the likely distribution of the judgments. Otherwise you're stuck with the conclusions given by @Glen_b. — whuber, Aug 30 '16 at 13:51
When you say "median instead of mean" is this referring to $C$, $E[]$ or both? — GeoMatt22, Aug 30 '16 at 14:23
I will admit to a lack of understanding why a "linear" estimate of the median is required. One reason the median is not a "sufficient" statistic and the mean is, is that only two values are required for its estimation -- the min and the max to find the 50th percentile (as noted by David Cox in his book *Theoretical Statistics*). That said, there are "shortcut" algorithms to estimation, e.g., Tukey wrote a monograph on his "ripper" approach to median estimation (can't find a citation for this article). Is that what you're looking for? — Mike Hunter, Aug 30 '16 at 23:56
Would the Hodges-Lehman median estimator work? https://en.wikipedia.org/wiki/Hodges%E2%80%93Lehmann_estimator — Mike Hunter, Sep 02 '16 at 16:34

How can I approximate the median with a linear function?

0 Answers0