If there is no autocorrelation within each data set, then this could be handled by a mixed model. As the nature of each row is evidently the same in all the data sets, the 2000 rows would be included as random effects (e.g., the row number could serve the same function as a subject ID in other applications of mixed models) and the group and data set (location) would be treated as fixed effects.
It might be simplest to reformat your 8 separate data sets (one per location) into one large data frame. Within each data set, add a column for rowID and another column with an identifier of the location from which that data set was taken, giving you four data columns. Then just combine them all into a 16000-row by 4-column data frame.
A simple mixed regression model would then be:
value ~ group + location + (1|rowID)
where the (1|rowID)
symbol means that each rowID will be associated with a potentially different intercept on the value
scale. This model assumes that the value
for any individual observation for a rowID is the sum of its intercept and additive contributions from the group
and location
of the observation, plus a random error, and that there are no interactions beyond these strictly additive effects among group
, location
, or rowID
in terms of determining the value
.
As with any regression you might need to transform the values
to meet linearity and equal-error-variance assumptions, or you might need to allow for interactions between group
and location
(that is, the effect of group
might depend on the location
, and thus vice-versa). In principle you could allow for random effects other than just in terms of the intercept associated with each rowID
; see this page for some examples.
Your estimate of the effect of group
will then be the regression coefficient estimated for group
(if there is no interaction specified between group
and location
). There can be some issues with properly determining p-values for fixed-effect coefficients in mixed models, but my sense is that with all rowIDs
represented once in each location
and each rowID
having the same group
assignment in all cases that shouldn't be a problem. See this page and its links for discussion.