I'm working with land surface models. These models basically take a bunch of meteorological forcing data (downward radiation, wind, rain, humidity, etc), and run it through some biogeochemical/physical equations, and spit out a bunch of output data ($CO_2$ update/release, net radiation, evaporation, runoff, etc.). There are some problems with the models, and I'm trying to use clustering methods to split the input variables up into subdomains, so I can look at the responses in the output in a non-linear manner.
My question is whether it is valid and/or sensible to cluster on structural variables (e.g. time and space related variables). The input variables, like downward radiation, are highly dependent on these variables (e.g. the amount of sunshine is largely determined by latitude, time of year, and time of day). So I guess it doesn't make much sense to cluster on both the structural and the input variables. But clustering on the structural variables may offer things that clustering on the input variables doesn't - the ability to categorise model output by time of day, for example. Is there any reason why using structural variables should be avoided? Or is it entirely dependent on the model in question?