I am trying to do regression analysis with level of a chemical in blood as dependent variable and age, gender and weight of children as predicting variables. The sample size is about 5000. Age and weight are highly correlated in children. My doubts are:
Should I use z-scores or percentiles for weight rather than raw values?
Should I use some other technique rather than ordinary linear regression?
Do I need to check if data has normal distribution at this sample size?
Edit: I want to clarify regarding z-score or percentile here: I have ages as 5,6,7,8 etc with no fractional ages. I thought for each age I can calculate z-score or percentile of weight for that individual child and use it instead of raw weight. By this I can answer the question that 'Is being overweight for age has any effect on blood level of the chemical'? Is this reasonable argument? Also, this question differs from the earlier question and is not a duplicate. My questions 2 and 3 do not figure in the title.
Regarding a comment on biological issues by @DLDahly: The ages are 5-15 years. Biologically, I want to determine if the weight is a predictor of blood level of chemical, independent of age? Chemical level rises with age, but it is not clear if being overweight increases it further. Actually, one cannot rule out the possibility that this rise may be related mainly to weight and not to age as such.