How should be statistics on scientific papers read?

Question

Let's take this research published on Plos ONE

Increasing carbohydrate intake was associated with increasing stroke risk (HR = 2.01, 95%CI = 1.04–3.86 highest vs. lowest quintile; p for trend 0.025).

Multivariable Cox modeling estimated adjusted hazard ratios (HRs) of stroke with 95% confidence intervals (95%CI).

How should people read those values? What does it mean confidence interval for example?

Do you know any resource explaining the interpretation of the statistical part of those scientific papers?

On one hand, I think this is a great question and I'd like to have a good resource handy I could link people to - one that goes beyond "p is the conditional probability of obtaining a test statistic as or more extreme blah blah blah", but actually explains what typical stats *mean* in a way that's useful to non-geeks. On the other hand, I don't think this is the right board to ask for that. CrossValidated is. — jona, Sep 24 '14 at 10:23
@Glen_b: I've put a citation from the article I linked. It speaks of HR, Cl and so on.. what is it speaking of? — Revious, Sep 24 '14 at 13:36
Sorry, I meant the referent of the word "that" in your question, which in that context would refer to a single item. Did you mean "those", which would refer to more than one (and imply all the values)? Are you simply after a translation of the abbreviations? (HR=hazard ratio, CI =confidence interval, p="p-value") — Glen_b, Sep 24 '14 at 13:39
Yes, sorry, I meant those. But Are them abbreviations universally used or not? Have also a look to what Jona says. Many of this papers speaks of "p".. This papers give this stuff as understood, without any further explanation. I will edit the question.. — Revious, Sep 24 '14 at 13:42
[Hazard ratio](http://en.wikipedia.org/wiki/Hazard_ratio), [Confidence interval](http://en.wikipedia.org/wiki/Confidence_interval), [p-value](http://en.wikipedia.org/wiki/P-value) — Glen_b, Sep 24 '14 at 13:46
I think now it's two questions - one asking for the interpretation of these specific lines in this specific paper; the other about a general resource for understanding common stats terminology and reporting styles. — jona, Sep 24 '14 at 14:33
@jona: I think we could also change a bit the question to become more generic.The wikipedia entries are indeed very useful, but for a common person would be enough one line of explanation for each of those concepts. — Revious, Sep 24 '14 at 14:53

score 3 · Accepted Answer · answered Sep 24 '14 at 15:30

I'll take a stab at the portion of the question asking about interpreting these specific results (and I'm going to try to be non-technical about it):

The hazard ratio (HR) of 2.01 suggests that over the course of the study, those in the focal group (in this case, those with increased carbohydrate intake) were about 2 times as likely to experience the outcome of interest (in this case, stroke). Put another way, they had twice the stroke risk of those in the comparison group.

The confidence interval suggests that we can conclude, with 95% certainty, that the true hazard rate in the population could fall anywhere between 1.04 and 3.86. In the broader population, the stroke risk associated with increased carbohydrate consumption could be as high as 3.86 times or as low as 1.04 times that of the comparison group. One thing to note is that this range does not include 1 (although it comes close), suggesting that there is probably a relationship between carbohydrate intake and stroke in the broader population. If 1 were included, this would mean that we would not be justified in concluding that the hazard ratio was different from equal (1:1).

The p value can be interpreted as the probability of finding a result equal to, or more extreme, than these results in the population by chance alone. Typically, a p value lower than .05 suggests that the results are significant- they are extreme enough to suggest that it is an actual effect, not chance, that is accounting for the results. In this case, the p value is .025, which would allow you to conclude that these results are significant and suggest a relationship between your variables in the population.

Thanks a lot. I will open another question for asking how a statistical search may arrive at those probabilistic conclusion. As a profane it's hard for me to understand concept like this one: "The confidence interval suggests that we can conclude, with 95% certainty, that the true hazard rate in the population could fall anywhere between 1.04 and 3.86" — Revious, Sep 25 '14 at 07:30
Here it is: http://stats.stackexchange.com/questions/116711/how-are-statistics-on-scientific-papers-inferred — Revious, Sep 25 '14 at 07:44
There's an issue with the interpretation of "confidence interval" (CI) in this answer, as now explained in http://stats.stackexchange.com/questions/116711/how-are-statistics-on-scientific-papers-inferred. If you had the same population and performed the same study a large number of times, then 95% of hazard ratios would fall within the CI. That's not necessarily the same thing as having 95% certainty that the true hazard ratio is contained in that range. — EdM, Sep 25 '14 at 16:54

How should be statistics on scientific papers read?

1 Answers1

Linked