How can I do Stepwise Method if only independent variables are given?

Question

This is the data set

enter image description here

How can I determine if this 4 groups differed significantly in their performance

Do you know the letters to which your numbers correspond, like 84 was in group A, 90 was in group C, etc? — Dave, Dec 23 '21 at 01:11
What are the trials? Is this just four measurements in each of four groups, that’s different than if you track four subjects over time, for instance. — Dave, Dec 23 '21 at 01:27

Alexis · Answer 1 · 2022-01-14T04:29:58.273

6

Do.

Not.

Use.

Step-wise.

Regression.

Models.

Unless you want:

$p$ values biased to be small
$\beta$ estimates biased away from zero
equal likelihood of including false predictors as true predictors
equal likelihood of excluding false predictors as true predictors
$F$ statistics biased upward
$R^2$ biased upward
a series of nested "condition on the decisions about models using such-and-such variables" which are virtually never presented to the audience

edited Jan 14 '22 at 04:29

answered Dec 23 '21 at 03:46

Alexis

26,219
5
78
131

How is stepwise variable selection related to this situation of a single (categorial) predictor? – cdalitz Dec 23 '21 at 06:06
2

@cdalitz Perhaps you missed the OP's title? And the only tag they selected for their question? – Alexis Dec 23 '21 at 08:16

DifferentialPleiometry · Accepted Answer · 2021-12-23T06:31:42.243

0

How can I determine if this 4 groups differed significantly in their performance

Based on the stripplot of your data below, I had two thoughts:

Formal statistical inference aside, these groups don't look substantially different. But I also don't know the measure scale we are dealing with here, so it is difficult for me to judge.
Your sample size is small. Not so small that you cannot do the calculations for a hypothesis test, but small enough that I have doubts about stability of inferences subject to sampling error. Even so, consider reading through this post for a discussion of the smallest sample size one might do with an ANOVA.

But for what it is worth, an ANOVA of your data had an $F=1.176$ and $p=0.3510$. Under the usual confidence limit of $\alpha=0.05$, one would consider there to not be a significant difference among your groups.

edited Dec 23 '21 at 06:31

answered Dec 23 '21 at 05:37

DifferentialPleiometry

2,274
1
11
27

1

[Kruskal-Wallis](https://en.wikipedia.org/wiki/Kruskal%E2%80%93Wallis_one-way_analysis_of_variance) is a highly-similar approach, but would have even less statistical power. – DifferentialPleiometry Dec 23 '21 at 06:38
If their data are repeated measures design (not saying they are, they did not describe with much detail), they may have enough power to detect a difference in treatment means just going visually by the distribution of green dots. – Alexis Dec 23 '21 at 08:20

score 0 · Answer 3 · answered Dec 23 '21 at 06:22

If I interpret your table correctly, you have one predictor (inpendent variable) which is the category (or group) A-D, and one response (dependent variable) that is the score value in the table and is measured on different independent trials.

In this situation, there are two standard statistical tests that try to answer different questions:

ANOVA (see this answer), which tests whether there is any difference in the mean values between all pairs of groups.
Two sample Kolmogorov-Smirnov test (KS), which tests whether the distribution of the response is different between two groups.

ANOVA automatically takes care of the multiple testing problem, while KS only compares a single pair.

How can I do Stepwise Method if only independent variables are given?

This is the data set

3 Answers3

Linked