Questions tagged [spss]

IBM SPSS Statistics is a statistical software package. Use this tag for any on-topic question that (a) involves SPSS either as a critical part of the question or expected answer and (b) is not just about how to use SPSS.

SPSS (Statistical Package for the Social Sciences) is a proprietary cross-platform general-purpose statistical software package. SPSS's homepage The official name at present is IBM SPSS Statistics.

SPSS has both well-developed GUI and command syntax. One unique aspect of SPSS Statistics compared to other popular propriety software packages (such as Stata or SAS) is the built in functionality to call Python or R commands within syntax (in addition to SPSS own commands). Otherwise it is largely comparable to other general proprietary and freeware packages (such as R), although it differs in some advanced statistical capabilities and aspects of data manipulation.

Suggested readings on using SPSS and learning the command syntax are two online PDF's.

Other useful print versions are

Forums and groups entirely devoted to the software are (and suggested material to search when encountering a problem with SPSS are):

Other suggested webpages are

PSPP is a free-ware, open source alternative largely mimicking the look and functionality of SPSS.

1922 questions
376
votes
26 answers

Python as a statistics workbench

Lots of people use a main tool like Excel or another spreadsheet, SPSS, Stata, or R for their statistics needs. They might turn to some specific package for very special needs, but a lot of things can be done with a simple spreadsheet or a general…
Fabian Fagerholm
  • 215
  • 3
  • 6
  • 7
147
votes
3 answers

When is R squared negative?

My understanding is that $R^2$ cannot be negative as it is the square of R. However I ran a simple linear regression in SPSS with a single independent variable and a dependent variable. My SPSS output give me a negative value for $R^2$. If I was to…
Anne
  • 1,967
  • 6
  • 17
  • 13
69
votes
8 answers

Is PCA followed by a rotation (such as varimax) still PCA?

I have tried to reproduce some research (using PCA) from SPSS in R. In my experience, principal() function from package psych was the only function that came close (or if my memory serves me right, dead on) to match the output. To match the same…
Roman Luštrik
  • 3,338
  • 3
  • 31
  • 39
50
votes
7 answers

Why is "statistically significant" not enough?

I have completed my data analysis and got "statistically significant results" which is consistent with my hypothesis. However, a student in statistics told me this is a premature conclusion. Why? Is there anything else needed to be included in my…
38
votes
1 answer

Doing principal component analysis or factor analysis on binary data

I have a dataset with a large number of Yes/No responses. Can I use principal components (PCA) or any other data reduction analyses (such as factor analysis) for this type of data? Please advise how I go about doing this using SPSS.
Cathy
  • 381
  • 1
  • 4
  • 3
34
votes
4 answers

Difference between binomial, negative binomial and Poisson regression

I am looking for some information about the difference between binomial, negative binomial and Poisson regression and for which situations are these regression best fitted. Are there any tests I can perform in SPSS that can tell me which of these…
32
votes
1 answer

Best factor extraction methods in factor analysis

SPSS offers several methods of factor extraction: Principal components (which isn't factor analysis at all) Unweighted least squares Generalized least squares Maximum Likelihood Principal Axis Alpha factoring Image factoring Ignoring the first…
Placidia
  • 13,501
  • 6
  • 33
  • 62
32
votes
5 answers

Fisher's Exact Test in contingency tables larger than 2x2

I was taught to only apply Fisher's Exact Test in contingency tables that were 2x2. Questions: Did Fisher himself ever envision this test to be used in tables larger than 2x2 (I am aware of the tale of him devising the test while trying to guess…
pmgjones
  • 5,543
  • 8
  • 36
  • 36
26
votes
3 answers

Choice between Type-I, Type-II, or Type-III ANOVA

We have a dataset with three variables (dV: self-reported measure on scale 1-5, assumed to be metric; iV1: factor with 4 levels; iV2: factor with 8 levels). We are interested whether the dV differs in regard to both iVs and whether there is an…
phx
  • 617
  • 2
  • 9
  • 20
23
votes
2 answers

CHAID vs CRT (or CART)

I am running a decision tree classification using SPSS on a data set with around 20 predictors (categorical with few categories). CHAID (Chi-squared Automatic Interaction Detection) and CRT/CART (Classification And Regression Trees) are giving me…
Placidia
  • 13,501
  • 6
  • 33
  • 62
23
votes
1 answer

Methods to compute factor scores, and what is the "score coefficient" matrix in PCA or factor analysis?

As per my understanding, in PCA based on correlations we get factor (= principal component in this instance) loadings which are nothing but the correlations between variables and factors. Now when I need to generate factor scores in SPSS, I can…
Kartikeya Pandey
  • 443
  • 2
  • 5
  • 7
22
votes
2 answers

How to cluster time series?

I have a question about cluster analysis. There are 3000 companies, which have to be clustered according to their power usage over 5 years. Each company has values for every hour during 5 years. I would like to find out if some companies have the…
user89686
  • 221
  • 1
  • 2
  • 3
22
votes
1 answer

After bootstrapping regression analysis, all p-values are multiple of 0.001996

I'm running various multiple regression analyses in SPSS 27, and with those that are not bootstrapped, the p-values vary such that I do not find the same p-value twice within a regression (e.g., the p-values will be 0.000012435, 0.0053868,…
cjfcjf
  • 419
  • 1
  • 9
19
votes
4 answers

Visualizing Likert responses using R or SPSS

I have 82 respondents in 2 groups (43 in Group A and 39 in Group B) that completed a survey of 65 Likert questions each ranging from 1 – 5 (strongly agree - strongly disagree). I therefore have a dataframe with 66 columns (1 for each question + 1…
Adam
  • 331
  • 1
  • 3
  • 6
19
votes
5 answers

Interpreting discrepancies between R and SPSS with exploratory factor analysis

I am a graduate student in computer science. I have been doing some exploratory factor analysis for a research project. My colleagues (who are leading the project) use SPSS, while I prefer to use R. This didn't matter until we discovered a major…
Oliver
  • 528
  • 3
  • 9
1
2 3
99 100