Hello folks. Our university just started using v18. A colleague who was attempting to perform MannWhitney U tests on a couple of variables asked me why SPSS was advising her to "retain the null hypothesis" when the pvalue was well below .05. I said I didn't know, and asked to see the data.
It turns out that there are now socalled "legacy dialogs" under Analyze  Nonparametric. If you fail to notice that, and you use the newfangled dialogs, your analysis is done via NPTESTS rather than NPAR TESTS. The output (the default output, at least) for NPTESTS is VERY sparse  it does not give any descriptive statistics (including the sample sizes), nor does it report the value of the test statistic. It only gives a statement of the null hypothesis, says which test was used, gives a pvalue, and a decision (i.e., reject or retain H0).
IMO, this is a big step in the wrong direction. Does IBM SPSS Statistics really want to market itself by saying things like, "You don't even have to understand statistics to use it!"? The term GIGO comes to mind.
And there's another problem: For one of my colleague's two variables, the "decision" was to retain H0, despite the fact that the pvalue was .013. See below. (And yes, I do know that there are far too many ties here, which is very problematic for rankbased tests. But will the targeted user who "doesn't even have to understand statistics?" know that? I doubt it.)
data list list / Group Y1 Y2 (3f5.0).
begin data
0 0 34
0 0 16
0 0 33
0 0 0
0 0 22
0 0 0
0 46 22
0 0 0
0 0 0
0 0 0
1 0 0
1 0 0
1 0 0
1 0 0
1 0 0
1 0 0
1 0 0
1 0 0
1 0 0
1 0 0
end data.
variable level Y1 Y2 (scale).
* The default GUI options for Analyze  Nonparametric
* now generate NPTESTS syntax.
*Nonparametric Tests: Independent Samples.
NPTESTS
/INDEPENDENT TEST (Y1 Y2) GROUP (Group)
/MISSING SCOPE=ANALYSIS USERMISSING=EXCLUDE
/CRITERIA ALPHA=0.05 CILEVEL=95.
* Notice that the output is VERY sparse.
* There is no reporting of descriptive stats  not even
* the sample sizes; nor is the test statistic reported.
* A "decision" is reported (reject or retain H0); but
* for variable Y2, the "decision" does not agree with
* the reported pvalue.
* To obtain the oldfashioned NPAR TESTS syntax, one
* must now use the "legacy dialogs" under Analyze  Nonparametric.
NPAR TESTS MW = Y1 Y2 BY group(0,1).


Hi Bruce:
Given the small sample sizes, you must pay attention to the exact
pvalue (although it will be somewhat biased towards non significance
due to the high number of ties), and it is p=0.063. The asymptotic
approximation is not reliable when both sample sizes are below 20, like
in this case.
Anyway, MannWhitney's U test needs that both sample distributions are
similar in shape and spread. This is clearly not the case with your
data. I would recommend using median test instead:
NPAR TESTS
/MEDIAN=Y1 Y2 BY Group(0 1)
/MISSING ANALYSIS.
The result is significant (using and exact pvalue, BTW).
I DON'T like NPTESTS either. I use a collection of macros that are more
complete (IMHO)
HTH,
Marta GG
Re: RANT: NPTESTS vs NPAR TESTS
I recommend the legacy dialogues for n onparameteric to all my friends ans students
The new part is complicated to use for INPUT and seriously lacking for OUTPUT
Yet another example of PASW letting their designers drive the interface
Typically they have no idea about statistics [as eg shows] and no idea about hci as evidenced by lack of user testing
Best
Diana
Hi Bruce,
With regard to the disagreement between
the significance displayed in the initial output table and the decision
recommended, there's a bug, as described in the resolution included below.
David Nichols
Statistical Support
SPSS, an IBM Company
Resolution #90896
Problem Summary:
NPTESTS MannWhitney test output significance and decision don't agree
Problem Description:
I'm running a MannWhitney test in the NPTESTS procedure in IBM SPSS Statistics.
In the resulting output, I sometimes see a p value below my specified critical
level in the Sig. column, along with Retain the null hypothesis in the
Decision column, or vice versa. Why is this happening?
Resolution Summary:
This problem has been reported to SPSS Development  it occurs because
significance is based on asymptotic results, decision on exact results
Resolution Description:
If the sample size is sufficiently small, you get both asymptotic and exact
results for the MannWhitney test. The results shown in the main initial
output in the Viewer should be consistent in using the exact significance
levels, but are currently using significance levels from the asymptotic
test and basing the decision on the exact test. The problem has been reported
to SPSS Development and will be fixed in a future release. In the meantime,
you can doubleclick on the output to activate the Viewer and look at the
results in the righthand screen for the variable(s) of interest and see
both asymptotic and exact significance levels. We apologize for any resulting
inconvenience.
Bruce Weaver
View this message in context:
Thanks David. I didn't know about the "model viewer" that pops up when you doubleclick on the output. The output shown there is actually quite nice  I like the plot showing the distributions for the two groups, for example. If this had appeared as part of the standard output, I would have avoided a rant. ;)
Cheers,
Bruce
Cheers,
Bruce
Hi Marta. As it happens, I ended up at the same place you did. I suggested converting things to 2x2 tables, with Group in the rows, and None vs Some in the columns. Given that the grand median = 0 in both cases, my 2x2 tables are exactly the same as those used for the median test. One difference is that I recommended using the N1 chisquare in the case where all expected counts were large enough.
Cheers,
Bruce
http://www.iancampbell.co.uk/twobytwo/calculator.htmCheers,
Bruce
There's another minor angle on that new command which is that all the model
graphics, which look very nice, do not directly print. They print but only
after you have double clicked on result box and then select print on the
right hand pane. I'm not interested in ranting on this (and I won't say
anything more on this) but I vote against hidden output. I prefer everything
to be visable in the output file, whether viewed on a screen or on paper.
Gene Maguin
Cum es damnatorum possederunt porcos iens ut salire off sanguinum cliff in abyssum?"


