statistics

Maria and Eric meet z-scores

ByAnnMaria De Mars December 12, 2013December 12, 2013

One of the problems many students have when first learning statistics is deciding when to reject the null hypothesis. Z is small and low probability means it is not likely to occur so you reject, right? (Wrong!). P > .86 and when you have a large z-score you reject the null hypothesis, so with p = .86 you reject, right? (Wrong!)

Enter Maria and Eric to help us explain z-scores. Eric is 6 foot 4, or 76 inches tall. That is a high number, both in mathematical terms and off of the ground. I want to determine if Eric’s height is significantly different from the mean. I use the heart data set included with the SAS Web Editor to compute mean and standard deviation for an adult male, as so:

proc sort data=sashelp.heart out=temp ;
by sex ;
proc univariate data=temp;
var height ;
by sex ;

I find that the mean is 67.6 and the standard deviation is 2.7. I then compute my z-score which is the obtained value of 76 inches, minus the mean value of 67.6 divided by the standard deviation of 2.7. This gives me a z-score of 3.1 which tells me that Eric is 3.1 standard deviations above the mean.

Listen carefully here — there is a SMALL probability of LARGE differences from the mean.

A z-score of 1.96 occurs less than 5% of the time, that is about two standard deviations from the mean. How often does a z-score of 3.1 occur? p < .002. So, even though he is a LARGE difference from the average height, people who are that tall represent a small proportion of the population.

We would therefore REJECT that null hypothesis that there is NO difference between Eric’s height and the average and conclude that he is significantly taller than average.

Since Maria just sniffed disrespectfully,

I could have told you that!

(I can hear you over the Internet) … we will now examine Maria.

She is 5 foot 4, or 64 inches. The average height for a woman is 62.6 inches and the standard deviation is 2.5. Her z-score is (64-62.6)/ 2.5 = .56 and the probability of a z-score that high or larger is almost 60%, p> .59 . So, she differs a SMALL amount from the average and that will happen a LARGE proportion of the time.

SO … would you accept or reject the null hypothesis that Maria is no different than the average height for women? Discuss.

—————— Buy our game! It’s awesome.

Want your children to be good at math? Want to improve your own math skills while killing animated buffalo and running around in a virtual world? Have $9.99 ? It’s your lucky day. Click here to buy Spirit Lake: The Game

statistics

Choosing models that suck less: Akaike is more than just fun to say

ByAnnMaria De Mars January 6, 2011January 6, 2011

I’m on Twitter a lot, and more to the point, I read a whole lot of blogs and web pages, all of which point to three, related questions: Why do I so seldom read anything on how to DO predictive analytics or modeling from people who are always tweeting how these are (** Drum roll…

computer games | statistics

Standardized Testing In Plain Words

ByAnnMaria De Mars November 19, 2016

I hate the concept of those books with titles like “something or other for dummies” or “idiot’s guide to whatever” because of the implication that if you don’t know microbiology or how to create a bonsai tree of take out your own appendix you must be a moron. I once had a student ask me…

Software | statistics | Technology

SAS Global Forum: Getting my Geek On

ByAnnMaria De Mars April 9, 2012April 9, 2012

I am an unashamed statistical programming geek. I’m leaving very soon, stopping to visit my mom on the way to SAS Global Forum because 94.7% of all mothers have retired to Florida by age 68. (That is a real statistic. As someone commented about fake boobs – if they exist, they’re real.) I admit it,…

computer games | statistics

Is it sick to get this excited about data analysis?

ByAnnMaria De Mars November 9, 2015

The results are in! The chart below gladdens my little heart, somewhat. One thing to note is the fact that the 95% confidence interval is comfortably above zero. Another point is that it looks like a pretty normal distribution. What is it? It is the difference between pretest and post-test scores for 71 students at…

Software | statistics | Technology

Captain Obvious and SAS Enterprise Miner

ByAnnMaria De Mars June 15, 2014June 16, 2014

Maybe this is obvious, but I have often found that what is obvious to some people is not so obvious to others, so here are a few random tips. 1. Enterprise Miner can take a REALLY long time to load during which you wonder if anything is happening at all. Open up the task manager…

Dr. De Mars General Life Ramblings | Software | statistics

Making a Difference: Different views from WUSS

ByAnnMaria De Mars October 13, 2011October 13, 2011

At the opening session, Randy Guard from SAS talked about making a difference. That sounded promising, but then the examples he gave were how analyses could be run on large databases of stock market data so much more quickly that instead of having market value overnight traders could get the data hourly. It sounded like…

2 Comments

sylver says:

December 13, 2013 at 12:07 am

If I understand properly the terms, a z-score indicates how different a data point is from the standard deviation and P indicates how likely it is for a data point to be this different.

z-score = 0 : perfectly average
-1< z-score < 1 : different from average but within normal deviation: in engineering terms, I think we call that "pretty average"
z-score1 : outside the norm

For maria, with a z-score at 0.56, her height is well within the standard deviation and P indicates that it is pretty common, so the null hypothesis should be accepted:

Maria’s height is not unusual at all (and I could have told you that without learning what a z-score is, thank you very much 😉
sylver says:

December 17, 2013 at 3:51 am

PS: Did I get it right?

Similar Posts

2 Comments

Leave a Reply