A SAS Mystery Solved – When FREQ and MEANS disagree

ByAnnMaria De Mars June 15, 2015June 15, 2015

I’m preparing a data set for analysis and since the data are scored by SAS I am double-checking to make sure that I coded it correctly. One check is to select out an item and compare the percentage who answered correctly with the mean score for that item. These should be equal since items are scored 0=wrong, 1=correct.

When I look at the output for my PROC MEANS it says that 31% of the respondents answered this item correctly, that is, mean = .310.

However, the correct answer is D and when I look at the results from my PROC FREQ it shows that 35% of the respondents gave ‘D’ as the correct answer.

What is going on here? Is my program to score the tests off somewhere? Will I need to score all of these tests by hand?

I am sure those of you who are SAS gurus thought of the answer already (and if you didn’t, you’re going to be slapping your head when you read the simple solution).

By default, PROC FREQ gives you the percentage of non-missing records. Since many students who did not know the answer to the question left it blank, they were (rightfully) given a zero when the test was automatically scored. To get your FREQ and MEANS results to match, use the MISSING option, as so

PROC FREQ DATA =in.score ;
TABLES item1 / MISSING ;

You will find that 31% of the total (including those who skipped the question) got the answer right.

Sometimes it’s the simplest things that give you pause.

Local SAS User Groups, mostly in Tweets

ByAnnMaria De Mars December 11, 2012

Go to your local users group. If you don’t know if you have a local users group in your area, check the sascommunity.org page that lists bunches of them. There are six in California listed on their site and I heard of two others that started very recently that aren’t listed. LABSUG is the Los…

statistics

SHOWING students statistics

ByAnnMaria De Mars June 26, 2011June 26, 2011

Science is boring! Math is boring! This is the whine of the world’s most spoiled 13-year-old as she does her homework, and I find it hard to argue with her because I have read her textbooks and all of them could be put to a better use as a cure for insomnia or starting a…

Dr. De Mars General Life Ramblings | Open data | statistics

Why I Don’t Have Minions

ByAnnMaria De Mars September 20, 2011September 20, 2011

Admit it, more than once you have thought to yourself, Wouldn’t it be convenient about now to have some mindless minions to do my bidding? I’d always thought if this whole statistical consulting thing didn’t work out, I could be an evil scientist. I mean, I already went to the trouble to get a Ph.D….

Algebra | statistics

Math and Computer Programming through Black Belt Eyes

ByAnnMaria De Mars December 10, 2010

In my misspent youth, I was the first American to win the world judo championships. This came about since I had a propensity to run my mouth off, which often led to fights. Those people who said I better be able to “walk the walk if I was going to talk the talk”. Well, I…

Software

Mistakes with SAS Today

ByAnnMaria De Mars June 11, 2008June 11, 2008

Every now and then I post a mistake I made using either statistical software or statistics. Students often get discouraged feeling they make so many mistakes and they will never get it all right. No one gets it all right all the time. Obvious mistake of the day …. I was making minor cosmetic changes…

Open data | statistics

Census in Black & White: What I wondered about lately

ByAnnMaria De Mars August 22, 2011

The census now allows more than one race to be checked. For many years, friends of mine in inter-racial couples when they registered their children for school would check the “Other” box for race, rather than pick black or white. Although an individual’s census form responses are confidential, you certainly are free to tell anyone…

2 Comments

Prashant says:

July 30, 2015 at 11:44 am

Ann,

Can you please provide a small SAS code example to illustrate what you mentioned in this blog ie the difference of PROC MEANS and PROC FREQ wrt Missing Values. Shouldn’t both exclude missing values ?

Thanks.
AnnMaria says:

August 8, 2015 at 2:53 am

Both by default exclude missing values. However, if you score your tests like this:

If answer = “D” then correct = 1 ;
else correct = 0 ;

All of those missing an answer will be scored 0, as it should be, since they did not give the correct answer. So, when you compute the mean,all of those will no longer be missing, they will have had their answer on that item scored as a 0.

Similar Posts

2 Comments

Leave a Reply