statistics

Standardized Testing in Plain Words (continued)

ByAnnMaria De Mars November 20, 2016

Last post I wrote a little about local norms versus national norms and gave the example of how the best-performing student in the area can still be below grade level.

Today, I want to talk a little about tests. As I mentioned previously, when we conducted the pretest prior to student playing our game, Spirit Lake, the average student scored 37% on a test of mathematics standards for grades 2-5. These were questions that required them to say, subtract one three-digit number from another or multiply two one-digit numbers.

Originally, we had written our tests to model the state standardized tests which, at the time, were multiple choice. This ended up presenting quite a problem. Here is a bit of test theory for you. A test score is made up two parts – true score variance and error variance.

True score variance exists when Bob gets an answer right and Fred gets it wrong because Bob really knows more math (and the correct answer) compared to Fred.

Error variance occurs when, for some reason, Bob gets the answer right and Fred gets it wrong even though there really is no difference between the two. That is, the variance between Fred and Bob is an error. (If you want to be picky about it, you would say it was actually the variance from the mean was an error, but just hush.)

How could this happen? Well, the most likely explanation is that Bob guessed and happened to get lucky. (It could happen for other reasons – Fred really knew the answer but misread the question, etc.)

If very little guessing occurs on a test, or if guesses have very little chance of being correct, then you don’t have to worry too much.

However, the test we used initially had four multiple-choice items for each question. The odds of guessing correctly were 1 in 4, that is, 25%. Because students turned out to be substantially further below grade level than we had anticipated, they did a LOT of guessing. In fact, for several of the items, the percentage of correct responses was close to the 25% students would get from randomly guessing.

When we computed the internal consistency reliability coefficient (Cronbach alpha) which measures the degree to which items in a test correlate with one another, it was a measly .57. In case you are wondering, no, this is not good. It shows a relatively high degree of error variance. So, we were sad.

SAS CODE FOR COMPUTING ALPHA

PROC CORR DATA = mydataset NOCORR ALPHA ;

VAR item1 – item24 ;

The very simple code above will give you coefficient alpha as well as the descriptive statistics for each item. Since we very wisely scored our items 0 = wrong, 1= right a mean of say, .22 would indicate that only 22% of students answered an item correctly.

To find out how we fixed this, read the next post.

To buy our games or donate one to a school, click here. Evaluated and developed based on actual data. How about that? Learn fractions, multiplication , statistics – take your pick!

statistics

How to solve any (statistics) problem: Part 2

ByAnnMaria De Mars October 16, 2012October 16, 2012

Yesterday, I mentioned this problem For 17 girls diagnosed with anorexia, weight change after family therapy was as follows: 11,11, 6, 9, 14, -3, 0, 7, 22, -5 , -4, 13, 13, 9, 4 , 6, 11 Partial results are shown below. Fill in the missing results: And we had gotten the table completed as…

Software | statistics

Categorical Data Analysis is even more fun than it sounds

ByAnnMaria De Mars June 17, 2011

I was very happy to get an acceptance letter from WUSS telling me that my class proposal for the 2011 conference was accepted. I’m usually happy to get acceptance letters in general but this was particularly nice since I have gotten the first four chapters of my naked mole rat book done and the next…

Dr. De Mars General Life Ramblings | statistics

I’m Claiming my Love Stats Award

ByAnnMaria De Mars November 25, 2011

It’s about time I got some recognition ! You can claim your own Love Stats award here. Careful, of undeserved awards, though. The last person who falsely claimed a Love Stats award had multicollinearity in his measures, a high VIF and died of complications of homoscedasticity. You have been warned.

statistics

Computers, public libraries and beware the cell chi-square

ByAnnMaria De Mars June 22, 2011

When the first computer lab was put into the tribal college where I was a consultant, the professor in charge of the project complained that students were spending time in the lab on Yahoo, MySpace, emailing friends on other reservations, downloading software and all sorts of non-academic activities. He asked what he should do. I…

statistics

Statistics is statistics

ByAnnMaria De Mars October 1, 2013October 1, 2013

While I love teaching and am looking forward to be working in a completely new environment – teaching an online course to masters students – I was initially concerned that teaching a course on biostatistics in public health might draw too much time away to my work for The Julia Group. I really should have…

statistics

SHOWING students statistics

ByAnnMaria De Mars June 26, 2011June 26, 2011

Science is boring! Math is boring! This is the whine of the world’s most spoiled 13-year-old as she does her homework, and I find it hard to argue with her because I have read her textbooks and all of them could be put to a better use as a cure for insomnia or starting a…

One Comment

E-bone says:

November 23, 2016 at 4:23 pm

Ooooh- cliffhanger before Thanksgiving, no less!

I always thought my statistics professor was being a smartass when he would refer to multiple “choice” tests as multiple “guess”. Hmmmm

Similar Posts

One Comment

Leave a Reply