Day 2: Start-up News – Boring, Important Measurement

Bar graph showing percentage correct by item grade level

It never ceases to amaze me that intelligent people will spend huge amounts of time doing a literature review, designing elaborate theories, generating elegant hypotheses, selecting a three-stage stratified random sample, performing multivariate analyses, and their measures on which this brilliant study rests are some questions they made up with their three best friends over Chardonnay during happy hour one Friday night. This is also known as the “panel of experts” method and it has the added benefit that it allows you to deduct the wine on your taxes. (Not actual tax advice. Consult your accountant. Of course, if you are doing your 1040 based on reading this blog, you are probably beyond help.)

We did not go with this approach. Our original idea was to use released items from the state standards test from North Dakota but, unfortunately, that is one of the states that never releases items. What we did was find standards that were the same, verbatim, as other states and then found items from those states that had been released. For example,

” Compute a given percent of a whole number”

and the problem would be

“What is 40% of 250?”

with the same four multiple choice options that had been used on the state test.

As someone pointed out, even if the same test had not been previously, since we pulled only the items that tested exactly what we included in the game, the individual items had been validated. So, we had content validity.

One bit of evidence for construct validity came from the item difficulty levels. Here is one of several charts. This shows what percentage of the fourth-grade students answered each item correctly. The items are broken down by grade level. It is also important to know that the state tests showed the majority of students at this school to be low-performing in mathematics. What we see is that as students go from second-grade level items, all of which the majority of the students answered correctly, to fifth-grade items, the percentage correct declines. We see that for the fifth-grade items, only one of them did the students exceed the 25% that would be answered correct by random guessing (remember, there were four multiple-choice options).

Since the state’s test have shown these students to be performing poorly, we should see that they generally are not at grade level, that is, they do not answer many of the fourth-grade items correctly at a rate exceeding chance. That, as you can see from the chart, is the exact situation.

Of course, we did more than this, beginning with replicating this identical chart with fifth-graders, who showed pretty much the same pattern but, as would be expected, answered a higher proportion correctly at each grade level than did the fourth-graders.

That’s the sort of thing that too many studies take for granted and never test. This isn’t the exciting part of creating a game, the part where you make an attack scene and the kid gets to shoot flaming arrows. So, what good does this do us? Well, the combination of the different analyses of the measure confirms that the measure we used for students to test whether or not their mathematics achievement increased is, in fact, a valid measure of mathematics achievement.

Also, this method has the advantage of not being required to share any of the wine with our best friend/ expert panel so we get to drink it all ourselves.

The Lies about Anchor Babies

ByAnnMaria De Mars April 26, 2011April 26, 2011

My father was born in New York City to two non-citizens who were in the U.S. for a few years, left and never returned. In his twenties, he returned to the U.S. and joined the military, I am pretty sure because it was the one thing he could think of that would most piss off…

20 Day Blogging | Software | statistics | Technology

Drinking and teaching statistics: Day 10 of the 20-day blogging challenge

ByAnnMaria De Mars February 13, 2014

There are multiple reasons that I haven’t gotten around to Day 10 of the 20-day blogging challenge. In part, because I have been really busy, and the other part is because I read this topic, “Share ideas that your classroom uses for brain breaks and/or indoor recess” and I thought I got nothin’ Anyone who…

Software | statistics | Technology

Mixed models with SAS Enterprise Guide – Not Really

ByAnnMaria De Mars February 13, 2013February 13, 2013

I was going to use SAS Enterprise Guide 4.3 with SAS On-Demand to do my mixed model analysis, but it did not quite work out. First of all, if like me you are used to doing PROC GLM where each subject is one record, you have to change your dataset to be one where each…

Dr. De Mars General Life Ramblings | statistics | The Julia Group

Failing Forward and regression lines

ByAnnMaria De Mars January 31, 2014

It’s been a really productive two weeks in North Dakota, installing our game in schools on two reservations, in tribal schools and public schools. I didn’t write this post to talk about that. Rather, in keeping with some of the really useful posts I’ve read about start-up failures, I wanted to share with you the…

computer games | Dr. De Mars General Life Ramblings | The Julia Group

Taking All of Your Children to Work Days: My take away from let’s move

ByAnnMaria De Mars February 27, 2014February 27, 2014

When I add up all of the ad revenue from this blog on top of the business it garners, in a good month it might average out to $30 an hour and in a not-so-good month maybe $10. Since my consulting rate is a heck of a lot more than $30 an hour you might…

Software | statistics | Technology

Know Thy Data: The Most Important Commandment in Statistics

ByAnnMaria De Mars January 7, 2016January 7, 2016

I was going to write about prevalence and incidence, and how so-called simple statistics can be much more important than people think and are vastly under-rated. It was going to be cool. Trust me. In the process, I ran across two things even more important or cooler (I know, hard to believe, right?) Here’s what…

One Comment

Similar Posts

One Comment

Leave a Reply