Free (as in beer) software for analyzing data with imputed values

ByAnnMaria De Mars July 1, 2011July 1, 2011

Any time you add another layer of complexity you better have a damn good reason.

I’m often skeptical of proponents of both Item Response Theory and multiple imputation procedures, not because either IRT or MI is a bad thing in itself but because its inclusion makes data analysis and reporting more complicated. At the National Center for Education Statistics (NCES) seminar this week, Dr. Emmanuel Sikali gave some damn good reasons why the National Assessment of Educational Progress (NAEP) makes use of both IRT and multiple imputation.

In very brief, he noted that it is only fair when conducting an assessment to measure what is taught in the curriculum. Your good old basic content validity. However, to do that for an area like mathematics or reading would take a very large number of questions. We not only want students to answer the test questions but we also want to get information like if they have a computer at home, how many minutes a night they spend on math homework and a few dozen similar questions. We can’t really expect schools to volunteer to have students take a four-hour long, 250 item test for each subject. We can’t expect students to be willing to spend that much time on testing without getting tired or sick of it. (The tests are done in grade 4 and grade 8. I can already hear the world’s most spoiled 13-year-old saying, “This stupid test sucks. I’m not doing it.”)

SO … how do you fairly assess the curriculum without giving students a zillion item test? You create say, four different tests and give each student 1/4 of a zillion items. Then, based on the answers students gave for the questions they did receive, you estimate the scores they would have gotten on the other items. Because students randomly get one version of a test, data really ARE missing at random. Of course, you don’t want to treat this data you imputed the same as the items the students actually answered because there is some uncertainty as to whether your estimate is correct. So, you do this imputation multiple times. Hence the name. You can read a pretty nice introduction to multiple imputation here by Joseph Schafer at the Penn State University Methodology Center. They provide a ton of useful information on their site. Between them being funded by DHHS and the Dept of Ed funding NCES, I have decided that I will not become a Republican this week because I have proof that the government DOES do some things right.

Okay, so we are convinced of the greater goodness of multiple imputation. Now what do we do with those plausible values? Also, I should throw in that students being sampled within schools, you need to account for the cluster in sampling. Oh, and it is not a simple random sample, you need to include student weights. You could use SAS. If you pay for the complex samples module, you can use SPSS.

The Department of Education funded development of AM Statistical Software (no, it was not named after me). You can download it for free and it is unbelievably simple to use. It is all pointing and clicking. As far as I know, it only runs on Windows. I used an earlier version that only imported SPSS datasets. The AM website says they now import SAS datasets also. It’s no problem if you have the older version. I just did the creation of factors, recoding, etc. that I wanted in SAS, then exported it as an SPSS file.

Importing your data is simple – FILE > Import Data > SPSS type then pick the file.
Analysis is also super simple. More on that tomorrow, though.

I’ve been gone for a week and the cat litter, guinea pig cage, frog tank , carpet and my clothes all need to be cleaned.

How do I write a statistical analysis paper: Step two

ByAnnMaria De Mars May 18, 2015May 18, 2015

In the last post, I posed the following null hypothesis as an example: There is no difference in obesity among Caucasians, African-Americans and Latinos. You can see the results from the statistical analyses here. Since my question only pertains to those three groups, let’s begin by creating a data set with just those subjects. libname…

Software | Technology

Coding Tools to Make Life Easier

ByAnnMaria De Mars July 2, 2013July 2, 2013

I was working on something for a client when The Invisible Developer walked into my office, looked over my shoulder at the code and said, “So, you’re a PHP programmer now?” I answered, “I’m a whatever-language-we-happen-to-need-at-the-moment programmer.” A year and a half ago, I took a look at Codecademy and was underwhelmed. It’s gotten mixed…

Dr. De Mars General Life Ramblings | Software | Technology

How to stop gaslighting women in tech

ByAnnMaria De Mars August 27, 2018August 27, 2018

According to that source of all knowledge on the interwebz, Wikipedia, “Gaslighting is a form of psychological manipulation that seeks to sow seeds of doubt in a targeted individual or in members of a targeted group, making them question their own memory, perception, and sanity.” Have you ever had a brilliant, super-competent friend who doubted her…

Software | Technology

Web Editor May Save SAS from Going the Way of COBOL

ByAnnMaria De Mars November 20, 2013

I am old. I remember punched cards, COBOL, dumb terminals and having to walk over to the computer center and load tapes on to the drive if I wanted to use large data sets – large back then meaning 100,000 records or more with a few hundred variables. We thought that was pretty big data….

statistics

What does everybody already know about categorical data?

ByAnnMaria De Mars October 3, 2011

I’m teaching a class on categorical data analysis after the Western Users of SAS Software conference next week. As always, I have WAY more information than I can cover. Handouts are limited to 40 pages so I sent the organizers 80 slides but I know I am going to cover way more than that. Why…

computer games | Software | Technology

The Secret Life of Evaluators, with SAS

ByAnnMaria De Mars July 20, 2016

At the Western Users of SAS Software conference (yes, they DO know that is WUSS), I’ll be speaking about using SAS for evaluation. “If the results bear any relationship at all to reality, it is indeed a fortunate coincidence.” I first read that in a review of research on expectancy effects, but I think it…

3 Comments

disgruntledphd says:

July 2, 2011 at 6:24 am

While I would agree that unnecessarily complicating a model is never a good thing, IRT does have a lot of advantages over other methods of analysing scales used in psychology.

In my experience, IRT is far more likely to fail to fit the data, whereas you can get a factor analysis to fit almost anything. This is a very good thing if you’re trying to confirm or reject a theory or a measure.
AnnMaria says:

July 4, 2011 at 1:46 am

I agree with you. My objection isn’t to IRT, or multiple imputation, for that matter, but rather the mis-use on one hand, and, unnecessary complexity on the other.

I’ve had people argue for collecting small sample size because “with Item Response Theory we don’t really need to collect much data”.

As I said, though, in this particular case, I think their design and analysis were well thought-out and appropriate. Not something you see nearly often enough.
Pingback: Choosing the Right Propensity Score Method: A statistics fable : AnnMaria’s Blog

Similar Posts

3 Comments

Leave a Reply