Plotting Agreement with Kappa Plots from PROC FREQ

ByAnnMaria De Mars August 10, 2015August 10, 2015

In assessing whether our Fish Lake game really works to teach fractions, we collect a lot of data, including a pretest and a post-test. We also use a lot of types of items, including a couple of essay questions. Being reasonable people, we are interested in the extent to which the ratings on these items agree.

To measure agreement between two raters, we use Kappa’s coefficient. PROC FREQ produces two types of Kappa coefficients. The Kappa coefficient ranges from -1 to 1, with 1 indicating perfect agreement, 1 indicating exactly the agreement that would be expected by chance and negative numbers indicating less agreement than would be expected by chance . When there are only two categories, PROC FREQ produces only the Kappa coefficient. When more than two categories are rated, a weighted Kappa is also produced which credits categories closer together as partial agreement and categories at the extreme ends as no agreement.

The code is really simple:

ODS GRAPHICS ON; PROC FREQ DATA =datasetname ; TABLES variable1*variable2 / PLOTS = KAPPAPLOT; TEST AGREE ;

Including the ODS GRAPHICS ON statement and the PLOTS = KAPPAPLOT option in your TABLES statement will give you a plot of both the agreement and distribution of ratings. Personally, I find the kappa plots, like the example below, to be pretty helpful.

This visual representation of the agreement shows that there was a large amount of exact agreement (dark blue shading) for incorrect answers, scored 0, with a small percentage partial agreement and very few with no agreement. With 3 categories, only exact agreement or partial agreement is possible for the middle category. Two other take-away points from this plot are that agreement is lower for correct and partially correct answers than incorrect ones and that the distribution is skewed, with a large proportion of answers scored incorrect. Because it is adjusted for chance agreement, Kappa is affected by the distribution among categories . If each rater scores 90% of the answers correct, there should be 81% agreement by chance, thus requiring an extremely high level of agreement to be significantly different from chance. The Kappa plot shows agreement and distribution simultaneously, which is why I like it.

———

Want to play the game ? You can download it here, as well as our game for younger players, Spirit Lake.

Software | statistics | Technology

PROC FREQ (and a LAG) for data validity

ByAnnMaria De Mars July 11, 2015July 11, 2015

I’m in the middle of data preparation on a research project on games to teach fractions. This is the part of a data analysis project that takes up 80% of the time. Fortunately, PROC FREQ from SAS can simplify things. 1. How many unique records ? There are multiple quizzes in the game, and you…

Dr. De Mars General Life Ramblings | Open data | statistics

Why I Don’t Have Minions

ByAnnMaria De Mars September 20, 2011September 20, 2011

Admit it, more than once you have thought to yourself, Wouldn’t it be convenient about now to have some mindless minions to do my bidding? I’d always thought if this whole statistical consulting thing didn’t work out, I could be an evil scientist. I mean, I already went to the trouble to get a Ph.D….

Software | statistics | Technology

SAS On-Demand making statistics professors’ lives better, since, oh, Tuesday

ByAnnMaria De Mars January 11, 2012January 13, 2012

The downward mobility of Ph.D. students is like domestic violence in many ways. That is, the people in the “family” all know it is a fact of life, but they don’t talk about it among themselves, and to outsiders they pretend it doesn’t exist. The fact is, far more people will graduate from Harvard than…

Dr. De Mars General Life Ramblings | statistics

30 Things I Learned in 30 Years as a Statistical Consultant – Part 1 of lots

Byannmaria September 16, 2019September 16, 2019

Never fear, I’m not going to post all 30 things in this post. This is a series. A LONG series. Get excited. I was invited to speak at SAS Global Forum next year and it occurred to me after thinking about it for 14.2 seconds that there are plenty of people at SAS and elsewhere…

Software | Technology

Watch me work: Finishing the test scoring with more SAS character functions

ByAnnMaria De Mars February 9, 2016

Recall that in the last post we were using SAS functions to score a test that had been completed by middle school and upper elementary students. Since we wanted to make it as easy as possible for students to enter their answers, we accepted just about any format. Picking up where we left off ……

Software | statistics

Failing Forward: My Excellent Adventure with Microdata Continues

ByAnnMaria De Mars March 22, 2011

I was very thrilled to be invited to speak to six classes of seventh- and eighth-grade students at an urban school. Actually, they wanted me to speak to seven classes but there is no way on earth I am getting up at 6:30 a.m. or whatever ungodly hour would be required for me to make…

One Comment

Pingback: SAS Global Forum Random Post 1: Statistics : AnnMaria's Blog

Similar Posts

One Comment

Leave a Reply