Computing Kappa is a Piece of Cake

ByAnnMaria De Mars February 8, 2015February 8, 2015

Kappa is a useful measure of agreement between two raters. Say you have two radiologists looking at X-rays, rating them as normal or abnormal and you want to get a quantitative measure of how well they agree. Kappa is your go-to coefficient.

How do you compute it? Well, personally, I use SAS because this is the year 2015 and we have computers.

Let’s take this table, where 100 X rays were rated by two different raters as an example:

Rating by Physician 1

————-Abnormal | Normal

Physician 2
————————————–

Abnormal 40 20

Normal 10 30

So ….. the first physician rated 60 X-rays as Abnormal. Of those 60, the second physician rated 40 abnormal and 20 normal, and so on.
If you received the data as a SAS data set like this, with an abnormal rating = 1 and normal = 0, then life is easy and you can just do the PROC FREQ.

Rater1 Rater2

1 1
1 1

and so for 50 lines.

However, I very often get not an actual data set but a table like the one above. In this case, it is still relatively simple to code

DATA compk ;

INPUT rater1 rater2 nums ;

DATALINES ;

1 1 40
1 0 20
0 1 10
0 0 30
;

So, there were 40 x-rays coded as abnormal by both rater1 and rater2. When rater1 = 1 (abnormal) and rater2 = 0 (normal), there were 20, and so on.

The next part is easy

PROC FREQ DATA = compk ;

TABLES rater1*rater2/ AGREE ;

WEIGHT nums ;

That’s it. The WEIGHT statement is necessary in this case because I did not have 100 individual records, I just had a table, so the WEIGHT variable gives the number in each category.

This will work fine for a 2 x 2 table. If you have a table that is more than 2 x 2, at the end, you can add the statement

TEST WTKAP ;

This will give you the weighted Kappa coefficient. If you include this with a 2 x2 table nothing happens because the weighted kappa coefficient and the simple Kappa coefficient are the same in this case.

See, I told you it was simple.

Software

Fixing data the easy way, part 2

ByAnnMaria De Mars July 18, 2011

I have learned not to be too smart for my own good. Yesterday was an example. My client provides many different types of services to the consumers who use their program. There are about 15 different options, from counseling to on-the-job training to assistive technology. We want to get the total number of services each…

Software | statistics

There is no substitute for real data

ByAnnMaria De Mars August 6, 2014

The second time I taught statistics, I supplemented the textbook with assignments using real data, and I have been doing it in the twenty-eight years since. The benefits seem so obvious to me that it’s hard to believe that everyone doesn’t do the same. The only explanation I can imagine is that they are not…

Software | Technology

SAS On-Demand Needs You to be Reasonable

ByAnnMaria De Mars July 24, 2012July 24, 2012

Tried reading in a file with 360,000+ records times 279 variables with SAS On-Demand using SAS Enterprise Guide. It was on one of my office computers that has a pretty slow Internet connection. I was using a different computer at the time so I just let it run. After 29 minutes, I gave up, did…

Software | Technology

SAS Tip: Preventing Disaster When Variable Lengths Differ

ByAnnMaria De Mars November 2, 2015November 2, 2015

Over the weekend, I wrote a post showing how SAS can be used to make what appears to be a complex problem quite simple. First of all, am I just being dramatic? Seriously, how can having your variable lengths differ be a disaster? Simple. You are merging by a variable that is a unique user…

Dr. De Mars General Life Ramblings | Software

Statistical Software – The Secret Documents, Part 1

ByAnnMaria De Mars April 1, 2009April 1, 2009

All right, well maybe they are not that secret, but there are some really great resources out there that you may want to check out. If you didn’t get to the SPSS Higher Education Road Show at UCLA because rush hour in Los Angeles is from 7-9, 4-6 and any time you are on the…

Software | statistics

Statistics Guru Predicts Republican Sweep! With Proc GMAP

ByAnnMaria De Mars April 2, 2016April 2, 2016

Esteemed statistics guru, Dr. Nathaniel Golden has some sobering news for Democrats. His latest models predict a Republican blow out. As can be seen by the map below, the Republican front-runner has tapped into the mood of resentment in the country’s non-elites. When the dust has settled, only the two highest earning states in the…

Similar Posts

Leave a Reply