SAS Proc Transpose – how have I not written about this before?

When I was young and knew everything, I would frequently see procedures or statistics and think, “When am I ever going to use THAT?” That was my thought when I learned about this new procedure to transpose a data set. (It was new then. Keep in mind, I learned SAS when I was pregnant with my first child. She is now CEO of a an educational game company and the mother of three children. )

PROC TRANSPOSE is super-useful. You might only think it is useful for transforming data for use with PROC GLM to use with PROC MIXED, or you might have no idea what the hell that means and it is still super-useful.

Let me give you today’s example. I’m looking for data to use in a biostatistics class I’m teaching next month. It’s a small data set, with data on eight states included in the Center for Disease Control’s Autism and Developmental Disabilities Monitoring Network.

The data looks like this:

As you can see, each state is a column. I would like to know, for example, what percentage of people with autism also have a physical disability. There is a way to do it by finding the mean across variables but I want to use this data set for a few examples and it would be much easier for me if each of those categories was a variable.

The code is super simple:

PROC TRANSPOSE DATA=mydata.autism OUT=mydata.autism2 NAME=state;
ID eligibility ;

The NAME = option is not required nor is the ID statement but they will make your life easier. First, let’s take a look at our new data.

Now, instead of state being a variable, we have one record for each state, the percent with autism diagnosis only is one variable, percent with emotional disturbance another, and so on. What the NAME = option does is give a name to that new variable which was the name of each column. If you don’t use that option, the first column would be named _name_ . Now, with these data it would still be pretty obvious that this variable is the state but in some cases it wouldn’t be obvious at all.

The ID statement is really necessary in this case because otherwise each column is going to be named “COL1”, “COL2” etc. Personally, I found the ID statement here confusing because normally the ID statement I think of as the individual ID for each record, like a social security number or student ID. In this case, the variable name you give in the ID statement is going to be used to name the variables. So, as you can see above, the first column is named Autism(%), the second is named Emotional Disturbance (%) and so on.

So, that’s it. All I need to do to get means, standard deviation, minimum and maximum is :

PROC MEANS DATA =mydata.autism2;

So, that’s it.

By the way, I get this data set and a few others from SAS Curriculum Pathways. Nice source for small data sets to start off a course.

I live in opposite world, where my day job is making games and I teach statistics and write about programming for fun. You can check out our games here. You’re probably already pretty good with division but you’ll learn about the Lakota language and culture with Making Camp Lakota. A bilingual (English-Lakota) game that teaches math.

feather

Learn to Code for Free (for Real)

ByAnnMaria De Mars April 22, 2014April 22, 2014

I can imagine the type of person served by an expensive, intensive programming bootcamp – someone with money (or, at least, good credit) and several weeks of free time. That has never described me in my life. The last time I had six weeks free was in the summer after tenth grade, before I started…

Technology

The Real Story of Technology in Rural Schools

ByAnnMaria De Mars December 29, 2015December 29, 2015

This is something that has bothered me for a long time. When I read tables and reports from the National Center of Education Statistics (yes, I do, don’t judge me!), it makes it sound as if all is well with rural education. According to this table, for example, 100% of rural schools have at least…

Software | statistics | Technology

Bonus SAS Tips Before SAS Global Forum

ByAnnMaria De Mars March 26, 2015March 27, 2015

I’m giving a talk on Preparing Students for the Real World of Data at SAS Global Forum next month. You’d think 50 minutes would be long enough for me to talk, but that just goes to show you don’t know me as well as you think you do. One point made in the template for…

computer games | statistics

Standardized Testing In Plain Words

ByAnnMaria De Mars November 19, 2016

I hate the concept of those books with titles like “something or other for dummies” or “idiot’s guide to whatever” because of the implication that if you don’t know microbiology or how to create a bonsai tree of take out your own appendix you must be a moron. I once had a student ask me…

Software | Technology

%INCLUDE IS YOUR FRIEND

ByAnnMaria De Mars August 15, 2014August 15, 2014

Lately, I’ve been working on a report that uses eight datasets that all have the same problems with the usernames. In addition to needing to remove every username that contained the word “test” or “intern” we also needed to delete specific names of the classroom teachers who had played the game. We needed to correct…

statistics

Teaching statistics tip: Know your students

ByAnnMaria De Mars December 5, 2017December 5, 2017

Almost always when I get asked to teach anything my answer is: No. I don’t even think about it . Just, no. I’m too busy. Usually, I’ll teach one graduate class a year and that’s it. However, recently I had the opportunity to teach an introduction to statistics course and design the whole course from…

Similar Posts

Leave a Reply